acid sequence diversity: Topics by Science.gov

Sample records for acid sequence diversity

Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library.

PubMed

Krumpe, Lauren R H; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki

2007-10-05

Amino acid sequence diversity is introduced into a phage-displayed peptide library by randomizing library oligonucleotide DNA. We recently evaluated the diversity of peptide libraries displayed on T7 lytic phage and M13 filamentous phage and showed that T7 phage can display a more diverse amino acid sequence repertoire due to differing processes of viral morphogenesis. In this study, we evaluated and compared the diversity of a 12-mer T7 phage-displayed peptide library randomized using codon-corrected trinucleotide cassettes with a T7 and an M13 12-mer phage-displayed peptide library constructed using the degenerate codon randomization method. We herein demonstrate that the combination of trinucleotide cassette amino acid codon randomization and T7 phage display construction methods resulted in a significant enhancement to the functional diversity of a 12-mer peptide library. This novel library exhibited superior amino acid uniformity and order-of-magnitude increases in amino acid sequence diversity as compared to degenerate codon randomized peptide libraries. Comparative analyses of the biophysical characteristics of the 12-mer peptide libraries revealed the trinucleotide cassette-randomized library to be a unique resource. The combination of T7 phage display and trinucleotide cassette randomization resulted in a novel resource for the potential isolation of binding peptides for new and previously studied molecular targets.
New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes

PubMed Central

Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok

2018-01-01

Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas. PMID:29872447
New Insight Into the Diversity of SemiSWEET Sugar Transporters and the Homologs in Prokaryotes.

PubMed

Jia, Baolei; Hao, Lujiang; Xuan, Yuan Hu; Jeon, Che Ok

2018-01-01

Sugars will eventually be exported transporters (SWEETs) and SemiSWEETs represent a family of sugar transporters in eukaryotes and prokaryotes, respectively. SWEETs contain seven transmembrane helices (TMHs), while SemiSWEETs contain three. The functions of SemiSWEETs are less studied. In this perspective article, we analyzed the diversity and conservation of SemiSWEETs and further proposed the possible functions. 1,922 SemiSWEET homologs were retrieved from the UniProt database, which is not proportional to the sequenced prokaryotic genomes. However, these proteins are very diverse in sequences and can be classified into 19 clusters when >50% sequence identity is required. Moreover, a gene context analysis indicated that several SemiSWEETs are located in the operons that are related to diverse carbohydrate metabolism. Several proteins with seven TMHs can be found in bacteria, and sequence alignment suggested that these proteins in bacteria may be formed by the duplication and fusion. Multiple sequence alignments showed that the amino acids for sugar translocation are still conserved and coevolved, although the sequences show diversity. Among them, the functions of a few amino acids are still not clear. These findings highlight the challenges that exist in SemiSWEETs and provide future researchers the foundation to explore these uncharted areas.
Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

PubMed Central

Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

1994-01-01

To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
Crimean-Congo Hemorrhagic Fever

DTIC Science & Technology

2004-01-01

aminocaproic acid were also indicated. Much emphasis was also placed on preventing reinfection, including the necessity of remov- ing blood crusts from...The se- quence is approximately 60% identical both at the nucleotide and amino acid levels to the L segment of Dugbe virus, the only other Nairovirus...However, more recent data based on nucleic acid sequence analysis have revealed extensive genetic diversity. The first published CCHFV sequence
The Diversity Present in 5140 Human Mitochondrial Genomes

PubMed Central

Pereira, Luísa; Freitas, Fernando; Fernandes, Verónica; Pereira, Joana B.; Costa, Marta D.; Costa, Stephanie; Máximo, Valdemar; Macaulay, Vincent; Rocha, Ricardo; Samuels, David C.

2009-01-01

We analyzed the current status (as of the end of August 2008) of human mitochondrial genomes deposited in GenBank, amounting to 5140 complete or coding-region sequences, in order to present an overall picture of the diversity present in the mitochondrial DNA of the global human population. To perform this task, we developed mtDNA-GeneSyn, a computer tool that identifies and exhaustedly classifies the diversity present in large genetic data sets. The diversity observed in the 5140 human mitochondrial genomes was compared with all possible transitions and transversions from the standard human mitochondrial reference genome. This comparison showed that tRNA and rRNA secondary structures have a large effect in limiting the diversity of the human mitochondrial sequences, whereas for the protein-coding genes there is a bias toward less variation at the second codon positions. The analysis of the observed amino acid variations showed a tolerance of variations that convert between the amino acids V, I, A, M, and T. This defines a group of amino acids with similar chemical properties that can interconvert by a single transition. PMID:19426953
High-Throughput rRNA Gene Sequencing Reveals High and Complex Bacterial Diversity Associated with Brazilian Coffee Bean Fermentation

PubMed Central

Vinícius de Melo, Gilberto

2018-01-01

Summary Coffee bean fermentation is a spontaneous, on-farm process involving the action of different microbial groups, including bacteria and fungi. In this study, high-throughput sequencing approach was employed to study the diversity and dynamics of bacteria associated with Brazilian coffee bean fermentation. The total DNA from fermenting coffee samples was extracted at different time points, and the 16S rRNA gene with segments around the V4 variable region was sequenced by Illumina high-throughput platform. Using this approach, the presence of over eighty bacterial genera was determined, many of which have been detected for the first time during coffee bean fermentation, including Fructobacillus, Pseudonocardia, Pedobacter, Sphingomonas and Hymenobacter. The presence of Fructobacillus suggests an influence of these bacteria on fructose metabolism during coffee fermentation. Temporal analysis showed a strong dominance of lactic acid bacteria with over 97% of read sequences at the end of fermentation, mainly represented by the Leuconostoc and Lactococcus. Metabolism of lactic acid bacteria was associated with the high formation of lactic acid during fermentation, as determined by HPLC analysis. The results reported in this study confirm the underestimation of bacterial diversity associated with coffee fermentation. New microbial groups reported in this study may be explored as functional starter cultures for on-farm coffee processing.
A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package

PubMed Central

Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M.

2013-01-01

Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs. PMID:24688703
A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package.

PubMed

Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M

2013-01-01

Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs.
Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population

PubMed Central

Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C. Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B.; Nauck, Markus; Kaminski, Wolfgang E.

2017-01-01

The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its “a” determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the “a” determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of “a” determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated. PMID:28472040
Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population.

PubMed

Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-Suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B; Nauck, Markus; Kaminski, Wolfgang E

2017-01-01

The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its "a" determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the "a" determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of "a" determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated.
High levels of MHC class II allelic diversity in lake trout from Lake Superior

USGS Publications Warehouse

Dorschner, M.O.; Duris, T.; Bronte, C.R.; Burnham-Curtis, M. K.; Phillips, R.B.

2000-01-01

Sequence variation in a 216 bp portion of the major histocompatibility complex (MHC) II B1 domain was examined in 74 individual lake trout (Salvelinus namaycush) from different locations in Lake Superior. Forty-three alleles were obtained which encoded 71-72 amino acids of the mature protein. These sequences were compared with previous data obtained from five Pacific salmon species and Atlantic salmon using the same primers. Although all of the lake trout alleles clustered together in the neighbor-joining analysis of amino acid sequences, one amino acid allelic lineage was shared with Atlantic salmon (Salmo salar), a species in another genus which probably diverged from Salvelinus more than 10-20 million years ago. As shown previously in other salmonids, the level of nonsynonymous nucleotide substitution (d(N)) exceeded the level of synonymous substitution (d(S)). The level of nucleotide diversity at the MHC class II B1 locus was considerably higher in lake trout than in the Pacific salmon (genus Oncorhynchus). These results are consistent with the hypothesis that lake trout colonized Lake Superior from more than one refuge following the Wisconsin glaciation. Recent population bottlenecks may have reduced nucleotide diversity in Pacific salmon populations.
DNA tetrominoes: the construction of DNA nanostructures using self-organised heterogeneous deoxyribonucleic acids shapes.

PubMed

Ong, Hui San; Rahim, Mohd Syafiq; Firdaus-Raih, Mohd; Ramlan, Effirul Ikhwan

2015-01-01

The unique programmability of nucleic acids offers alternative in constructing excitable and functional nanostructures. This work introduces an autonomous protocol to construct DNA Tetris shapes (L-Shape, B-Shape, T-Shape and I-Shape) using modular DNA blocks. The protocol exploits the rich number of sequence combinations available from the nucleic acid alphabets, thus allowing for diversity to be applied in designing various DNA nanostructures. Instead of a deterministic set of sequences corresponding to a particular design, the protocol promotes a large pool of DNA shapes that can assemble to conform to any desired structures. By utilising evolutionary programming in the design stage, DNA blocks are subjected to processes such as sequence insertion, deletion and base shifting in order to enrich the diversity of the resulting shapes based on a set of cascading filters. The optimisation algorithm allows mutation to be exerted indefinitely on the candidate sequences until these sequences complied with all the four fitness criteria. Generated candidates from the protocol are in agreement with the filter cascades and thermodynamic simulation. Further validation using gel electrophoresis indicated the formation of the designed shapes. Thus, supporting the plausibility of constructing DNA nanostructures in a more hierarchical, modular, and interchangeable manner.
Sequences Of Amino Acids For Human Serum Albumin

NASA Technical Reports Server (NTRS)

Carter, Daniel C.

1992-01-01

Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.
Genetic diversity of the merozoite surface protein-3 gene in Plasmodium falciparum populations in Thailand.

PubMed

Pattaradilokrat, Sittiporn; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Siripoon, Napaporn; Harnyuttanakorn, Pongchai

2016-10-21

An effective malaria vaccine is an urgently needed tool to fight against human malaria, the most deadly parasitic disease of humans. One promising candidate is the merozoite surface protein-3 (MSP-3) of Plasmodium falciparum. This antigenic protein, encoded by the merozoite surface protein (msp-3) gene, is polymorphic and classified according to size into the two allelic types of K1 and 3D7. A recent study revealed that both the K1 and 3D7 alleles co-circulated within P. falciparum populations in Thailand, but the extent of the sequence diversity and variation within each allelic type remains largely unknown. The msp-3 gene was sequenced from 59 P. falciparum samples collected from five endemic areas (Mae Hong Son, Kanchanaburi, Ranong, Trat and Ubon Ratchathani) in Thailand and analysed for nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity. The gene was also subject to population genetic analysis (F st ) and neutrality tests (Tajima's D, Fu and Li D* and Fu and Li' F* tests) to determine any signature of selection. The sequence analyses revealed eight unique DNA haplotypes and seven amino acid sequence variants, with a haplotype and nucleotide diversity of 0.828 and 0.049, respectively. Neutrality tests indicated that the polymorphism detected in the alanine heptad repeat region of MSP-3 was maintained by positive diversifying selection, suggesting its role as a potential target of protective immune responses and supporting its role as a vaccine candidate. Comparison of MSP-3 variants among parasite populations in Thailand, India and Nigeria also inferred a close genetic relationship between P. falciparum populations in Asia. This study revealed the extent of the msp-3 gene diversity in P. falciparum in Thailand, providing the fundamental basis for the better design of future blood stage malaria vaccines against P. falciparum.
Insights into the diversity of eukaryotes in acid mine drainage biofilm communities.

PubMed

Baker, Brett J; Tyson, Gene W; Goosherst, Lindsey; Banfield, Jillian F

2009-04-01

Microscopic eukaryotes are known to have important ecosystem functions, but their diversity in most environments remains vastly unexplored. Here we analyzed an 18S rRNA gene library from a subsurface iron- and sulfur-oxidizing microbial community growing in highly acidic (pH < 0.9) runoff within the Richmond Mine at Iron Mountain (northern California). Phylogenetic analysis revealed that the majority (68%) of the sequences belonged to fungi. Protists falling into the deeply branching lineage named the acidophilic protist clade (APC) and the class Heterolobosea were also present. The APC group represents kingdom-level novelty, with <76% sequence similarity to 18S rRNA gene sequences of organisms from other environments. Fluorescently labeled oligonucleotide rRNA probes were designed to target each of these groups in biofilm samples, enabling abundance and morphological characterization. Results revealed that the populations vary significantly with the habitat and no group is ubiquitous. Surprisingly, many of the eukaryotic lineages (with the exception of the APC) are closely related to neutrophiles, suggesting that they recently adapted to this extreme environment. Molecular analyses presented here confirm that the number of eukaryotic species associated with the acid mine drainage (AMD) communities is low. This finding is consistent with previous results showing a limited diversity of archaea, bacteria, and viruses in AMD environments and suggests that the environmental pressures and interplay between the members of these communities limit species diversity at all trophic levels.
Prevalence, distribution, and sequence diversity of hmwA among commensal and otitis media non-typeable Haemophilus influenzae.

PubMed

Davis, Gregg S; Patel, May; Hammond, James; Zhang, Lixin; Dawid, Suzanne; Marrs, Carl F; Gilsdorf, Janet R

2014-12-01

Nontypeable Haemophilus influenzae (NTHi) are Gram-negative coccobacilli that colonize the human pharynx, their only known natural reservoir. Adherence to the host epithelium facilitates NTHi colonization and marks one of the first steps in NTHi pathogenesis. Epithelial cell attachment is mediated, in part, by a pair of high molecular weight (HMW) adhesins that are highly immunogenic, antigenically diverse, and display a wide range of amino acid diversity both within and between isolates. In this study, the prevalence of hmwA, which encodes the HMW adhesin, was determined for a collection of 170 NTHi isolates recovered from the middle ears of children with otitis media (OM isolates) or throats or nasopharynges of healthy children (commensal isolates) from Finland, Israel, and the U.S. Overall, hmwA was detected in 61% of NTHi isolates and was significantly more prevalent (P=0.004) among OM isolates than among commensal isolates; the prevalence ratio comparing hmwA prevalence among ear isolates with that of commensal isolates was 1.47 (95% CI (1.12, 1.92)). Ninety-five percent (98/103) of the hmwA-positive NTHi isolates possessed two hmw loci. To advance our understanding of hmwA binding sequence diversity, we determined the DNA sequence of the hmwA binding region of 33 isolates from this collection. The average amino acid identity across all hmwA sequences was 62%. Phylogenetic analyses of the hmwA binding revealed four distinct sequence clusters, and the majority of hmwA sequences (83%) belonged to one of two dominant sequence clusters. hmwA sequences did not cluster by chromosomal location, geographic region, or disease status. Copyright © 2014 Elsevier B.V. All rights reserved.
A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness

PubMed Central

Martínez-Castilla, León P.; Rodríguez-Sotres, Rogelio

2010-01-01

Background Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or similar function with high success. However, identification of function-related and structure-related amino acid positions is only possible after a detailed study of every protein. Folding pattern diversity seems to be much narrower than sequence diversity, and the amino acid sequences of natural proteins have evolved under a selective pressure comprising structural and functional requirements acting in parallel. Principal Findings The approach described in this work begins by generating a large number of amino acid sequences using ROSETTA [Dantas G et al. (2003) J Mol Biol 332:449–460], a program with notable robustness in the assignment of amino acids to a known three-dimensional structure. The resulting sequence-sets showed no conservation of amino acids at active sites, or protein-protein interfaces. Hidden Markov models built from the resulting sequence sets were used to search sequence databases. Surprisingly, the models retrieved from the database sequences belonged to proteins with the same or a very similar function. Given an appropriate cutoff, the rate of false positives was zero. According to our results, this protocol, here referred to as Rd.HMM, detects fine structural details on the folding patterns, that seem to be tightly linked to the fitness of a structural framework for a specific biological function. Conclusion Because the sequence of the native protein used to create the Rd.HMM model was always amongst the top hits, the procedure is a reliable tool to score, very accurately, the quality and appropriateness of computer-modeled 3D-structures, without the need for spectroscopy data. However, Rd.HMM is very sensitive to the conformational features of the models' backbone. PMID:20830209
The diversity of the orthoreoviruses: molecular taxonomy and phylogentic divides.

USDA-ARS?s Scientific Manuscript database

The family Reoviridae is a diverse group of viruses with double-stranded ribonucleic acid (RNA) genomes contained within icosahedral, layered protein capsids. Within the Reoviridae, the Orthoreovirus genus includes viruses that infect reptiles, birds and mammals (including humans). Recent sequencing...
RTS,S/AS01 malaria vaccine mismatch observed among Plasmodium falciparum isolates from southern and central Africa and globally.

PubMed

Pringle, Julia C; Carpi, Giovanna; Almagro-Garcia, Jacob; Zhu, Sha Joe; Kobayashi, Tamaki; Mulenga, Modest; Bobanga, Thierry; Chaponda, Mike; Moss, William J; Norris, Douglas E

2018-04-26

The RTS,S/AS01 malaria vaccine encompasses the central repeats and C-terminal of Plasmodium falciparum circumsporozoite protein (PfCSP). Although no Phase II clinical trial studies observed evidence of strain-specific immunity, recent studies show a decrease in vaccine efficacy against non-vaccine strain parasites. In light of goals to reduce malaria morbidity, anticipating the effectiveness of RTS,S/AS01 is critical to planning widespread vaccine introduction. We deep sequenced C-terminal Pfcsp from 77 individuals living along the international border in Luapula Province, Zambia and Haut-Katanga Province, the Democratic Republic of the Congo (DRC) and compared translated amino acid haplotypes to the 3D7 vaccine strain. Only 5.2% of the 193 PfCSP sequences from the Zambia-DRC border region matched 3D7 at all 84 amino acids. To further contextualize the genetic diversity sampled in this study with global PfCSP diversity, we analyzed an additional 3,809 Pfcsp sequences from the Pf3k database and constructed a haplotype network representing 15 countries from Africa and Asia. The diversity observed in our samples was similar to the diversity observed in the global haplotype network. These observations underscore the need for additional research assessing genetic diversity in P. falciparum and the impact of PfCSP diversity on RTS,S/AS01 efficacy.

Comparative genomics of citric-acid producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88

DOE Office of Scientific and Technical Information (OSTI.GOV)

Andersen, Mikael R.; Salazar, Margarita; Schaap, Peter

2011-06-01

The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compels additional exploration. We therefore undertook whole genome sequencing of the acidogenic A. niger wild type strain (ATCC 1015), and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence and half the telomeric regionsmore » have been elucidated. Moreover, sequence information from ATCC 1015 was utilized to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 megabase of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis revealed up-regulation of the electron transport chain, specifically the alternative oxidative pathway in ATCC 1015, while CBS 513.88 showed significant up regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases and protein transporters.« less
A statistical view of FMRFamide neuropeptide diversity.

PubMed

Espinoza, E; Carrigan, M; Thomas, S G; Shaw, G; Edison, A S

2000-01-01

FMRFamide-like peptide (FLP) amino acid sequences have been collected and statistically analyzed. FLP amino acid composition as a function of position in the peptide is graphically presented for several major phyla. Results of total amino acid composition and frequencies of pairs of FLP amino acids have been computed and compared with corresponding values from the entire GenBank protein sequence database. The data for pairwise distributions of amino acids should help in future structure-function studies of FLPs. To aid in future peptide discovery, a computer program and search protocol was developed to identify FLPs from the GenBank protein database without the use of keywords.
A Robust and Versatile Method of Combinatorial Chemical Synthesis of Gene Libraries via Hierarchical Assembly of Partially Randomized Modules

PubMed Central

Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried

2015-01-01

A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis. PMID:26355961
A Robust and Versatile Method of Combinatorial Chemical Synthesis of Gene Libraries via Hierarchical Assembly of Partially Randomized Modules.

PubMed

Popova, Blagovesta; Schubert, Steffen; Bulla, Ingo; Buchwald, Daniela; Kramer, Wilfried

2015-01-01

A major challenge in gene library generation is to guarantee a large functional size and diversity that significantly increases the chances of selecting different functional protein variants. The use of trinucleotides mixtures for controlled randomization results in superior library diversity and offers the ability to specify the type and distribution of the amino acids at each position. Here we describe the generation of a high diversity gene library using tHisF of the hyperthermophile Thermotoga maritima as a scaffold. Combining various rational criteria with contingency, we targeted 26 selected codons of the thisF gene sequence for randomization at a controlled level. We have developed a novel method of creating full-length gene libraries by combinatorial assembly of smaller sub-libraries. Full-length libraries of high diversity can easily be assembled on demand from smaller and much less diverse sub-libraries, which circumvent the notoriously troublesome long-term archivation and repeated proliferation of high diversity ensembles of phages or plasmids. We developed a generally applicable software tool for sequence analysis of mutated gene sequences that provides efficient assistance for analysis of library diversity. Finally, practical utility of the library was demonstrated in principle by assessment of the conformational stability of library members and isolating protein variants with HisF activity from it. Our approach integrates a number of features of nucleic acids synthetic chemistry, biochemistry and molecular genetics to a coherent, flexible and robust method of combinatorial gene synthesis.
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.

PubMed

Sakai, Ryo; Aerts, Jan

2014-01-01

The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
Defining Electron Bifurcation in the Electron-Transferring Flavoprotein Family.

PubMed

Garcia Costas, Amaya M; Poudel, Saroj; Miller, Anne-Frances; Schut, Gerrit J; Ledbetter, Rhesa N; Fixen, Kathryn R; Seefeldt, Lance C; Adams, Michael W W; Harwood, Caroline S; Boyd, Eric S; Peters, John W

2017-11-01

Electron bifurcation is the coupling of exergonic and endergonic redox reactions to simultaneously generate (or utilize) low- and high-potential electrons. It is the third recognized form of energy conservation in biology and was recently described for select electron-transferring flavoproteins (Etfs). Etfs are flavin-containing heterodimers best known for donating electrons derived from fatty acid and amino acid oxidation to an electron transfer respiratory chain via Etf-quinone oxidoreductase. Canonical examples contain a flavin adenine dinucleotide (FAD) that is involved in electron transfer, as well as a non-redox-active AMP. However, Etfs demonstrated to bifurcate electrons contain a second FAD in place of the AMP. To expand our understanding of the functional variety and metabolic significance of Etfs and to identify amino acid sequence motifs that potentially enable electron bifurcation, we compiled 1,314 Etf protein sequences from genome sequence databases and subjected them to informatic and structural analyses. Etfs were identified in diverse archaea and bacteria, and they clustered into five distinct well-supported groups, based on their amino acid sequences. Gene neighborhood analyses indicated that these Etf group designations largely correspond to putative differences in functionality. Etfs with the demonstrated ability to bifurcate were found to form one group, suggesting that distinct conserved amino acid sequence motifs enable this capability. Indeed, structural modeling and sequence alignments revealed that identifying residues occur in the NADH- and FAD-binding regions of bifurcating Etfs. Collectively, a new classification scheme for Etf proteins that delineates putative bifurcating versus nonbifurcating members is presented and suggests that Etf-mediated bifurcation is associated with surprisingly diverse enzymes. IMPORTANCE Electron bifurcation has recently been recognized as an electron transfer mechanism used by microorganisms to maximize energy conservation. Bifurcating enzymes couple thermodynamically unfavorable reactions with thermodynamically favorable reactions in an overall spontaneous process. Here we show that the electron-transferring flavoprotein (Etf) enzyme family exhibits far greater diversity than previously recognized, and we provide a phylogenetic analysis that clearly delineates bifurcating versus nonbifurcating members of this family. Structural modeling of proteins within these groups reveals key differences between the bifurcating and nonbifurcating Etfs. Copyright © 2017 American Society for Microbiology.
Defining Electron Bifurcation in the Electron-Transferring Flavoprotein Family

PubMed Central

Garcia Costas, Amaya M.; Poudel, Saroj; Miller, Anne-Frances; Schut, Gerrit J.; Ledbetter, Rhesa N.; Seefeldt, Lance C.; Adams, Michael W. W.

2017-01-01

ABSTRACT Electron bifurcation is the coupling of exergonic and endergonic redox reactions to simultaneously generate (or utilize) low- and high-potential electrons. It is the third recognized form of energy conservation in biology and was recently described for select electron-transferring flavoproteins (Etfs). Etfs are flavin-containing heterodimers best known for donating electrons derived from fatty acid and amino acid oxidation to an electron transfer respiratory chain via Etf-quinone oxidoreductase. Canonical examples contain a flavin adenine dinucleotide (FAD) that is involved in electron transfer, as well as a non-redox-active AMP. However, Etfs demonstrated to bifurcate electrons contain a second FAD in place of the AMP. To expand our understanding of the functional variety and metabolic significance of Etfs and to identify amino acid sequence motifs that potentially enable electron bifurcation, we compiled 1,314 Etf protein sequences from genome sequence databases and subjected them to informatic and structural analyses. Etfs were identified in diverse archaea and bacteria, and they clustered into five distinct well-supported groups, based on their amino acid sequences. Gene neighborhood analyses indicated that these Etf group designations largely correspond to putative differences in functionality. Etfs with the demonstrated ability to bifurcate were found to form one group, suggesting that distinct conserved amino acid sequence motifs enable this capability. Indeed, structural modeling and sequence alignments revealed that identifying residues occur in the NADH- and FAD-binding regions of bifurcating Etfs. Collectively, a new classification scheme for Etf proteins that delineates putative bifurcating versus nonbifurcating members is presented and suggests that Etf-mediated bifurcation is associated with surprisingly diverse enzymes. IMPORTANCE Electron bifurcation has recently been recognized as an electron transfer mechanism used by microorganisms to maximize energy conservation. Bifurcating enzymes couple thermodynamically unfavorable reactions with thermodynamically favorable reactions in an overall spontaneous process. Here we show that the electron-transferring flavoprotein (Etf) enzyme family exhibits far greater diversity than previously recognized, and we provide a phylogenetic analysis that clearly delineates bifurcating versus nonbifurcating members of this family. Structural modeling of proteins within these groups reveals key differences between the bifurcating and nonbifurcating Etfs. PMID:28808132
High-Throughput Ligand Discovery Reveals a Sitewise Gradient of Diversity in Broadly Evolved Hydrophilic Fibronectin Domains

PubMed Central

Woldring, Daniel R.; Holec, Patrick V.; Zhou, Hong; Hackel, Benjamin J.

2015-01-01

Discovering new binding function via a combinatorial library in small protein scaffolds requires balance between appropriate mutations to introduce favorable intermolecular interactions while maintaining intramolecular integrity. Sitewise constraints exist in a non-spatial gradient from diverse to conserved in evolved antibody repertoires; yet non-antibody scaffolds generally do not implement this strategy in combinatorial libraries. Despite the fact that biased amino acid distributions, typically elevated in tyrosine, serine, and glycine, have gained wider use in synthetic scaffolds, these distributions are still predominantly applied uniformly to diversified sites. While select sites in fibronectin domains and DARPins have shown benefit from sitewise designs, they have not been deeply evaluated. Inspired by this disparity between diversity distributions in natural libraries and synthetic scaffold libraries, we hypothesized that binders resulting from discovery and evolution would exhibit a non-spatial, sitewise gradient of amino acid diversity. To identify sitewise diversities consistent with efficient evolution in the context of a hydrophilic fibronectin domain, >105 binders to six targets were evolved and sequenced. Evolutionarily favorable amino acid distributions at 25 sites reveal Shannon entropies (range: 0.3–3.9; median: 2.1; standard deviation: 1.1) supporting the diversity gradient hypothesis. Sitewise constraints in evolved sequences are consistent with complementarity, stability, and consensus biases. Implementation of sitewise constrained diversity enables direct selection of nanomolar affinity binders validating an efficient strategy to balance inter- and intra-molecular interaction demands at each site. PMID:26383268
Sequence and phylogenetic analysis of chicken anaemia virus obtained from backyard and commercial chickens in Nigeria.

PubMed

Oluwayelu, D O; Todd, D; Olaleye, O D

2008-12-01

This work reports the first molecular analysis study of chicken anaemia virus (CAV) in backyard chickens in Africa using molecular cloning and sequence analysis to characterize CAV strains obtained from commercial chickens and Nigerian backyard chickens. Partial VP1 gene sequences were determined for three CAVs from commercial chickens and for six CAV variants present in samples from a backyard chicken. Multiple alignment analysis revealed that the 6% and 4% nucleotide diversity obtained respectively for the commercial and backyard chicken strains translated to only 2% amino acid diversity for each breed. Overall, the amino acid composition of Nigerian CAVs was found to be highly conserved. Since the partial VP1 gene sequence of two backyard chicken cloned CAV strains (NGR/CI-8 and NGR/CI-9) were almost identical and evolutionarily closely related to the commercial chicken strains NGR-1, and NGR-4 and NGR-5, respectively, we concluded that CAV infections had crossed the farm boundary.
Fatty Acid Diversity is Not Associated with Neutral Genetic Diversity in Native Populations of the Biodiesel Plant Jatropha curcas L.

PubMed

Martínez-Díaz, Yesenia; González-Rodríguez, Antonio; Rico-Ponce, Héctor Rómulo; Rocha-Ramírez, Víctor; Ovando-Medina, Isidro; Espinosa-García, Francisco J

2017-01-01

Jatropha curcas L. (Euphorbiaceae) is a shrub native to Mexico and Central America, which produces seeds with a high oil content that can be converted to biodiesel. The genetic diversity of this plant has been widely studied, but it is not known whether the diversity of the seed oil chemical composition correlates with neutral genetic diversity. The total seed oil content, the diversity of profiles of fatty acids and phorbol esters were quantified, also, the genetic diversity obtained from simple sequence repeats was analyzed in native populations of J. curcas in Mexico. Using the fatty acids profiles, a discriminant analysis recognized three groups of individuals according to geographical origin. Bayesian assignment analysis revealed two genetic groups, while the genetic structure of the populations could not be explained by isolation-by-distance. Genetic and fatty acid profile data were not correlated based on Mantel test. Also, phorbol ester content and genetic diversity were not associated. Multiple linear regression analysis showed that total oil content was associated with altitude and seasonality of temperature. The content of unsaturated fatty acids was associated with altitude. Therefore, the cultivation planning of J. curcas should take into account chemical variation related to environmental factors. © 2017 Wiley-VHCA AG, Zurich, Switzerland.
Sequence diversity and evolution of antimicrobial peptides in invertebrates.

PubMed

Tassanakajon, Anchalee; Somboonwiwat, Kunlaya; Amparyup, Piti

2015-02-01

Antimicrobial peptides (AMPs) are evolutionarily ancient molecules that act as the key components in the invertebrate innate immunity against invading pathogens. Several AMPs have been identified and characterized in invertebrates, and found to display considerable diversity in their amino acid sequence, structure and biological activity. AMP genes appear to have rapidly evolved, which might have arisen from the co-evolutionary arms race between host and pathogens, and enabled organisms to survive in different microbial environments. Here, the sequence diversity of invertebrate AMPs (defensins, cecropins, crustins and anti-lipopolysaccharide factors) are presented to provide a better understanding of the evolution pattern of these peptides that play a major role in host defense mechanisms. Copyright © 2014 Elsevier Ltd. All rights reserved.
The genetic diversity of merozoite surface antigen 1 (MSA-1) among Babesia bovis detected from cattle populations in Thailand, Brazil and Ghana.

PubMed

Nagano, Daisuke; Sivakumar, Thillaiampalam; De De Macedo, Alane Caine Costa; Inpankaew, Tawin; Alhassan, Andy; Igarashi, Ikuo; Yokoyama, Naoaki

2013-11-01

In the present study, we screened blood DNA samples obtained from cattle bred in Brazil (n=164) and Ghana (n=80) for Babesia bovis using a diagnostic PCR assay and found prevalences of 14.6% and 46.3%, respectively. Subsequently, the genetic diversity of B. bovis in Thailand, Brazil and Ghana was analyzed, based on the DNA sequence of merozoite surface antigen-1 (MSA-1). In Thailand, MSA-1 sequences were relatively conserved and found in a single clade of the phylogram, while Brazilian MSA-1 sequences showed high genetic diversity and were dispersed across three different clades. In contrast, the sequences from Ghanaian samples were detected in two different clades, one of which contained only a single Ghanaian sequence. The identities among the MSA-1 sequences from Thailand, Brazil and Ghana were 99.0-100%, 57.5-99.4% and 60.3-100%, respectively, while the similarities among the deduced MSA-1 amino acid sequences within the respective countries were 98.4-100%, 59.4-99.7% and 58.7-100%, respectively. These observations suggested that the genetic diversity of B. bovis based on MSA-1 sequences was higher in Brazil and Ghana than in Thailand. The current data highlight the importance of conducting extensive studies on the genetic diversity of B. bovis before designing immune control strategies in each surveyed country.
Characterization of the Genetic Diversity of Acid Lime (Citrus aurantifolia (Christm.) Swingle) Cultivars of Eastern Nepal Using Inter-Simple Sequence Repeat Markers.

PubMed

Munankarmi, Nabin Narayan; Rana, Neesha; Bhattarai, Tribikram; Shrestha, Ram Lal; Joshi, Bal Krishna; Baral, Bikash; Shrestha, Sangita

2018-06-12

Acid lime ( Citrus aurantifolia (Christm.) Swingle) is an important fruit crop, which has high commercial value and is cultivated in 60 out of the 77 districts representing all geographical landscapes of Nepal. A lack of improved high-yielding varieties, infestation with various diseases, and pests, as well as poor management practices might have contributed to its extremely reduced productivity, which necessitates a reliable understanding of genetic diversity in existing cultivars. Hereby, we aim to characterize the genetic diversity of acid lime cultivars cultivated at three different agro-ecological gradients of eastern Nepal, employing PCR-based inter-simple sequence repeat (ISSR) markers. Altogether, 21 polymorphic ISSR markers were used to assess the genetic diversity in 60 acid lime cultivars sampled from different geographical locations. Analysis of binary data matrix was performed on the basis of bands obtained, and principal coordinate analysis and phenogram construction were performed using different computer algorithms. ISSR profiling yielded 234 amplicons, of which 87.18% were polymorphic. The number of amplified fragments ranged from 7⁻18, with amplicon size ranging from ca. 250⁻3200 bp. The Numerical Taxonomy and Multivariate System (NTSYS)-based cluster analysis using the unweighted pair group method of arithmetic averages (UPGMA) algorithm and Dice similarity coefficient separated 60 cultivars into two major and three minor clusters. Genetic diversity analysis using Popgene ver. 1.32 revealed the highest percentage of polymorphic bands (PPB), Nei’s genetic diversity (H), and Shannon’s information index (I) for the Terai zone (PPB = 69.66%; H = 0.215; I = 0.325), and the lowest of all three for the high hill zone (PPB = 55.13%; H = 0.173; I = 0.262). Thus, our data indicate that the ISSR marker has been successfully employed for evaluating the genetic diversity of Nepalese acid lime cultivars and has furnished valuable information on intrinsic genetic diversity and the relationship between cultivars that might be useful in acid lime breeding and conservation programs in Nepal.
Characterization of fatty acid-producing wastewater microbial communities using next generation sequencing technologies

EPA Science Inventory

While wastewater represents a viable source of bacterial biodiesel production, very little is known on the composition of these microbial communities. We studied the taxonomic diversity and succession of microbial communities in bioreactors accumulating fatty acids using 454-pyro...
Comparative genomics of citric-acid-producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88

PubMed Central

Andersen, Mikael R.; Salazar, Margarita P.; Schaap, Peter J.; van de Vondervoort, Peter J.I.; Culley, David; Thykaer, Jette; Frisvad, Jens C.; Nielsen, Kristian F.; Albang, Richard; Albermann, Kaj; Berka, Randy M.; Braus, Gerhard H.; Braus-Stromeyer, Susanna A.; Corrochano, Luis M.; Dai, Ziyu; van Dijck, Piet W.M.; Hofmann, Gerald; Lasure, Linda L.; Magnuson, Jon K.; Menke, Hildegard; Meijer, Martin; Meijer, Susan L.; Nielsen, Jakob B.; Nielsen, Michael L.; van Ooyen, Albert J.J.; Pel, Herman J.; Poulsen, Lars; Samson, Rob A.; Stam, Hein; Tsang, Adrian; van den Brink, Johannes M.; Atkins, Alex; Aerts, Andrea; Shapiro, Harris; Pangilinan, Jasmyn; Salamov, Asaf; Lou, Yigong; Lindquist, Erika; Lucas, Susan; Grimwood, Jane; Grigoriev, Igor V.; Kubicek, Christian P.; Martinez, Diego; van Peij, Noël N.M.E.; Roubos, Johannes A.; Nielsen, Jens; Baker, Scott E.

2011-01-01

The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compel additional exploration. We therefore undertook whole-genome sequencing of the acidogenic A. niger wild-type strain (ATCC 1015) and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence, and half the telomeric regions have been elucidated. Moreover, sequence information from ATCC 1015 was used to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 Mb of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis supported up-regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases, and protein transporters in the protein producing CBS 513.88 strain. Our results and data sets from this integrative systems biology analysis resulted in a snapshot of fungal evolution and will support further optimization of cell factories based on filamentous fungi. PMID:21543515
Diversity of Functionally Permissive Sequences in the Receptor-Binding Site of Influenza Hemagglutinin.

PubMed

Wu, Nicholas C; Xie, Jia; Zheng, Tianqing; Nycholat, Corwin M; Grande, Geramie; Paulson, James C; Lerner, Richard A; Wilson, Ian A

2017-06-14

Influenza A virus hemagglutinin (HA) initiates viral entry by engaging host receptor sialylated glycans via its receptor-binding site (RBS). The amino acid sequence of the RBS naturally varies across avian and human influenza virus subtypes and is also evolvable. However, functional sequence diversity in the RBS has not been fully explored. Here, we performed a large-scale mutational analysis of the RBS of A/WSN/33 (H1N1) and A/Hong Kong/1/1968 (H3N2) HAs. Many replication-competent mutants not yet observed in nature were identified, including some that could escape from an RBS-targeted broadly neutralizing antibody. This functional sequence diversity is made possible by pervasive epistasis in the RBS 220-loop and can be buffered by avidity in viral receptor binding. Overall, our study reveals that the HA RBS can accommodate a much greater range of sequence diversity than previously thought, which has significant implications for the complex evolutionary interrelationships between receptor specificity and immune escape. Copyright © 2017 Elsevier Inc. All rights reserved.
Evidence of Divergent Amino Acid Usage in Comparative Analyses of R5- and X4-Associated HIV-1 Vpr Sequences

PubMed Central

Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia

2017-01-01

Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
Genetic diversity of pneumococcal surface protein A in invasive pneumococcal isolates from Korean children, 1991-2016.

PubMed

Yun, Ki Wook; Choi, Eun Hwa; Lee, Hoan Jong

2017-01-01

Pneumococcal surface protein A (PspA) is an important virulence factor of pneumococci and has been investigated as a primary component of a capsular serotype-independent pneumococcal vaccine. Thus, we sought to determine the genetic diversity of PspA to explore its potential as a vaccine candidate. Among the 190 invasive pneumococcal isolates collected from Korean children between 1991 and 2016, two (1.1%) isolates were found to have no pspA by multiple polymerase chain reactions. The full length pspA genes from 185 pneumococcal isolates were sequenced. The length of pspA varied, ranging from 1,719 to 2,301 base pairs with 55.7-100% nucleotide identity. Based on the sequences of the clade-defining regions, 68.7% and 49.7% were in PspA family 2 and clade 3/family 2, respectively. PspA clade types were correlated with genotypes using multilocus sequence typing and divided into several subclades based on diversity analysis of the N-terminal α-helical regions, which showed nucleotide sequence identities of 45.7-100% and amino acid sequence identities of 23.1-100%. Putative antigenicity plots were also diverse among individual clades and subclades. The differences in antigenicity patterns were concentrated within the N-terminal 120 amino acids. In conclusion, the N-terminal α-helical domain, which is known to be the major immunogenic portion of PspA, is genetically variable and should be further evaluated for antigenic differences and cross-reactivity between various PspA types from pneumococcal isolates.
Size and sequence polymorphisms in the glutamate-rich protein gene of the human malaria parasite Plasmodium falciparum in Thailand.

PubMed

Pattaradilokrat, Sittiporn; Trakoolsoontorn, Chawinya; Simpalipan, Phumin; Warrit, Natapot; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai

2018-01-22

The glutamate-rich protein (GLURP) of the malaria parasite Plasmodium falciparum is a key surface antigen that serves as a component of a clinical vaccine. Moreover, the GLURP gene is also employed routinely as a genetic marker for malarial genotyping in epidemiological studies. While extensive size polymorphisms in GLURP are well recorded, the extent of the sequence diversity of this gene is rarely investigated. The present study aimed to explore the genetic diversity of GLURP in natural populations of P. falciparum. The polymorphic C-terminal repetitive R2 region of GLURP sequences from 65 P. falciparum isolates in Thailand were generated and combined with the data from 103 worldwide isolates to generate a GLURP database. The collection was comprised of 168 alleles, encoding 105 unique GLURP subtypes, characterized by 18 types of amino acid repeat units (AAU). Of these, 28 GLURP subtypes, formed by 10 AAU types, were detected in P. falciparum in Thailand. Among them, 19 GLURP subtypes and 2 AAU types are described for the first time in the Thai parasite population. The AAU sequences were highly conserved, which is likely due to negative selection. Standard Fst analysis revealed the shared distributions of GLURP types among the P. falciparum populations, providing evidence of gene flow among the different demographic populations. Sequence diversity causing size variations in GLURP in Thai P. falciparum populations were detected, and caused by non-synonymous substitutions in repeat units and some insertion/deletion of aspartic acid or glutamic acid codons between repeat units. The P. falciparum population structure based on GLURP showed promising implications for the development of GLURP-based vaccines and for monitoring vaccine efficacy.
The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.

PubMed

Schnitzler, P; Handermann, M; Szépe, O; Darai, G

1991-06-01

The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.

Survey of duckweed diversity in Lake Chao and total fatty acid, triacylglycerol, profiles of representative strains.

PubMed

Tang, J; Li, Y; Ma, J; Cheng, J J

2015-09-01

Lemnaceae (duckweeds) are widely distributed aquatic flowering plants. Their high growth rate, starch content and suitability for bioremediation make them potential feedstock for biofuels. However, few natural duckweed resources have been investigated in China, and there is no information about total fatty acid (TFA) and triacylglycerol (TAG) composition of duckweeds from China. Here, the genetic diversity of a natural duckweed population collected from Lake Chao, China, was investigated using multilocus sequence typing (MLST). The 54 strains were categorised into four species in four genera, representing 12 distinct sequence types. Strains representing Lemna aequinoctialis and Spirodela polyrhiza were predominant. Interestingly, a surprisingly high degree of genetic diversification within L. aequinoctialis was observed. The four duckweed species revealed a uniform fatty acid composition, with three fatty acids, palmitic acid, linoleic acid and linolenic acid, accounting for more than 80% of the TFA. The TFA in biomass varied among species, ranging from 1.05% (of dry weight, DW) for L. punctata and S. polyrhiza to 1.62% for Wolffia globosa. The four duckweed species contained similar TAG contents, 0.02% mg · DW(-1). The fatty acid profiles of TAG were different from those of TFA, and also varied among the four species. The survey investigated the genetic diversity of duckweeds from Lake Chao, and provides an initial insight into TFA and TAG of four duckweed species, indicating that intraspecific and interspecific variations exist in the content and composition of both TFA and TAG in comparison with other studies. © 2015 German Botanical Society and The Royal Botanical Society of the Netherlands.
Diversity of virus-host systems in hypersaline Lake Retba, Senegal.

PubMed

Sime-Ngando, Télesphore; Lucas, Soizick; Robin, Agnès; Tucker, Kimberly Pause; Colombet, Jonathan; Bettarel, Yvan; Desmond, Elie; Gribaldo, Simonetta; Forterre, Patrick; Breitbart, Mya; Prangishvili, David

2011-08-01

Remarkable morphological diversity of virus-like particles was observed by transmission electron microscopy in a hypersaline water sample from Lake Retba, Senegal. The majority of particles morphologically resembled hyperthermophilic archaeal DNA viruses isolated from extreme geothermal environments. Some hypersaline viral morphotypes have not been previously observed in nature, and less than 1% of observed particles had a head-and-tail morphology, which is typical for bacterial DNA viruses. Culture-independent analysis of the microbial diversity in the sample suggested the dominance of extremely halophilic archaea. Few of the 16S sequences corresponded to known archeal genera (Haloquadratum, Halorubrum and Natronomonas), whereas the majority represented novel archaeal clades. Three sequences corresponded to a new basal lineage of the haloarchaea. Bacteria belonged to four major phyla, consistent with the known diversity in saline environments. Metagenomic sequencing of DNA from the purified virus-like particles revealed very few similarities to the NCBI non-redundant database at either the nucleotide or amino acid level. Some of the identifiable virus sequences were most similar to previously described haloarchaeal viruses, but no sequence similarities were found to archaeal viruses from extreme geothermal environments. A large proportion of the sequences had similarity to previously sequenced viral metagenomes from solar salterns. © 2010 Society for Applied Microbiology and Blackwell Publishing Ltd.
Self-sequencing of amino acids and origins of polyfunctional protocells

NASA Technical Reports Server (NTRS)

Fox, S. W.

1984-01-01

The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.
Unexpected fungal communities in the Rehai thermal springs of Tengchong influenced by abiotic factors.

PubMed

Liu, Kai-Hui; Ding, Xiao-Wei; Salam, Nimaichand; Zhang, Bo; Tang, Xiao-Fei; Deng, Baiwan; Li, Wen-Jun

2018-05-01

Fungal communities represent an indispensable part of the geothermal spring ecosystem; however, studies on fungal community within hot springs are still scant. Here, we used Illumina HiSeq 2500 sequencing to detect fungal community diversity in extremely acidic hot springs (pH < 4) and neutral and alkaline springs (pH > 6) of Tengchong-indicated by the presence of over 0.75 million valid reads. These sequences were phylogenetically assigned to 5 fungal phyla, 67 order, and 375 genera, indicating unexpected fungal diversity in the hot springs. The genera such as Penicillium, Entyloma, and Cladosporium dominated the fungal community in the acidic geothermal springs, while the groups such as Penicillium, Engyodontium, and Schizophyllum controlled the fungal assemblages in the alkaline hot springs. The alpha-diversity indices and the abundant fungal taxa were significantly correlated with physicochemical factors of the hot springs particularly pH, temperature, and concentrations of Fe 2+ , NH 4 + , NO 2 -, and S 2- , suggesting that the diversity and distribution of fungal assemblages can be influenced by the complex environmental factors of hot springs.
Photosynthesis within Mars' volcanic craters?: Insights from Cerro Negro Volcano, Nicaragua

NASA Astrophysics Data System (ADS)

Rogers, K. L.; Hynek, B. M.; McCollom, T. M.

2011-12-01

Discrete locales of sulfate-rich bedrocks exist on Mars and in many cases represent the products of acid-sulfate alteration of martian basalt. In some places, the products have been attributed to hydrothermal processes from local volcanism. In order to evaluate the habitability of such an environment, we are investigating the geochemical and biological composition of active fumaroles at Cerro Negro Volcano, Nicaragua, where fresh basaltic cinders similar in composition to martian basalts are altered by acidic, sulfur-bearing gases. Temperatures at active fumaroles can reach as high as 400°C and the pH of the steam ranges from <0 to 5. Adjacent to some fumaroles, silica is being precipitated from condensing steam on the crater walls and endolithic photosynthetic mats are found at 1-2 cm depth within these silica deposits. We have analyzed one of these mats, Monkey Cheek (T=65°C, pH ~4.5), for both Archaeal and Bacterial diversity. Cloning of PCR-amplified 16S rRNA genes reveals a diverse community of Bacteria, with eight phyla represented. The most common bacterial sequences belonged to the Cyanobacteria and Ktedonobacteria, however Actinobacteria, alpha-Proteobacteria and Acidobacteria were also identified. Many of the cyanobacterial sequences were similar to those of the eukaryotic Cyanidiales, red algae that inhabit acidic, geothermal environments. Many of sequences related to Ktedonobacteria and Actinobacteria have also been found in acid mine drainage environments. The Archaeal community was far less diverse, with sequences matching those of unclassified Desulfurococcales and unclassified Thermoprotei. These sequences were more distant from isolated species than the bacterial sequences. Similar bacterial and archaeal communities have been found in hot spring environments in Yellowstone National Park, Greenland, Iceland, New Zealand and Costa Rica. Some of Mars' volcanoes were active for billions of years and by analogy to Cerro Negro, may have hosted photosynthetic organisms that could have been preserved in alteration mineral assemblages. Even on a generally cold and dry Mars, volcanic craters likely provided long-lived warm and wet conditions and should be a key target for future exploration assessing habitability.
The domestication of the probiotic bacterium Lactobacillus acidophilus

PubMed Central

Bull, Matthew J.; Jolley, Keith A.; Bray, James E.; Aerts, Maarten; Vandamme, Peter; Maiden, Martin C. J.; Marchesi, Julian R.; Mahenthiralingam, Eshwar

2014-01-01

Lactobacillus acidophilus is a Gram-positive lactic acid bacterium that has had widespread historical use in the dairy industry and more recently as a probiotic. Although L. acidophilus has been designated as safe for human consumption, increasing commercial regulation and clinical demands for probiotic validation has resulted in a need to understand its genetic diversity. By drawing on large, well-characterised collections of lactic acid bacteria, we examined L. acidophilus isolates spanning 92 years and including multiple strains in current commercial use. Analysis of the whole genome sequence data set (34 isolate genomes) demonstrated L. acidophilus was a low diversity, monophyletic species with commercial isolates essentially identical at the sequence level. Our results indicate that commercial use has domesticated L. acidophilus with genetically stable, invariant strains being consumed globally by the human population. PMID:25425319
The domestication of the probiotic bacterium Lactobacillus acidophilus.

PubMed

Bull, Matthew J; Jolley, Keith A; Bray, James E; Aerts, Maarten; Vandamme, Peter; Maiden, Martin C J; Marchesi, Julian R; Mahenthiralingam, Eshwar

2014-11-26

Lactobacillus acidophilus is a Gram-positive lactic acid bacterium that has had widespread historical use in the dairy industry and more recently as a probiotic. Although L. acidophilus has been designated as safe for human consumption, increasing commercial regulation and clinical demands for probiotic validation has resulted in a need to understand its genetic diversity. By drawing on large, well-characterised collections of lactic acid bacteria, we examined L. acidophilus isolates spanning 92 years and including multiple strains in current commercial use. Analysis of the whole genome sequence data set (34 isolate genomes) demonstrated L. acidophilus was a low diversity, monophyletic species with commercial isolates essentially identical at the sequence level. Our results indicate that commercial use has domesticated L. acidophilus with genetically stable, invariant strains being consumed globally by the human population.
Diversity analysis of lactic acid bacteria in takju, Korean rice wine.

PubMed

Jin, Jianbo; Kim, So-Young; Jin, Qing; Eom, Hyun-Ju; Han, Nam Soo

2008-10-01

To investigate the lactic acid bacterial population in Korean traditional rice wines, biotyping was performed using cell morphology and whole-cell protein pattern analysis by SDSPAGE, and then the isolates were identified by 16S rRNA sequencing analysis. Based on the morphological characteristics, 103 LAB isolates were detected in wine samples, characterized by whole-cell protein pattern analysis, and they were then divided into 18 patterns. By gene sequencing of 16S rRNA, the isolates were identified as Lactobacillus paracasei, Lb. arizonensis, Lb. plantarum, Lb. harbinensis, Lb. parabuchneri, Lb. brevis, and Lb. hilgardii when listed by their frequency of occurrence. It was found that the difference in bacterial diversity between rice and grape wines depends on the raw materials, especially the composition of starch and glucose.
T7 lytic phage-displayed peptide libraries: construction and diversity characterization.

PubMed

Krumpe, Lauren R H; Mori, Toshiyuki

2014-01-01

In this chapter, we describe the construction of T7 bacteriophage (phage)-displayed peptide libraries and the diversity analyses of random amino acid sequences obtained from the libraries. We used commercially available reagents, Novagen's T7Select system, to construct the libraries. Using a combination of biotinylated extension primer and streptavidin-coupled magnetic beads, we were able to prepare library DNA without applying gel purification, resulting in extremely high ligation efficiencies. Further, we describe the use of bioinformatics tools to characterize library diversity. Amino acid frequency and positional amino acid diversity and hydropathy are estimated using the REceptor LIgand Contacts website http://relic.bio.anl.gov. Peptide net charge analysis and peptide hydropathy analysis are conducted using the Genetics Computer Group Wisconsin Package computational tools. A comprehensive collection of the estimated number of recombinants and titers of T7 phage-displayed peptide libraries constructed in our lab is included.
Diversity of the P2 protein among nontypeable Haemophilus influenzae isolates.

PubMed Central

Bell, J; Grass, S; Jeanteur, D; Munson, R S

1994-01-01

The genes for outer membrane protein P2 of four nontypeable Haemophilus influenzae strains were cloned and sequenced. The derived amino acid sequences were compared with the outer membrane protein P2 sequence from H. influenzae type b MinnA and the sequences of P2 from three additional nontypeable H. influenzae strains. The sequences were 76 to 94% identical. The sequences had regions with considerable variability separated by regions which were highly conserved. The variable regions mapped to putative surface-exposed loops of the protein. PMID:8188390
Diversity of Pneumolysin and Pneumococcal Histidine Triad Protein D of Streptococcus pneumoniae Isolated from Invasive Diseases in Korean Children.

PubMed

Yun, Ki Wook; Lee, Hyunju; Choi, Eun Hwa; Lee, Hoan Jong

2015-01-01

Pneumolysin (Ply) and pneumococcal histidine triad protein D (PhtD) are candidate proteins for a next-generation pneumococcal vaccine. We aimed to analyze the genetic diversity and antigenic heterogeneity of Ply and PhtD for 173 pneumococci isolated from invasive diseases in Korean children. Allele was designated based on the variation of amino acid sequence. Antigenicity was predicted by the amino acid hydrophobicity of the region. There were seven and 39 allele types for the ply and phtD genes, respectively. The nucleotide sequence identity was 97.2%-99.9% for ply and 91.4%-98.0% for phtD gene. Only minor variations in hydrophobicity were noted among the antigenicity plots of Ply and PhtD. Overall, the allele types of the ply and phtD genes were remarkably homogeneous, and the antigenic diversity of the corresponding proteins was very limited. The Ply and PhtD could be useful antigens for universal pneumococcal vaccines.
Variation in Seed Fatty Acid Composition, and Sequence Divergence in the FAD2 Gene Coding Region between Wild and Cultivated Sesame

USDA-ARS?s Scientific Manuscript database

Sesame germplasm harbors genetic diversity which can be useful for sesame improvement in breeding programs. Seven accessions with different levels of oleic acid were selected from the entire USDA sesame germplasm collection (1232 accessions) and planted for morphological observation and re-examinati...
Helicobacter pylori Heat Shock Protein A: Serologic Responses and Genetic Diversity

PubMed Central

Ng, Enders K. W.; Thompson, Stuart A.; Pérez-Pérez, Guillermo I.; Kansau, Imad; van der Ende, Arie; Labigne, Agnès; Sung, Joseph J. Y.; Chung, S. C. Sydney; Blaser, Martin J.

1999-01-01

Helicobacter pylori synthesizes an unusual GroES homolog, heat shock protein A (HspA). The present study was aimed at an assessment of the serological response to HspA in a group of Chinese patients with defined gastroduodenal pathologies and determination of whether diversity is present in the nucleotide sequences encoding HspA in isolates from these patients. Serum samples collected from 154 patients who had an upper gastrointestinal pathology and the presence of H. pylori defined by biopsy were tested for an immunoglobulin G (IgG) serologic response to H. pylori HspA by an enzyme linked immunosorbant assay. HspA-encoding nucleotide sequences in H. pylori isolates from 14 patients (7 seropositive and 7 seronegative for HspA) were analyzed by PCR and direct sequencing of the PCR products. The sequencing results were compared to those of 48 isolates from other parts of the world. Of the 154 known H. pylori-positive patients, 54 (35.1%) were seropositive for HspA. The A domain (GroES homology) of HspA was highly conserved in the 14 isolates tested. Although the B domain (metal-binding site unique to H. pylori) resembled that in the known major variant, particular amino acid substitutions allowed definition of an HspA variant associated with isolates from East Asia. There were no associations between patient characteristics and HspA seropositivity or amino acid sequences. We confirmed in this study that the clinical outcomes of H. pylori infection are not related to HspA antigenicity or to sequence variation. However, B-domain sequence variation may be a marker for the study of the genetic diversity of H. pylori strains of different geographic origins. PMID:10225839
Next-Generation Sequencing Reveals Significant Bacterial Diversity of Botrytized Wine

PubMed Central

Bokulich, Nicholas A.; Joseph, C. M. Lucy; Allen, Greg; Benson, Andrew K.; Mills, David A.

2012-01-01

While wine fermentation has long been known to involve complex microbial communities, the composition and role of bacteria other than a select set of lactic acid bacteria (LAB) has often been assumed either negligible or detrimental. This study served as a pilot study for using barcoded amplicon next-generation sequencing to profile bacterial community structure in wines and grape musts, comparing the taxonomic depth achieved by sequencing two different domains of prokaryotic 16S rDNA (V4 and V5). This study was designed to serve two goals: 1) to empirically determine the most taxonomically informative 16S rDNA target region for barcoded amplicon sequencing of wine, comparing V4 and V5 domains of bacterial 16S rDNA to terminal restriction fragment length polymorphism (TRFLP) of LAB communities; and 2) to explore the bacterial communities of wine fermentation to better understand the biodiversity of wine at a depth previously unattainable using other techniques. Analysis of amplicons from the V4 and V5 provided similar views of the bacterial communities of botrytized wine fermentations, revealing a broad diversity of low-abundance taxa not traditionally associated with wine, as well as atypical LAB communities initially detected by TRFLP. The V4 domain was determined as the more suitable read for wine ecology studies, as it provided greater taxonomic depth for profiling LAB communities. In addition, targeted enrichment was used to isolate two species of Alphaproteobacteria from a finished fermentation. Significant differences in diversity between inoculated and uninoculated samples suggest that Saccharomyces inoculation exerts selective pressure on bacterial diversity in these fermentations, most notably suppressing abundance of acetic acid bacteria. These results determine the bacterial diversity of botrytized wines to be far higher than previously realized, providing further insight into the fermentation dynamics of these wines, and demonstrate the utility of next-generation sequencing for wine ecology studies. PMID:22563494
Theileria parva antigens recognized by CD8+ T cells show varying degrees of diversity in buffalo-derived infected cell lines.

PubMed

Sitt, Tatjana; Pelle, Roger; Chepkwony, Maurine; Morrison, W Ivan; Toye, Philip

2018-05-06

The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.
Penicillin-resistant, ampicillin-susceptible Enterococcus faecalis of hospital origin: pbp4 gene polymorphism and genetic diversity.

PubMed

Conceição, Natália; da Silva, Lucas Emanuel Pinheiro; Darini, Ana Lúcia da Costa; Pitondo-Silva, André; de Oliveira, Adriana Gonçalves

2014-12-01

Despite the spread of penicillin-resistant, ampicillin-susceptible Enterococcus faecalis (PRASEF) isolates in diverse countries, the mechanisms leading to this unusual resistance phenotype have not yet been investigated. The aim of this study was to evaluate whether polymorphism in the pbp4 gene is associated with penicillin resistance in PRASEF isolates and to determine their genetic diversity. E. faecalis isolates were recovered from different clinical specimens of hospitalized patients from February 2006 to June 2010. The β-lactam minimal inhibitory concentrations (MICs) were determined by E-test®. The PCR-amplified pbp4 gene was sequenced with an automated sequencer. The genetic diversities of the isolates were established by PFGE (pulsed-field gel electrophoresis) and MLST (multilocus sequencing typing). Seventeen non-producing β-lactamase PRASEF and 10 penicillin-susceptible, ampicillin-susceptible E. faecalis (PSASEF) strains were analyzed. A single-amino-acid substitution (Asp-573→Glu) in the penicillin-binding domain was significantly found in all PRASEF isolates by sequencing of the pbp4 gene but not in the penicillin-susceptible isolates. In contrast to the PSASEF isolates, a majority of the PRASEFs had similar PFGE profiles. Six representative PRASEF isolates were resolved by MLST into ST9 and ST524 and belong to the globally dispersed clonal complex 9 (CC9). In conclusion, it appears quite likely that the amino acid alteration (Asp-573→Glu) found in the PBP4 of the Brazilian PRASEF isolates may account for their reduced susceptibility to penicillin, although other resistance mechanisms remain to be investigated. Copyright © 2014 Elsevier B.V. All rights reserved.
Conservation and variability of West Nile virus proteins.

PubMed

Koo, Qi Ying; Khan, Asif M; Jung, Keun-Ok; Ramdas, Shweta; Miotto, Olivo; Tan, Tin Wee; Brusic, Vladimir; Salmon, Jerome; August, J Thomas

2009-01-01

West Nile virus (WNV) has emerged globally as an increasingly important pathogen for humans and domestic animals. Studies of the evolutionary diversity of the virus over its known history will help to elucidate conserved sites, and characterize their correspondence to other pathogens and their relevance to the immune system. We describe a large-scale analysis of the entire WNV proteome, aimed at identifying and characterizing evolutionarily conserved amino acid sequences. This study, which used 2,746 WNV protein sequences collected from the NCBI GenPept database, focused on analysis of peptides of length 9 amino acids or more, which are immunologically relevant as potential T-cell epitopes. Entropy-based analysis of the diversity of WNV sequences, revealed the presence of numerous evolutionarily stable nonamer positions across the proteome (entropy value of < or = 1). The representation (frequency) of nonamers variant to the predominant peptide at these stable positions was, generally, low (< or = 10% of the WNV sequences analyzed). Eighty-eight fragments of length 9-29 amino acids, representing approximately 34% of the WNV polyprotein length, were identified to be identical and evolutionarily stable in all analyzed WNV sequences. Of the 88 completely conserved sequences, 67 are also present in other flaviviruses, and several have been associated with the functional and structural properties of viral proteins. Immunoinformatic analysis revealed that the majority (78/88) of conserved sequences are potentially immunogenic, while 44 contained experimentally confirmed human T-cell epitopes. This study identified a comprehensive catalogue of completely conserved WNV sequences, many of which are shared by other flaviviruses, and majority are potential epitopes. The complete conservation of these immunologically relevant sequences through the entire recorded WNV history suggests they will be valuable as components of peptide-specific vaccines or other therapeutic applications, for sequence-specific diagnosis of a wide-range of Flavivirus infections, and for studies of homologous sequences among other flaviviruses.
Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity

PubMed Central

Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F; Abbazia, Patrick; Ababio, Amma; Adam, Naazneen

2015-01-01

The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery. DOI: http://dx.doi.org/10.7554/eLife.06416.001 PMID:25919952
Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity.

PubMed

Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F

2015-04-28

The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery.
Population genetic structure and natural selection of Plasmodium falciparum apical membrane antigen-1 in Myanmar isolates.

PubMed

Kang, Jung-Mi; Lee, Jinyoung; Moe, Mya; Jun, Hojong; Lê, Hương Giang; Kim, Tae Im; Thái, Thị Lam; Sohn, Woon-Mok; Myint, Moe Kyaw; Lin, Khin; Shin, Ho-Joon; Kim, Tong-Soo; Na, Byoung-Kuk

2018-02-07

Plasmodium falciparum apical membrane antigen-1 (PfAMA-1) is one of leading blood stage malaria vaccine candidates. However, genetic variation and antigenic diversity identified in global PfAMA-1 are major hurdles in the development of an effective vaccine based on this antigen. In this study, genetic structure and the effect of natural selection of PfAMA-1 among Myanmar P. falciparum isolates were analysed. Blood samples were collected from 58 Myanmar patients with falciparum malaria. Full-length PfAMA-1 gene was amplified by polymerase chain reaction and cloned into a TA cloning vector. PfAMA-1 sequence of each isolate was sequenced. Polymorphic characteristics and effect of natural selection were analysed with using DNASTAR, MEGA4, and DnaSP programs. Polymorphic nature and natural selection in 459 global PfAMA-1 were also analysed. Thirty-seven different haplotypes of PfAMA-1 were identified in 58 Myanmar P. falciparum isolates. Most amino acid changes identified in Myanmar PfAMA-1 were found in domains I and III. Overall patterns of amino acid changes in Myanmar PfAMA-1 were similar to those in global PfAMA-1. However, frequencies of amino acid changes differed by country. Novel amino acid changes in Myanmar PfAMA-1 were also identified. Evidences for natural selection and recombination event were observed in global PfAMA-1. Among 51 commonly identified amino acid changes in global PfAMA-1 sequences, 43 were found in predicted RBC-binding sites, B-cell epitopes, or IUR regions. Myanmar PfAMA-1 showed similar patterns of nucleotide diversity and amino acid polymorphisms compared to those of global PfAMA-1. Balancing natural selection and intragenic recombination across PfAMA-1 are likely to play major roles in generating genetic diversity in global PfAMA-1. Most common amino acid changes in global PfAMA-1 were located in predicted B-cell epitopes where high levels of nucleotide diversity and balancing natural selection were found. These results highlight the strong selective pressure of host immunity on the PfAMA-1 gene. These results have significant implications in understanding the nature of Myanmar PfAMA-1 along with global PfAMA-1. They also provide useful information for the development of effective malaria vaccine based on this antigen.

Genetic Diversity of Hepatitis A Virus in China: VP3-VP1-2A Genes and Evidence of Quasispecies Distribution in the Isolates

PubMed Central

Cao, Jingyuan; Zhou, Wenting; Yi, Yao; Jia, Zhiyuan; Bi, Shengli

2013-01-01

Hepatitis A virus (HAV) is the most common cause of infectious hepatitis throughout the world, spread largely by the fecal-oral route. To characterize the genetic diversity of the virus circulating in China where HAV in endemic, we selected the outbreak cases with identical sequences in VP1-2A junction region and compiled a panel of 42 isolates. The VP3-VP1-2A regions of the HAV capsid-coding genes were further sequenced and analyzed. The quasispecies distribution was evaluated by cloning the VP3 and VP1-2A genes in three clinical samples. Phylogenetic analysis demonstrated that the same genotyping results could be obtained whether using the complete VP3, VP1, or partial VP1-2A genes for analysis in this study, although some differences did exist. Most isolates clustered in sub-genotype IA, and fewer in sub-genotype IB. No amino acid mutations were found at the published neutralizing epitope sites, however, several unique amino acid substitutions in the VP3 or VP1 region were identified, with two amino acid variants closely located to the immunodominant site. Quasispecies analysis showed the mutation frequencies were in the range of 7.22x10-4 -2.33x10-3 substitutions per nucleotide for VP3, VP1, or VP1-2A. When compared with the consensus sequences, mutated nucleotide sites represented the minority of all the analyzed sequences sites. HAV replicated as a complex distribution of closely genetically related variants referred to as quasispecies, and were under negative selection. The results indicate that diverse HAV strains and quasispecies inside the viral populations are presented in China, with unique amino acid substitutions detected close to the immunodominant site, and that the possibility of antigenic escaping mutants cannot be ruled out and needs to be further analyzed. PMID:24069343
The microbiology of Bandji, palm wine of Borassus akeassii from Burkina Faso: identification and genotypic diversity of yeasts, lactic acid and acetic acid bacteria.

PubMed

Ouoba, L I I; Kando, C; Parkouda, C; Sawadogo-Lingani, H; Diawara, B; Sutherland, J P

2012-12-01

To investigate physicochemical characteristics and especially genotypic diversity of the main culturable micro-organisms involved in fermentation of sap from Borassus akeassii, a newly identified palm tree from West Africa. Physicochemical characterization was performed using conventional methods. Identification of micro-organisms included phenotyping and sequencing of: 26S rRNA gene for yeasts, 16S rRNA and gyrB genes for lactic acid bacteria (LAB) and acetic acid bacteria (AAB). Interspecies and intraspecies genotypic diversities of the micro-organisms were screened respectively by amplification of the ITS1-5.8S rDNA-ITS2/16S-23S rDNA ITS regions and repetitive sequence-based PCR (rep-PCR). The physicochemical characteristics of samples were: pH: 3.48-4.12, titratable acidity: 1.67-3.50 mg KOH g(-1), acetic acid: 0.16-0.37%, alcohol content: 0.30-2.73%, sugars (degrees Brix): 2.70-8.50. Yeast included mainly Saccharomyces cerevisiae and species of the genera Arthroascus, Issatchenkia, Candida, Trichosporon, Hanseniaspora, Kodamaea, Schizosaccharomyces, Trigonopsis and Galactomyces. Lactobacillus plantarum was the predominant LAB species. Three other species of Lactobacillus were also identified as well as isolates of Leuconostoc mesenteroides, Fructobacillus durionis and Streptococcus mitis. Acetic acid bacteria included nine species of the genus Acetobacter with Acetobacter indonesiensis as predominant species. In addition, isolates of Gluconobacter oxydans and Gluconacetobacter saccharivorans were also identified. Intraspecies diversity was observed for some species of micro-organisms including four genotypes for Acet. indonesiensis, three for Candida tropicalis and Lactobacillus fermentum and two each for S. cerevisiae, Trichosporon asahii, Candida pararugosa and Acetobacter tropicalis. fermentation of palm sap from B. akeassii involved multi-yeast-LAB-AAB cultures at genus, species and intraspecies level. First study describing microbiological and physicochemical characteristics of palm wine from B. akeassii. Genotypic diversity of palm wine LAB and AAB not reported before is demonstrated and this constitutes valuable information for better understanding of the fermentation which can be used to improve the product quality and develop added value by-products. © 2012 The Society for Applied Microbiology.
Fungal genome sequencing: basic biology to biotechnology.

PubMed

Sharma, Krishna Kant

2016-08-01

The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet's stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research.
Genetic diversity of the DBLalpha region in Plasmodium falciparum var genes among Asia-Pacific isolates.

PubMed

Fowler, Elizabeth V; Peters, Jennifer M; Gatton, Michelle L; Chen, Nanhua; Cheng, Qin

2002-03-01

In Plasmodium falciparum a highly polymorphic multi-copy gene family, var, encodes the variant surface antigen P. falciparum erythrocyte membrane protein 1 (PfEMP1), which has an important role in cytoadherence and immune evasion. Using previously described universal PCR primers for the first Duffy binding-like domain (DBLalpha) of var we analysed the DBLalpha repertoires of Dd2 (originally from Thailand) and eight isolates from the Solomon Islands (n=4), Philippines (n=2), Papua New Guinea (n=1) and Africa (n=1). We found 15-32 unique DBLalpha sequence types among these isolates and estimated detectable DBLalpha repertoire sizes ranging from 33-38 to 52-57 copies per genome. Our data suggest that var gene repertoires generally consist of 40-50 copies per genome. Eighteen DBLalpha sequences appeared in more than one Asia-Pacific isolate with the number of sequences shared between any two isolates ranging from 0 to 6 (mean=2.0 +/-1.6). At the amino acid level DBLalpha sequence similarity within isolates ranged from 45.2 +/- 7.1 to 50.2 +/- 6.9%, and was not significantly different from the DBLalpha amino acid sequence similarity among isolates (P>0.1). Comparisons with published sequences also revealed little overlap among DBLalpha sequences from different regions. High DBLalpha sequence diversity and minimal overlap among these isolates suggest that the global var gene repertoire is immense, and may potentially be selected for by the host's protective immune response to the var gene products, PfEMP1.
A diverse family of serine proteinase genes expressed in cotton boll weevil (Anthonomus grandis): implications for the design of pest-resistant transgenic cotton plants.

PubMed

Oliveira-Neto, Osmundo B; Batista, João A N; Rigden, Daniel J; Fragoso, Rodrigo R; Silva, Rodrigo O; Gomes, Eliane A; Franco, Octávio L; Dias, Simoni C; Cordeiro, Célia M T; Monnerat, Rose G; Grossi-De-Sá, Maria F

2004-09-01

Fourteen different cDNA fragments encoding serine proteinases were isolated by reverse transcription-PCR from cotton boll weevil (Anthonomus grandis) larvae. A large diversity between the sequences was observed, with a mean pairwise identity of 22% in the amino acid sequence. The cDNAs encompassed 11 trypsin-like sequences classifiable into three families and three chymotrypsin-like sequences belonging to a single family. Using a combination of 5' and 3' RACE, the full-length sequence was obtained for five of the cDNAs, named Agser2, Agser5, Agser6, Agser10 and Agser21. The encoded proteins included amino acid sequence motifs of serine proteinase active sites, conserved cysteine residues, and both zymogen activation and signal peptides. Southern blotting analysis suggested that one or two copies of these serine proteinase genes exist in the A. grandis genome. Northern blotting analysis of Agser2 and Agser5 showed that for both genes, expression is induced upon feeding and is concentrated in the gut of larvae and adult insects. Reverse northern analysis of the 14 cDNA fragments showed that only two trypsin-like and two chymotrypsin-like were expressed at detectable levels. Under the effect of the serine proteinase inhibitors soybean Kunitz trypsin inhibitor and black-eyed pea trypsin/chymotrypsin inhibitor, expression of one of the trypsin-like sequences was upregulated while expression of the two chymotrypsin-like sequences was downregulated. Copyright 2004 Elsevier Ltd.
Diversity of Ligninolytic Enzymes and Their Genes in Strains of the Genus Ganoderma: Applicable for Biodegradation of Xenobiotic Compounds?

PubMed Central

Torres-Farradá, Giselle; Manzano León, Ana M.; Rineau, François; Ledo Alonso, Lucía L.; Sánchez-López, María I.; Thijs, Sofie; Colpaert, Jan; Ramos-Leal, Miguel; Guerra, Gilda; Vangronsveld, Jaco

2017-01-01

White-rot fungi (WRF) and their ligninolytic enzymes (laccases and peroxidases) are considered promising biotechnological tools to remove lignin related Persistent Organic Pollutants from industrial wastewaters and contaminated ecosystems. A high diversity of the genus Ganoderma has been reported in Cuba; in spite of this, the diversity of ligninolytic enzymes and their genes remained unexplored. In this study, 13 native WRF strains were isolated from decayed wood in urban ecosystems in Havana (Cuba). All strains were identified as Ganoderma sp. using a multiplex polymerase chain reaction (PCR)-method based on ITS sequences. All Ganoderma sp. strains produced laccase enzymes at higher levels than non-specific peroxidases. Native-PAGE of extracellular enzymatic extracts revealed a high diversity of laccase isozymes patterns between the strains, suggesting the presence of different amino acid sequences in the laccase enzymes produced by these Ganoderma strains. We determined the diversity of genes encoding laccases and peroxidases using a PCR and cloning approach with basidiomycete-specific primers. Between two and five laccase genes were detected in each strain. In contrast, only one gene encoding manganese peroxidase or versatile peroxidase was detected in each strain. The translated laccases and peroxidases amino acid sequences have not been described before. Extracellular crude enzymatic extracts produced by the Ganoderma UH strains, were able to degrade model chromophoric compounds such as anthraquinone and azo dyes. These findings hold promises for the development of a practical application for the treatment of textile industry wastewaters and also for bioremediation of polluted ecosystems by well-adapted native WRF strains. PMID:28588565
Diversity of lactic acid bacteria in suan-tsai and fu-tsai, traditional fermented mustard products of Taiwan.

PubMed

Chao, Shiou-Huei; Wu, Ruei-Jie; Watanabe, Koichi; Tsai, Ying-Chieh

2009-11-15

Fu-tsai and suan-tsai are spontaneously fermented mustard products traditionally prepared by the Hakka tribe of Taiwan. We chose 5 different processing stages of these products for analysis of the microbial community of lactic acid bacteria (LAB) by 16S rRNA gene sequencing. From 500 LAB isolates we identified 119 representative strains belonging to 5 genera and 18 species, including Enterococcus (1 species), Lactobacillus (11 species), Leuconostoc (3 species), Pediococcus (1 species), and Weissella (2 species). The LAB composition of mustard fermented for 3 days, known as the Mu sample, was the most diverse, with 11 different LAB species being isolated. We used sequence analysis of the 16S rRNA gene to identify the LAB strains and analysis of the dnaA, pheS, and rpoA genes to identify 13 LAB strains for which identification by 16S rRNA gene sequences was not possible. These 13 strains were found to belong to 5 validated known species: Lactobacillus farciminis, Leuconostoc mesenteroides, Leuconostoc pseudomesenteroides, Weissella cibaria, and Weissella paramesenteroides, and 5 possibly novel Lactobacillus species. These results revealed that there is a high level of diversity in LAB at the different stages of fermentation in the production of suan-tsai and fu-tsai.
Influence of Geographical Origin and Flour Type on Diversity of Lactic Acid Bacteria in Traditional Belgian Sourdoughs▿ †

PubMed Central

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2007-01-01

A culture-based approach was used to investigate the diversity of lactic acid bacteria (LAB) in Belgian traditional sourdoughs and to assess the influence of flour type, bakery environment, geographical origin, and technological characteristics on the taxonomic composition of these LAB communities. For this purpose, a total of 714 LAB from 21 sourdoughs sampled at 11 artisan bakeries throughout Belgium were subjected to a polyphasic identification approach. The microbial composition of the traditional sourdoughs was characterized by bacteriological culture in combination with genotypic identification methods, including repetitive element sequence-based PCR fingerprinting and phenylalanyl-tRNA synthase (pheS) gene sequence analysis. LAB from Belgian sourdoughs belonged to the genera Lactobacillus, Pediococcus, Leuconostoc, Weissella, and Enterococcus, with the heterofermentative species Lactobacillus paralimentarius, Lactobacillus sanfranciscensis, Lactobacillus plantarum, and Lactobacillus pontis as the most frequently isolated taxa. Statistical analysis of the identification data indicated that the microbial composition of the sourdoughs is mainly affected by the bakery environment rather than the flour type (wheat, rye, spelt, or a mixture of these) used. In conclusion, the polyphasic approach, based on rapid genotypic screening and high-resolution, sequence-dependent identification, proved to be a powerful tool for studying the LAB diversity in traditional fermented foods such as sourdough. PMID:17675431
Influence of geographical origin and flour type on diversity of lactic acid bacteria in traditional Belgian sourdoughs.

PubMed

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2007-10-01

A culture-based approach was used to investigate the diversity of lactic acid bacteria (LAB) in Belgian traditional sourdoughs and to assess the influence of flour type, bakery environment, geographical origin, and technological characteristics on the taxonomic composition of these LAB communities. For this purpose, a total of 714 LAB from 21 sourdoughs sampled at 11 artisan bakeries throughout Belgium were subjected to a polyphasic identification approach. The microbial composition of the traditional sourdoughs was characterized by bacteriological culture in combination with genotypic identification methods, including repetitive element sequence-based PCR fingerprinting and phenylalanyl-tRNA synthase (pheS) gene sequence analysis. LAB from Belgian sourdoughs belonged to the genera Lactobacillus, Pediococcus, Leuconostoc, Weissella, and Enterococcus, with the heterofermentative species Lactobacillus paralimentarius, Lactobacillus sanfranciscensis, Lactobacillus plantarum, and Lactobacillus pontis as the most frequently isolated taxa. Statistical analysis of the identification data indicated that the microbial composition of the sourdoughs is mainly affected by the bakery environment rather than the flour type (wheat, rye, spelt, or a mixture of these) used. In conclusion, the polyphasic approach, based on rapid genotypic screening and high-resolution, sequence-dependent identification, proved to be a powerful tool for studying the LAB diversity in traditional fermented foods such as sourdough.
Sequence diversity among badnavirus isolates infecting yam (Dioscorea spp.) in Ghana, Togo, Benin and Nigeria.

PubMed

Eni, A O; Hughes, J d'A; Asiedu, R; Rey, M E C

2008-01-01

We analysed the sequence diversity in the reverse transcriptase (RT)/ribonuclease H (RNaseH) coding region of 19 badnavirus isolates infecting yam (Dioscorea spp.) in Ghana, Togo, Benin, and Nigeria. Phylogenetic analysis of the deduced amino acid sequences revealed that the isolates are broadly divided into two distinct species, each clustering with Dioscorea alata bacilliform virus (DaBV) and Dioscorea sansibarensis bacilliform virus (DsBV). Fourteen isolates had 90-96% amino acid identity with DaBV, while four isolates had 83-84% amino acid identity with DsBV. One isolate from Benin, BN4Dr, was distinct and had 77 and 75% amino acid identity with DaBV and DsBV, respectively, and may be a member of a new badnavirus species infecting yam in West Africa. Viruses of the two main species were present in Ghana, Togo and Benin and were observed to infect both D. alata and D. rotundata indiscriminately. This is the first confirmed report of DsBV infection in yam in Ghana and Togo. The results of this study demonstrate that members of two distinct species of badnaviruses infect yam in the West African yam zone and suggest a putative new species, BN4Dr. We also conclude that these species are not confined to limited geographic regions or specific for yam host species. However, the three badnavirus species are serologically related. The sequence information obtained from this study can be used to develop PCR-based diagnostics to detect members of the various species and/or strains of badnaviruses infecting yam in West Africa.
A-to-I RNA Editing Contributes to Proteomic Diversity in Cancer. | Office of Cancer Genomics

Cancer.gov

Adenosine (A) to inosine (I) RNA editing introduces many nucleotide changes in cancer transcriptomes. However, due to the complexity of post-transcriptional regulation, the contribution of RNA editing to proteomic diversity in human cancers remains unclear. Here, we performed an integrated analysis of TCGA genomic data and CPTAC proteomic data. Despite limited site diversity, we demonstrate that A-to-I RNA editing contributes to proteomic diversity in breast cancer through changes in amino acid sequences. We validate the presence of editing events at both RNA and protein levels.
Neotropical Bats from Costa Rica harbour Diverse Coronaviruses.

PubMed

Moreira-Soto, A; Taylor-Castillo, L; Vargas-Vargas, N; Rodríguez-Herrera, B; Jiménez, C; Corrales-Aguilar, E

2015-11-01

Bats are hosts of diverse coronaviruses (CoVs) known to potentially cross the host-species barrier. For analysing coronavirus diversity in a bat species-rich country, a total of 421 anal swabs/faecal samples from Costa Rican bats were screened for CoV RNA-dependent RNA polymerase (RdRp) gene sequences by a pancoronavirus PCR. Six families, 24 genera and 41 species of bats were analysed. The detection rate for CoV was 1%. Individuals (n = 4) from four different species of frugivorous (Artibeus jamaicensis, Carollia perspicillata and Carollia castanea) and nectivorous (Glossophaga soricina) bats were positive for coronavirus-derived nucleic acids. Analysis of 440 nt. RdRp sequences allocated all Costa Rican bat CoVs to the α-CoV group. Several CoVs sequences clustered near previously described CoVs from the same species of bat, but were phylogenetically distant from the human CoV sequences identified to date, suggesting no recent spillover events. The Glossophaga soricina CoV sequence is sufficiently dissimilar (26% homology to the closest known bat CoVs) to represent a unique coronavirus not clustering near other CoVs found in the same bat species so far, implying an even higher CoV diversity than previously suspected. © 2015 Blackwell Verlag GmbH.
Analysis of diversity of diazotrophic bacteria associated with the rhizosphere of a tropical Arbor, Melastoma malabathricum L.

PubMed

Sato, Atsuya; Watanabe, Toshihiro; Unno, Yusuke; Purnomo, Erry; Osaki, Mitsuru; Shinano, Takuro

2009-01-01

The diversity of diazotrophic bacteria in the rhizosphere of Melastoma malabathricum L. was investigated by cloning-sequencing of the nifH gene directly amplified from DNA extracted from soil. Samples were obtained from the rhizosphere and bulk soil of M. malabathricum growing in three different soil types (acid sulfate, peat and sandy clay soils) located very close to each other in south Kalimantan, Indonesia. Six clone libraries were constructed, generated from bulk and rhizosphere soil samples, and 300 nifH clones were produced, then assembled into 29 operational taxonomic units (OTUs) based on percent identity values. Our results suggested that nifH gene diversity is mainly dependent on soil properties, and did not differ remarkably between the rhizosphere and bulk soil of M. malabathricum except in acid sulfate soil. In acid sulfate soil, as the Shannon diversity index was lower in rhizosphere than in bulk soil, it is suggested that particular bacterial species might accumulate in the rhizosphere.
Genome sequence analysis of five Canadian isolates of strawberry mottle virus reveals extensive intra-species diversity and a longer RNA2 with increased coding capacity compared to a previously characterized European isolate.

PubMed

Bhagwat, Basdeo; Dickison, Virginia; Ding, Xinlun; Walker, Melanie; Bernardy, Michael; Bouthillier, Michel; Creelman, Alexa; DeYoung, Robyn; Li, Yinzi; Nie, Xianzhou; Wang, Aiming; Xiang, Yu; Sanfaçon, Hélène

2016-06-01

In this study, we report the genome sequence of five isolates of strawberry mottle virus (family Secoviridae, order Picornavirales) from strawberry field samples with decline symptoms collected in Eastern Canada. The Canadian isolates differed from the previously characterized European isolate 1134 in that they had a longer RNA2, resulting in a 239-amino-acid extension of the C-terminal region of the polyprotein. Sequence analysis suggests that reassortment and recombination occurred among the isolates. Phylogenetic analysis revealed that the Canadian isolates are diverse, grouping in two separate branches along with isolates from Europe and the Americas.
Microbial Ecology and Evolution in the Acid Mine Drainage Model System.

PubMed

Huang, Li-Nan; Kuang, Jia-Liang; Shu, Wen-Sheng

2016-07-01

Acid mine drainage (AMD) is a unique ecological niche for acid- and toxic-metals-adapted microorganisms. These low-complexity systems offer a special opportunity for the ecological and evolutionary analyses of natural microbial assemblages. The last decade has witnessed an unprecedented interest in the study of AMD communities using 16S rRNA high-throughput sequencing and community genomic and postgenomic methodologies, significantly advancing our understanding of microbial diversity, community function, and evolution in acidic environments. This review describes new data on AMD microbial ecology and evolution, especially dynamics of microbial diversity, community functions, and population genomes, and further identifies gaps in our current knowledge that future research, with integrated applications of meta-omics technologies, will fill. Copyright © 2016 Elsevier Ltd. All rights reserved.
Sex determination: balancing selection in the honey bee.

PubMed

Charlesworth, Deborah

2004-07-27

Sequences of alleles of the honey bee's primary sex-determining gene have extremely high diversity, with many amino acid variants, suggesting that different alleles of this gene have been maintained in populations for very long evolutionary times.
Genome Sequence of “Candidatus Walczuchella monophlebidarum” the Flavobacterial Endosymbiont of Llaveia axin axin (Hemiptera: Coccoidea: Monophlebidae)

PubMed Central

Rosas-Pérez, Tania; Rosenblueth, Mónica; Rincón-Rosales, Reiner; Mora, Jaime; Martínez-Romero, Esperanza

2014-01-01

Scale insects (Hemiptera: Coccoidae) constitute a very diverse group of sap-feeding insects with a large diversity of symbiotic associations with bacteria. Here, we present the complete genome sequence, metabolic reconstruction, and comparative genomics of the flavobacterial endosymbiont of the giant scale insect Llaveia axin axin. The gene repertoire of its 309,299 bp genome was similar to that of other flavobacterial insect endosymbionts though not syntenic. According to its genetic content, essential amino acid biosynthesis is likely to be the flavobacterial endosymbiont's principal contribution to the symbiotic association with its insect host. We also report the presence of a γ-proteobacterial symbiont that may be involved in waste nitrogen recycling and also has amino acid biosynthetic capabilities that may provide metabolic precursors to the flavobacterial endosymbiont. We propose “Candidatus Walczuchella monophlebidarum” as the name of the flavobacterial endosymbiont of insects from the Monophlebidae family. PMID:24610838
Combining Rosetta with molecular dynamics (MD): A benchmark of the MD-based ensemble protein design.

PubMed

Ludwiczak, Jan; Jarmula, Adam; Dunin-Horkawicz, Stanislaw

2018-07-01

Computational protein design is a set of procedures for computing amino acid sequences that will fold into a specified structure. Rosetta Design, a commonly used software for protein design, allows for the effective identification of sequences compatible with a given backbone structure, while molecular dynamics (MD) simulations can thoroughly sample near-native conformations. We benchmarked a procedure in which Rosetta design is started on MD-derived structural ensembles and showed that such a combined approach generates 20-30% more diverse sequences than currently available methods with only a slight increase in computation time. Importantly, the increase in diversity is achieved without a loss in the quality of the designed sequences assessed by their resemblance to natural sequences. We demonstrate that the MD-based procedure is also applicable to de novo design tasks started from backbone structures without any sequence information. In addition, we implemented a protocol that can be used to assess the stability of designed models and to select the best candidates for experimental validation. In sum our results demonstrate that the MD ensemble-based flexible backbone design can be a viable method for protein design, especially for tasks that require a large pool of diverse sequences. Copyright © 2018 Elsevier Inc. All rights reserved.
Algorithms for optimizing cross-overs in DNA shuffling.

PubMed

He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris

2012-03-21

DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
Diverse novel astroviruses identified in wild Himalayan marmots.

PubMed

Ao, Yuan-Yun; Yu, Jie-Mei; Li, Li-Li; Cao, Jing-Yuan; Deng, Hong-Yan; Xin, Yun-Yun; Liu, Meng-Meng; Lin, Lin; Lu, Shan; Xu, Jian-Guo; Duan, Zhao-Jun

2017-04-01

With advances in viral surveillance and next-generation sequencing, highly diverse novel astroviruses (AstVs) and different animal hosts had been discovered in recent years. However, the existence of AstVs in marmots had yet to be shown. Here, we identified two highly divergent strains of AstVs (tentatively named Qinghai Himalayanmarmot AstVs, HHMAstV1 and HHMAstV2), by viral metagenomic analysis in liver tissues isolated from wild Marmota himalayana in China. Overall, 12 of 99 (12.1 %) M. himalayana faecal samples were positive for the presence of genetically diverse AstVs, while only HHMAstV1 and HHMAstV2 were identified in 300 liver samples. The complete genomic sequences of HHMAstV1 and HHMAstV2 were 6681 and 6610 nt in length, respectively, with the typical genomic organization of AstVs. Analysis of the complete ORF 2 sequence showed that these novel AstVs are most closely related to the rabbit AstV, mamastrovirus 23 (with 31.0 and 48.0 % shared amino acid identity, respectively). Phylogenetic analysis of the amino acid sequences of ORF1a, ORF1b and ORF2 indicated that HHMAstV1 and HHMAstV2 form two distinct clusters among the mamastroviruses, and may share a common ancestor with the rabbit-specific mamastrovirus 23. These results suggest that HHMAstV1 and HHMAstV2 are two novel species of the genus Mamastrovirus in the Astroviridae. The remarkable diversity of these novel AstVs will contribute to a greater understanding of the evolution and ecology of AstVs, although additional studies will be needed to understand the clinical significance of these novel AstVs in marmots, as well as in humans.

Analysis of microbial community variation during the mixed culture fermentation of agricultural peel wastes to produce lactic acid.

PubMed

Liang, Shaobo; Gliniewicz, Karol; Gerritsen, Alida T; McDonald, Armando G

2016-05-01

Mixed cultures fermentation can be used to convert organic wastes into various chemicals and fuels. This study examined the fermentation performance of four batch reactors fed with different agricultural (orange, banana, and potato (mechanical and steam)) peel wastes using mixed cultures, and monitored the interval variation of reactor microbial communities with 16S rRNA genes using Illumina sequencing. All four reactors produced similar chemical profile with lactic acid (LA) as dominant compound. Acetic acid and ethanol were also observed with small fractions. The Illumina sequencing results revealed the diversity of microbial community decreased during fermentation and a community of largely lactic acid producing bacteria dominated by species of Lactobacillus developed. Copyright © 2016 Elsevier Ltd. All rights reserved.
Constancy and diversity in the flavivirus fusion peptide.

PubMed

Seligman, Stephen J

2008-02-14

Flaviviruses include the mosquito-borne dengue, Japanese encephalitis, yellow fever and West Nile and the tick-borne encephalitis viruses. They are responsible for considerable world-wide morbidity and mortality. Viral entry is mediated by a conserved fusion peptide containing 16 amino acids located in domain II of the envelope protein E. Highly orchestrated conformational changes initiated by exposure to acidic pH accompany the fusion process and are important factors limiting amino acid changes in the fusion peptide that still permit fusion with host cell membranes in both arthropod and vertebrate hosts. The cell-fusing related agents, growing only in mosquitoes or insect cell lines, possess a different homologous peptide. Analysis of 46 named flaviviruses deposited in the Entrez Nucleotides database extended the constancy in the canonical fusion peptide sequences of mosquito-borne, tick-borne and viruses with no known vector to include more recently-sequenced viruses. The mosquito-borne signature amino acid, G104, was also found in flaviviruses with no known vector and with the cell-fusion related viruses. Despite the constancy in the canonical sequences in pathogenic flaviviruses, mutations were surprisingly frequent with a 27% prevalence of nonsynonymous mutations in yellow fever virus fusion peptide sequences, and 0 to 7.4% prevalence in the others. Six of seven yellow fever patients whose virus had fusion peptide mutations died. In the cell-fusing related agents, not enough sequences have been deposited to estimate reliably the prevalence of fusion peptide mutations. However, the canonical sequences homologous to the fusion peptide and the pattern of disulfide linkages in protein E differed significantly from the other flaviviruses. The constancy of the canonical fusion peptide sequences in the arthropod-borne flaviviruses contrasts with the high prevalence of mutations in most individual viruses. The discrepancy may be the result of a survival advantage accompanying sequence diversity (quasispecies) involving the fusion peptide. Limited clinical data with yellow fever virus suggest that the presence of fusion peptide mutants is not associated with a decreased case fatality rate. The cell-fusing related agents may have substantial differences from other flaviviruses in their mechanism of viral entry into the host cell.
Microbial Diversity of Acidic Hot Spring (Kawah Hujan B) in Geothermal Field of Kamojang Area, West Java-Indonesia

PubMed Central

Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

2009-01-01

Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria. PMID:19440252
Microbial diversity of acidic hot spring (kawah hujan B) in geothermal field of kamojang area, west java-indonesia.

PubMed

Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

2009-01-01

Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria.
Genotypic diversity of stress response in Lactobacillus plantarum, Lactobacillus paraplantarum and Lactobacillus pentosus.

PubMed

Ricciardi, Annamaria; Parente, Eugenio; Guidone, Angela; Ianniello, Rocco Gerardo; Zotta, Teresa; Abu Sayem, S M; Varcamonti, Mario

2012-07-02

Lactobacillus plantarum, Lactobacillus pentosus and Lactobacillus paraplantarum are three closely related species which are widespread in food and non-food environments, and are important as starter bacteria or probiotics. In order to evaluate the phenotypic diversity of stress tolerance in the L. plantarum group and the ability to mount an adaptive heat shock response, the survival of exponential and stationary phase and of heat adapted exponential phase cells of six L. plantarum subsp. plantarum, one L. plantarum subsp. argentoratensis, one L. pentosus and two L. paraplantarum strains selected in a previous work upon exposure to oxidative, heat, detergent, starvation and acid stresses was compared to that of the L. plantarum WCFS1 strain. Furthermore, to evaluate the genotypic diversity in stress response genes, ten genes (encoding for chaperones DnaK, GroES and GroEL, regulators CtsR, HrcA and CcpA, ATPases/proteases ClpL, ClpP, ClpX and protease FtsH) were amplified using primers derived from the WCFS1 genome sequence and submitted to restriction with one or two endonucleases. The results were compared by univariate and multivariate statistical methods. In addition, the amplicons for hrcA and ctsR were sequenced and compared by multiple sequence alignment and polymorphism analysis. Although there was evidence of a generalized stress response in the stationary phase, with increase of oxidative, heat, and, to a lesser extent, starvation stress tolerance, and for adaptive heat stress response, with increased tolerance to heat, acid and detergent, different growth phases and adaptation patterns were found. Principal component analysis showed that while heat, acid and detergent stresses respond similarly to growth phase and adaptation, tolerance to oxidative and starvation stresses implies completely unrelated mechanisms. A dendrogram obtained using the data from multilocus restriction typing (MLRT) of stress response genes clearly separated two groups of L. plantarum strains from the other species but there was no correlation between genotypic grouping and grouping obtained on the basis of the stress response pattern, nor with the phylograms obtained from hrcA and ctsR sequences. Differences in sequence in L. plantarum strains were mostly due to single nucleotide polymorphisms with a high frequency of synonymous nucleotide changes and, while hrcA was characterized by an excess of low frequency polymorphism, very low diversity was found in ctsR sequences. Sequence alignment of hrcA allowed a correct discrimination of the strains at the species level, thus confirming the relevance of stress response genes for taxonomy. Copyright © 2012 Elsevier B.V. All rights reserved.
Epstein-Barr Virus Latent Membrane Protein 1 Genetic Variability in Peripheral Blood B Cells and Oropharyngeal Fluids

PubMed Central

Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R.; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F.

2014-01-01

ABSTRACT We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. IMPORTANCE This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed. PMID:24429365
Epstein-Barr virus latent membrane protein 1 genetic variability in peripheral blood B cells and oropharyngeal fluids.

PubMed

Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F; Luzuriaga, Katherine

2014-04-01

We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed.
Diversity of Babesia bovis merozoite surface antigen genes in the Philippines.

PubMed

Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Ybanez, Adrian Patalinghug; Ybanez, Rochelle Haidee Daclan; Perez, Zandro Obligado; Guswanto, Azirwan; Igarashi, Ikuo; Yokoyama, Naoaki

2014-02-01

Babesia bovis is the causative agent of fatal babesiosis in cattle. In the present study, we investigated the genetic diversity of B. bovis among Philippine cattle, based on the genes that encode merozoite surface antigens (MSAs). Forty-one B. bovis-positive blood DNA samples from cattle were used to amplify the msa-1, msa-2b, and msa-2c genes. In phylogenetic analyses, the msa-1, msa-2b, and msa-2c gene sequences generated from Philippine B. bovis-positive DNA samples were found in six, three, and four different clades, respectively. All of the msa-1 and most of the msa-2b sequences were found in clades that were formed only by Philippine msa sequences in the respective phylograms. While all the msa-1 sequences from the Philippines showed similarity to those formed by Australian msa-1 sequences, the msa-2b sequences showed similarity to either Australian or Mexican msa-2b sequences. In contrast, msa-2c sequences from the Philippines were distributed across all the clades of the phylogram, although one clade was formed exclusively by Philippine msa-2c sequences. Similarities among the deduced amino acid sequences of MSA-1, MSA-2b, and MSA-2c from the Philippines were 62.2-100, 73.1-100, and 67.3-100%, respectively. The present findings demonstrate that B. bovis populations are genetically diverse in the Philippines. This information will provide a good foundation for the future design and implementation of improved immunological preventive methodologies against bovine babesiosis in the Philippines. The study has also generated a set of data that will be useful for futher understanding of the global genetic diversity of this important parasite. © 2013.
Microbial diversity at the moderate acidic stage in three different sulfidic mine tailings dumps generating acid mine drainage.

PubMed

Korehi, Hananeh; Blöthe, Marco; Schippers, Axel

2014-11-01

In freshly deposited sulfidic mine tailings the pH is alkaline or circumneutral. Due to pyrite or pyrrhotite oxidation the pH is dropping over time to pH values <3 at which acidophilic iron- and sulfur-oxidizing prokaryotes prevail and accelerate the oxidation processes, well described for several mine waste sites. The microbial communities at the moderate acidic stage in mine tailings are only scarcely studied. Here we investigated the microbial diversity via 16S rRNA gene sequence analysis in eight samples (pH range 3.2-6.5) from three different sulfidic mine tailings dumps in Botswana, Germany and Sweden. In total 701 partial 16S rRNA gene sequences revealed a divergent microbial community between the three sites and at different tailings depths. Proteobacteria and Firmicutes were overall the most abundant phyla in the clone libraries. Acidobacteria, Actinobacteria, Bacteroidetes, and Nitrospira occurred less frequently. The found microbial communities were completely different to microbial communities in tailings at
Biosynthetic multitasking facilitates thalassospiramide structural diversity in marine bacteria.

PubMed

Ross, Avena C; Xu, Ying; Lu, Liang; Kersten, Roland D; Shao, Zongze; Al-Suwailem, Abdulaziz M; Dorrestein, Pieter C; Qian, Pei-Yuan; Moore, Bradley S

2013-01-23

Thalassospiramides A and B are immunosuppressant cyclic lipopeptides first reported from the marine α-proteobacterium Thalassospira sp. CNJ-328. We describe here the discovery and characterization of an extended family of 14 new analogues from four Tistrella and Thalassospira isolates. These potent calpain 1 protease inhibitors belong to six structure classes in which the length and composition of the acylpeptide side chain varies extensively. Genomic sequence analysis of the thalassospiramide-producing microbes revealed related, genus-specific biosynthetic loci encoding hybrid nonribosomal peptide synthetase/polyketide synthases consistent with thalassospiramide assembly. The bioinformatics analysis of the gene clusters suggests that structural diversity, which ranges from the 803.4 Da thalassospiramide C to the 1291.7 Da thalassospiramide F, results from a complex sequence of reactions involving amino acid substrate channeling and enzymatic multimodule skipping and iteration. Preliminary biochemical analysis of the N-terminal nonribosomal peptide synthetase module from the Thalassospira TtcA megasynthase supports a biosynthetic model in which in cis amino acid activation competes with in trans activation to increase the range of amino acid substrates incorporated at the N terminus.
Biosynthetic Multitasking Facilitates Thalassospiramide Structural Diversity in Marine Bacteria

PubMed Central

Ross, Avena C.; Xu, Ying; Lu, Liang; Kersten, Roland D.; Shao, Zongze; Al-Suwailem, Abdulaziz M.; Dorrestein, Pieter C.; Qian, Pei-Yuan; Moore, Bradley S.

2013-01-01

Thalassospiramides A and B are immunosuppressant cyclic lipopeptides first reported from the marine α-proteobacterium Thalassospira sp. CNJ-328. We describe here the discovery and characterization of an extended family of 14 new analogues from four Tistrella and Thalassospira isolates. These potent calpain 1 protease inhibitors belong to six structure classes in which the length and composition of the acylpeptide side chain varies extensively. Genomic sequence analysis of the thalassospiramide-producing microbes revealed related, genus-specific biosynthetic loci encoding hybrid nonribosomal peptide synthetase/polyketide synthases consistent with thalassospiramide assembly. The bioinformatics analysis of the gene clusters suggests that structural diversity, which ranges from the 803.4 Da thalassospiramide C to the 1291.7 Da thalassospiramide F, results from a complex sequence of reactions involving amino acid substrate channeling and enzymatic multi-module skipping and iteration. Preliminary biochemical analysis of the N-terminal NRPS module from the Thalassospira TtcA megasynthase supports a biosynthetic model in which in cis amino acid activation competes with in trans activation to increase the range of amino acid substrates incorporated at the N-terminus. PMID:23270364
Evolution of sequence-defined highly functionalized nucleic acid polymers

NASA Astrophysics Data System (ADS)

Chen, Zhen; Lichtor, Phillip A.; Berliner, Adrian P.; Chen, Jonathan C.; Liu, David R.

2018-03-01

The evolution of sequence-defined synthetic polymers made of building blocks beyond those compatible with polymerase enzymes or the ribosome has the potential to generate new classes of receptors, catalysts and materials. Here we describe a ligase-mediated DNA-templated polymerization and in vitro selection system to evolve highly functionalized nucleic acid polymers (HFNAPs) made from 32 building blocks that contain eight chemically diverse side chains on a DNA backbone. Through iterated cycles of polymer translation, selection and reverse translation, we discovered HFNAPs that bind proprotein convertase subtilisin/kexin type 9 (PCSK9) and interleukin-6, two protein targets implicated in human diseases. Mutation and reselection of an active PCSK9-binding polymer yielded evolved polymers with high affinity (KD = 3 nM). This evolved polymer potently inhibited the binding between PCSK9 and the low-density lipoprotein receptor. Structure-activity relationship studies revealed that specific side chains at defined positions in the polymers are required for binding to their respective targets. Our findings expand the chemical space of evolvable polymers to include densely functionalized nucleic acids with diverse, researcher-defined chemical repertoires.
Diversity and Activity of Alternative Nitrogenases in Sequenced Genomes and Coastal Environments

PubMed Central

McRose, Darcy L.; Zhang, Xinning; Kraepiel, Anne M. L.; Morel, François M. M.

2017-01-01

The nitrogenase enzyme, which catalyzes the reduction of N2 gas to NH4+, occurs as three separate isozyme that use Mo, Fe-only, or V. The majority of global nitrogen fixation is attributed to the more efficient ‘canonical’ Mo-nitrogenase, whereas Fe-only and V-(‘alternative’) nitrogenases are often considered ‘backup’ enzymes, used when Mo is limiting. Yet, the environmental distribution and diversity of alternative nitrogenases remains largely unknown. We searched for alternative nitrogenase genes in sequenced genomes and used PacBio sequencing to explore the diversity of canonical (nifD) and alternative (anfD and vnfD) nitrogenase amplicons in two coastal environments: the Florida Everglades and Sippewissett Marsh (MA). Genome-based searches identified an additional 25 species and 10 genera not previously known to encode alternative nitrogenases. Alternative nitrogenase amplicons were found in both Sippewissett Marsh and the Florida Everglades and their activity was further confirmed using newly developed isotopic techniques. Conserved amino acid sequences corresponding to cofactor ligands were also analyzed in anfD and vnfD amplicons, offering insight into environmental variants of these motifs. This study increases the number of available anfD and vnfD sequences ∼20-fold and allows for the first comparisons of environmental Mo-, Fe-only, and V-nitrogenase diversity. Our results suggest that alternative nitrogenases are maintained across a range of organisms and environments and that they can make important contributions to nitrogenase diversity and nitrogen fixation. PMID:28293220
Diversity and Activity of Alternative Nitrogenases in Sequenced Genomes and Coastal Environments.

PubMed

McRose, Darcy L; Zhang, Xinning; Kraepiel, Anne M L; Morel, François M M

2017-01-01

The nitrogenase enzyme, which catalyzes the reduction of N 2 gas to NH 4 + , occurs as three separate isozyme that use Mo, Fe-only, or V. The majority of global nitrogen fixation is attributed to the more efficient 'canonical' Mo-nitrogenase, whereas Fe-only and V-('alternative') nitrogenases are often considered 'backup' enzymes, used when Mo is limiting. Yet, the environmental distribution and diversity of alternative nitrogenases remains largely unknown. We searched for alternative nitrogenase genes in sequenced genomes and used PacBio sequencing to explore the diversity of canonical ( nifD ) and alternative ( anfD and vnfD ) nitrogenase amplicons in two coastal environments: the Florida Everglades and Sippewissett Marsh (MA). Genome-based searches identified an additional 25 species and 10 genera not previously known to encode alternative nitrogenases. Alternative nitrogenase amplicons were found in both Sippewissett Marsh and the Florida Everglades and their activity was further confirmed using newly developed isotopic techniques. Conserved amino acid sequences corresponding to cofactor ligands were also analyzed in anfD and vnfD amplicons, offering insight into environmental variants of these motifs. This study increases the number of available anfD and vnfD sequences ∼20-fold and allows for the first comparisons of environmental Mo-, Fe-only, and V-nitrogenase diversity. Our results suggest that alternative nitrogenases are maintained across a range of organisms and environments and that they can make important contributions to nitrogenase diversity and nitrogen fixation.
Draft Genome Sequence of Streptomyces clavuligerus NRRL 3585, a Producer of Diverse Secondary Metabolites▿

PubMed Central

Song, Ju Yeon; Jeong, Haeyoung; Yu, Dong Su; Fischbach, Michael A.; Park, Hong-Seog; Kim, Jae Jong; Seo, Jeong-Sun; Jensen, Susan E.; Oh, Tae Kwang; Lee, Kye Joon; Kim, Jihyun F.

2010-01-01

Streptomyces clavuligerus is an important industrial strain that produces a number of antibiotics, including clavulanic acid and cephamycin C. A high-quality draft genome sequence of the S. clavuligerus NRRL 3585 strain was produced by employing a hybrid approach that involved Sanger sequencing, Roche/454 pyrosequencing, optical mapping, and partial finishing. Its genome, comprising four linear replicons, one chromosome, and four plasmids, carries numerous sets of genes involved in the biosynthesis of secondary metabolites, including a variety of antibiotics. PMID:20889745
A Therapeutic Uricase with Reduced Immunogenicity Risk and Improved Development Properties.

PubMed

Nyborg, Andrew C; Ward, Chris; Zacco, Anna; Chacko, Benoy; Grinberg, Luba; Geoghegan, James C; Bean, Ryan; Wendeler, Michaela; Bartnik, Frank; O'Connor, Ellen; Gruia, Flaviu; Iyer, Vidyashankara; Feng, Hui; Roy, Varnika; Berge, Mark; Miner, Jeffrey N; Wilson, David M; Zhou, Dongmei; Nicholson, Simone; Wilker, Clynn; Wu, Chi Y; Wilson, Susan; Jermutus, Lutz; Wu, Herren; Owen, David A; Osbourn, Jane; Coats, Steven; Baca, Manuel

2016-01-01

Humans and higher primates are unique in that they lack uricase, the enzyme capable of oxidizing uric acid. As a consequence of this enzyme deficiency, humans have high serum uric acid levels. In some people, uric acid levels rise above the solubility limit resulting in crystallization in joints, acute inflammation in response to those crystals causes severe pain; a condition known as gout. Treatment for severe gout includes injection of non-human uricase to reduce serum uric acid levels. Krystexxa® is a hyper-PEGylated pig-baboon chimeric uricase indicated for chronic refractory gout that induces an immunogenic response in 91% of treated patients, including infusion reactions (26%) and anaphylaxis (6.5%). These properties limit its use and effectiveness. An innovative approach has been used to develop a therapeutic uricase with improved properties such as: soluble expression, neutral pH solubility, high E. coli expression level, thermal stability, and excellent activity. More than 200 diverse uricase sequences were aligned to guide protein engineering and reduce putative sequence liabilities. A single uricase lead candidate was identified, which showed low potential for immunogenicity in >200 human donor samples selected to represent diverse HLA haplotypes. Cysteines were engineered into the lead sequence for site specific PEGylation and studies demonstrated >95% PEGylation efficiency. PEGylated uricase retains enzymatic activity in vitro at neutral pH, in human serum and in vivo (rats and canines) and has an extended half-life. In canines, an 85% reduction in serum uric acid levels was observed with a single subcutaneous injection. This PEGylated, non-immunogenic uricase has the potential to provide meaningful benefits to patients with gout.
Marked Genomic Diversity of Norovirus Genogroup I Strains in a Waterborne Outbreak

PubMed Central

Hannoun, Charles; Larsson, Charlotte U.; Bergström, Tomas

2012-01-01

Marked norovirus (NoV) diversity was detected in patient samples from a large community outbreak of gastroenteritis with waterborne epidemiology affecting approximately 2,400 people. NoV was detected in 33 of 50 patient samples examined by group-specific real-time reverse transcription-PCR. NoV genotype I (GI) strains predominated in 31 patients, with mixed GI infections occurring in 5 of these patients. Sequence analysis of RNA-dependent polymerase-N/S capsid-coding regions (∼900 nucleotides in length) confirmed the dominance of the GI strains (n = 36). Strains of NoV GI.4 (n = 21) and GI.7 (n = 9) were identified, but six strains required full capsid amino acid analyses (530 to 550 amino acids) based on control sequencing of cloned amplicons before the virus genotype could be determined. Three strains were assigned to a new NoV GI genotype, proposed as GI.9, based on capsid amino acid analyses showing 26% dissimilarity from the established genotypes GI.1 to GI.8. Three other strains grouped in a sub-branch of GI.3 with 13 to 15% amino acid dissimilarity to GI.3 GenBank reference strains. Phylogenetic analysis (2.1 kb) of 10 representative strains confirmed these genotype clusters. Strains of NoV GII.4 (n = 1), NoV GII.6 (n = 2), sapovirus GII.2 (n = 1), rotavirus (n = 3), adenovirus (n = 1), and Campylobacter spp. (n = 2) were detected as single infections or as mixtures with NoV GI. Marked NoV GI diversity detected in patients was consistent with epidemiologic evidence of waterborne NoV infections, suggesting human fecal contamination of the water supply. Recognition of NoV diversity in a cluster of patients provided a useful warning marker of waterborne contamination in the Lilla Edet outbreak. PMID:22247153
Diversity of formyltetrahydrofolate synthetase genes in the rumens of roe deer (Capreolus pygargus) and sika deer (Cervus nippon) fed different diets.

PubMed

Li, Zhipeng; Henderson, Gemma; Yang, Yahan; Li, Guangyu

2017-01-01

Reductive acetogenesis by homoacetogens represents an alternative pathway to methanogenesis to remove metabolic hydrogen during rumen fermentation. In this study, we investigated the occurrence of homoacetogen in the rumens of pasture-fed roe deer (Capreolus pygargus) and sika deer (Cervus nippon) fed either oak-leaf-based (tannin-rich, 100 mg/kg dried matter), corn-stover-based, or corn-silage-based diets, by using formyltetrahydrofolate synthetase (FTHFS) gene sequences as a marker. The diversity and richness of FTHFS sequences was lowest in animals fed oak leaf, indicating that tannin-containing plants may affect rumen homoacetogen diversity. FTHFS amino acid sequences in the rumen of roe deer significantly differed from those of sika deer. The phylogenetic analyses showed that 44.8% of sequences in pasture-fed roe deer, and 72.1%, 81.1%, and 37.5% of sequences in sika deer fed oak-leaf-, corn-stover-, and corn-silage-based diets, respectively, may represent novel bacteria that have not yet been cultured. These results demonstrate that the rumens of roe deer and sika deer harbor potentially novel homoacetogens and that diet may influence homoacetogen community structure.
Selection dynamic of Escherichia coli host in M13 combinatorial peptide phage display libraries.

PubMed

Zanconato, Stefano; Minervini, Giovanni; Poli, Irene; De Lucrezia, Davide

2011-01-01

Phage display relies on an iterative cycle of selection and amplification of random combinatorial libraries to enrich the initial population of those peptides that satisfy a priori chosen criteria. The effectiveness of any phage display protocol depends directly on library amino acid sequence diversity and the strength of the selection procedure. In this study we monitored the dynamics of the selective pressure exerted by the host organism on a random peptide library in the absence of any additional selection pressure. The results indicate that sequence censorship exerted by Escherichia coli dramatically reduces library diversity and can significantly impair phage display effectiveness.
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection.

PubMed

Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T; Carr, Christopher E

2017-08-01

Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments-Nucleic acids-Mars-Panspermia. Astrobiology 17, 747-760.

Relevance and Diversity of Nitrospira Populations in Biofilters of Brackish RAS

PubMed Central

Kruse, Myriam; Keuter, Sabine; Bakker, Evert; Spieck, Eva; Eggers, Till; Lipski, André

2013-01-01

Lithoautotrophic nitrite-oxidizing bacterial populations from moving-bed biofilters of brackish recirculation aquaculture systems (RAS; shrimp and barramundi) were tested for their metabolic activity and phylogenetic diversity. Samples from the biofilters were labeled with 13C-bicarbonate and supplemented with nitrite at concentrations of 0.3, 3 and 10 mM, and incubated at 17 and 28°C, respectively. The biofilm material was analyzed by fatty acid methyl ester - stable isotope probing (FAME-SIP). High portions of up to 45% of Nitrospira-related labeled lipid markers were found confirming that Nitrospira is the major autotrophic nitrite oxidizer in these brackish systems with high nitrogen loads. Other nitrite-oxidizing bacteria such as Nitrobacter or Nitrotoga were functionally not relevant in the investigated biofilters. Nitrospira-related 16S rRNA gene sequences were obtained from the samples with 10 mM nitrite and analyzed by a cloning approach. Sequence studies revealed four different phylogenetic clusters within the marine sublineage IV of Nitrospira, though most sequences clustered with the type strain of Nitrospira marina and with a strain isolated from a marine RAS. Three lipids dominated the whole fatty acid profiles of nitrite-oxidizing marine and brackish enrichments of Nitrospira sublineage IV organisms. The membranes included two marker lipids (16∶1 cis7 and 16∶1 cis11) combined with the non-specific acid 16∶0 as major compounds and confirmed these marker lipids as characteristic for sublineage IV species. The predominant labeling of these characteristic fatty acids and the phylogenetic sequence analyses of the marine Nitrospira sublineage IV identified organisms of this sublineage as main autotrophic nitrite-oxidizers in the investigated brackish biofilter systems. PMID:23705006
Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

PubMed Central

2015-01-01

Abstract Trees contribute to enormous plant oil reserves because many trees contain 50%–80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the “proline knot” motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs. PMID:26258573
Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

PubMed

Cao, Heping

2015-09-01

Trees contribute to enormous plant oil reserves because many trees contain 50%-80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the "proline knot" motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs.
The Microbial Genomes Atlas (MiGA) webserver: taxonomic and gene diversity analysis of Archaea and Bacteria at the whole genome level.

PubMed

Rodriguez-R, Luis M; Gunturu, Santosh; Harvey, William T; Rosselló-Mora, Ramon; Tiedje, James M; Cole, James R; Konstantinidis, Konstantinos T

2018-06-14

The small subunit ribosomal RNA gene (16S rRNA) has been successfully used to catalogue and study the diversity of prokaryotic species and communities but it offers limited resolution at the species and finer levels, and cannot represent the whole-genome diversity and fluidity. To overcome these limitations, we introduced the Microbial Genomes Atlas (MiGA), a webserver that allows the classification of an unknown query genomic sequence, complete or partial, against all taxonomically classified taxa with available genome sequences, as well as comparisons to other related genomes including uncultivated ones, based on the genome-aggregate Average Nucleotide and Amino Acid Identity (ANI/AAI) concepts. MiGA integrates best practices in sequence quality trimming and assembly and allows input to be raw reads or assemblies from isolate genomes, single-cell sequences, and metagenome-assembled genomes (MAGs). Further, MiGA can take as input hundreds of closely related genomes of the same or closely related species (a so-called 'Clade Project') to assess their gene content diversity and evolutionary relationships, and calculate important clade properties such as the pangenome and core gene sets. Therefore, MiGA is expected to facilitate a range of genome-based taxonomic and diversity studies, and quality assessment across environmental and clinical settings. MiGA is available at http://microbial-genomes.org/.
Phylogeny of North American Powassan virus.

PubMed

Ebel, G D; Spielman, A; Telford, S R

2001-07-01

To determine whether Powassan virus (POW) and deer tick virus (DTV) constitute distinct flaviviral populations transmitted by ixodid ticks in North America, we analysed diverse nucleotide sequences from 16 strains of these viruses. Two distinct genetic lineages are evident, which may be defined by geographical and host associations. The nucleotide and amino acid sequences of lineage one (comprising New York and Canadian POW isolates) are highly conserved across time and space, but those of lineage two (comprising isolates from deer ticks and a fox) are more variable. The divergence between lineages is much greater than the variation within either lineage, and lineage two appears to be more diverse genetically than is lineage one. Application of McDonald-Kreitman tests to the sequences of these strains indicates that adaptive evolution of the envelope protein separates lineage one from lineage two. The two POW lineages circulating in North America possess a pattern of genetic diversity suggesting that they comprise distinct subtypes that may perpetuate in separate enzootic cycles.
Ancient diversity and geographical sub-structuring in African buffalo Theileria parva populations revealed through metagenetic analysis of antigen-encoding loci.

PubMed

Hemmink, Johanneke D; Sitt, Tatjana; Pelle, Roger; de Klerk-Lorist, Lin-Mari; Shiels, Brian; Toye, Philip G; Morrison, W Ivan; Weir, William

2018-03-01

An infection and treatment protocol involving infection with a mixture of three parasite isolates and simultaneous treatment with oxytetracycline is currently used to vaccinate cattle against Theileria parva. While vaccination results in high levels of protection in some regions, little or no protection is observed in areas where animals are challenged predominantly by parasites of buffalo origin. A previous study involving sequencing of two antigen-encoding genes from a series of parasite isolates indicated that this is associated with greater antigenic diversity in buffalo-derived T. parva. The current study set out to extend these analyses by applying high-throughput sequencing to ex vivo samples from naturally infected buffalo to determine the extent of diversity in a set of antigen-encoding genes. Samples from two populations of buffalo, one in Kenya and the other in South Africa, were examined to investigate the effect of geographical distance on the nature of sequence diversity. The results revealed a number of significant findings. First, there was a variable degree of nucleotide sequence diversity in all gene segments examined, with the percentage of polymorphic nucleotides ranging from 10% to 69%. Second, large numbers of allelic variants of each gene were found in individual animals, indicating multiple infection events. Third, despite the observed diversity in nucleotide sequences, several of the gene products had highly conserved amino acid sequences, and thus represent potential candidates for vaccine development. Fourth, although compelling evidence for population differentiation between the Kenyan and South African T. parva parasites was identified, analysis of molecular variance for each gene revealed that the majority of the underlying nucleotide sequence polymorphism was common to both areas, indicating that much of this aspect of genetic variation in the parasite population arose prior to geographic separation. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
In-Gel Determination of L-Amino Acid Oxidase Activity Based on the Visualization of Prussian Blue-Forming Reaction

PubMed Central

Zhou, Ning; Zhao, Chuntian

2013-01-01

L-amino acid oxidase (LAAO) is attracting increasing attention due to its important functions. Diverse detection methods with their own properties have been developed for characterization of LAAO. In the present study, a simple, rapid, sensitive, cost-effective and reproducible method for quantitative in-gel determination of LAAO activity based on the visualization of Prussian blue-forming reaction is described. Coupled with SDS-PAGE, this Prussian blue agar assay can be directly used to determine the numbers and approximate molecular weights of LAAO in one step, allowing straightforward application for purification and sequence identification of LAAO from diverse samples. PMID:23383337
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection

NASA Astrophysics Data System (ADS)

Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.; Carr, Christopher E.

2017-08-01

Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars.
Characterization of relative abundance of lactic acid bacteria species in French organic sourdough by cultural, qPCR and MiSeq high-throughput sequencing methods.

PubMed

Michel, Elisa; Monfort, Clarisse; Deffrasnes, Marion; Guezenec, Stéphane; Lhomme, Emilie; Barret, Matthieu; Sicard, Delphine; Dousset, Xavier; Onno, Bernard

2016-12-19

In order to contribute to the description of sourdough LAB composition, MiSeq sequencing and qPCR methods were performed in association with cultural methods. A panel of 16 French organic bakers and farmer-bakers were selected for this work. The lactic acid bacteria (LAB) diversity of their organic sourdoughs was investigated quantitatively and qualitatively combining (i) Lactobacillus sanfranciscensis-specific qPCR, (ii) global sequencing with MiSeq Illumina technology and (iii) molecular isolates identification. In addition, LAB and yeast enumeration, pH, Total Titratable Acidity, organic acids and bread specific volume were analyzed. Microbial and physico-chemical data were statistically treated by Principal Component Analysis (PCA) and Hierarchical Ascendant Classification (HAC). Total yeast counts were 6 log 10 to 7.6 log 10 CFU/g while LAB counts varied from 7.2 log 10 to 9.6 log 10 CFU/g. Values obtained by L. sanfranciscensis-specific qPCR were estimated between 7.2 and 10.3 log 10 CFU/g, except for one sample at 4.4 log 10 CFU/g. HAC and PCA clustered the sixteen sourdoughs into three classes described by their variables but without links to bakers' practices. L. sanfranciscensis was the dominant species in 13 of the 16 sourdoughs analyzed by Next Generation Sequencing (NGS), by the culture dependent method this species was dominant only in only 10 samples. Based on isolates identification, LAB diversity was higher for 7 sourdoughs with the recovery of L. curvatus, L. brevis, L. heilongjiangensis, L. xiangfangensis, L. koreensis, L. pontis, Weissella sp. and Pediococcus pentosaceus, as the most representative species. L. koreensis, L. heilongjiangensis and L. xiangfangensis were identified in traditional Asian food and here for the first time as dominant in organic sourdough. This study highlighted that L. sanfranciscensis was not the major species in 6/16 sourdough samples and that a relatively high LAB diversity can be observed in French organic sourdough. Copyright © 2016. Published by Elsevier B.V.
ScaffoldSeq: Software for characterization of directed evolution populations.

PubMed

Woldring, Daniel R; Holec, Patrick V; Hackel, Benjamin J

2016-07-01

ScaffoldSeq is software designed for the numerous applications-including directed evolution analysis-in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high-throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site-specific manner as well as identify epistatic pairwise interactions. The code and detailed information are freely available at http://research.cems.umn.edu/hackel. Proteins 2016; 84:869-874. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Cultivable Anaerobic Microbiota of Severe Early Childhood Caries▿¶

PubMed Central

Tanner, A. C. R.; Mathney, J. M. J.; Kent, R. L.; Chalmers, N. I.; Hughes, C. V.; Loo, C. Y.; Pradhan, N.; Kanasi, E.; Hwang, J.; Dahlan, M. A.; Papadopolou, E.; Dewhirst, F. E.

2011-01-01

Severe early childhood caries (ECC), while strongly associated with Streptococcus mutans using selective detection (culture, PCR), has also been associated with a widely diverse microbiota using molecular cloning approaches. The aim of this study was to evaluate the microbiota of severe ECC using anaerobic culture. The microbial composition of dental plaque from 42 severe ECC children was compared with that of 40 caries-free children. Bacterial samples were cultured anaerobically on blood and acid (pH 5) agars. Isolates were purified, and partial sequences for the 16S rRNA gene were obtained from 5,608 isolates. Sequence-based analysis of the 16S rRNA isolate libraries from blood and acid agars of severe ECC and caries-free children had >90% population coverage, with greater diversity occurring in the blood isolate library. Isolate sequences were compared with taxon sequences in the Human Oral Microbiome Database (HOMD), and 198 HOMD taxa were identified, including 45 previously uncultivated taxa, 29 extended HOMD taxa, and 45 potential novel groups. The major species associated with severe ECC included Streptococcus mutans, Scardovia wiggsiae, Veillonella parvula, Streptococcus cristatus, and Actinomyces gerensceriae. S. wiggsiae was significantly associated with severe ECC children in the presence and absence of S. mutans detection. We conclude that anaerobic culture detected as wide a diversity of species in ECC as that observed using cloning approaches. Culture coupled with 16S rRNA identification identified over 74 isolates for human oral taxa without previously cultivated representatives. The major caries-associated species were S. mutans and S. wiggsiae, the latter of which is a candidate as a newly recognized caries pathogen. PMID:21289150
Viral morphogenesis is the dominant source of sequence censorship in M13 combinatorial peptide phage display.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rodi, D. J.; Soares, A. S.; Makowski, L.

Novel statistical methods have been developed and used to quantitate and annotate the sequence diversity within combinatorial peptide libraries on the basis of small numbers (1-200) of sequences selected at random from commercially available M13 p3-based phage display libraries. These libraries behave statistically as though they correspond to populations containing roughly 4.0{+-}1.6% of the random dodecapeptides and 7.9{+-}2.6% of the random constrained heptapeptides that are theoretically possible within the phage populations. Analysis of amino acid residue occurrence patterns shows no demonstrable influence on sequence censorship by Escherichia coli tRNA isoacceptor profiles or either overall codon or Class II codon usagemore » patterns, suggesting no metabolic constraints on recombinant p3 synthesis. There is an overall depression in the occurrence of cysteine, arginine and glycine residues and an overabundance of proline, threonine and histidine residues. The majority of position-dependent amino acid sequence bias is clustered at three positions within the inserted peptides of the dodecapeptide library, +1, +3 and +12 downstream from the signal peptidase cleavage site. Conformational tendency measures of the peptides indicate a significant preference for inserts favoring a {beta}-turn conformation. The observed protein sequence limitations can primarily be attributed to genetic codon degeneracy and signal peptidase cleavage preferences. These data suggest that for applications in which maximal sequence diversity is essential, such as epitope mapping or novel receptor identification, combinatorial peptide libraries should be constructed using codon-corrected trinucleotide cassettes within vector-host systems designed to minimize morphogenesis-related censorship.« less
Genetic variation in potential Giardia vaccine candidates cyst wall protein 2 and α1-giardin.

PubMed

Radunovic, Matej; Klotz, Christian; Saghaug, Christina Skår; Brattbakk, Hans-Richard; Aebischer, Toni; Langeland, Nina; Hanevik, Kurt

2017-08-01

Giardia is a prevalent intestinal parasitic infection. The trophozoite structural protein a1-giardin (a1-g) and the cyst protein cyst wall protein 2 (CWP2) have shown promise as Giardia vaccine antigen candidates in murine models. The present study assesses the genetic diversity of a1-g and CWP2 between and within assemblages A and B in human clinical isolates. a1-g and CWP2 sequences were acquired from 15 Norwegian isolates by PCR amplification and 20 sequences from German cultured isolates by whole genome sequencing. Sequences were aligned to reference genomes from assemblage A2 and B to identify genetic variance. Genetic diversity was found between assemblage A and B reference sequences for both a1-g (90.8% nucleotide identity) and CWP2 (82.5% nucleotide identity). However, for a1-g, this translated into only 3 amino acid (aa) substitutions, while for CWP2 there were 41 aa substitutions, and also one aa deletion. Genetic diversity within assemblage B was larger; nucleotide identity 92.0% for a1-g and 94.3% for CWP2, than within assemblage A (nucleotide identity 99.0% for a1-g and 99.7% for CWP2). For CWP2, the diversity on both nucleotide and protein level was higher in the C-terminal end. Predicted antigenic epitopes were not affected for a1-g, but partially for CWP2. Despite genetic diversity in a1-g, we found aa sequence, characteristics, and antigenicity to be well preserved. CWP2 showed more aa variance and potential antigenic differences. Several CWP2 antigens might be necessary in a future Giardia vaccine to provide cross protection against both Giardia assemblages infecting humans.
Carrot Juice Fermentations as Man-Made Microbial Ecosystems Dominated by Lactic Acid Bacteria.

PubMed

Wuyts, Sander; Van Beeck, Wannes; Oerlemans, Eline F M; Wittouck, Stijn; Claes, Ingmar J J; De Boeck, Ilke; Weckx, Stefan; Lievens, Bart; De Vuyst, Luc; Lebeer, Sarah

2018-06-15

Spontaneous vegetable fermentations, with their rich flavors and postulated health benefits, are regaining popularity. However, their microbiology is still poorly understood, therefore raising concerns about food safety. In addition, such spontaneous fermentations form interesting cases of man-made microbial ecosystems. Here, samples from 38 carrot juice fermentations were collected through a citizen science initiative, in addition to three laboratory fermentations. Culturing showed that Enterobacteriaceae were outcompeted by lactic acid bacteria (LAB) between 3 and 13 days of fermentation. Metabolite-target analysis showed that lactic acid and mannitol were highly produced, as well as the biogenic amine cadaverine. High-throughput 16S rRNA gene sequencing revealed that mainly species of Leuconostoc and Lactobacillus (as identified by 8 and 20 amplicon sequence variants [ASVs], respectively) mediated the fermentations in subsequent order. The analyses at the DNA level still detected a high number of Enterobacteriaceae , but their relative abundance was low when RNA-based sequencing was performed to detect presumptive metabolically active bacterial cells. In addition, this method greatly reduced host read contamination. Phylogenetic placement indicated a high LAB diversity, with ASVs from nine different phylogenetic groups of the Lactobacillus genus complex. However, fermentation experiments with isolates showed that only strains belonging to the most prevalent phylogenetic groups preserved the fermentation dynamics. The carrot juice fermentation thus forms a robust man-made microbial ecosystem suitable for studies on LAB diversity and niche specificity. IMPORTANCE The usage of fermented food products by professional chefs is steadily growing worldwide. Meanwhile, this interest has also increased at the household level. However, many of these artisanal food products remain understudied. Here, an extensive microbial analysis was performed of spontaneous fermented carrot juices which are used as nonalcoholic alternatives for wine in a Belgian Michelin star restaurant. Samples were collected through an active citizen science approach with 38 participants, in addition to three laboratory fermentations. Identification of the main microbial players revealed that mainly species of Leuconostoc and Lactobacillus mediated the fermentations in subsequent order. In addition, a high diversity of lactic acid bacteria was found; however, fermentation experiments with isolates showed that only strains belonging to the most prevalent lactic acid bacteria preserved the fermentation dynamics. Finally, this study showed that the usage of RNA-based 16S rRNA amplicon sequencing greatly reduces host read contamination. Copyright © 2018 American Society for Microbiology.
Guiding principles for peptide nanotechnology through directed discovery.

PubMed

Lampel, A; Ulijn, R V; Tuttle, T

2018-05-21

Life's diverse molecular functions are largely based on only a small number of highly conserved building blocks - the twenty canonical amino acids. These building blocks are chemically simple, but when they are organized in three-dimensional structures of tremendous complexity, new properties emerge. This review explores recent efforts in the directed discovery of functional nanoscale systems and materials based on these same amino acids, but that are not guided by copying or editing biological systems. The review summarises insights obtained using three complementary approaches of searching the sequence space to explore sequence-structure relationships for assembly, reactivity and complexation, namely: (i) strategic editing of short peptide sequences; (ii) computational approaches to predicting and comparing assembly behaviours; (iii) dynamic peptide libraries that explore the free energy landscape. These approaches give rise to guiding principles on controlling order/disorder, complexation and reactivity by peptide sequence design.
Ultrasmall Peptides Self-Assemble into Diverse Nanostructures: Morphological Evaluation and Potential Implications

PubMed Central

Lakshmanan, Anupama; Hauser, Charlotte A.E.

2011-01-01

In this study, we perform a morphological evaluation of the diverse nanostructures formed by varying concentration and amino acid sequence of a unique class of ultrasmall self-assembling peptides. We modified these peptides by replacing the aliphatic amino acid at the C-aliphatic terminus with different aromatic amino acids. We tracked the effect of introducing aromatic residues on self-assembly and morphology of resulting nanostructures. Whereas aliphatic peptides formed long, helical fibers that entangle into meshes and entrap >99.9% water, the modified peptides contrastingly formed short, straight fibers with a flat morphology. No helical fibers were observed for the modified peptides. For the aliphatic peptides at low concentrations, different supramolecular assemblies such as hollow nanospheres and membrane blebs were found. Since the ultrasmall peptides are made of simple, aliphatic amino acids, considered to have existed in the primordial soup, study of these supramolecular assemblies could be relevant to understanding chemical evolution leading to the origin of life on Earth. In particular, we propose a variety of potential applications in bioengineering and nanotechnology for the diverse self-assembled nanostructures. PMID:22016623
Unraveling Haplotype Diversity of the Apical Membrane Antigen-1 Gene in Plasmodium falciparum Populations in Thailand

PubMed Central

Lumkul, Lalita; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai; Pattaradilokrat, Sittiporn

2018-01-01

Development of an effective vaccine is critically needed for the prevention of malaria. One of the key antigens for malaria vaccines is the apical membrane antigen 1 (AMA-1) of the human malaria parasite Plasmodium falciparum, the surface protein for erythrocyte invasion of the parasite. The gene encoding AMA-1 has been sequenced from populations of P. falciparum worldwide, but the haplotype diversity of the gene in P. falciparum populations in the Greater Mekong Subregion (GMS), including Thailand, remains to be characterized. In the present study, the AMA-1 gene was PCR amplified and sequenced from the genomic DNA of 65 P. falciparum isolates from 5 endemic areas in Thailand. The nearly full-length 1,848 nucleotide sequence of AMA-1 was subjected to molecular analyses, including nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity and neutrality tests. Phylogenetic analysis and pairwise population differentiation (Fst indices) were performed to infer the population structure. The analyses identified 60 single nucleotide polymorphic loci, predominately located in domain I of AMA-1. A total of 31 unique AMA-1 haplotypes were identified, which included 11 novel ones. The phylogenetic tree of the AMA-1 haplotypes revealed multiple clades of AMA-1, each of which contained parasites of multiple geographical origins, consistent with the Fst indices indicating genetic homogeneity or gene flow among geographically distinct populations of P. falciparum in Thailand’s borders with Myanmar, Laos and Cambodia. In summary, the study revealed novel haplotypes and population structure needed for the further advancement of AMA-1-based malaria vaccines in the GMS. PMID:29742870
Variability and transmission by Aphis glycines of North American and Asian Soybean mosaic virus isolates.

PubMed

Domier, L L; Latorre, I J; Steinlage, T A; McCoppin, N; Hartman, G L

2003-10-01

The variability of North American and Asian strains and isolates of Soybean mosaic virus was investigated. First, polymerase chain reaction (PCR) products representing the coat protein (CP)-coding regions of 38 SMVs were analyzed for restriction fragment length polymorphisms (RFLP). Second, the nucleotide and predicted amino acid sequence variability of the P1-coding region of 18 SMVs and the helper component/protease (HC/Pro) and CP-coding regions of 25 SMVs were assessed. The CP nucleotide and predicted amino acid sequences were the most similar and predicted phylogenetic relationships similar to those obtained from RFLP analysis. Neither RFLP nor sequence analyses of the CP-coding regions grouped the SMVs by geographical origin. The P1 and HC/Pro sequences were more variable and separated the North American and Asian SMV isolates into two groups similar to previously reported differences in pathogenic diversity of the two sets of SMV isolates. The P1 region was the most informative of the three regions analyzed. To assess the biological relevance of the sequence differences in the HC/Pro and CP coding regions, the transmissibility of 14 SMV isolates by Aphis glycines was tested. All field isolates of SMV were transmitted efficiently by A. glycines, but the laboratory isolates analyzed were transmitted poorly. The amino acid sequences from most, but not all, of the poorly transmitted isolates contained mutations in the aphid transmission-associated DAG and/or KLSC amino acid sequence motifs of CP and HC/Pro, respectively.
Optimizing the specificity of nucleic acid hybridization.

PubMed

Zhang, David Yu; Chen, Sherry Xi; Yin, Peng

2012-01-22

The specific hybridization of complementary sequences is an essential property of nucleic acids, enabling diverse biological and biotechnological reactions and functions. However, the specificity of nucleic acid hybridization is compromised for long strands, except near the melting temperature. Here, we analytically derived the thermodynamic properties of a hybridization probe that would enable near-optimal single-base discrimination and perform robustly across diverse temperature, salt and concentration conditions. We rationally designed 'toehold exchange' probes that approximate these properties, and comprehensively tested them against five different DNA targets and 55 spurious analogues with energetically representative single-base changes (replacements, deletions and insertions). These probes produced discrimination factors between 3 and 100+ (median, 26). Without retuning, our probes function robustly from 10 °C to 37 °C, from 1 mM Mg(2+) to 47 mM Mg(2+), and with nucleic acid concentrations from 1 nM to 5 µM. Experiments with RNA also showed effective single-base change discrimination.
A Therapeutic Uricase with Reduced Immunogenicity Risk and Improved Development Properties

PubMed Central

Nyborg, Andrew C.; Ward, Chris; Zacco, Anna; Grinberg, Luba; Geoghegan, James C.; Bean, Ryan; Wendeler, Michaela; Bartnik, Frank; O’Connor, Ellen; Gruia, Flaviu; Iyer, Vidyashankara; Feng, Hui; Roy, Varnika; Berge, Mark; Miner, Jeffrey N.; Wilson, David M.; Zhou, Dongmei; Nicholson, Simone; Wilker, Clynn; Wu, Chi Y.; Wilson, Susan; Jermutus, Lutz; Wu, Herren; Owen, David A.; Osbourn, Jane; Coats, Steven; Baca, Manuel

2016-01-01

Humans and higher primates are unique in that they lack uricase, the enzyme capable of oxidizing uric acid. As a consequence of this enzyme deficiency, humans have high serum uric acid levels. In some people, uric acid levels rise above the solubility limit resulting in crystallization in joints, acute inflammation in response to those crystals causes severe pain; a condition known as gout. Treatment for severe gout includes injection of non-human uricase to reduce serum uric acid levels. Krystexxa® is a hyper-PEGylated pig-baboon chimeric uricase indicated for chronic refractory gout that induces an immunogenic response in 91% of treated patients, including infusion reactions (26%) and anaphylaxis (6.5%). These properties limit its use and effectiveness. An innovative approach has been used to develop a therapeutic uricase with improved properties such as: soluble expression, neutral pH solubility, high E. coli expression level, thermal stability, and excellent activity. More than 200 diverse uricase sequences were aligned to guide protein engineering and reduce putative sequence liabilities. A single uricase lead candidate was identified, which showed low potential for immunogenicity in >200 human donor samples selected to represent diverse HLA haplotypes. Cysteines were engineered into the lead sequence for site specific PEGylation and studies demonstrated >95% PEGylation efficiency. PEGylated uricase retains enzymatic activity in vitro at neutral pH, in human serum and in vivo (rats and canines) and has an extended half-life. In canines, an 85% reduction in serum uric acid levels was observed with a single subcutaneous injection. This PEGylated, non-immunogenic uricase has the potential to provide meaningful benefits to patients with gout. PMID:28002433

RECOVIR Software for Identifying Viruses

NASA Technical Reports Server (NTRS)

Chakravarty, Sugoto; Fox, George E.; Zhu, Dianhui

2013-01-01

Most single-stranded RNA (ssRNA) viruses mutate rapidly to generate a large number of strains with highly divergent capsid sequences. Determining the capsid residues or nucleotides that uniquely characterize these strains is critical in understanding the strain diversity of these viruses. RECOVIR (an acronym for "recognize viruses") software predicts the strains of some ssRNA viruses from their limited sequence data. Novel phylogenetic-tree-based databases of protein or nucleic acid residues that uniquely characterize these virus strains are created. Strains of input virus sequences (partial or complete) are predicted through residue-wise comparisons with the databases. RECOVIR uses unique characterizing residues to identify automatically strains of partial or complete capsid sequences of picorna and caliciviruses, two of the most highly diverse ssRNA virus families. Partition-wise comparisons of the database residues with the corresponding residues of more than 300 complete and partial sequences of these viruses resulted in correct strain identification for all of these sequences. This study shows the feasibility of creating databases of hitherto unknown residues uniquely characterizing the capsid sequences of two of the most highly divergent ssRNA virus families. These databases enable automated strain identification from partial or complete capsid sequences of these human and animal pathogens.
Diversity surveys and evolutionary relationships of aoxB genes in aerobic arsenite-oxidizing bacteria.

PubMed

Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N; Garrido, Francis; Joulian, Catherine

2008-07-01

A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers.
Phylogenetic analysis of human influenza A/H3N2 viruses isolated in 2015 in Germany indicates significant genetic divergence from vaccine strains.

PubMed

Mostafa, Ahmed; Abdelwhab, El-Sayed M; Slanina, Heiko; Hussein, Mohamed A; Kuznetsova, Irina; Schüttler, Christian G; Ziebuhr, John; Pleschka, Stephan

2016-06-01

Infections by H3N2-type influenza A viruses (IAV) resulted in significant numbers of hospitalization in several countries in 2014-2015, causing disease also in vaccinated individuals and, in some cases, fatal outcomes. In this study, sequence analysis of H3N2 viruses isolated in Germany from 1998 to 2015, including eleven H3N2 isolates collected early in 2015, was performed. Compared to the vaccine strain A/Texas/50/2012 (H3N2), the 2015 strains from Germany showed up to 4.5 % sequence diversity in their HA1 protein, indicating substantial genetic drift. The data further suggest that two distinct phylogroups, 3C.2 and 3C.3, with 1.6-2.3 % and 0.3-2.4 % HA1 nucleotide and amino acid sequence diversity, respectively, co-circulated in Germany in the 2014/2015 season. Distinct glycosylation patterns and amino acid substitutions in the hemagglutinin and neuraminidase proteins were identified, possibly contributing to the unusually high number of H3N2 infections in this season and providing important information for developing vaccines that are effective against both genotypes.
Sequence-Specific Recognition of DNA by Proteins: Binding Motifs Discovered Using a Novel Statistical/Computational Analysis

PubMed Central

Jakubec, David; Laskowski, Roman A.; Vondrasek, Jiri

2016-01-01

Decades of intensive experimental studies of the recognition of DNA sequences by proteins have provided us with a view of a diverse and complicated world in which few to no features are shared between individual DNA-binding protein families. The originally conceived direct readout of DNA residue sequences by amino acid side chains offers very limited capacity for sequence recognition, while the effects of the dynamic properties of the interacting partners remain difficult to quantify and almost impossible to generalise. In this work we investigated the energetic characteristics of all DNA residue—amino acid side chain combinations in the conformations found at the interaction interface in a very large set of protein—DNA complexes by the means of empirical potential-based calculations. General specificity-defining criteria were derived and utilised to look beyond the binding motifs considered in previous studies. Linking energetic favourability to the observed geometrical preferences, our approach reveals several additional amino acid motifs which can distinguish between individual DNA bases. Our results remained valid in environments with various dielectric properties. PMID:27384774
Microbial population Diversity of indigenous acidophilic bacteria for recovering the valuable resources

NASA Astrophysics Data System (ADS)

Kim, B.; Cho, K.; Lee, D.; Choi, N.; Park, C.

2011-12-01

A taxon- or group-specific PCR primer serves as a valuable tool for studying the bioleaching mechanisms of a particular group of microorganisms. Especially for an uncultured (or very difficult to isolate from their environments) group of microorganisms, the group-specific PCR primer is essential for the investigation of distribution patterns and the estimation of genetic diversity of the target microorganisms. This study investigated the Biodiversity through molecular biology method using the three different indigenous acidophilic bacteria collected from acid mine drainage in Go-seong and Yeon-hwa, Korea and acidic hot spring in Hatchnobaru, Japan. We performed the optical analysis (phase-contrast microscope and SEM), base sequencing. In the phase-contrast microscope(X 4,000) and SEM analysis, the rod-shaped bacteria with 1μm in length were observed. The results of base sequencing using EzTaxon server data revealed Acidithiobacillus ferrooxidans (Go-seong - 97.79%, Yeon-hwa - 97.90% and Hatchnobaru - 97.97%)
Automated design evolution of stereochemically randomized protein foldamers

NASA Astrophysics Data System (ADS)

Ranbhor, Ranjit; Kumar, Anil; Patel, Kirti; Ramakrishnan, Vibin; Durani, Susheel

2018-05-01

Diversification of chain stereochemistry opens up the possibilities of an ‘in principle’ increase in the design space of proteins. This huge increase in the sequence and consequent structural variation is aimed at the generation of smart materials. To diversify protein structure stereochemically, we introduced L- and D-α-amino acids as the design alphabet. With a sequence design algorithm, we explored the usage of specific variables such as chirality and the sequence of this alphabet in independent steps. With molecular dynamics, we folded stereochemically diverse homopolypeptides and evaluated their ‘fitness’ for possible design as protein-like foldamers. We propose a fitness function to prune the most optimal fold among 1000 structures simulated with an automated repetitive simulated annealing molecular dynamics (AR-SAMD) approach. The highly scored poly-leucine fold with sequence lengths of 24 and 30 amino acids were later sequence-optimized using a Dead End Elimination cum Monte Carlo based optimization tool. This paper demonstrates a novel approach for the de novo design of protein-like foldamers.
Environmental isolation explains Iberian genetic diversity in the highly homozygous model grass Brachypodium distachyon.

PubMed

Marques, Isabel; Shiposha, Valeriia; López-Alvarez, Diana; Manzaneda, Antonio J; Hernandez, Pilar; Olonova, Marina; Catalán, Pilar

2017-06-15

Brachypodium distachyon (Poaceae), an annual Mediterranean Aluminum (Al)-sensitive grass, is currently being used as a model species to provide new information on cereals and biofuel crops. The plant has a short life cycle and one of the smallest genomes in the grasses being well suited to experimental manipulation. Its genome has been fully sequenced and several genomic resources are being developed to elucidate key traits and gene functions. A reliable germplasm collection that reflects the natural diversity of this species is therefore needed for all these genomic resources. However, despite being a model plant, we still know very little about its genetic diversity. As a first step to overcome this gap, we used nuclear Simple Sequence Repeats (nSSR) to study the patterns of genetic diversity and population structure of B. distachyon in 14 populations sampled across the Iberian Peninsula (Spain), one of its best known areas. We found very low levels of genetic diversity, allelic number and heterozygosity in B. distachyon, congruent with a highly selfing system. Our results indicate the existence of at least three genetic clusters providing additional evidence for the existence of a significant genetic structure in the Iberian Peninsula and supporting this geographical area as an important genetic reservoir. Several hotspots of genetic diversity were detected and populations growing on basic soils were significantly more diverse than those growing in acidic soils. A partial Mantel test confirmed a statistically significant Isolation-By-Distance (IBD) among all studied populations, as well as a statistically significant Isolation-By-Environment (IBE) revealing the presence of environmental-driven isolation as one explanation for the genetic patterns found in the Iberian Peninsula. The finding of higher genetic diversity in eastern Iberian populations occurring in basic soils suggests that these populations can be better adapted than those occurring in western areas of the Iberian Peninsula where the soils are more acidic and accumulate toxic Al ions. This suggests that the western Iberian acidic soils might prevent the establishment of Al-sensitive B. distachyon populations, potentially causing the existence of more genetically depauperated individuals.
Diversity and duplication of DQB and DRB-like genes of the MHC in baleen whales (suborder: Mysticeti).

PubMed

Baker, C S; Vant, M D; Dalebout, M L; Lento, G M; O'Brien, S J; Yuhki, N

2006-05-01

The molecular diversity and phylogenetic relationships of two class II genes of the baleen whale major histocompatibility complex were investigated and compared to toothed whales and out-groups. Amplification of the DQB exon 2 provided sequences showing high within-species and between-species nucleotide diversity and uninterrupted reading frames consistent with functional class II loci found in related mammals (e.g., ruminants). Cloning of amplified products indicated gene duplication in the humpback whale and triplication in the southern right whale, with average nucleotide diversity of 5.9 and 6.3%, respectively, for alleles of each species. Significantly higher nonsynonymous divergence at sites coding for peptide binding (32% for humpback and 40% for southern right) suggested that these loci were subject to positive (overdominant) selection. A population survey of humpback whales detected 23 alleles, differing by up to 21% of their inferred amino acid sequences. Amplification of the DRB exon 2 resulted in two groups of sequences. One was most similar to the DRB3 of the cow and present in all whales screened to date, including toothed whales. The second was most similar to the DRB2 of the cow and was found only in the bowhead and right whales. Both loci showed low diversity among species and apparent loss of function or altered function including interruption of reading frames. Finally, comparison of inferred protein sequence of the DRB3-like locus suggested convergence with the DQB, perhaps resulting from intergenic conversion or recombination.
Depletion of Unwanted Nucleic Acid Templates by Selective Cleavage: LNAzymes, Catalytically Active Oligonucleotides Containing Locked Nucleic Acids, Open a New Window for Detecting Rare Microbial Community Members

PubMed Central

Dolinšek, Jan; Dorninger, Christiane; Lagkouvardos, Ilias; Wagner, Michael

2013-01-01

Many studies of molecular microbial ecology rely on the characterization of microbial communities by PCR amplification, cloning, sequencing, and phylogenetic analysis of genes encoding rRNAs or functional marker enzymes. However, if the established clone libraries are dominated by one or a few sequence types, the cloned diversity is difficult to analyze by random clone sequencing. Here we present a novel approach to deplete unwanted sequence types from complex nucleic acid mixtures prior to cloning and downstream analyses. It employs catalytically active oligonucleotides containing locked nucleic acids (LNAzymes) for the specific cleavage of selected RNA targets. When combined with in vitro transcription and reverse transcriptase PCR, this LNAzyme-based technique can be used with DNA or RNA extracts from microbial communities. The simultaneous application of more than one specific LNAzyme allows the concurrent depletion of different sequence types from the same nucleic acid preparation. This new method was evaluated with defined mixtures of cloned 16S rRNA genes and then used to identify accompanying bacteria in an enrichment culture dominated by the nitrite oxidizer “Candidatus Nitrospira defluvii.” In silico analysis revealed that the majority of publicly deposited rRNA-targeted oligonucleotide probes may be used as specific LNAzymes with no or only minor sequence modifications. This efficient and cost-effective approach will greatly facilitate tasks such as the identification of microbial symbionts in nucleic acid preparations dominated by plastid or mitochondrial rRNA genes from eukaryotic hosts, the detection of contaminants in microbial cultures, and the analysis of rare organisms in microbial communities of highly uneven composition. PMID:23263968
The diversity of H3 loops determines the antigen-binding tendencies of antibody CDR loops.

PubMed

Tsuchiya, Yuko; Mizuguchi, Kenji

2016-04-01

Of the complementarity-determining regions (CDRs) of antibodies, H3 loops, with varying amino acid sequences and loop lengths, adopt particularly diverse loop conformations. The diversity of H3 conformations produces an array of antigen recognition patterns involving all the CDRs, in which the residue positions actually in contact with the antigen vary considerably. Therefore, for a deeper understanding of antigen recognition, it is necessary to relate the sequence and structural properties of each residue position in each CDR loop to its ability to bind antigens. In this study, we proposed a new method for characterizing the structural features of the CDR loops and obtained the antigen-binding ability of each residue position in each CDR loop. This analysis led to a simple set of rules for identifying probable antigen-binding residues. We also found that the diversity of H3 loop lengths and conformations affects the antigen-binding tendencies of all the CDR loops. © 2016 The Protein Society.
Capturing the genetic makeup of the active microbiome in situ.

PubMed

Singer, Esther; Wagner, Michael; Woyke, Tanja

2017-09-01

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and that have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions.
Characterization of the hepcidin gene in eight species of bats.

PubMed

Stasiak, Iga M; Smith, Dale A; Crawshaw, Graham J; Hammermueller, Jutta D; Bienzle, Dorothee; Lillie, Brandon N

2014-02-01

Hemochromatosis, or iron storage disease, has been associated with significant liver disease and mortality in captive Egyptian fruit bats (Rousettus aegyptiacus). The physiologic basis for this susceptibility has not been established. In humans, a deficiency or resistance to the iron regulatory hormone, hepcidin has been implicated in the development of hereditary hemochromatosis. In the present study, we compared the coding sequence of the hepcidin gene in eight species of bats representing three distinct taxonomic families with diverse life histories and dietary preferences. Bat hepcidin mRNA encoded a 23 amino acid signal peptide, a 34 or 35 amino acid pro-region, and a 25 amino acid mature peptide, similar to other mammalian species. Differences in the sequence of the portion of the hepcidin gene that encodes the mature peptide that might account for the increased susceptibility of the Egyptian fruit bat to iron storage disease were not identified. Variability in gene sequence corresponded to the taxonomic relationship amongst species. Copyright © 2013 Elsevier Ltd. All rights reserved.
Capturing the genetic makeup of the active microbiome in situ

PubMed Central

Singer, Esther; Wagner, Michael; Woyke, Tanja

2017-01-01

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and that have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions. PMID:28574490
Marine protist diversity in European coastal waters and sediments as revealed by high-throughput sequencing.

PubMed

Massana, Ramon; Gobet, Angélique; Audic, Stéphane; Bass, David; Bittner, Lucie; Boutte, Christophe; Chambouvet, Aurélie; Christen, Richard; Claverie, Jean-Michel; Decelle, Johan; Dolan, John R; Dunthorn, Micah; Edvardsen, Bente; Forn, Irene; Forster, Dominik; Guillou, Laure; Jaillon, Olivier; Kooistra, Wiebe H C F; Logares, Ramiro; Mahé, Frédéric; Not, Fabrice; Ogata, Hiroyuki; Pawlowski, Jan; Pernice, Massimo C; Probert, Ian; Romac, Sarah; Richards, Thomas; Santini, Sébastien; Shalchian-Tabrizi, Kamran; Siano, Raffaele; Simon, Nathalie; Stoeck, Thorsten; Vaulot, Daniel; Zingone, Adriana; de Vargas, Colomban

2015-10-01

Although protists are critical components of marine ecosystems, they are still poorly characterized. Here we analysed the taxonomic diversity of planktonic and benthic protist communities collected in six distant European coastal sites. Environmental deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) from three size fractions (pico-, nano- and micro/mesoplankton), as well as from dissolved DNA and surface sediments were used as templates for tag pyrosequencing of the V4 region of the 18S ribosomal DNA. Beta-diversity analyses split the protist community structure into three main clusters: picoplankton-nanoplankton-dissolved DNA, micro/mesoplankton and sediments. Within each cluster, protist communities from the same site and time clustered together, while communities from the same site but different seasons were unrelated. Both DNA and RNA-based surveys provided similar relative abundances for most class-level taxonomic groups. Yet, particular groups were overrepresented in one of the two templates, such as marine alveolates (MALV)-I and MALV-II that were much more abundant in DNA surveys. Overall, the groups displaying the highest relative contribution were Dinophyceae, Diatomea, Ciliophora and Acantharia. Also, well represented were Mamiellophyceae, Cryptomonadales, marine alveolates and marine stramenopiles in the picoplankton, and Monadofilosa and basal Fungi in sediments. Our extensive and systematic sequencing of geographically separated sites provides the most comprehensive molecular description of coastal marine protist diversity to date. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Elucidating the substrate specificities of acyl-lipid thioesterases from diverse plant taxa.

PubMed

Kalinger, Rebecca S; Pulsifer, Ian P; Rowland, Owen

2018-06-01

Acyl-ACP thioesterase enzymes, which cleave fatty acyl thioester bonds to release free fatty acids, contribute to much of the fatty acid diversity in plants. In Arabidopsis thaliana, a family of four single hot-dog fold domain, plastid-localized acyl-lipid thioesterases (AtALT1-4) generate medium-chain (C6-C14) fatty and β-keto fatty acids as secondary metabolites. These volatile products may serve to attract insect pollinators or deter predatory insects. Homologs of AtALT1-4 are present in all plant taxa, but are nearly all uncharacterized. Despite high sequence identity, AtALT1-4 generate different lipid products, suggesting that ALT homologs in other plants also have highly varied activities. We investigated the catalytic diversity of ALT-like thioesterases by screening the substrate specificities of 15 ALT homologs from monocots, eudicots, a lycophyte, a green microalga, and the ancient gymnosperm Gingko biloba, via expression in Escherichia coli. Overall, these enzymes had highly varied substrate preferences compared to one another and to AtALT1-4, and could be classified into four catalytic groups comprising members from diverse taxa. Group 1 ALTs primarily generated 14:1 β-keto fatty acids, Group 2 ALTs produced 6-10 carbon fatty/β-keto fatty acids, Group 3 ALTs predominantly produced 12-14 carbon fatty acids, and Group 4 ALTs mainly generated 16 carbon fatty acids. Enzymes in each group differed significantly in the quantities of lipids and types of minor products they generated in E. coli. Medium-chain fatty acids are used to manufacture insecticides, pharmaceuticals, and biofuels, and ALT-like proteins are ideal candidates for metabolic engineering to produce specific fatty acids in significant quantities. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection

PubMed Central

Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.

2017-01-01

Abstract Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry–dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a “universal” nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments—Nucleic acids—Mars—Panspermia. Astrobiology 17, 747–760. PMID:28704064
Genetic diversity and antigenicity variation of Babesia bovis merozoite surface antigen-1 (MSA-1) in Thailand.

PubMed

Tattiyapong, Muncharee; Sivakumar, Thillaiampalam; Takemae, Hitoshi; Simking, Pacharathon; Jittapalapong, Sathaporn; Igarashi, Ikuo; Yokoyama, Naoaki

2016-07-01

Babesia bovis, an intraerythrocytic protozoan parasite, causes severe clinical disease in cattle worldwide. The genetic diversity of parasite antigens often results in different immune profiles in infected animals, hindering efforts to develop immune control methodologies against the B. bovis infection. In this study, we analyzed the genetic diversity of the merozoite surface antigen-1 (msa-1) gene using 162 B. bovis-positive blood DNA samples sourced from cattle populations reared in different geographical regions of Thailand. The identity scores shared among 93 msa-1 gene sequences isolated by PCR amplification were 43.5-100%, and the similarity values among the translated amino acid sequences were 42.8-100%. Of 23 total clades detected in our phylogenetic analysis, Thai msa-1 gene sequences occurred in 18 clades; seven among them were composed of sequences exclusively from Thailand. To investigate differential antigenicity of isolated MSA-1 proteins, we expressed and purified eight recombinant MSA-1 (rMSA-1) proteins, including an rMSA-1 from B. bovis Texas (T2Bo) strain and seven rMSA-1 proteins based on the Thai msa-1 sequences. When these antigens were analyzed in a western blot assay, anti-T2Bo cattle serum strongly reacted with the rMSA-1 from T2Bo, as well as with three other rMSA-1 proteins that shared 54.9-68.4% sequence similarity with T2Bo MSA-1. In contrast, no or weak reactivity was observed for the remaining rMSA-1 proteins, which shared low sequence similarity (35.0-39.7%) with T2Bo MSA-1. While demonstrating the high genetic diversity of the B. bovis msa-1 gene in Thailand, the present findings suggest that the genetic diversity results in antigenicity variations among the MSA-1 antigens of B. bovis in Thailand. Copyright © 2016 Elsevier B.V. All rights reserved.
Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity.

PubMed Central

Johnson, P R; Fomsgaard, A; Allan, J; Gravell, M; London, W T; Olmsted, R A; Hirsch, V M

1990-01-01

African green monkeys are asymptomatic carriers of simian immunodeficiency viruses (SIV), commonly called SIVagm. As many as 50% of African green monkeys in the wild may be SIV seropositive. This high seroprevalence rate and the potential for genetic variation of lentiviruses suggested to us that African green monkeys may harbor widely differing genotypes of SIVagm. To investigate this hypothesis, we determined the entire nucleotide sequence of an infectious proviral molecular clone of SIVagm (155-4) and partial sequences (long terminal repeat and Gag) of three other distinct SIVagm isolates (90, gri-1, and ver-1). Comparisons among the SIVagm isolates revealed extreme diversity at the nucleotide and amino acid levels. Long terminal repeat nucleotide sequences varied up to 35% and Gag protein sequences varied up to 30%. The variability among SIVagm isolates exceeded the variability among any other group of primate lentiviruses. Our data suggest that SIVagm has been in the African green monkey population for a long time and may be the oldest primate lentivirus group in existence. PMID:2304139
Functional identification and regulatory analysis of Δ6-fatty acid desaturase from the oleaginous fungus Mucor sp. EIM-10.

PubMed

Jiang, Xianzhang; Liu, Hongjiao; Niu, Yongchao; Qi, Feng; Zhang, Mingliang; Huang, Jianzhong

2017-03-01

To enlarge the diversity of the desaturases associated with PUFA biosynthesis and to better understand the transcriptional regulation of desaturases, a Δ 6 -desaturase gene (Md6) from Mucor sp. and its 5'-upstream sequence was functionally identified in Saccharomyces cerevisiae. Expression of the Δ 6 -fatty acid desaturase (Md6) in S. cerevisiae showed that Md6 could convert linolenic acid to γ-linolenic acid. Computational analysis of the promoter of Md6 suggested it contains several eukaryotic fundamental transcription regulatory elements. In vivo functional analysis of the promoter showed the 5'-upstream sequence of Md6 could initiate expression of GFP and Md6 itself in S. cerevisiae. A series deletion analysis of the promoter suggested that sequence between -919 to -784 bp (relative to start site) named as eMd6 is the key factor for high activity of Δ 6 -desaturase. The activity of Δ 6 -desaturase was increased by 2.8-fold and 2.5-fold when the eMd6 sequence was placed upstream of -434 with forward or reverse orientations respectively. To our best knowledge, the native promoter of Md6 from Mucor is the strongest promoter for Δ 6 -desaturase reported so far and the sequence between -919 to -784 bp is an enhancer for Δ 6 -desaturase activity.
Ranalexin. A novel antimicrobial peptide from bullfrog (Rana catesbeiana) skin, structurally related to the bacterial antibiotic, polymyxin.

PubMed

Clark, D P; Durell, S; Maloy, W L; Zasloff, M

1994-04-08

Antimicrobial peptides comprise a diverse class of molecules used in host defense by plants, insects, and animals. In this study we have isolated a novel antimicrobial peptide from the skin of the bullfrog, Rana catesbeiana. This 20 amino acid peptide, which we have termed Ranalexin, has the amino acid sequence: NH2-Phe-Leu-Gly-Gly-Leu-Ile-Lys-Ile-Val-Pro-Ala-Met-Ile-Cys-Ala-Val-Thr- Lys-Lys - Cys-COOH, and it contains a single intramolecular disulfide bond which forms a heptapeptide ring within the molecule. Structurally, Ranalexin resembles the bacterial antibiotic, polymyxin, which contains a similar heptapeptide ring. We have also cloned the cDNA for Ranalexin from a metamorphic R. catesbeiana tadpole cDNA library. Based on the cDNA sequence, it appears that Ranalexin is initially synthesized as a propeptide with a putative signal sequence and an acidic amino acid-rich region at its amino-terminal end. Interestingly, the putative signal sequence of the Ranalexin cDNA is strikingly similar to the signal sequence of opioid peptide precursors isolated from the skin of the South American frogs Phyllomedusa sauvagei and Phyllomedusa bicolor. Northern blot analysis and in situ hybridization experiments demonstrated that Ranalexin mRNA is first expressed in R. catesbeiana skin at metamorphosis and continues to be expressed into adulthood.

Conservation of a pH-sensitive structure in the C-terminal region of spider silk extends across the entire silk gene family.

PubMed

Strickland, Michelle; Tudorica, Victor; Řezáč, Milan; Thomas, Neil R; Goodacre, Sara L

2018-06-01

Spiders produce multiple silks with different physical properties that allow them to occupy a diverse range of ecological niches, including the underwater environment. Despite this functional diversity, past molecular analyses show a high degree of amino acid sequence similarity between C-terminal regions of silk genes that appear to be independent of the physical properties of the resulting silks; instead, this domain is crucial to the formation of silk fibers. Here, we present an analysis of the C-terminal domain of all known types of spider silk and include silk sequences from the spider Argyroneta aquatica, which spins the majority of its silk underwater. Our work indicates that spiders have retained a highly conserved mechanism of silk assembly, despite the extraordinary diversification of species, silk types and applications of silk over 350 million years. Sequence analysis of the silk C-terminal domain across the entire gene family shows the conservation of two uncommon amino acids that are implicated in the formation of a salt bridge, a functional bond essential to protein assembly. This conservation extends to the novel sequences isolated from A. aquatica. This finding is relevant to research regarding the artificial synthesis of spider silk, suggesting that synthesis of all silk types will be possible using a single process.
Selecting Fully-Modified XNA Aptamers Using Synthetic Genetics.

PubMed

Taylor, Alexander I; Holliger, Philipp

2018-06-01

This unit describes the application of "synthetic genetics," i.e., the replication of xeno nucleic acids (XNAs), artificial analogs of DNA and RNA bearing alternative backbone or sugar congeners, to the directed evolution of synthetic oligonucleotide ligands (XNA aptamers) specific for target proteins or nucleic acid motifs, using a cross-chemistry selective exponential enrichment (X-SELEX) approach. Protocols are described for synthesis of diverse-sequence XNA repertoires (typically 10 14 molecules) using DNA templates, isolation and panning for functional XNA sequences using targets immobilized on solid phase or gel shift induced by target binding in solution, and XNA reverse transcription to allow cDNA amplification or sequencing. The method may be generally applied to select fully-modified XNA aptamers specific for a wide range of target molecules. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.
Quantifying selection and diversity in viruses by entropy methods, with application to the haemagglutinin of H3N2 influenza

PubMed Central

Pan, Keyao; Deem, Michael W.

2011-01-01

Many viruses evolve rapidly. For example, haemagglutinin (HA) of the H3N2 influenza A virus evolves to escape antibody binding. This evolution of the H3N2 virus means that people who have previously been exposed to an influenza strain may be infected by a newly emerged virus. In this paper, we use Shannon entropy and relative entropy to measure the diversity and selection pressure by an antibody in each amino acid site of H3 HA between the 1992–1993 season and the 2009–2010 season. Shannon entropy and relative entropy are two independent state variables that we use to characterize H3N2 evolution. The entropy method estimates future H3N2 evolution and migration using currently available H3 HA sequences. First, we show that the rate of evolution increases with the virus diversity in the current season. The Shannon entropy of the sequence in the current season predicts relative entropy between sequences in the current season and those in the next season. Second, a global migration pattern of H3N2 is assembled by comparing the relative entropy flows of sequences sampled in China, Japan, the USA and Europe. We verify this entropy method by describing two aspects of historical H3N2 evolution. First, we identify 54 amino acid sites in HA that have evolved in the past to evade the immune system. Second, the entropy method shows that epitopes A and B on the top of HA evolve most vigorously to escape antibody binding. Our work provides a novel entropy-based method to predict and quantify future H3N2 evolution and to describe the evolutionary history of H3N2. PMID:21543352
The Vaginal Eukaryotic DNA Virome and Preterm Birth.

PubMed

Wylie, Kristine M; Wylie, Todd N; Cahill, Alison G; Macones, George A; Tuuli, Methodius G; Stout, Molly J

2018-05-05

Despite decades of attempts to link infectious agents to preterm birth, an exact causative microbe or community of microbes remains elusive. Culture-independent sequencing of vaginal bacterial communities demonstrates community characteristics are associated with preterm birth, although none are specific enough to apply clinically. Viruses are important components of the vaginal microbiome and have dynamic relationships with vaginal bacterial communities. We hypothesized that vaginal eukaryotic DNA viral communities (the "vaginal virome") either alone or in the context of bacterial communities are associated with preterm birth. The objective of this study was to use high-throughput sequencing to examine the vaginal eukaryotic DNA virome in a cohort of pregnant women and examine associations between vaginal community characteristics and preterm birth. This is a nested case-control study within a prospective cohort study of women with singleton pregnancies, not on supplemental progesterone, and without cervical cerclage in situ. Serial mid-vaginal swabs were obtained at routine prenatal visits. DNA was extracted, bacterial communities were characterized by 16S rRNA gene sequencing, and eukaryotic viral communities were characterized by enrichment of viral nucleic acid with the ViroCap targeted sequence capture panel followed by nucleic acid sequencing. Viral communities were analyzed according to presence/absence of viruses, diversity, dynamics over time, and association with bacterial community data obtained from the same specimens. Sixty subjects contributed 128 vaginal swabs longitudinally across pregnancy. Twenty-four patients delivered preterm. Participants were predominantly African-American (65%). Six families of eukaryotic DNA viruses were detected in the vaginal samples. At least 1 virus was detected in 80% of women. No specific virus or group of viruses was associated with preterm delivery. Higher viral richness was significantly associated with preterm delivery in the full group and in the African American subgroup (P=0.0005 and P=0.0003, respectively). Having both high bacterial diversity and high viral diversity in the first trimester was associated with the highest risk for preterm birth. Higher vaginal viral diversity is associated with preterm birth. Changes in vaginal virome diversity appear similar to changes in the vaginal bacterial microbiome over pregnancy, suggesting that underlying physiology of pregnancy may regulate both bacterial and viral communities. Copyright © 2018 Elsevier Inc. All rights reserved.
Large diversity of the piggyBac-like elements in the genome of Tribolium castaneum

PubMed Central

Wang, Jianjun; Du, Yuzhou; Wang, Suzhi; Brown, Sue; Park, Yoonseong

2011-01-01

The piggyBac transposable element, originally discovered in the cabbage looper, Trichoplusia ni, has been widely used in insect transgenesis including the red flour beetle Tribolium castaneum. We surveyed piggyBac-like (PLE) sequences in the genome of Tribolium castaneum by homology searches using as queries the diverse PLE sequences that have been described previously. The search yielded a total of 32 piggyBac-like elements (TcPLEs) which were classified into 14 distinct groups. Most of the TcPLEs contain defective functional motifs in that they are lacking inverted terminal repeats or have disrupted open reading frames. Only one single copy of TcPLE1 appears to be intact with imperfect 16 bp inverted terminal repeats flanking an open reading frame encoding a transposase of 571 amino acid residues. Many copies of TcPLEs were found to be inserted into or close to other transposon-like sequences. This large diversity of TcPLEs with generally low copy numbers suggests multiple invasions of the TcPLEs over a long evolutionary time without extensive multiplications or occurrence of rapid loss of TcPLEs copies. PMID:18342253
Phylogenetic distribution of phenotypic traits in bacillus thuringiensis analyzed by multilocus sequence typing

USDA-ARS?s Scientific Manuscript database

Strains from a collection of 3,639 diverse Bacillus thuringiensis isolates were classified based on phenotypic profiles resulting from six biochemical tests, including production of amylase (T), lecithinase (L), urease (U), acid from sucrose (S) and salicin (A), and the hydrolysis of esculin (E). St...
Computational analysis of sequence selection mechanisms.

PubMed

Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

2004-04-01

Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.
Low level of sequence diversity at merozoite surface protein-1 locus of Plasmodium ovale curtisi and P. ovale wallikeri from Thai isolates.

PubMed

Putaporntip, Chaturong; Hughes, Austin L; Jongwutiwes, Somchai

2013-01-01

The merozoite surface protein-1 (MSP-1) is a candidate target for the development of blood stage vaccines against malaria. Polymorphism in MSP-1 can be useful as a genetic marker for strain differentiation in malarial parasites. Although sequence diversity in the MSP-1 locus has been extensively analyzed in field isolates of Plasmodium falciparum and P. vivax, the extent of variation in its homologues in P. ovale curtisi and P. ovale wallikeri, remains unknown. Analysis of the mitochondrial cytochrome b sequences of 10 P. ovale isolates from symptomatic malaria patients from diverse endemic areas of Thailand revealed co-existence of P. ovale curtisi (n = 5) and P. ovale wallikeri (n = 5). Direct sequencing of the PCR-amplified products encompassing the entire coding region of MSP-1 of P. ovale curtisi (PocMSP-1) and P. ovale wallikeri (PowMSP-1) has identified 3 imperfect repeated segments in the former and one in the latter. Most amino acid differences between these proteins were located in the interspecies variable domains of malarial MSP-1. Synonymous nucleotide diversity (πS) exceeded nonsynonymous nucleotide diversity (πN) for both PocMSP-1 and PowMSP-1, albeit at a non-significant level. However, when MSP-1 of both these species was considered together, πS was significantly greater than πN (p<0.0001), suggesting that purifying selection has shaped diversity at this locus prior to speciation. Phylogenetic analysis based on conserved domains has placed PocMSP-1 and PowMSP-1 in a distinct bifurcating branch that probably diverged from each other around 4.5 million years ago. The MSP-1 sequences support that P. ovale curtisi and P. ovale wallikeri are distinct species. Both species are sympatric in Thailand. The low level of sequence diversity in PocMSP-1 and PowMSP-1 among Thai isolates could stem from persistent low prevalence of these species, limiting the chance of outcrossing at this locus.
Low Level of Sequence Diversity at Merozoite Surface Protein-1 Locus of Plasmodium ovale curtisi and P. ovale wallikeri from Thai Isolates

PubMed Central

Putaporntip, Chaturong; Hughes, Austin L.; Jongwutiwes, Somchai

2013-01-01

Background The merozoite surface protein-1 (MSP-1) is a candidate target for the development of blood stage vaccines against malaria. Polymorphism in MSP-1 can be useful as a genetic marker for strain differentiation in malarial parasites. Although sequence diversity in the MSP-1 locus has been extensively analyzed in field isolates of Plasmodium falciparum and P. vivax, the extent of variation in its homologues in P. ovale curtisi and P. ovale wallikeri, remains unknown. Methodology/Principal Findings Analysis of the mitochondrial cytochrome b sequences of 10 P. ovale isolates from symptomatic malaria patients from diverse endemic areas of Thailand revealed co-existence of P. ovale curtisi (n = 5) and P. ovale wallikeri (n = 5). Direct sequencing of the PCR-amplified products encompassing the entire coding region of MSP-1 of P. ovale curtisi (PocMSP-1) and P. ovale wallikeri (PowMSP-1) has identified 3 imperfect repeated segments in the former and one in the latter. Most amino acid differences between these proteins were located in the interspecies variable domains of malarial MSP-1. Synonymous nucleotide diversity (πS) exceeded nonsynonymous nucleotide diversity (πN) for both PocMSP-1 and PowMSP-1, albeit at a non-significant level. However, when MSP-1 of both these species was considered together, πS was significantly greater than πN (p<0.0001), suggesting that purifying selection has shaped diversity at this locus prior to speciation. Phylogenetic analysis based on conserved domains has placed PocMSP-1 and PowMSP-1 in a distinct bifurcating branch that probably diverged from each other around 4.5 million years ago. Conclusion/Significance The MSP-1 sequences support that P. ovale curtisi and P. ovale wallikeri are distinct species. Both species are sympatric in Thailand. The low level of sequence diversity in PocMSP-1 and PowMSP-1 among Thai isolates could stem from persistent low prevalence of these species, limiting the chance of outcrossing at this locus. PMID:23536840
Diversity Surveys and Evolutionary Relationships of aoxB Genes in Aerobic Arsenite-Oxidizing Bacteria▿ †

PubMed Central

Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N.; Garrido, Francis; Joulian, Catherine

2008-01-01

A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers. PMID:18502920
Molecular Signatures of Microbial Metabolism in an Actively Growing, Silicified, Microbial Structure from Yellowstone National Park

NASA Astrophysics Data System (ADS)

Ferreira, M.; Creveling, J.; Hilburn, I.; Karlsson, E.; Pepe-Ranney, C.; Spear, J.; Dawson, S.; Geobio2008, I.

2008-12-01

Silicified structures that exhibit a putative biologic component in their formation permeate the rock record as stromatolites. We have studied a silicified microbial structure from a hot spring in Yellowstone National Park using phenotypic, phylogenetic, and metagenomic analyses to determine microbial carbon metabolic pathways and the phylogenetic affiliations of microbes present in this unique structure. In this multi-faceted approach, dominant physiologies, specifically with regards to anaerobic and aerobic metabolisms, were inferred from 16S rRNA gene sequences and 454 sequencing data from bulk DNA samples of the structure. Carbon utilization as indicated by ECO Biolog plates showed abundant heterotrophy and heterotrophic diversity throughout the microbial structure. Microbes within the structure are able to utilize all tested sources of carbohydrates, lipids/fatty acids, and protein/amino acids as carbon sources. ECO plate testing of the hot spring water yielded considerable less carbohydrate consumption (only 4 out of 13 tested carbohydrates) and similar lipids/fatty acids and protein/amino acids consumption (2 out of 3 and 5 out of 5 tested sources respectively). Full length 16S rRNA gene sequences and metagenomic 454 pyrosequencing of community DNA showed limited diversity among primary producers. From the 16S data, the majority of the autotrophs are inferred to utilize the Calvin cycle for CO2 fixation, followed by 3-hydroxypropionate/4- hydroxybutyrate CO2 fixation. However, an analysis of the metagenomic data compared to the KEGG database does not show genes directly involved with Calvin cycle carbon fixation. Further BLAST searches of our data failed to find significant matches within our 6514 metagenomic sequences to known RuBisCo sequences taken from the NCBI database. This is likely due to a far under-sampled dataset of metagenomic sequences, and the low number (958) that had matches to the KEGG pathways database. Anaerobic versus aerobic physiology also can be estimated from the 16S clone libraries. Phylogenetic analysis of recovered 16S sequences suggests that 15% of the 16S sequences can be attributed to anaerobic microbes while 42% likely come from aerobes. The remaining 43% of 16S rRNA gene sequences belong to metabolically unassigned phyla both known and novel. This preliminary study demonstrates that the small spatially stratified silicified microbial structure present on the margins of a hot spring contains a rich and complex microbial community with different trophic levels and enzymatic pathways.
Genomic diversity and versatility of Lactobacillus plantarum, a natural metabolic engineer.

PubMed

Siezen, Roland J; van Hylckama Vlieg, Johan E T

2011-08-30

In the past decade it has become clear that the lactic acid bacterium Lactobacillus plantarum occupies a diverse range of environmental niches and has an enormous diversity in phenotypic properties, metabolic capacity and industrial applications. In this review, we describe how genome sequencing, comparative genome hybridization and comparative genomics has provided insight into the underlying genomic diversity and versatility of L. plantarum. One of the main features appears to be genomic life-style islands consisting of numerous functional gene cassettes, in particular for carbohydrates utilization, which can be acquired, shuffled, substituted or deleted in response to niche requirements. In this sense, L. plantarum can be considered a "natural metabolic engineer".
Genomic diversity and versatility of Lactobacillus plantarum, a natural metabolic engineer

PubMed Central

2011-01-01

In the past decade it has become clear that the lactic acid bacterium Lactobacillus plantarum occupies a diverse range of environmental niches and has an enormous diversity in phenotypic properties, metabolic capacity and industrial applications. In this review, we describe how genome sequencing, comparative genome hybridization and comparative genomics has provided insight into the underlying genomic diversity and versatility of L. plantarum. One of the main features appears to be genomic life-style islands consisting of numerous functional gene cassettes, in particular for carbohydrates utilization, which can be acquired, shuffled, substituted or deleted in response to niche requirements. In this sense, L. plantarum can be considered a “natural metabolic engineer”. PMID:21995294
Phylogenetic comparison of the methanogenic communities from an acidic, oligotrophic fen and an anaerobic digester treating municipal wastewater sludge.

PubMed

Steinberg, Lisa M; Regan, John M

2008-11-01

Methanogens play a critical role in the decomposition of organics under anaerobic conditions. The methanogenic consortia in saturated wetland soils are often subjected to large temperature fluctuations and acidic conditions, imposing a selective pressure for psychro- and acidotolerant community members; however, methanogenic communities in engineered digesters are frequently maintained within a narrow range of mesophilic and circumneutral conditions to retain system stability. To investigate the hypothesis that these two disparate environments have distinct methanogenic communities, the methanogens in an oligotrophic acidic fen and a mesophilic anaerobic digester treating municipal wastewater sludge were characterized by creating clone libraries for the 16S rRNA and methyl coenzyme M reductase alpha subunit (mcrA) genes. A quantitative framework was developed to assess the differences between these two communities by calculating the average sequence similarity for 16S rRNA genes and mcrA within a genus and family using sequences of isolated and characterized methanogens within the approved methanogen taxonomy. The average sequence similarities for 16S rRNA genes within a genus and family were 96.0 and 93.5%, respectively, and the average sequence similarities for mcrA within a genus and family were 88.9 and 79%, respectively. The clone libraries of the bog and digester environments showed no overlap at the species level and almost no overlap at the family level. Both libraries were dominated by clones related to uncultured methanogen groups within the Methanomicrobiales, although members of the Methanosarcinales and Methanobacteriales were also found in both libraries. Diversity indices for the 16S rRNA gene library of the bog and both mcrA libraries were similar, but these indices indicated much lower diversity in the 16S digester library than in the other three libraries.
Genetic Diversity and Selective Pressure in Hepatitis C Virus Genotypes 1-6: Significance for Direct-Acting Antiviral Treatment and Drug Resistance.

PubMed

Cuypers, Lize; Li, Guangdi; Libin, Pieter; Piampongsant, Supinya; Vandamme, Anne-Mieke; Theys, Kristof

2015-09-16

Treatment with pan-genotypic direct-acting antivirals, targeting different viral proteins, is the best option for clearing hepatitis C virus (HCV) infection in chronically infected patients. However, the diversity of the HCV genome is a major obstacle for the development of antiviral drugs, vaccines, and genotyping assays. In this large-scale analysis, genome-wide diversity and selective pressure was mapped, focusing on positions important for treatment, drug resistance, and resistance testing. A dataset of 1415 full-genome sequences, including genotypes 1-6 from the Los Alamos database, was analyzed. In 44% of all full-genome positions, the consensus amino acid was different for at least one genotype. Focusing on positions sharing the same consensus amino acid in all genotypes revealed that only 15% was defined as pan-genotypic highly conserved (≥99% amino acid identity) and an additional 24% as pan-genotypic conserved (≥95%). Despite its large genetic diversity, across all genotypes, codon positions were rarely identified to be positively selected (0.23%-0.46%) and predominantly found to be under negative selective pressure, suggesting mainly neutral evolution. For NS3, NS5A, and NS5B, respectively, 40% (6/15), 33% (3/9), and 14% (2/14) of the resistance-related positions harbored as consensus the amino acid variant related to resistance, potentially impeding treatment. For example, the NS3 variant 80K, conferring resistance to simeprevir used for treatment of HCV1 infected patients, was present in 39.3% of the HCV1a strains and 0.25% of HCV1b strains. Both NS5A variants 28M and 30S, known to be associated with resistance to the pan-genotypic drug daclatasvir, were found in a significant proportion of HCV4 strains (10.7%). NS5B variant 556G, known to confer resistance to non-nucleoside inhibitor dasabuvir, was observed in 8.4% of the HCV1b strains. Given the large HCV genetic diversity, sequencing efforts for resistance testing purposes may need to be genotype-specific or geographically tailored.
High Diversity of CTX-M Extended-Spectrum β-Lactamases in Municipal Wastewater and Urban Wetlands

PubMed Central

Borgogna, Timothy R.; Borgogna, Joanna-Lynn; Mielke, Jenna A.; Brown, Celeste J.; Top, Eva M.; Botts, Ryan T.

2016-01-01

The CTX-M-type extended-spectrum β-lactamases (ESBLs) present a serious public health threat as they have become nearly ubiquitous among clinical gram-negative pathogens, particularly the enterobacteria. To aid in the understanding and eventual control of the spread of such resistance genes, we sought to determine the diversity of CTX-M ESBLs not among clinical isolates, but in the environment, where weaker and more diverse selective pressures may allow greater enzyme diversification. This was done by examining the CTX-M diversity in municipal wastewater and urban coastal wetlands in southern California, United States, by Sanger sequencing of polymerase chain reaction amplicons. Of the five known CTX-M phylogroups (1, 2, 8, 9, and 25), only genes from groups 1 and 2 were detected in both wastewater treatment plants (WWTPs), and group 1 genes were also detected in one of the two wetlands after a winter rain. The highest relative abundance of blaCTX-M group 1 genes was in the sludge of one WWTP (2.1 × 10−4 blaCTX-M copies/16S rRNA gene copy). Gene libraries revealed surprisingly high nucleotide sequence diversity, with 157 new variants not found in GenBank, representing 99 novel amino acid sequences. Our results indicate that the resistomes of WWTPs and urban wetlands contain diverse blaCTX-M ESBLs, which may constitute a mobile reservoir of clinically relevant resistance genes. PMID:26670020
Vba2p, a vacuolar membrane protein involved in basic amino acid transport in Schizosaccharomyces pombe.

PubMed

Sugimoto, Naoko; Iwaki, Tomoko; Chardwiriyapreecha, Soracom; Shimazu, Masamitsu; Sekito, Takayuki; Takegawa, Kaoru; Kakinuma, Yoshimi

2010-01-01

A recent study filling the gap in the genome sequence in the left arm of chromosome 2 of Schizosaccharomyces pombe revealed a homolog of budding yeast Vba2p, a vacuolar transporter of basic amino acids. GFP-tagged Vba2p in fission yeast was localized to the vacuolar membrane. Upon disruption of vba2, the uptake of several amino acids, including lysine, histidine, and arginine, was impaired. A transient increase in lysine uptake under nitrogen starvation was lowered by this mutation. These findings suggest that Vba2p is involved in basic amino acid transport in S. pombe under diverse conditions.
Exploiting genes and functional diversity of chlorogenic acid and luteolin biosyntheses in Lonicera japonica and their substitutes.

PubMed

Yuan, Yuan; Wang, Zhouyong; Jiang, Chao; Wang, Xumin; Huang, Luqi

2014-01-25

Chlorogenic acids (CGAs) and luteolin are active compounds in Lonicera japonica, a plant of high medicinal value in traditional Chinese medicine. This study provides a comprehensive overview of gene families involved in chlorogenic acid and luteolin biosynthesis in L. japonica, as well as its substitutes Lonicera hypoglauca and Lonicera macranthoides. The gene sequence feature and gene expression patterns in various tissues and buds of the species were characterized. Bioinformatics analysis revealed that 14 chlorogenic acid and luteolin biosynthesis-related genes were identified from the L. japonica transcriptome assembly. Phylogenetic analyses suggested that the function of individual gene could be differentiation and induce active compound diversity. Their orthologous genes were also recognized in L. hypoglauca and L. macranthoides genomic datasets, except for LHCHS1 and LMC4H2. The expression patterns of these genes are different in the tissues of L. japonica, L. hypoglauca and L. macranthoides. Results also showed that CGAs were controlled in the first step of biosynthesis, whereas both steps controlled luteolin in the bud of L. japonica. The expression of LJFNS2 exhibited positive correlation with luteolin levels in L. japonica. This study provides significant information for understanding the functional diversity of gene families involved in chlorogenic acid and the luteolin biosynthesis, active compound diversity of L. japonica and its substitutes, and the different usages of the three species. Copyright © 2012. Published by Elsevier B.V.
Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites.

PubMed

Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica; Nielsen, Jens; Nielsen, Kristian Fog; Workman, Mhairi; Frisvad, Jens Christian

2016-10-14

A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311 T = IBT 12289 T ). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted in the identification of 62 putative biosynthetic gene clusters. Extracts of P. arizonense were analysed for secondary metabolites and austalides, pyripyropenes, tryptoquivalines, fumagillin, pseurotin A, curvulinic acid and xanthoepocin were detected. A comparative analysis against known pathways enabled the proposal of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential industrial applications for the new species P. arizonense. The description and availability of the genome sequence of P. arizonense, further provides the basis for biotechnological exploitation of this species.
Penicillium arizonense, a new, genome sequenced fungal species, reveals a high chemical diversity in secreted metabolites

PubMed Central

Grijseels, Sietske; Nielsen, Jens Christian; Randelovic, Milica; Nielsen, Jens; Nielsen, Kristian Fog; Workman, Mhairi; Frisvad, Jens Christian

2016-01-01

A new soil-borne species belonging to the Penicillium section Canescentia is described, Penicillium arizonense sp. nov. (type strain CBS 141311T = IBT 12289T). The genome was sequenced and assembled into 33.7 Mb containing 12,502 predicted genes. A phylogenetic assessment based on marker genes confirmed the grouping of P. arizonense within section Canescentia. Compared to related species, P. arizonense proved to encode a high number of proteins involved in carbohydrate metabolism, in particular hemicellulases. Mining the genome for genes involved in secondary metabolite biosynthesis resulted in the identification of 62 putative biosynthetic gene clusters. Extracts of P. arizonense were analysed for secondary metabolites and austalides, pyripyropenes, tryptoquivalines, fumagillin, pseurotin A, curvulinic acid and xanthoepocin were detected. A comparative analysis against known pathways enabled the proposal of biosynthetic gene clusters in P. arizonense responsible for the synthesis of all detected compounds except curvulinic acid. The capacity to produce biomass degrading enzymes and the identification of a high chemical diversity in secreted bioactive secondary metabolites, offers a broad range of potential industrial applications for the new species P. arizonense. The description and availability of the genome sequence of P. arizonense, further provides the basis for biotechnological exploitation of this species. PMID:27739446

Characterization of a Defined 2,3,5,6-Tetrachlorobiphenyl-ortho-Dechlorinating Microbial Community by Comparative Sequence Analysis of Genes Coding for 16S rRNA

PubMed Central

Pulliam Holoman, Tracey R.; Elberson, Margaret A.; Cutter, Leah A.; May, Harold D.; Sowers, Kevin R.

1998-01-01

Defined microbial communities were developed by combining selective enrichment with molecular monitoring of total community genes coding for 16S rRNAs (16S rDNAs) to identify potential polychlorinated biphenyl (PCB)-dechlorinating anaerobes that ortho dechlorinate 2,3,5,6-tetrachlorobiphenyl. In enrichment cultures that contained a defined estuarine medium, three fatty acids, and sterile sediment, a Clostridium sp. was predominant in the absence of added PCB, but undescribed species in the δ subgroup of the class Proteobacteria, the low-G+C gram-positive subgroup, the Thermotogales subgroup, and a single species with sequence similarity to the deeply branching species Dehalococcoides ethenogenes were more predominant during active dechlorination of the PCB. Species with high sequence similarities to Methanomicrobiales and Methanosarcinales archaeal subgroups were predominant in both dechlorinating and nondechlorinating enrichment cultures. Deletion of sediment from PCB-dechlorinating enrichment cultures reduced the rate of dechlorination and the diversity of the community. Substitution of sodium acetate for the mixture of three fatty acids increased the rate of dechlorination, further reduced the community diversity, and caused a shift in the predominant species that included restriction fragment length polymorphism patterns not previously detected. Although PCB-dechlorinating cultures were methanogenic, inhibition of methanogenesis and elimination of the archaeal community by addition of bromoethanesulfonic acid only slightly inhibited dechlorination, indicating that the archaea were not required for ortho dechlorination of the congener. Deletion of Clostridium spp. from the community profile by addition of vancomycin only slightly reduced dechlorination. However, addition of sodium molybdate, an inhibitor of sulfate reduction, inhibited dechlorination and deleted selected species from the community profiles of the class Bacteria. With the exception of one 16S rDNA sequence that had the highest sequence similarity to the obligate perchloroethylene-dechlorinating Dehalococcoides, the 16S rDNA sequences associated with PCB ortho dechlorination had high sequence similarities to the δ, low-G+C gram-positive, and Thermotogales subgroups, which all include sulfur-, sulfate-, and/or iron(III)-respiring bacterial species. PMID:9726883
The Oral Microbiota in Health and Disease: An Overview of Molecular Findings.

PubMed

Siqueira, José F; Rôças, Isabela N

2017-01-01

Culture-independent nucleic acid technologies have been extensively applied to the analysis of oral bacterial communities associated with healthy and diseased conditions. These methods have confirmed and substantially expanded the findings from culture studies to reveal the oral microbial inhabitants and candidate pathogens associated with the major oral diseases. Over 1000 bacterial distinct species-level taxa have been identified in the oral cavity and studies using next-generation DNA sequencing approaches indicate that the breadth of bacterial diversity may be even much larger. Nucleic acid technologies have also been helpful in profiling bacterial communities and identifying disease-related patterns. This chapter provides an overview of the diversity and taxonomy of oral bacteria associated with health and disease.
Isolation and characterization of antigen-specific alpaca (Lama pacos) VHH antibodies by biopanning followed by high-throughput sequencing.

PubMed

Miyazaki, Nobuo; Kiyose, Norihiko; Akazawa, Yoko; Takashima, Mizuki; Hagihara, Yosihisa; Inoue, Naokazu; Matsuda, Tomonari; Ogawa, Ryu; Inoue, Seiya; Ito, Yuji

2015-09-01

The antigen-binding domain of camelid dimeric heavy chain antibodies, known as VHH or Nanobody, has much potential in pharmaceutical and industrial applications. To establish the isolation process of antigen-specific VHH, a VHH phage library was constructed with a diversity of 8.4 × 10(7) from cDNA of peripheral blood mononuclear cells of an alpaca (Lama pacos) immunized with a fragment of IZUMO1 (IZUMO1PFF) as a model antigen. By conventional biopanning, 13 antigen-specific VHHs were isolated. The amino acid sequences of these VHHs, designated as N-group VHHs, were very similar to each other (>93% identity). To find more diverse antibodies, we performed high-throughput sequencing (HTS) of VHH genes. By comparing the frequencies of each sequence between before and after biopanning, we found the sequences whose frequencies were increased by biopanning. The top 100 sequences of them were supplied for phylogenic tree analysis. In total 75% of them belonged to N-group VHHs, but the other were phylogenically apart from N-group VHHs (Non N-group). Two of three VHHs selected from non N-group VHHs showed sufficient antigen binding ability. These results suggested that biopanning followed by HTS provided a useful method for finding minor and diverse antigen-specific clones that could not be identified by conventional biopanning. © The Authors 2015. Published by Oxford University Press on behalf of the Japanese Biochemical Society. All rights reserved.
Diel fluctuations in the abundance and community diversity of coastal bacterioplankton assemblages over a tidal cycle.

PubMed

Olapade, Ola A

2012-01-01

The diel change in abundance and community diversity of the bacterioplankton assemblages within the Pacific Ocean at a fixed location in Monterey Bay, California (USA) were examined with several culture-independent (i.e., nucleic acid staining, fluorescence in situ hybridization {FISH}, and 16S ribosomal RNA gene libraries) approaches over a tidal cycle. FISH analyses revealed the quantitative predominance of bacterial members belonging to the Cytophaga-Flavobacterium cluster as well as two Proteobacteria (α- and γ-) subclasses within the bacterioplankton assemblages, especially during high tide (HT) and outgoing tide (OT) than the other tidal events. While the clone libraries showed that majority of the sequences were similar to the 16S rRNA gene sequences of unknown bacteria (32% to 73%), however, the operational taxonomic units from members of the α-Proteobacteria, Bacteroidetes, Firmicutes, and Cyanobacteria were also well represented during the four tidal events examined. Comparatively, sequence diversity was highest in OT, lowest in low tide, and very similar between HT and incoming tide. The results indicate that the dynamics of bacterial occurrence and diversity appeared to be more pronounced during HT and OT, further indicative of the ecological importance of several environmental variables including temperature, light intensity, and nutrient availability that are also concurrently fluctuating during these tidal events in marine systems.
The wheat cytochrome oxidase subunit II gene has an intron insert and three radical amino acid changes relative to maize

PubMed Central

Bonen, Linda; Boer, Poppo H.; Gray, Michael W.

1984-01-01

We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
Distribution and diversity of Verrucomicrobia methanotrophs in geothermal and acidic environments.

PubMed

Sharp, Christine E; Smirnova, Angela V; Graham, Jaime M; Stott, Matthew B; Khadka, Roshan; Moore, Tim R; Grasby, Stephen E; Strack, Maria; Dunfield, Peter F

2014-06-01

Recently, methanotrophic members of the phylum Verrucomicrobia have been described, but little is known about their distribution in nature. We surveyed methanotrophic bacteria in geothermal springs and acidic wetlands via pyrosequencing of 16S rRNA gene amplicons. Putative methanotrophic Verrucomicrobia were found in samples covering a broad temperature range (22.5-81.6°C), but only in acidic conditions (pH 1.8-5.0) and only in geothermal environments, not in acidic bogs or fens. Phylogenetically, three 16S rRNA gene sequence clusters of putative methanotrophic Verrucomicrobia were observed. Those detected in high-temperature geothermal samples (44.1-81.6°C) grouped with known thermoacidiphilic 'Methylacidiphilum' isolates. A second group dominated in moderate-temperature geothermal samples (22.5-40.1°C) and a representative mesophilic methanotroph from this group was isolated (strain LP2A). Genome sequencing verified that strain LP2A possessed particulate methane monooxygenase, but its 16S rRNA gene sequence identity to 'Methylacidiphilum infernorum' strain V4 was only 90.6%. A third group clustered distantly with known methanotrophic Verrucomicrobia. Using pmoA-gene targeted quantitative polymerase chain reaction, two geothermal soil profiles showed a dominance of LP2A-like pmoA sequences in the cooler surface layers and 'Methylacidiphilum'-like pmoA sequences in deeper, hotter layers. Based on these results, there appears to be a thermophilic group and a mesophilic group of methanotrophic Verrucomicrobia. However, both were detected only in acidic geothermal environments. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.
What can we learn about lyssavirus genomes using 454 sequencing?

PubMed

Höper, Dirk; Finke, Stefan; Freuling, Conrad M; Hoffmann, Bernd; Beer, Martin

2012-01-01

The main task of the individual project number four"Whole genome sequencing, virus-host adaptation, and molecular epidemiological analyses of lyssaviruses "within the network" Lyssaviruses--a potential re-emerging public health threat" is to provide high quality complete genome sequences from lyssaviruses. These sequences are analysed in-depth with regard to the diversity of the viral populations as to both quasi-species and so-called defective interfering RNAs. Moreover, the sequence data will facilitate further epidemiological analyses, will provide insight into the evolution of lyssaviruses and will be the basis for the design of novel nucleic acid based diagnostics. The first results presented here indicate that not only high quality full-length lyssavirus genome sequences can be generated, but indeed efficient analysis of the viral population gets feasible.
Optimizing the specificity of nucleic acid hybridization

PubMed Central

Zhang, David Yu; Chen, Sherry Xi; Yin, Peng

2014-01-01

The specific hybridization of complementary sequences is an essential property of nucleic acids, enabling diverse biological and biotechnological reactions and functions. However, the specificity of nucleic acid hybridization is compromised for long strands, except near the melting temperature. Here, we analytically derived the thermodynamic properties of a hybridization probe that would enable near-optimal single-base discrimination and perform robustly across diverse temperature, salt and concentration conditions. We rationally designed ‘toehold exchange’ probes that approximate these properties, and comprehensively tested them against five different DNA targets and 55 spurious analogues with energetically representative single-base changes (replacements, deletions and insertions). These probes produced discrimination factors between 3 and 100+ (median, 26). Without retuning, our probes function robustly from 10 °C to 37 °C, from 1 mM Mg2+ to 47 mM Mg2+, and with nucleic acid concentrations from 1 nM to 5 μM. Experiments with RNA also showed effective single-base change discrimination. PMID:22354435
Multiple DNA and protein sequence alignment on a workstation and a supercomputer.

PubMed

Tajima, K

1988-11-01

This paper describes a multiple alignment method using a workstation and supercomputer. The method is based on the alignment of a set of aligned sequences with the new sequence, and uses a recursive procedure of such alignment. The alignment is executed in a reasonable computation time on diverse levels from a workstation to a supercomputer, from the viewpoint of alignment results and computational speed by parallel processing. The application of the algorithm is illustrated by several examples of multiple alignment of 12 amino acid and DNA sequences of HIV (human immunodeficiency virus) env genes. Colour graphic programs on a workstation and parallel processing on a supercomputer are discussed.
Exploitation of the diverse insertion sequence element content of dairy Lactobacillus helveticus starters as a rapid method to identify different strains.

PubMed

Kaleta, Pawel; Callanan, Michael J; O'Callaghan, John; Fitzgerald, Gerald F; Beresford, Thomas P; Ross, R Paul

2009-10-01

The species Lactobacillus helveticus is a commonly used thermophilic starter and/or adjunct culture for Swiss and Cheddar cheese manufacture. Its use is normally associated with flavour improvement which is known to be associated with culture traits such as rapid autolysis and high proteolytic activity. The genome of the commercial strain, DPC4571, was recently sequenced and found to have an abundance of IS sequences in terms of both abundance (213 intact) and diversity (21 types). Given this unique diversity for a lactic acid bacterium, we investigated whether PCR-based IS fingerprinting could be used as a discriminatory tool to distinguish between different strains of Lb. helveticus. A set of ten primers targeting five of the most numerous groups (ISL1201, ISLhe65, ISLhe2, ISLhe15 and ISL2) of IS elements was designed. Multiplex-PCR with all primers resulted in 1-12 discreet amplicons for each strain tested. The resultant fingerprints (in the 0.5 kb-3 kb range) were found to be strain specific and reproducible. This approach thus provides a valuable method to distinguish between Lb. helveticus strains while giving some indication of the relative abundance of IS sequences in each strain.
Evidence for Interspecies Gene Transfer in the Evolution of 2,4-Dichlorophenoxyacetic Acid Degraders

PubMed Central

McGowan, Catherine; Fulthorpe, Roberta; Wright, Alice; Tiedje, J. M.

1998-01-01

Small-subunit ribosomal DNA (SSU rDNA) from 20 phenotypically distinct strains of 2,4-dichlorophenoxyacetic acid (2,4-D)-degrading bacteria was partially sequenced, yielding 18 unique strains belonging to members of the alpha, beta, and gamma subgroups of the class Proteobacteria. To understand the origin of 2,4-D degradation in this diverse collection, the first gene in the 2,4-D pathway, tfdA, was sequenced. The sequences fell into three unique classes found in various members of the beta and gamma subgroups of Proteobacteria. None of the α-Proteobacteria yielded tfdA PCR products. A comparison of the dendrogram of the tfdA genes with that of the SSU rDNA genes demonstrated incongruency in phylogenies, and hence 2,4-D degradation must have originated from gene transfer between species. Only those strains with tfdA sequences highly similar to the tfdA sequence of strain JMP134 (tfdA class I) transferred all the 2,4-D genes and conferred the 2,4-D degradation phenotype to a Burkholderia cepacia recipient. PMID:9758850
Generation and reactivation of T-cell receptor A joining region pseudogenes in primates

DOE Office of Scientific and Technical Information (OSTI.GOV)

Thiel, C.; Lanchbury, J.S.; Otting, N.

1996-06-01

Tandemly duplicated T-cell receptor (Tcr) AJ (J{alpha}) segments contribute significantly to TCRA chain junctional region diversity in mammals. Since only limited data exists on TCRA diversity in nonhuman primates, we examined the TCRAJ regions of 37 chimpanzee and 71 rhesus macaque TCRA cDNA clones derived from inverse polymerase chain reaction on peripheral blood mononuclear cell cDNA of healthy animals. Twenty-five different TCRAJ regions were characterized in the chimpanzee and 36 in the rhesus macaque. Each bears a close structural relationship to an equivalent human TCRAJ region. Conserved amino acid motifs are shared between all three species. There are indications thatmore » differences between nonhuman primates and humans exist in the generation of TCRAJ pseudogenes. The nucleotide and amino acid sequences of the various characterized TCRAJ of each species are reported and we compare our results to the available information on human genomic sequences. Although we provide evidence of dynamic processes modifying TCRAJ segments during primate evolution, their repertoire and primary structure appears to be relatively conserved. 21 refs., 2 figs.« less
Simple-MSSM: a simple and efficient method for simultaneous multi-site saturation mutagenesis.

PubMed

Cheng, Feng; Xu, Jian-Miao; Xiang, Chao; Liu, Zhi-Qiang; Zhao, Li-Qing; Zheng, Yu-Guo

2017-04-01

To develop a practically simple and robust multi-site saturation mutagenesis (MSSM) method that enables simultaneously recombination of amino acid positions for focused mutant library generation. A general restriction enzyme-free and ligase-free MSSM method (Simple-MSSM) based on prolonged overlap extension PCR (POE-PCR) and Simple Cloning techniques. As a proof of principle of Simple-MSSM, the gene of eGFP (enhanced green fluorescent protein) was used as a template gene for simultaneous mutagenesis of five codons. Forty-eight randomly selected clones were sequenced. Sequencing revealed that all the 48 clones showed at least one mutant codon (mutation efficiency = 100%), and 46 out of the 48 clones had mutations at all the five codons. The obtained diversities at these five codons are 27, 24, 26, 26 and 22, respectively, which correspond to 84, 75, 81, 81, 69% of the theoretical diversity offered by NNK-degeneration (32 codons; NNK, K = T or G). The enzyme-free Simple-MSSM method can simultaneously and efficiently saturate five codons within one day, and therefore avoid missing interactions between residues in interacting amino acid networks.
Carboxylic acid reductase enzymes (CARs).

PubMed

Winkler, Margit

2018-04-01

Carboxylate reductases (CARs) are emerging as valuable catalysts for the selective one-step reduction of carboxylic acids to their corresponding aldehydes. The substrate scope of CARs is exceptionally broad and offers potential for their application in diverse synthetic processes. Two major fields of application are the preparation of aldehydes as end products for the flavor and fragrance sector and the integration of CARs in cascade reactions with aldehydes as the key intermediates. The latest applications of CARs are dominated by in vivo cascades and chemo-enzymatic reaction sequences. The challenge to fully exploit product selectivity is discussed. Recent developments in the characterization of CARs are summarized, with a focus on aspects related to the domain architecture and protein sequences of CAR enzymes. Copyright © 2017 Elsevier Ltd. All rights reserved.
Mining on scorpion venom biodiversity.

PubMed

Rodríguez de la Vega, Ricardo C; Schwartz, Elisabeth F; Possani, Lourival D

2010-12-15

Scorpion venoms are complex mixtures of dozens or even hundreds of distinct proteins, many of which are inter-genome active elements. Fifty years after the first scorpion toxin sequences were determined, chromatography-assisted purification followed by automated protein sequencing or gene cloning, on a case-by-case basis, accumulated nearly 250 amino acid sequences of scorpion venom components. A vast majority of the available sequences correspond to proteins adopting a common three-dimensional fold, whose ion channel modulating functions have been firmly established or could be confidently inferred. However, the actual molecular diversity contained in scorpion venoms -as revealed by bioassay-driven purification, some unexpected activities of "canonical" neurotoxins and even serendipitous discoveries- is much larger than those "canonical" toxin types. In the last few years mining into the molecular diversity contained in scorpion has been assisted by high-throughput Mass Spectrometry techniques and large-scale DNA sequencing, collectively accounting for the more than twofold increase in the number of known sequences of scorpion venom components (now reaching 500 unique sequences). This review, from a comparative perspective, deals with recent data obtained by proteomic and transcriptomic studies on scorpion venoms and venom glands. Altogether, these studies reveal a large contribution of non canonical venom components, which would account for more than half of the total protein diversity of any scorpion venom. On top of aiding at the better understanding of scorpion venom biology, whether in the context of venom function or within the venom gland itself, these "novel" venom components certainly are an interesting source of bioactive proteins, whose characterization is worth pursuing. Copyright © 2009 Elsevier Ltd. All rights reserved.
A Structure-Based Classification of Class A β-Lactamases, a Broadly Diverse Family of Enzymes

PubMed Central

Slama, Patrick; Dény, Paul; Labia, Roger

2015-01-01

SUMMARY For medical biologists, sequencing has become a commonplace technique to support diagnosis. Rapid changes in this field have led to the generation of large amounts of data, which are not always correctly listed in databases. This is particularly true for data concerning class A β-lactamases, a group of key antibiotic resistance enzymes produced by bacteria. Many genomes have been reported to contain putative β-lactamase genes, which can be compared with representative types. We analyzed several hundred amino acid sequences of class A β-lactamase enzymes for phylogenic relationships, the presence of specific residues, and cluster patterns. A clear distinction was first made between dd-peptidases and class A enzymes based on a small number of residues (S70, K73, P107, 130SDN132, G144, E166, 234K/R, 235T/S, and 236G [Ambler numbering]). Other residues clearly separated two main branches, which we named subclasses A1 and A2. Various clusters were identified on the major branch (subclass A1) on the basis of signature residues associated with catalytic properties (e.g., limited-spectrum β-lactamases, extended-spectrum β-lactamases, and carbapenemases). For subclass A2 enzymes (e.g., CfxA, CIA-1, CME-1, PER-1, and VEB-1), 43 conserved residues were characterized, and several significant insertions were detected. This diversity in the amino acid sequences of β-lactamases must be taken into account to ensure that new enzymes are accurately identified. However, with the exception of PER types, this diversity is poorly represented in existing X-ray crystallographic data. PMID:26511485
Comparative genomics reveals high biological diversity and specific adaptations in the industrially and medically important fungal genus Aspergillus.

PubMed

de Vries, Ronald P; Riley, Robert; Wiebenga, Ad; Aguilar-Osorio, Guillermo; Amillis, Sotiris; Uchima, Cristiane Akemi; Anderluh, Gregor; Asadollahi, Mojtaba; Askin, Marion; Barry, Kerrie; Battaglia, Evy; Bayram, Özgür; Benocci, Tiziano; Braus-Stromeyer, Susanna A; Caldana, Camila; Cánovas, David; Cerqueira, Gustavo C; Chen, Fusheng; Chen, Wanping; Choi, Cindy; Clum, Alicia; Dos Santos, Renato Augusto Corrêa; Damásio, André Ricardo de Lima; Diallinas, George; Emri, Tamás; Fekete, Erzsébet; Flipphi, Michel; Freyberg, Susanne; Gallo, Antonia; Gournas, Christos; Habgood, Rob; Hainaut, Matthieu; Harispe, María Laura; Henrissat, Bernard; Hildén, Kristiina S; Hope, Ryan; Hossain, Abeer; Karabika, Eugenia; Karaffa, Levente; Karányi, Zsolt; Kraševec, Nada; Kuo, Alan; Kusch, Harald; LaButti, Kurt; Lagendijk, Ellen L; Lapidus, Alla; Levasseur, Anthony; Lindquist, Erika; Lipzen, Anna; Logrieco, Antonio F; MacCabe, Andrew; Mäkelä, Miia R; Malavazi, Iran; Melin, Petter; Meyer, Vera; Mielnichuk, Natalia; Miskei, Márton; Molnár, Ákos P; Mulé, Giuseppina; Ngan, Chew Yee; Orejas, Margarita; Orosz, Erzsébet; Ouedraogo, Jean Paul; Overkamp, Karin M; Park, Hee-Soo; Perrone, Giancarlo; Piumi, Francois; Punt, Peter J; Ram, Arthur F J; Ramón, Ana; Rauscher, Stefan; Record, Eric; Riaño-Pachón, Diego Mauricio; Robert, Vincent; Röhrig, Julian; Ruller, Roberto; Salamov, Asaf; Salih, Nadhira S; Samson, Rob A; Sándor, Erzsébet; Sanguinetti, Manuel; Schütze, Tabea; Sepčić, Kristina; Shelest, Ekaterina; Sherlock, Gavin; Sophianopoulou, Vicky; Squina, Fabio M; Sun, Hui; Susca, Antonia; Todd, Richard B; Tsang, Adrian; Unkles, Shiela E; van de Wiele, Nathalie; van Rossen-Uffink, Diana; Oliveira, Juliana Velasco de Castro; Vesth, Tammi C; Visser, Jaap; Yu, Jae-Hyuk; Zhou, Miaomiao; Andersen, Mikael R; Archer, David B; Baker, Scott E; Benoit, Isabelle; Brakhage, Axel A; Braus, Gerhard H; Fischer, Reinhard; Frisvad, Jens C; Goldman, Gustavo H; Houbraken, Jos; Oakley, Berl; Pócsi, István; Scazzocchio, Claudio; Seiboth, Bernhard; vanKuyk, Patricia A; Wortman, Jennifer; Dyer, Paul S; Grigoriev, Igor V

2017-02-14

The fungal genus Aspergillus is of critical importance to humankind. Species include those with industrial applications, important pathogens of humans, animals and crops, a source of potent carcinogenic contaminants of food, and an important genetic model. The genome sequences of eight aspergilli have already been explored to investigate aspects of fungal biology, raising questions about evolution and specialization within this genus. We have generated genome sequences for ten novel, highly diverse Aspergillus species and compared these in detail to sister and more distant genera. Comparative studies of key aspects of fungal biology, including primary and secondary metabolism, stress response, biomass degradation, and signal transduction, revealed both conservation and diversity among the species. Observed genomic differences were validated with experimental studies. This revealed several highlights, such as the potential for sex in asexual species, organic acid production genes being a key feature of black aspergilli, alternative approaches for degrading plant biomass, and indications for the genetic basis of stress response. A genome-wide phylogenetic analysis demonstrated in detail the relationship of the newly genome sequenced species with other aspergilli. Many aspects of biological differences between fungal species cannot be explained by current knowledge obtained from genome sequences. The comparative genomics and experimental study, presented here, allows for the first time a genus-wide view of the biological diversity of the aspergilli and in many, but not all, cases linked genome differences to phenotype. Insights gained could be exploited for biotechnological and medical applications of fungi.
Biologically active LIL proteins built with minimal chemical diversity

PubMed Central

Heim, Erin N.; Marston, Jez L.; Federman, Ross S.; Edwards, Anne P. B.; Karabadzhak, Alexander G.; Petti, Lisa M.; Engelman, Donald M.; DiMaio, Daniel

2015-01-01

We have constructed 26-amino acid transmembrane proteins that specifically transform cells but consist of only two different amino acids. Most proteins are long polymers of amino acids with 20 or more chemically distinct side-chains. The artificial transmembrane proteins reported here are the simplest known proteins with specific biological activity, consisting solely of an initiating methionine followed by specific sequences of leucines and isoleucines, two hydrophobic amino acids that differ only by the position of a methyl group. We designate these proteins containing leucine (L) and isoleucine (I) as LIL proteins. These proteins functionally interact with the transmembrane domain of the platelet-derived growth factor β-receptor and specifically activate the receptor to transform cells. Complete mutagenesis of these proteins identified individual amino acids required for activity, and a protein consisting solely of leucines, except for a single isoleucine at a particular position, transformed cells. These surprisingly simple proteins define the minimal chemical diversity sufficient to construct proteins with specific biological activity and change our view of what can constitute an active protein in a cellular context. PMID:26261320
Bacterial Profile of Dentine Caries and the Impact of pH on Bacterial Population Diversity

PubMed Central

Kianoush, Nima; Adler, Christina J.; Nguyen, Ky-Anh T.; Browne, Gina V.; Simonian, Mary; Hunter, Neil

2014-01-01

Dental caries is caused by the release of organic acids from fermentative bacteria, which results in the dissolution of hydroxyapatite matrices of enamel and dentine. While low environmental pH is proposed to cause a shift in the consortium of oral bacteria, favouring the development of caries, the impact of this variable has been overlooked in microbial population studies. This study aimed to detail the zonal composition of the microbiota associated with carious dentine lesions with reference to pH. We used 454 sequencing of the 16S rRNA gene (V3–V4 region) to compare microbial communities in layers ranging in pH from 4.5–7.8 from 25 teeth with advanced dentine caries. Pyrosequencing of the amplicons yielded 449,762 sequences. Nine phyla, 97 genera and 409 species were identified from the quality-filtered, de-noised and chimera-free sequences. Among the microbiota associated with dentinal caries, the most abundant taxa included Lactobacillus sp., Prevotella sp., Atopobium sp., Olsenella sp. and Actinomyces sp. We found a disparity between microbial communities localised at acidic versus neutral pH strata. Acidic conditions were associated with low diversity microbial populations, with Lactobacillus species including L. fermentum, L. rhamnosus and L. crispatus, being prominent. In comparison, the distinctive species of a more diverse flora associated with neutral pH regions of carious lesions included Alloprevotella tanerrae, Leptothrix sp., Sphingomonas sp. and Streptococcus anginosus. While certain bacteria were affected by the pH gradient, we also found that ∼60% of the taxa associated with caries were present across the investigated pH range, representing a substantial core. We demonstrated that some bacterial species implicated in caries progression show selective clustering with respect to pH gradient, providing a basis for specific therapeutic strategies. PMID:24675997
Identification of nitrogen-fixing genes and gene clusters from metagenomic library of acid mine drainage.

PubMed

Dai, Zhimin; Guo, Xue; Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community.

Identification of Nitrogen-Fixing Genes and Gene Clusters from Metagenomic Library of Acid Mine Drainage

PubMed Central

Yin, Huaqun; Liang, Yili; Cong, Jing; Liu, Xueduan

2014-01-01

Biological nitrogen fixation is an essential function of acid mine drainage (AMD) microbial communities. However, most acidophiles in AMD environments are uncultured microorganisms and little is known about the diversity of nitrogen-fixing genes and structure of nif gene cluster in AMD microbial communities. In this study, we used metagenomic sequencing to isolate nif genes in the AMD microbial community from Dexing Copper Mine, China. Meanwhile, a metagenome microarray containing 7,776 large-insertion fosmids was constructed to screen novel nif gene clusters. Metagenomic analyses revealed that 742 sequences were identified as nif genes including structural subunit genes nifH, nifD, nifK and various additional genes. The AMD community is massively dominated by the genus Acidithiobacillus. However, the phylogenetic diversity of nitrogen-fixing microorganisms is much higher than previously thought in the AMD community. Furthermore, a 32.5-kb genomic sequence harboring nif, fix and associated genes was screened by metagenome microarray. Comparative genome analysis indicated that most nif genes in this cluster are most similar to those of Herbaspirillum seropedicae, but the organization of the nif gene cluster had significant differences from H. seropedicae. Sequence analysis and reverse transcription PCR also suggested that distinct transcription units of nif genes exist in this gene cluster. nifQ gene falls into the same transcription unit with fixABCX genes, which have not been reported in other diazotrophs before. All of these results indicated that more novel diazotrophs survive in the AMD community. PMID:24498417
Capturing the genetic makeup of the active microbiome in situ

DOE PAGES

Singer, Esther; Wagner, Michael; Woyke, Tanja

2017-06-02

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and thatmore » have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions.« less
Capturing the genetic makeup of the active microbiome in situ

DOE Office of Scientific and Technical Information (OSTI.GOV)

Singer, Esther; Wagner, Michael; Woyke, Tanja

More than any other technology, nucleic acid sequencing has enabled microbial ecology studies to be complemented with the data volumes necessary to capture the extent of microbial diversity and dynamics in a wide range of environments. In order to truly understand and predict environmental processes, however, the distinction between active, inactive and dead microbial cells is critical. Also, experimental designs need to be sensitive toward varying population complexity and activity, and temporal as well as spatial scales of process rates. There are a number of approaches, including single-cell techniques, which were designed to study in situ microbial activity and thatmore » have been successively coupled to nucleic acid sequencing. The exciting new discoveries regarding in situ microbial activity provide evidence that future microbial ecology studies will indispensably rely on techniques that specifically capture members of the microbiome active in the environment. Herein, we review those currently used activity-based approaches that can be directly linked to shotgun nucleic acid sequencing, evaluate their relevance to ecology studies, and discuss future directions.« less
Considerable MHC Diversity Suggests That the Functional Extinction of Baiji Is Not Related to Population Genetic Collapse

PubMed Central

Xu, Shixia; Ju, Jianfeng; Zhou, Xuming; Wang, Lian; Zhou, Kaiya; Yang, Guang

2012-01-01

To further extend our understanding of the mechanism causing the current nearly extinct status of the baiji (Lipotes vexillifer), one of the most critically endangered species in the world, genetic diversity at the major histocompatibility complex (MHC) class II DRB locus was investigated in the baiji. Nine highly divergent DRB alleles were identified in 17 samples, with an average of 28.4 (13.2%) nucleotide difference and 16.7 (23.5%) amino acid difference between alleles. The unexpectedly high levels of DRB allelic diversity in the baiji may partly be attributable to its evolutionary adaptations to the freshwater environment which is regarded to have a higher parasite diversity compared to the marine environment. In addition, balancing selection was found to be the main mechanisms in generating sequence diversity at baiji DRB gene. Considerable sequence variation at the adaptive MHC genes despite of significant loss of neutral genetic variation in baiji genome might suggest that intense selection has overpowered random genetic drift as the main evolutionary forces, which further suggested that the critically endangered or nearly extinct status of the baiji is not an outcome of genetic collapse. PMID:22272349
A comprehensive bioinformatic analysis of hepatitis D virus full-length genomes.

PubMed

Delfino, C M; Cerrudo, C S; Biglione, M; Oubiña, J R; Ghiringhelli, P D; Mathet, V L

2018-02-06

In association with hepatitis B virus (HBV), hepatitis delta virus (HDV) is a subviral agent that may promote severe acute and chronic forms of liver disease. Based on the percentage of nucleotide identity of the genome, HDV was initially classified into three genotypes. However, since 2006, the original classification has been further expanded into eight clades/genotypes. The intergenotype divergence may be as high as 35%-40% over the entire RNA genome, whereas sequence heterogeneity among the isolates of a given genotype is <20%; furthermore, HDV recombinants have been clearly demonstrated. The genetic diversity of HDV is related to the geographic origin of the isolates. This study shows the first comprehensive bioinformatic analysis of the complete available set of HDV sequences, using both nucleotide and protein phylogenies (based on an evolutionary model selection, gamma distribution estimation, tree inference and phylogenetic distance estimation), protein composition analysis and comparison (based on the presence of invariant residues, molecular signatures, amino acid frequencies and mono- and di-amino acid compositional distances), as well as amino acid changes in sequence evolution. Taking into account the congruent and consistent results of both nucleotide and amino acid analyses of GenBank available sequences (recorded as of January, 2017), we propose that the eight hepatitis D virus genotypes may be grouped into three large genogroups fully supported by their shared characteristics. © 2018 John Wiley & Sons Ltd.
Genetic diversity of merozoite surface antigens in Babesia bovis detected from Sri Lankan cattle.

PubMed

Sivakumar, Thillaiampalam; Okubo, Kazuhiro; Igarashi, Ikuo; de Silva, Weligodage Kumarawansa; Kothalawala, Hemal; Silva, Seekkuge Susil Priyantha; Vimalakumar, Singarayar Caniciyas; Meewewa, Asela Sanjeewa; Yokoyama, Naoaki

2013-10-01

Babesia bovis, the causative agent of severe bovine babesiosis, is endemic in Sri Lanka. The live attenuated vaccine (K-strain), which was introduced in the early 1990s, has been used to immunize cattle populations in endemic areas of the country. The present study was undertaken to determine the genetic diversity of merozoite surface antigens (MSAs) in B. bovis isolates from Sri Lankan cattle, and to compare the gene sequences obtained from such isolates against those of the K-strain. Forty-four bovine blood samples isolated from different geographical regions of Sri Lanka and judged to be B. bovis-positive by PCR screening were used to amplify MSAs (MSA-1, MSA-2c, MSA-2a1, MSA-2a2, and MSA-2b), AMA-1, and 12D3 genes from parasite DNA. Although the AMA-1 and 12D3 gene sequences were highly conserved among the Sri Lankan isolates, the MSA gene sequences from the same isolates were highly diverse. Sri Lankan MSA-1, MSA-2c, MSA-2a1, MSA-2a2, and MSA-2b sequences clustered within 5, 2, 4, 1, and 9 different clades in the gene phylograms, respectively, while the minimum similarity values among the deduced amino acid sequences of these genes were 36.8%, 68.7%, 80.3%, 100%, and 68.3%, respectively. In the phylograms, none of the Sri Lankan sequences fell within clades containing the respective K-strain sequences. Additionally, the similarity values for MSA-1 and MSA-2c were 40-61.8% and 90.9-93.2% between the Sri Lankan isolates and the K-strain, respectively, while the K-strain MSA-2a/b sequence shared 64.5-69.8%, 69.3%, and 70.5-80.3% similarities with the Sri Lankan MSA-2a1, MSA-2a2, and MSA-2b sequences, respectively. The present study has shown that genetic diversity among MSAs of Sri Lankan B. bovis isolates is very high, and that the sequences of field isolates diverged genetically from the K-strain. Copyright © 2013 Elsevier B.V. All rights reserved.
Candidate new rotavirus species in Schreiber's bats, Serbia.

PubMed

Bányai, Krisztián; Kemenesi, Gábor; Budinski, Ivana; Földes, Fanni; Zana, Brigitta; Marton, Szilvia; Varga-Kugler, Renáta; Oldal, Miklós; Kurucz, Kornélia; Jakab, Ferenc

2017-03-01

The genus Rotavirus comprises eight species designated A to H and one tentative species, Rotavirus I. In a virus metagenomic analysis of Schreiber's bats sampled in Serbia in 2014 we obtained sequences likely representing novel rotavirus species. Whole genome sequencing and phylogenetic analysis classified the representative strain into a tentative tenth rotavirus species, we provisionally called Rotavirus J. The novel virus shared a maximum of 50% amino acid sequence identity within the VP6 gene to currently known members of the genus. This study extends our understanding of the genetic diversity of rotaviruses in bats. Copyright © 2016 Elsevier B.V. All rights reserved.
Positive selection of digestive Cys proteases in herbivorous Coleoptera.

PubMed

Vorster, Juan; Rasoolizadeh, Asieh; Goulet, Marie-Claire; Cloutier, Conrad; Sainsbury, Frank; Michaud, Dominique

2015-10-01

Positive selection is thought to contribute to the functional diversification of insect-inducible protease inhibitors in plants in response to selective pressures exerted by the digestive proteases of their herbivorous enemies. Here we assessed whether a reciprocal evolutionary process takes place on the insect side, and whether ingestion of a positively selected plant inhibitor may translate into a measurable rebalancing of midgut proteases in vivo. Midgut Cys proteases of herbivorous Coleoptera, including the major pest Colorado potato beetle (Leptinotarsa decemlineata), were first compared using a codon-based evolutionary model to look for the occurrence of hypervariable, positively selected amino acid sites among the tested sequences. Hypervariable sites were found, distributed within -or close to- amino acid regions interacting with Cys-type inhibitors of the plant cystatin protein family. A close examination of L. decemlineata sequences indicated a link between their assignment to protease functional families and amino acid identity at positively selected sites. A function-diversifying role for positive selection was further suggested empirically by in vitro protease assays and a shotgun proteomic analysis of L. decemlineata Cys proteases showing a differential rebalancing of protease functional family complements in larvae fed single variants of a model cystatin mutated at positively selected amino acid sites. These data confirm overall the occurrence of hypervariable, positively selected amino acid sites in herbivorous Coleoptera digestive Cys proteases. They also support the idea of an adaptive role for positive selection, useful to generate functionally diverse proteases in insect herbivores ingesting functionally diverse, rapidly evolving dietary cystatins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Assessment of the microbial diversity of Brazilian kefir grains by PCR-DGGE and pyrosequencing analysis.

PubMed

Leite, A M O; Mayo, B; Rachid, C T C C; Peixoto, R S; Silva, J T; Paschoalin, V M F; Delgado, S

2012-09-01

The microbial diversity and community structure of three different kefir grains from different parts of Brazil were examined via the combination of two culture-independent methods: PCR-denaturing gradient gel electrophoresis (PCR-DGGE) and pyrosequencing. PCR-DGGE showed Lactobacillus kefiranofaciens and Lactobacillus kefiri to be the major bacterial populations in all three grains. The yeast community was dominated by Saccharomyces cerevisiae. Pyrosequencing produced a total of 14,314 partial 16S rDNA sequence reads from the three grains. Sequence analysis grouped the reads into three phyla, of which Firmicutes was dominant. Members of the genus Lactobacillus were the most abundant operational taxonomic units (OTUs) in all samples, accounting for up to 96% of the sequences. OTUs belonging to other lactic and acetic acid bacteria genera, such as Lactococcus, Leuconostoc, Streptococcus and Acetobacter, were also identified at low levels. Two of the grains showed identical DGGE profiles and a similar number of OTUs, while the third sample showed the highest diversity by both techniques. Pyrosequencing allowed the identification of bacteria that were present in small numbers and rarely associated with the microbial community of this complex ecosystem. Copyright © 2012 Elsevier Ltd. All rights reserved.
Specificity, Privacy, and Degeneracy in the CD4 T Cell Receptor Repertoire Following Immunization

PubMed Central

Sun, Yuxin; Best, Katharine; Cinelli, Mattia; Heather, James M.; Reich-Zeliger, Shlomit; Shifrut, Eric; Friedman, Nir; Shawe-Taylor, John; Chain, Benny

2017-01-01

T cells recognize antigen using a large and diverse set of antigen-specific receptors created by a complex process of imprecise somatic cell gene rearrangements. In response to antigen-/receptor-binding-specific T cells then divide to form memory and effector populations. We apply high-throughput sequencing to investigate the global changes in T cell receptor sequences following immunization with ovalbumin (OVA) and adjuvant, to understand how adaptive immunity achieves specificity. Each immunized mouse contained a predominantly private but related set of expanded CDR3β sequences. We used machine learning to identify common patterns which distinguished repertoires from mice immunized with adjuvant with and without OVA. The CDR3β sequences were deconstructed into sets of overlapping contiguous amino acid triplets. The frequencies of these motifs were used to train the linear programming boosting (LPBoost) algorithm LPBoost to classify between TCR repertoires. LPBoost could distinguish between the two classes of repertoire with accuracies above 80%, using a small subset of triplet sequences present at defined positions along the CDR3. The results suggest a model in which such motifs confer degenerate antigen specificity in the context of a highly diverse and largely private set of T cell receptors. PMID:28450864
Solving the ecological puzzle of mycorrhizal associations using data from annotated collections and environmental samples - an example of saddle fungi.

PubMed

Hwang, Jonathan; Zhao, Qi; Yang, Zhu L; Wang, Zheng; Townsend, Jeffrey P

2015-08-01

The relation between ecological and genetic divergence of Helvella species (saddle fungi) has been perplexing. While a few species have been clearly demonstrated to be ectomycorrhizal fungi, ecological roles of many other species have been controversial, alternately considered as either saprotrophic or mycorrhizal. We applied SATé to build an inclusive deoxyribonucleic acid sequence alignment for the internal transcribed spacers (ITS) of annotated Helvella species and related environmental sequences. Phylogenetic informativeness of ITS and its regions were assessed using PhyDesign. Mycorrhizal lineages present a diversity of ecology, host type and geographic distribution. In two Helvella clades, no Helvella ITS sequences were recovered from root tips. Inclusion of environmental sequences in the ITS phylogeny from these sequences has the potential to link these data and reveal Helvella ecology. This study can serve as a model for revealing the diversity of relationships between unculturable fungi and their potential plant hosts. How non-mycorrhizal life styles within Helvella evolved will require expanded metagenomic investigation of soil and other environmental samples along with study of Helvella genomes. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Diversity of thermophilic fungi in Tengchong Rehai National Park revealed by ITS nucleotide sequence analyses.

PubMed

Pan, Wen-Zheng; Huang, Xiao-Wei; Wei, Kang-Bi; Zhang, Chun-Mei; Yang, Dong-Mei; Ding, Jun-Mei; Zhang, Ke-Qin

2010-04-01

The geothermal sites near neutral and alkalescent thermal springs in Tengchong Rehai National Park were examined through cultivation-dependent approach to determine the diversity of thermophilic fungi in these environments. Here, we collected soils samples in this area, plated on agar media conducive for fungal growth, obtained pure cultures, and then employed the method of internal transcribed spacer (ITS) sequencing combined with morphological analysis for identification of thermophilic fungi to the species level. In total, 102 strains were isolated and identified as Rhizomucor miehei, Chaetomium sp., Talaromyces thermophilus, Talaromyces byssochlamydoides, Thermoascus aurantiacus Miehe var. levisporus, Thermomyces lanuginosus, Scytalidium thermophilum, Malbranchea flava, Myceliophthora sp. 1, Myceliophthora sp. 2, Myceliophthora sp. 3, and Coprinopsis sp. Two species, T. lanuginosus and S. thermophilum were the dominant species, representing 34.78% and 28.26% of the sample, respectively. Our results indicated a greater diversity of thermophilic fungi in neutral and alkaline geothermal sites than acidic sites around hot springs reported in previous studies. Most of our strains thrived at alkaline growth conditions.
Molecular diversity of α-gliadin expressed genes in genetically contrasted spelt (Triticum aestivum ssp. spelta) accessions and comparison with bread wheat (T. aestivum ssp. aestivum) and related diploid Triticum and Aegilops species.

PubMed

Dubois, Benjamin; Bertin, Pierre; Mingeot, Dominique

2016-01-01

The gluten proteins of cereals such as bread wheat ( Triticum aestivum ssp. aestivum ) and spelt ( T. aestivum ssp. spelta ) are responsible for celiac disease (CD). The α-gliadins constitute the most immunogenic class of gluten proteins as they include four main T-cell stimulatory epitopes that affect CD patients. Spelt has been less studied than bread wheat and could constitute a source of valuable diversity. The objective of this work was to study the genetic diversity of spelt α-gliadin transcripts and to compare it with those of bread wheat. Genotyping data from 85 spelt accessions obtained with 19 simple sequence repeat (SSR) markers were used to select 11 contrasted accessions, from which 446 full open reading frame α-gliadin genes were cloned and sequenced, which revealed a high allelic diversity. High variations among the accessions were highlighted, in terms of the proportion of α-gliadin sequences from each of the three genomes (A, B and D), and their composition in the four T-cell stimulatory epitopes. An accession from Tajikistan stood out, having a particularly high proportion of α-gliadins from the B genome and a low immunogenic content. Even if no clear separation between spelt and bread wheat sequences was shown, spelt α-gliadins displayed specific features concerning e.g. the frequencies of some amino acid substitutions. Given this observation and the variations in toxicity revealed in the spelt accessions in this study, the high genetic diversity held in spelt germplasm collections could be a valuable resource in the development of safer varieties for CD patients.
Sequence diversity of hepatitis C virus 6a within the extended interferon sensitivity-determining region correlates with interferon-alpha/ribavirin treatment outcomes.

PubMed

Zhou, Daniel X M; Chan, Paul K S; Zhang, Tiejun; Tully, Damien C; Tam, John S

2010-10-01

Studies on the association between sequence variability of the interferon sensitivity-determining region (ISDR) of hepatitis C virus and the outcome of treatment have reached conflicting results. In this study, 25 patients infected with HCV 6a who had received interferon-alpha/ribavirin combination treatment were analyzed for the sequence variations. 14 of them had the full genome sequences obtained from a previous study, whereas the other 11 samples were sequenced for the extended ISDR (eISDR). This eISDR fragment covers 192 bp (64 amino acids) upstream and 201 bp (67 amino acids) downstream from the ISDR previously defined for HCV 1b. The comparison between interferon-alpha resistance and response groups for the amino acid mutations located in the full genome (6 and 8 patients respectively) as well as the mutations located in the eISDR (10 and 15 patients respectively) showed that the mutations I2160V, I2256V, V2292I (P<0.05) within eISDR were significantly associated with resistance to treatment. However, the extent of amino acid variations within previously defined ISDR was not associated with resistance to treatment as previously reported. Four amino acid variations I248V (P=0.03-0.06) within E1, R445K (P=0.02-0.05) and S747T (P=0.03) within E2, I861V (P=0.01) within NS2 which located outside the eISDR may also associate with treatment outcome as identified by a prescreening of variations within 14 HCV 6a full genomes. (c) 2010 Elsevier B.V. All rights reserved.
Properties of the intracellular transient receptor potential (TRP) channel in yeast, Yvc1.

PubMed

Chang, Yiming; Schlenstedt, Gabriel; Flockerzi, Veit; Beck, Andreas

2010-05-17

Transient receptor potential (TRP) channels are found among mammals, flies, worms, ciliates, Chlamydomonas, and yeast but are absent in plants. These channels are believed to be tetramers of proteins containing six transmembrane domains (TMs). Their primary structures are diverse with sequence similarities only in some short amino acid sequence motifs mainly within sequences covering TM5, TM6, and adjacent domains. In the yeast genome, there is one gene encoding a TRP-like sequence. This protein forms an ion channel in the vacuolar membrane and is therefore called Yvc1 for yeast vacuolar conductance 1. In the following we summarize its prominent features. Copyright 2009 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Soil Parameters Drive the Structure, Diversity and Metabolic Potentials of the Bacterial Communities Across Temperate Beech Forest Soil Sequences.

PubMed

Jeanbille, M; Buée, M; Bach, C; Cébron, A; Frey-Klett, P; Turpault, M P; Uroz, S

2016-02-01

Soil and climatic conditions as well as land cover and land management have been shown to strongly impact the structure and diversity of the soil bacterial communities. Here, we addressed under a same land cover the potential effect of the edaphic parameters on the soil bacterial communities, excluding potential confounding factors as climate. To do this, we characterized two natural soil sequences occurring in the Montiers experimental site. Spatially distant soil samples were collected below Fagus sylvatica tree stands to assess the effect of soil sequences on the edaphic parameters, as well as the structure and diversity of the bacterial communities. Soil analyses revealed that the two soil sequences were characterized by higher pH and calcium and magnesium contents in the lower plots. Metabolic assays based on Biolog Ecoplates highlighted higher intensity and richness in usable carbon substrates in the lower plots than in the middle and upper plots, although no significant differences occurred in the abundance of bacterial and fungal communities along the soil sequences as assessed using quantitative PCR. Pyrosequencing analysis of 16S ribosomal RNA (rRNA) gene amplicons revealed that Proteobacteria, Acidobacteria and Bacteroidetes were the most abundantly represented phyla. Acidobacteria, Proteobacteria and Chlamydiae were significantly enriched in the most acidic and nutrient-poor soils compared to the Bacteroidetes, which were significantly enriched in the soils presenting the higher pH and nutrient contents. Interestingly, aluminium, nitrogen, calcium, nutrient availability and pH appeared to be the best predictors of the bacterial community structures along the soil sequences.
Transferring the Characteristics of Naturally Occurring and Biased Antibody Repertoires to Human Antibody Libraries by Trapping CDRH3 Sequences

PubMed Central

Venet, Sophie; Ravn, Ulla; Buatois, Vanessa; Gueneau, Franck; Calloud, Sébastien; Kosco-Vilbois, Marie; Fischer, Nicolas

2012-01-01

Antibody repertoires are characterized by diversity as they vary not only amongst individuals and post antigen exposure but also differ significantly between vertebrate species. Such plasticity can be exploited to generate human antibody libraries featuring hallmarks of these diverse repertoires. In this study, the focus was to capture CDRH3 sequences, as this region generally accounts for most of the interaction energy with antigen. Sequences from human as well as non-human sources were successfully integrated into human antibody libraries. Next generation sequencing of these libraries proved that the CDRH3 lengths and amino acid composition corresponded to the species of origin. Specific CDRH3 sequences, biased towards the recognition of a model antigen either by immunizing mice or by selecting with phage display, were then integrated into another set of libraries. From these antigen biased libraries, highly potent antibodies were more frequently isolated, indicating that the characteristics of an immune repertoire is transferrable via CDRH3 sequences into a human antibody library. Taken together, these data demonstrate that the properties of naturally or experimentally biased repertoires can be effectively harnessed for the generation of targeted human antibody libraries, substantially increasing the probability of isolating antibodies suitable for therapeutic and diagnostic applications. PMID:22937053
"Multiple partial recognitions in dynamic equilibrium" in the binding sites of proteins form the molecular basis of promiscuous recognition of structurally diverse ligands.

PubMed

Kohda, Daisuke

2018-04-01

Promiscuous recognition of ligands by proteins is as important as strict recognition in numerous biological processes. In living cells, many short, linear amino acid motifs function as targeting signals in proteins to specify the final destination of the protein transport. In general, the target signal is defined by a consensus sequence containing wild-characters, and hence represented by diverse amino acid sequences. The classical lock-and-key or induced-fit/conformational selection mechanism may not cover all aspects of the promiscuous recognition. On the basis of our crystallographic and NMR studies on the mitochondrial Tom20 protein-presequence interaction, we proposed a new hypothetical mechanism based on "a rapid equilibrium of multiple states with partial recognitions". This dynamic, multiple recognition mode enables the Tom20 receptor to recognize diverse mitochondrial presequences with nearly equal affinities. The plant Tom20 is evolutionally unrelated to the animal Tom20 in our study, but is a functional homolog of the animal/fungal Tom20. NMR studies by another research group revealed that the presequence binding by the plant Tom20 was not fully explained by simple interaction modes, suggesting the presence of a similar dynamic, multiple recognition mode. Circumstantial evidence also suggested that similar dynamic mechanisms may be applicable to other promiscuous recognitions of signal peptides by the SRP54/Ffh and SecA proteins.
Isolation and Selection of Microalgal Strains from Natural Water Sources in Viet Nam with Potential for Edible Oil Production.

PubMed

Thao, Tran Yen; Linh, Dinh Thi Nhat; Si, Vo Chi; Carter, Taylor W; Hill, Russell T

2017-06-23

Industrial vegetable oil production in Viet Nam depends on oil seeds and crude plant oils that are currently more than 90% imported. As the first step in investigating the feasibility of using microalgae to provide Viet Nam with a domestic source of oil for food and edible oil industries, fifty lipid-producing microalgae were isolated and characterized. The microalgae were isolated from water sources ranging from freshwater to brackish and marine waters from a wide geographic distribution in Viet Nam. Initial analyses showed that 20 of the 50 strains had good growth rates, produced high biomass and had high lipid content, ranging up to 50% of dry weight biomass. 18S rRNA gene sequence analyses of the 50 strains showed a great diversity in this assemblage of microalgae, comprising at least 38 species and representatives of 25 genera : Chlamydomonas , Poterioochromonas , Scenedesmus , Desmodesmus , Chlorella , Bracteacoccus , Monoraphidium , Selenastrum , Acutodesmus , Mychonastes , Ankistrodesmus , Kirchneriella , Raphidocelis , Dictyosphaerium , Coelastrella , Schizochlamydella , Oocystidium , Nannochloris , Auxenochlorella , Chlorosarcinopsis , Stichococcus , Picochlorum , Prasinoderma , Chlorococcum , and Marvania. Some of the species are closely related to well-known lipid producers such as Chlorella sorokiniana , but some other strains are not closely related to the strains found in public sequence databases and likely represent new species. Analysis of oil quality showed that fatty acid profiles of the microalgal strains were very diverse and strain-dependent. Fatty acids in the microalgal oils comprised saturated fatty acids (SFAs), poly-unsaturated fatty acids (PUFAs), and mono-unsaturated fatty acids (MUFAs). The main SFA was palmitic acid. MUFAs and PUFAs were dominated by oleic acid, and linoleic and linolenic acids, respectively. Some strains were especially rich in the essential fatty acid α-linolenic acid (ALA), which comprised more than 20% of the fatty acids in these strains. Other strains had fatty acid compositions similar to that of palm oil. Several strains have been selected on the basis of their suitable fatty acid profiles and high lipid content for further chemical and physical characterization, toxicity and organoleptic tests of their oils, and for scale-up.
Isolation and Selection of Microalgal Strains from Natural Water Sources in Viet Nam with Potential for Edible Oil Production

PubMed Central

Thao, Tran Yen; Linh, Dinh Thi Nhat; Si, Vo Chi; Carter, Taylor W.; Hill, Russell T.

2017-01-01

Industrial vegetable oil production in Viet Nam depends on oil seeds and crude plant oils that are currently more than 90% imported. As the first step in investigating the feasibility of using microalgae to provide Viet Nam with a domestic source of oil for food and edible oil industries, fifty lipid-producing microalgae were isolated and characterized. The microalgae were isolated from water sources ranging from freshwater to brackish and marine waters from a wide geographic distribution in Viet Nam. Initial analyses showed that 20 of the 50 strains had good growth rates, produced high biomass and had high lipid content, ranging up to 50% of dry weight biomass. 18S rRNA gene sequence analyses of the 50 strains showed a great diversity in this assemblage of microalgae, comprising at least 38 species and representatives of 25 genera: Chlamydomonas, Poterioochromonas, Scenedesmus, Desmodesmus, Chlorella, Bracteacoccus, Monoraphidium, Selenastrum, Acutodesmus, Mychonastes, Ankistrodesmus, Kirchneriella, Raphidocelis, Dictyosphaerium, Coelastrella, Schizochlamydella, Oocystidium, Nannochloris, Auxenochlorella, Chlorosarcinopsis, Stichococcus, Picochlorum, Prasinoderma, Chlorococcum, and Marvania. Some of the species are closely related to well-known lipid producers such as Chlorella sorokiniana, but some other strains are not closely related to the strains found in public sequence databases and likely represent new species. Analysis of oil quality showed that fatty acid profiles of the microalgal strains were very diverse and strain-dependent. Fatty acids in the microalgal oils comprised saturated fatty acids (SFAs), poly-unsaturated fatty acids (PUFAs), and mono-unsaturated fatty acids (MUFAs). The main SFA was palmitic acid. MUFAs and PUFAs were dominated by oleic acid, and linoleic and linolenic acids, respectively. Some strains were especially rich in the essential fatty acid α-linolenic acid (ALA), which comprised more than 20% of the fatty acids in these strains. Other strains had fatty acid compositions similar to that of palm oil. Several strains have been selected on the basis of their suitable fatty acid profiles and high lipid content for further chemical and physical characterization, toxicity and organoleptic tests of their oils, and for scale-up. PMID:28644408

A-to-I RNA Editing Contributes to Proteomic Diversity in Cancer.

PubMed

Peng, Xinxin; Xu, Xiaoyan; Wang, Yumeng; Hawke, David H; Yu, Shuangxing; Han, Leng; Zhou, Zhicheng; Mojumdar, Kamalika; Jeong, Kang Jin; Labrie, Marilyne; Tsang, Yiu Huen; Zhang, Minying; Lu, Yiling; Hwu, Patrick; Scott, Kenneth L; Liang, Han; Mills, Gordon B

2018-05-14

Adenosine (A) to inosine (I) RNA editing introduces many nucleotide changes in cancer transcriptomes. However, due to the complexity of post-transcriptional regulation, the contribution of RNA editing to proteomic diversity in human cancers remains unclear. Here, we performed an integrated analysis of TCGA genomic data and CPTAC proteomic data. Despite limited site diversity, we demonstrate that A-to-I RNA editing contributes to proteomic diversity in breast cancer through changes in amino acid sequences. We validate the presence of editing events at both RNA and protein levels. The edited COPA protein increases proliferation, migration, and invasion of cancer cells in vitro. Our study suggests an important contribution of A-to-I RNA editing to protein diversity in cancer and highlights its translational potential. Copyright © 2018 Elsevier Inc. All rights reserved.
Comparative genomics of citric-acid producing Aspergillus niger ATCC 1015 versus enzyme-producing CBS 513.88

DOE Office of Scientific and Technical Information (OSTI.GOV)

Grigoriev, Igor V.; Baker, Scott E.; Andersen, Mikael R.

2011-04-28

The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compels additional exploration. We therefore undertook whole genome sequencing of the acidogenic A. niger wild type strain (ATCC 1015), and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence and half the telomeric regionsmore » have been elucidated. Moreover, sequence information from ATCC 1015 was utilized to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 megabase of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis revealed up-regulation of the electron transport chain, specifically the alternative oxidative pathway in ATCC 1015, while CBS 513.88 showed significant up-regulation of genes relevant to glucoamylase A production, such as tRNA-synthases and protein transporters. Our results and datasets from this integrative systems biology analysis resulted in a snapshot of fungal evolution and will support further optimization of cell factories based on filamentous fungi.[Supplemental materials (10 figures, three text documents and 16 tables) have been made available. The whole genome sequence for A. niger ATCC 1015 is available from NBCI under acc. no ACJE00000000. The up-dated sequence for A. niger CBS 513.88 is available from EMBL under acc. no AM269948-AM270415. The sequence data from the phylogeny study has been submitted to NCBI (GU296686-296739). Microarray data from this study is submitted to GEO as series GSE10983. Accession for reviewers is possible through: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi token GSE10983] The dsmM_ANIGERa_coll511030F library and platform information is deposited at GEO under number GPL6758« less
A novel rhabdovirus, related to Merida virus, in field-collected mosquitoes from Anatolia and Thrace.

PubMed

Ergünay, Koray; Brinkmann, Annika; Litzba, Nadine; Günay, Filiz; Kar, Sırrı; Öter, Kerem; Örsten, Serra; Sarıkaya, Yasemen; Alten, Bülent; Nitsche, Andreas; Linton, Yvonne-Marie

2017-07-01

Next-generation sequencing technologies have significantly facilitated the discovery of novel viruses, and metagenomic surveillance of arthropods has enabled exploration of the diversity of novel or known viral agents. We have identified a novel rhabdovirus that is genetically related to the recently described Merida virus via next-generation sequencing in a mosquito pool from Thrace. The complete viral genome contains 11,798 nucleotides with 83% genome-wide nucleotide sequence similarity to Merida virus. Five major putative open reading frames that follow the canonical rhabdovirus genome organization were identified. A total of 1380 mosquitoes comprising 13 species, collected from Thrace and the Mediterranean and Aegean regions of Anatolia were screened for the novel virus using primers based on the N and L genes of the prototype genome. Eight positive pools (6.2%) exclusively comprised Culex pipiens sensu lato specimens originating from all study regions. Infections were observed in pools with female as well as male or mixed-sex individuals. The overall and Cx. pipiens-specific minimal infection rates were calculated to be 5.7 and 14.8, respectively. Sequencing of the PCR products revealed marked diversity within a portion of the N gene, with up to 4% divergence and distinct amino acid substitutions that were unrelated to the collection site. Phylogenetic analysis of the complete and partial viral polymerase (L gene) amino acid sequences placed the novel virus and Merida virus in a distinct group, indicating that these strains are closely related. The strain is tentatively named "Merida-like virus Turkey". Studies are underway to isolate and further explore the host range and distribution of this new strain.
Feature selection using a one dimensional naïve Bayes' classifier increases the accuracy of support vector machine classification of CDR3 repertoires.

PubMed

Cinelli, Mattia; Sun, Yuxin; Best, Katharine; Heather, James M; Reich-Zeliger, Shlomit; Shifrut, Eric; Friedman, Nir; Shawe-Taylor, John; Chain, Benny

2017-04-01

Somatic DNA recombination, the hallmark of vertebrate adaptive immunity, has the potential to generate a vast diversity of antigen receptor sequences. How this diversity captures antigen specificity remains incompletely understood. In this study we use high throughput sequencing to compare the global changes in T cell receptor β chain complementarity determining region 3 (CDR3β) sequences following immunization with ovalbumin administered with complete Freund's adjuvant (CFA) or CFA alone. The CDR3β sequences were deconstructed into short stretches of overlapping contiguous amino acids. The motifs were ranked according to a one-dimensional Bayesian classifier score comparing their frequency in the repertoires of the two immunization classes. The top ranking motifs were selected and used to create feature vectors which were used to train a support vector machine. The support vector machine achieved high classification scores in a leave-one-out validation test reaching >90% in some cases. The study describes a novel two-stage classification strategy combining a one-dimensional Bayesian classifier with a support vector machine. Using this approach we demonstrate that the frequency of a small number of linear motifs three amino acids in length can accurately identify a CD4 T cell response to ovalbumin against a background response to the complex mixture of antigens which characterize Complete Freund's Adjuvant. The sequence data is available at www.ncbi.nlm.nih.gov/sra/?term¼SRP075893 . The Decombinator package is available at github.com/innate2adaptive/Decombinator . The R package e1071 is available at the CRAN repository https://cran.r-project.org/web/packages/e1071/index.html . b.chain@ucl.ac.uk. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.
Apparent founder effect during the early years of the San Francisco HIV type 1 epidemic (1978-1979).

PubMed

Foley, B; Pan, H; Buchbinder, S; Delwart, E L

2000-10-10

HIV-1 envelope sequence variants were RT-PCR amplified from serum samples cryopreserved in San Francisco in 1978-1979. The HIV-1 subtype B env V3-V5 sequences from four homosexual men clustered phylogenetically, with a median nucleotide distance of 2.8%, reflecting a recent common origin. These early U.S. HIV-1 env variants mapped close to the phylogenetic root of the subtype B tree while env variants collected in the United States throughout the 1980s and 1990s showed, on average, increasing genetic diversity and divergence from the subtype B consensus sequence. These results indicate that the majority of HIV-1 currently circulating in the United States may be descended from an initial introduction and rapid spread during the mid- to late 1970s of subtype B viruses with limited variability (i.e., a founder effect). As expected from the starburst-shaped phylogeny of HIV-1 subtype B, contemporary U.S. strains were, on average, more closely related at the nucleic acid and amino acid levels to the earlier 1978-1979 env variants than to each other. The growing levels of HIV-1 genetic diversity, one of multiple obstacles in designing a protective vaccine, may therefore be mitigated by using epidemic founding variants as antigenic strains for protection against contemporary strains.
Diverse bacterial PKS sequences derived from okadaic acid-producing dinoflagellates.

PubMed

Perez, Roberto; Liu, Li; Lopez, Jose; An, Tianying; Rein, Kathleen S

2008-05-22

Okadaic acid (OA) and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS) genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum.
IDM-PhyChm-Ens: intelligent decision-making ensemble methodology for classification of human breast cancer using physicochemical properties of amino acids.

PubMed

Ali, Safdar; Majid, Abdul; Khan, Asifullah

2014-04-01

Development of an accurate and reliable intelligent decision-making method for the construction of cancer diagnosis system is one of the fast growing research areas of health sciences. Such decision-making system can provide adequate information for cancer diagnosis and drug discovery. Descriptors derived from physicochemical properties of protein sequences are very useful for classifying cancerous proteins. Recently, several interesting research studies have been reported on breast cancer classification. To this end, we propose the exploitation of the physicochemical properties of amino acids in protein primary sequences such as hydrophobicity (Hd) and hydrophilicity (Hb) for breast cancer classification. Hd and Hb properties of amino acids, in recent literature, are reported to be quite effective in characterizing the constituent amino acids and are used to study protein foldings, interactions, structures, and sequence-order effects. Especially, using these physicochemical properties, we observed that proline, serine, tyrosine, cysteine, arginine, and asparagine amino acids offer high discrimination between cancerous and healthy proteins. In addition, unlike traditional ensemble classification approaches, the proposed 'IDM-PhyChm-Ens' method was developed by combining the decision spaces of a specific classifier trained on different feature spaces. The different feature spaces used were amino acid composition, split amino acid composition, and pseudo amino acid composition. Consequently, we have exploited different feature spaces using Hd and Hb properties of amino acids to develop an accurate method for classification of cancerous protein sequences. We developed ensemble classifiers using diverse learning algorithms such as random forest (RF), support vector machines (SVM), and K-nearest neighbor (KNN) trained on different feature spaces. We observed that ensemble-RF, in case of cancer classification, performed better than ensemble-SVM and ensemble-KNN. Our analysis demonstrates that ensemble-RF, ensemble-SVM and ensemble-KNN are more effective than their individual counterparts. The proposed 'IDM-PhyChm-Ens' method has shown improved performance compared to existing techniques.
Binning of shallowly sampled metagenomic sequence fragments reveals that low abundance bacteria play important roles in sulfur cycling and degradation of complex organic polymers in an acid mine drainage community

NASA Astrophysics Data System (ADS)

Dick, G. J.; Andersson, A.; Banfield, J. F.

2007-12-01

Our understanding of environmental microbiology has been greatly enhanced by community genome sequencing of DNA recovered directly the environment. Community genomics provides insights into the diversity, community structure, metabolic function, and evolution of natural populations of uncultivated microbes, thereby revealing dynamics of how microorganisms interact with each other and their environment. Recent studies have demonstrated the potential for reconstructing near-complete genomes from natural environments while highlighting the challenges of analyzing community genomic sequence, especially from diverse environments. A major challenge of shotgun community genome sequencing is identification of DNA fragments from minor community members for which only low coverage of genomic sequence is present. We analyzed community genome sequence retrieved from biofilms in an acid mine drainage (AMD) system in the Richmond Mine at Iron Mountain, CA, with an emphasis on identification and assembly of DNA fragments from low-abundance community members. The Richmond mine hosts an extensive, relatively low diversity subterranean chemolithoautotrophic community that is sustained entirely by oxidative dissolution of pyrite. The activity of these microorganisms greatly accelerates the generation of AMD. Previous and ongoing work in our laboratory has focused on reconstrucing genomes of dominant community members, including several bacteria and archaea. We binned contigs from several samples (including one new sample and two that had been previously analyzed) by tetranucleotide frequency with clustering by Self-Organizing Maps (SOM). The binning, evaluated by comparison with information from the manually curated assembly of the dominant organisms, was found to be very effective: fragments were correctly assigned with 95% accuracy. Improperly assigned fragments often contained sequences that are either evolutionarily constrained (e.g. 16S rRNA genes) or mobile elements that are not expected to reflect the tetranucleotide frequency signature of the host genome. Four unknown tetranucleotide frequency clusters with significant sequence (6 Mb total) were noted and analyzed further. Based on phylogenetic markers and BLAST results, these clusters represent low abundance bacteria including Acintobacteria, Firmicutes, and Proteobacteria. Functional analysis of these clusters revealved that the low- abundance bacteria harbor genes that could potentially encode important ecosystem functions such as sulfur utilization (e.g. polysulfide reductase) and polymer degradation (e.g. chitinase and glycoside hydrolase). We conclude that ESOM clustering of tetranucleotide frequency patterns is an effective method for rapidly binning shotgun community genomic sequences and a valuable tool for analyzing minor community members, which despite their low abundance may play crucial ecological roles.
Diversity within Italian Cheesemaking Brine-Associated Bacterial Communities Evidenced by Massive Parallel 16S rRNA Gene Tag Sequencing

PubMed Central

Marino, Marilena; Innocente, Nadia; Maifreni, Michela; Mounier, Jérôme; Cobo-Díaz, José F.; Coton, Emmanuel; Carraro, Lisa; Cardazzo, Barbara

2017-01-01

This study explored the bacterial diversity of brines used for cheesemaking in Italy, as well as their physicochemical characteristics. In this context, 19 brines used to salt soft, semi-hard, and hard Italian cheeses were collected in 14 commercial cheese plants and analyzed using a culture-independent amplicon sequencing approach in order to describe their bacterial microbiota. Large NaCl concentration variations were observed among the selected brines, with hard cheese brines exhibiting the highest values. Acidity values showed a great variability too, probably in relation to the brine use prior to sampling. Despite their high salt content, brine microbial loads ranged from 2.11 to 6.51 log CFU/mL for the total mesophilic count. Microbial community profiling assessed by 16S rRNA gene sequencing showed that these ecosystems were dominated by Firmicutes and Proteobacteria, followed by Actinobacteria and Bacteroidetes. Cheese type and brine salinity seem to be the main parameters accountable for brine microbial diversity. On the contrary, brine pH, acidity and protein concentration, correlated to cheese brine age, did not have any selective effect on the microbiota composition. Nine major genera were present in all analyzed brines, indicating that they might compose the core microbiome of cheese brines. Staphylococcus aureus was occasionally detected in brines using selective culture media. Interestingly, bacterial genera associated with a functional and technological use were frequently detected. Indeed Bifidobacteriaceae, which might be valuable probiotic candidates, and specific microbial genera such as Tetragenococcus, Corynebacterium and non-pathogenic Staphylococcus, which can contribute to sensorial properties of ripened cheeses, were widespread within brines. PMID:29163411
Phylogenetic Diversity of Lactic Acid Bacteria Associated with Paddy Rice Silage as Determined by 16S Ribosomal DNA Analysis

PubMed Central

Ennahar, Saïd; Cai, Yimin; Fujita, Yasuhito

2003-01-01

A total of 161 low-G+C-content gram-positive bacteria isolated from whole-crop paddy rice silage were classified and subjected to phenotypic and genetic analyses. Based on morphological and biochemical characters, these presumptive lactic acid bacterium (LAB) isolates were divided into 10 groups that included members of the genera Enterococcus, Lactobacillus, Lactococcus, Leuconostoc, Pediococcus, and Weissella. Analysis of the 16S ribosomal DNA (rDNA) was used to confirm the presence of the predominant groups indicated by phenotypic analysis and to determine the phylogenetic affiliation of representative strains. The virtually complete 16S rRNA gene was PCR amplified and sequenced. The sequences from the various LAB isolates showed high degrees of similarity to those of the GenBank reference strains (between 98.7 and 99.8%). Phylogenetic trees based on the 16S rDNA sequence displayed high consistency, with nodes supported by high bootstrap values. With the exception of one species, the genetic data was in agreement with the phenotypic identification. The prevalent LAB, predominantly homofermentative (66%), consisted of Lactobacillus plantarum (24%), Lactococcus lactis (22%), Leuconostoc pseudomesenteroides (20%), Pediococcus acidilactici (11%), Lactobacillus brevis (11%), Enterococcus faecalis (7%), Weissella kimchii (3%), and Pediococcus pentosaceus (2%). The present study, the first to fully document rice-associated LAB, showed a very diverse community of LAB with a relatively high number of species involved in the fermentation process of paddy rice silage. The comprehensive 16S rDNA-based approach to describing LAB community structure was valuable in revealing the large diversity of bacteria inhabiting paddy rice silage and enabling the future design of appropriate inoculants aimed at improving its fermentation quality. PMID:12514026
Phylogenetic diversity of lactic acid bacteria associated with paddy rice silage as determined by 16S ribosomal DNA analysis.

PubMed

Ennahar, Saïd; Cai, Yimin; Fujita, Yasuhito

2003-01-01

A total of 161 low-G+C-content gram-positive bacteria isolated from whole-crop paddy rice silage were classified and subjected to phenotypic and genetic analyses. Based on morphological and biochemical characters, these presumptive lactic acid bacterium (LAB) isolates were divided into 10 groups that included members of the genera Enterococcus, Lactobacillus, Lactococcus, Leuconostoc, Pediococcus, and WEISSELLA: Analysis of the 16S ribosomal DNA (rDNA) was used to confirm the presence of the predominant groups indicated by phenotypic analysis and to determine the phylogenetic affiliation of representative strains. The virtually complete 16S rRNA gene was PCR amplified and sequenced. The sequences from the various LAB isolates showed high degrees of similarity to those of the GenBank reference strains (between 98.7 and 99.8%). Phylogenetic trees based on the 16S rDNA sequence displayed high consistency, with nodes supported by high bootstrap values. With the exception of one species, the genetic data was in agreement with the phenotypic identification. The prevalent LAB, predominantly homofermentative (66%), consisted of Lactobacillus plantarum (24%), Lactococcus lactis (22%), Leuconostoc pseudomesenteroides (20%), Pediococcus acidilactici (11%), Lactobacillus brevis (11%), Enterococcus faecalis (7%), Weissella kimchii (3%), and Pediococcus pentosaceus (2%). The present study, the first to fully document rice-associated LAB, showed a very diverse community of LAB with a relatively high number of species involved in the fermentation process of paddy rice silage. The comprehensive 16S rDNA-based approach to describing LAB community structure was valuable in revealing the large diversity of bacteria inhabiting paddy rice silage and enabling the future design of appropriate inoculants aimed at improving its fermentation quality.
Epitope selection from an uncensored peptide library displayed on avian leukosis virus.

PubMed

Khare, Pranay D; Rosales, Ana G; Bailey, Kent R; Russell, Stephen J; Federspiel, Mark J

2003-10-25

Phage display libraries have provided an extraordinarily versatile technology to facilitate the isolation of peptides, growth factors, single chain antibodies, and enzymes with desired binding specificities or enzymatic activities. The overall diversity of peptides in phage display libraries can be significantly limited by Escherichia coli protein folding and processing machinery, which result in sequence censorship. To achieve an optimal diversity of displayed eukaryotic peptides, the library should be produced in the endoplasmic reticulum of eukaryotic cells using a eukaryotic display platform. In the accompanying article, we presented experiments that demonstrate that polypeptides of various sizes could be efficiently displayed on the envelope glycoproteins of a eukaryotic virus, avian leukosis virus (ALV), and the displayed polypeptides could efficiently attach to cognate receptors without interfering with viral attachment and entry into susceptible cells. In this study, methods were developed to construct a model library of randomized eight amino acid peptides using the ALV eukaryotic display platform and screen the library for specific epitopes using immobilized antibodies. A virus library with approximately 2 x 10(6) different members was generated from a plasmid library of approximately 5 x 10(6) diversity. The sequences of the randomized 24 nucleotide/eight amino acid regions of representatives of the plasmid and virus libraries were analyzed. No significant sequence censorship was observed in producing the virus display library from the plasmid library. Different populations of peptide epitopes were selected from the virus library when different monoclonal antibodies were used as the target. The results of these two studies clearly demonstrate the potential of ALV as a eukaryotic platform for the display and selection of eukaryotic polypeptides libraries.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

PubMed

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Draft Genome Sequence of the Nicotinate-Metabolizing Soil Bacterium Bacillus niacini DSM 2923

PubMed Central

Harvey, Zachary H.

2014-01-01

Bacillus niacini is a member of a small yet diverse group of bacteria able to catabolize nicotinic acid. We report here the availability of a draft genome for B. niacini, which we will use to understand the evolution of its namesake phenotype, which appears to be unique among the species in its phylogenetic neighborhood. PMID:25477409
Fine tangled pili expressed by Haemophilus ducreyi are a novel class of pili.

PubMed Central

Brentjens, R J; Ketterer, M; Apicella, M A; Spinola, S M

1996-01-01

Haemophilus ducreyi synthesizes fine, tangled pili composed predominantly of a protein whose apparent molecular weight is 24,000 (24K). A hybridoma, 2D8, produced a monoclonal antibody (MAb) that bound to a 24K protein in H. ducreyi strains isolated from diverse geographic locations. A lambda gt11 H. ducreyi library was screened with MAb 2D8. A 3.5-kb chromosomal insert from one reactive plaque was amplified and ligated into the pCRII vector. The recombinant plasmid, designated pHD24, expressed a 24K protein in Escherichia coli INV alpha F that bound MAb 2D8. The coding sequence of the 24K gene was localized by exonuclease III digestion. The insert contained a 570-bp open reading frame, designated ftpA (fine, tangled pili). Translation of ftpA predicted a polypeptide with a molecular weight of 21.1K. The predicted N-terminal amino acid sequence of the polypeptide encoded by ftpA was identical to the N-terminal amino acid sequence of purified pilin and lacked a cleavable signal sequence. Primer extension analysis of ftpA confirmed the lack of a leader peptide. The predicted amino acid sequence lacked homology to known pilin sequences but shared homology with the sequences of E. coli Dps and Treponema pallidum antigen TpF1 or 4D, proteins which associate to form ordered rings. An isogenic pilin mutant, H. ducreyi 35000ftpA::mTn3(Cm), was constructed by shuttle mutagenesis and did not contain pili when examined by electron microscopy. We conclude that H. ducreyi synthesizes fine, tangled pili that are composed of a unique major subunit, which may be exported by a signal sequence independent mechanism. PMID:8550517
Characterization of Microbial Communities in Chinese Rice Wine Collected at Yichang City and Suzhou City in China.

PubMed

Lü, Yucai; Gong, Yanli; Li, Yajie; Pan, Zejiang; Yao, Yi; Li, Ning; Guo, Jinling; Gong, Dachun; Tian, Yihong; Peng, Caiyun

2017-08-28

Two typical microbial communities from Chinese rice wine fermentation collected in Yichang city and Suzhou city in China were investigated. Both communities could ferment glutinous rice to rice wine in 2 days. The sugar and ethanol contents were 198.67 and 14.47 mg/g, respectively, for rice wine from Yichang city, and 292.50 and 12.31 mg/g, respectively, for rice wine from Suzhou city. Acetic acid and lactic acid were the most abundant organic acids. Abundant fungi and bacteria were detected in both communities by high-throughput sequencing. Saccharomycopsis fibuligera and Rhizopus oryzae were the dominant fungi in rice wine from Suzhou city, compared with R. oryzae , Wickerhamomyces anomalus, Saccharomyces cerevisiae, Mucor indicus , and Rhizopus microsporus in rice wine from Yichang city. Bacterial diversity was greater than fungal diversity in both communities. Citrobacter was the most abundant genus. Furthermore, Exiguobacterium, Aeromonas, Acinetobacter, Pseudomonas, Enterobacter, Bacillus , and Lactococcus were highly abundant in both communities.
Two Perspectives on the Origin of the Standard Genetic Code

NASA Astrophysics Data System (ADS)

Sengupta, Supratim; Aggarwal, Neha; Bandhu, Ashutosh Vishwa

2014-12-01

The origin of a genetic code made it possible to create ordered sequences of amino acids. In this article we provide two perspectives on code origin by carrying out simulations of code-sequence coevolution in finite populations with the aim of examining how the standard genetic code may have evolved from more primitive code(s) encoding a small number of amino acids. We determine the efficacy of the physico-chemical hypothesis of code origin in the absence and presence of horizontal gene transfer (HGT) by allowing a diverse collection of code-sequence sets to compete with each other. We find that in the absence of horizontal gene transfer, natural selection between competing codes distinguished by differences in the degree of physico-chemical optimization is unable to explain the structure of the standard genetic code. However, for certain probabilities of the horizontal transfer events, a universal code emerges having a structure that is consistent with the standard genetic code.
The bean. alpha. -amylase inhibitor is encoded by a lectin gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Moreno, J.; Altabella, T.; Chrispeels, M.J.

The common bean, Phaseolus vulgaris, contains an inhibitor of insect and mammalian {alpha}-amylases that does not inhibit plant {alpha}-amylase. This inhibitor functions as an anti-feedant or seed-defense protein. We purified this inhibitor by affinity chromatography and found that it consists of a series of glycoforms of two polypeptides (Mr 14,000-19,000). Partial amino acid sequencing was carried out, and the sequences obtained are identical with portions of the derived amino acid sequence of a lectin-like gene. This lectin gene encodes a polypeptide of MW 28,000, and the primary in vitro translation product identified by antibodies to the {alpha}-amylase inhibitor has themore » same size. Co- and posttranslational processing of this polypeptide results in glycosylated polypeptides of 14-19 kDa. Our interpretation of these results is that the bean lectins constitute a gene family that encodes diverse plant defense proteins, including phytohemagglutinin, arcelin and {alpha}-amylase inhibitor.« less
A gene variation of 14-3-3 zeta isoform in rat hippocampus.

PubMed

Murakami, K; Situ, S Y; Eshete, F

1996-11-14

A variant form of 14-3-3 zeta was isolated from the rat hippocampal cDNA library. The cloned cDNA is 1687 bp in length and it contains an entire ORF (nt = 63-797) with 245 amino acids that is characteristic to 14-3-3 zeta subtype. By comparing with reported sequences of 14-3-3 zeta, we found three nucleotide substitutions within the coding sequence in our clone; C<-->T transition at nt = 325 and G<-->C transversions at nt = 387 and 388. Both are missense mutations, leading ACG (Thr) to ATG (Met) and CGT (Arg) to GCT (Ala) conversions at residue 88 and 109, respectively. Our results show that at least three different genetic variants of 14-3-3 zeta are present in rat species which results in protein variations. Such mutation in the amino acid sequence is an important indication of the diverse functions of this protein and may also contribute to the recent contradictory observations regarding the role of the 14-3-3 zeta subtype.
Molecular Cloning and Expression of Three Polygalacturonase cDNAs from the Tarnished Plant Bug, Lygus lineolaris

PubMed Central

Allen, Margaret L.; Mertens, Jeffrey A.

2008-01-01

Three unique cDNAs encoding putative polygalacturonase enzymes were isolated from the tarnished plant bug, Lygus lineolaris (Palisot de Beauvois) (Hemiptera: Miridae). The three nucleotide sequences were dissimilar to one another, but the deduced amino acid sequences were similar to each other and to other polygalacturonases from insects, fungi, plants, and bacteria. Four conserved segments characteristic of polygalacturonases were present, but with some notable semiconservative substitutions. Two of four expected disulfide bridge—forming cysteine pairs were present. All three inferred protein translations included predicted signal sequences of 17 to 20 amino acids. Amplification of genomic DNA identified an intron in one of the genes, Llpg1, in the 5′ untranslated region. Semiquantitative RT-PCR revealed expression in all stages of the insect except the eggs. Expression in adults, male and female, was highly variable, indicating a family of highly inducible and diverse enzymes adapted to the generalist polyphagous nature of this important pest. PMID:20233096

Genetic diversity of the movement and coat protein genes of South American isolates of Prunus necrotic ringspot virus.

PubMed

Fiore, Nicola; Fajardo, Thor V M; Prodan, Simona; Herranz, María Carmen; Aparicio, Frederic; Montealegre, Jaime; Elena, Santiago F; Pallás, Vicente; Sánchez-Navarro, Jesús

2008-01-01

Prunus necrotic ringspot virus (PNRSV) is distributed worldwide, but no molecular data have been previously reported from South American isolates. The nucleotide sequences corresponding to the movement (MP) and coat (CP) proteins of 23 isolates of PNRSV from Chile, Brazil, and Uruguay, and from different Prunus species, have been obtained. Phylogenetic analysis performed with full-length MP and CP sequences from all the PNRSV isolates confirmed the clustering of the isolates into the previously reported PV32-I, PV96-II and PE5-III phylogroups. No association was found between specific sequences and host, geographic origin or symptomatology. Comparative analysis showed that both MP and CP have phylogroup-specific amino acids and all of the motifs previously characterized for both proteins. The study of the distribution of synonymous and nonsynonymous changes along both open reading frames revealed that most amino acid sites are under the effect of negative purifying selection.
Sequence Based Structural Characterization and Genetic Diversity Analysis of Full Length TLR4 CDS in Crossbred and Indigenous Cattle.

PubMed

Mishra, Chinmoy; Kumar, Subodh; Sonwane, Arvind Asaram; Yathish, H M; Chaudhary, Rajni

2017-01-02

The exploration of candidate genes for immune response in cattle may be vital for improving our understanding regarding the species specific response to pathogens. Toll-like receptor 4 (TLR4) is mostly involved in protection against the deleterious effects of Gram negative pathogens. Approximately 2.6 kb long cDNA sequence of TLR4 gene covering the entire coding region was characterized in two Indian milk cattle (Vrindavani and Tharparkar). The phylogenetic analysis confirmed that the bovine TLR4 was apparently evolved from an ancestral form that predated the appearance of vertebrates, and it is grouped with buffalo, yak, and mithun TLR4s. Sequence analysis revealed a 2526-nucleotide long open reading frame (ORF) encoding 841 amino acids, similar to other cattle breeds. The calculated molecular weight of the translated ORF was 96144 and 96040.9 Da; the isoelectric point was 6.35 and 6.42 in Vrindavani and Tharparkar cattle, respectively. The Simple Modular Architecture Research Tool (SMART) analysis identified 14 leucine rich repeats (LRR) motifs in bovine TLR4 protein. The deduced TLR4 amino acid sequence of Tharparkar had 4 different substitutions as compared to Bos taurus, Sahiwal, and Vrindavani. The signal peptide cleavage site predicted to lie between 16th and 17th amino acid of mature peptide. The transmebrane helix was identified between 635-657 amino acids in the mature peptide.
Compartmentalization of HIV-1 within the female genital tract is due to monotypic and low-diversity variants not distinct viral populations.

PubMed

Bull, Marta; Learn, Gerald; Genowati, Indira; McKernan, Jennifer; Hitti, Jane; Lockhart, David; Tapia, Kenneth; Holte, Sarah; Dragavon, Joan; Coombs, Robert; Mullins, James; Frenkel, Lisa

2009-09-22

Compartmentalization of HIV-1 between the genital tract and blood was noted in half of 57 women included in 12 studies primarily using cell-free virus. To further understand differences between genital tract and blood viruses of women with chronic HIV-1 infection cell-free and cell-associated virus populations were sequenced from these tissues, reasoning that integrated viral DNA includes variants archived from earlier in infection, and provides a greater array of genotypes for comparisons. Multiple sequences from single-genome-amplification of HIV-1 RNA and DNA from the genital tract and blood of each woman were compared in a cross-sectional study. Maximum likelihood phylogenies were evaluated for evidence of compartmentalization using four statistical tests. Genital tract and blood HIV-1 appears compartmentalized in 7/13 women by >/=2 statistical analyses. These subjects' phylograms were characterized by low diversity genital-specific viral clades interspersed between clades containing both genital and blood sequences. Many of the genital-specific clades contained monotypic HIV-1 sequences. In 2/7 women, HIV-1 populations were significantly compartmentalized across all four statistical tests; both had low diversity genital tract-only clades. Collapsing monotypic variants into a single sequence diminished the prevalence and extent of compartmentalization. Viral sequences did not demonstrate tissue-specific signature amino acid residues, differential immune selection, or co-receptor usage. In women with chronic HIV-1 infection multiple identical sequences suggest proliferation of HIV-1-infected cells, and low diversity tissue-specific phylogenetic clades are consistent with bursts of viral replication. These monotypic and tissue-specific viruses provide statistical support for compartmentalization of HIV-1 between the female genital tract and blood. However, the intermingling of these clades with clades comprised of both genital and blood sequences and the absence of tissue-specific genetic features suggests compartmentalization between blood and genital tract may be due to viral replication and proliferation of infected cells, and questions whether HIV-1 in the female genital tract is distinct from blood.
Lipoxygenase in Caragana jubata responds to low temperature, abscisic acid, methyl jasmonate and salicylic acid.

PubMed

Bhardwaj, Pardeep Kumar; Kaur, Jagdeep; Sobti, Ranbir Chander; Ahuja, Paramvir Singh; Kumar, Sanjay

2011-09-01

Lipoxygenase (LOX) catalyses oxygenation of free polyunsaturated fatty acids into oxylipins, and is a critical enzyme of the jasmonate signaling pathway. LOX has been shown to be associated with biotic and abiotic stress responses in diverse plant species, though limited data is available with respect to low temperature and the associated cues. Using rapid amplification of cDNA ends, a full-length cDNA (CjLOX) encoding lipoxygenase was cloned from apical buds of Caragana jubata, a temperate plant species that grows under extreme cold. The cDNA obtained was 2952bp long consisting of an open reading frame of 2610bp encoding 869 amino acids protein. Multiple alignment of the deduced amino acid sequence with those of other plants demonstrated putative LH2/ PLAT domain, lipoxygenase iron binding catalytic domain and lipoxygenase_2 signature sequences. CjLOX exhibited up- and down-regulation of gene expression pattern in response to low temperature (LT), abscisic acid (ABA), methyl jasmonate (MJ) and salicylic acid (SA). Among all the treatments, a strong up-regulation was observed in response to MJ. Data suggests an important role of jasmonate signaling pathway in response to LT in C. jubata. Copyright © 2011 Elsevier B.V. All rights reserved.
Modulating Effects of Dicaffeoylquinic Acids from Ilex kudingcha on Intestinal Microecology in Vitro.

PubMed

Xie, Minhao; Chen, Guijie; Wan, Peng; Dai, Zhuqing; Hu, Bing; Chen, Ligen; Ou, Shiyi; Zeng, Xiaoxiong; Sun, Yi

2017-11-29

Dietary polyphenols have been considered as novel prebiotics, and polyphenols could exert their functions through modulating intestinal microbiota. The diverse bioactivities of kudingcha could derive from its phenolic compounds, but the effects of dicaffeoylquinic acids (diCQAs) from Ilex kudingcha on intestinal microbiota have not been investigated. In the present study, high-throughput sequencing and anaerobic fermentation in vitro were utilized to investigate the microecology-modulating function of I. kudingcha diCQAs. As a result, diCQAs raised the diversity and exhibited a more considerable impact than a carbon source on the microbial profile. DiCQAs increased the relative abundances of Alistipes, Bacteroides, Bifidobacterium, Butyricimonas, Clostridium sensu stricto, Escherichia/Shigella, Parasutterella, Romboutsia, Oscillibacter, Veillonella, Phascolarctobacterium, Lachnospiracea incertae sedis, Gemmiger, Streptococcus, and Haemophilus and decreased the relative abundances of Ruminococcus, Anaerostipes, Dialister, Megasphaera, Megamonas, and Prevotella. DiCQAs also affected the generation of short-chain fatty acids through microbiota. The contents of acetic and lactic acids were raised, while the production of propionic and butyric acids was reduced. Conclusively, diCQAs from I. kudingcha had significant modulating effects on intestinal microbiota in vitro, which might be the fundamental of diCQAs exerting their bioactivities.
Dolichol phosphate mannose synthase: a Glycosyltransferase with Unity in molecular diversities.

PubMed

Banerjee, Dipak K; Zhang, Zhenbo; Baksi, Krishna; Serrano-Negrón, Jesús E

2017-08-01

N-glycans provide structural and functional stability to asparagine-linked (N-linked) glycoproteins, and add flexibility. Glycan biosynthesis is elaborative, multi-compartmental and involves many glycosyltransferases. Failure to assemble N-glycans leads to phenotypic changes developing infection, cancer, congenital disorders of glycosylation (CDGs) among others. Biosynthesis of N-glycans begins at the endoplasmic reticulum (ER) with the assembly of dolichol-linked tetra-decasaccharide (Glc 3 Man 9 GlcNAc 2 -PP-Dol) where dolichol phosphate mannose synthase (DPMS) plays a central role. DPMS is also essential for GPI anchor biosynthesis as well as for O- and C-mannosylation of proteins in yeast and in mammalian cells. DPMS has been purified from several sources and its gene has been cloned from 39 species (e.g., from protozoan parasite to human). It is an inverting GT-A folded enzyme and classified as GT2 by CAZy (carbohydrate active enZyme; http://www.cazy.org ). The sequence alignment detects the presence of a metal binding DAD signature in DPMS from all 39 species but finds cAMP-dependent protein phosphorylation motif (PKA motif) in only 38 species. DPMS also has hydrophobic region(s). Hydropathy analysis of amino acid sequences from bovine, human, S. crevisiae and A. thaliana DPMS show PKA motif is present between the hydrophobic domains. The location of PKA motif as well as the hydrophobic domain(s) in the DPMS sequence vary from species to species. For example, the domain(s) could be located at the center or more towards the C-terminus. Irrespective of their catalytic similarity, the DNA sequence, the amino acid identity, and the lack of a stretch of hydrophobic amino acid residues at the C-terminus, DPMS is still classified as Type I and Type II enzyme. Because of an apparent bio-sensing ability, extracellular signaling and microenvironment regulate DPMS catalytic activity. In this review, we highlight some important features and the molecular diversities of DPMS.
The Evolution of Vp1 Gene in Enterovirus C Species Sub-Group That Contains Types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99

PubMed Central

Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

2014-01-01

Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific ‘signature’ amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific ‘signature’ amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared. PMID:24695547
The evolution of Vp1 gene in enterovirus C species sub-group that contains types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99.

PubMed

Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

2014-01-01

Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific 'signature' amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific 'signature' amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared.
Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening

PubMed Central

Crowhurst, Ross N; Gleave, Andrew P; MacRae, Elspeth A; Ampomah-Dwamena, Charles; Atkinson, Ross G; Beuning, Lesley L; Bulley, Sean M; Chagne, David; Marsh, Ken B; Matich, Adam J; Montefiori, Mirco; Newcomb, Richard D; Schaffer, Robert J; Usadel, Björn; Allan, Andrew C; Boldingh, Helen L; Bowen, Judith H; Davy, Marcus W; Eckloff, Rheinhart; Ferguson, A Ross; Fraser, Lena G; Gera, Emma; Hellens, Roger P; Janssen, Bart J; Klages, Karin; Lo, Kim R; MacDiarmid, Robin M; Nain, Bhawana; McNeilage, Mark A; Rassam, Maysoon; Richardson, Annette C; Rikkerink, Erik HA; Ross, Gavin S; Schröder, Roswitha; Snowden, Kimberley C; Souleyre, Edwige JF; Templeton, Matt D; Walton, Eric F; Wang, Daisy; Wang, Mindy Y; Wang, Yanming Y; Wood, Marion; Wu, Rongmei; Yauk, Yar-Khing; Laing, William A

2008-01-01

Background Kiwifruit (Actinidia spp.) are a relatively new, but economically important crop grown in many different parts of the world. Commercial success is driven by the development of new cultivars with novel consumer traits including flavor, appearance, healthful components and convenience. To increase our understanding of the genetic diversity and gene-based control of these key traits in Actinidia, we have produced a collection of 132,577 expressed sequence tags (ESTs). Results The ESTs were derived mainly from four Actinidia species (A. chinensis, A. deliciosa, A. arguta and A. eriantha) and fell into 41,858 non redundant clusters (18,070 tentative consensus sequences and 23,788 EST singletons). Analysis of flavor and fragrance-related gene families (acyltransferases and carboxylesterases) and pathways (terpenoid biosynthesis) is presented in comparison with a chemical analysis of the compounds present in Actinidia including esters, acids, alcohols and terpenes. ESTs are identified for most genes in color pathways controlling chlorophyll degradation and carotenoid biosynthesis. In the health area, data are presented on the ESTs involved in ascorbic acid and quinic acid biosynthesis showing not only that genes for many of the steps in these pathways are represented in the database, but that genes encoding some critical steps are absent. In the convenience area, genes related to different stages of fruit softening are identified. Conclusion This large EST resource will allow researchers to undertake the tremendous challenge of understanding the molecular basis of genetic diversity in the Actinidia genus as well as provide an EST resource for comparative fruit genomics. The various bioinformatics analyses we have undertaken demonstrates the extent of coverage of ESTs for genes encoding different biochemical pathways in Actinidia. PMID:18655731
Prevalence, antimicrobial resistance and genetic diversity of Campylobacter coli and Campylobacter jejuni in Ecuadorian broilers at slaughter age

PubMed Central

Vinueza-Burgos, Christian; Wautier, Magali; Martiny, Delphine; Cisneros, Marco; Van Damme, Inge; De Zutter, Lieven

2017-01-01

Abstract Thermotolerant Campylobacter spp. are a major cause of foodborne gastrointestinal infections worldwide. The linkage of human campylobacteriosis and poultry has been widely described. In this study we aimed to investigate the prevalence, antimicrobial resistance and genetic diversity of C. coli and C. jejuni in broilers from Ecuador. Caecal content from 379 randomly selected broiler batches originating from 115 farms were collected from 6 slaughterhouses located in the province of Pichincha during 1 year. Microbiological isolation was performed by direct plating on mCCDA agar. Identification of Campylobacter species was done by PCR. Minimum inhibitory concentration (MIC) values for gentamicin, ciprofloxacin, nalidixic acid, tetracycline, streptomycin, and erythromycin were obtained. Genetic variation was assessed by RFLP-flaA typing and Multilocus Sequence Typing (MLST) of selected isolates. Prevalence at batch level was 64.1%. Of the positive batches 68.7% were positive for C. coli, 18.9% for C. jejuni, and 12.4% for C. coli and C. jejuni. Resistance rates above 67% were shown for tetracycline, ciprofloxacin, and nalidixic acid. The resistance pattern tetracycline, ciprofloxin, and nalidixic acid was the dominant one in both Campylobacter species. RFLP-flaA typing analysis showed that C. coli and C. jejuni strains belonged to 38 and 26 profiles respectively. On the other hand MLST typing revealed that C. coli except one strain belonged to CC-828, while C. jejuni except 2 strains belonged to 12 assigned clonal complexes (CCs). Furthermore 4 new sequence types (STs) for both species were described, whereby 2 new STs for C. coli were based on new allele sequences. Further research is necessary to estimate the impact of the slaughter of Campylobacter positive broiler batches on the contamination level of carcasses in slaughterhouses and at retail in Ecuador. PMID:28339716
Comparative genomics of the lactic acid bacteria

DOE Office of Scientific and Technical Information (OSTI.GOV)

Makarova, K.; Slesarev, A.; Wolf, Y.

Lactic acid-producing bacteria are associated with various plant and animal niches and play a key role in the production of fermented foods and beverages. We report nine genome sequences representing the phylogenetic and functional diversity of these bacteria. The small genomes of lactic acid bacteria encode a broad repertoire of transporters for efficient carbon and nitrogen acquisition from the nutritionally rich environments they inhabit and reflect a limited range of biosynthetic capabilities that indicate both prototrophic and auxotrophic strains. Phylogenetic analyses, comparison of gene content across the group, and reconstruction of ancestral gene sets indicate a combination of extensive genemore » loss and key gene acquisitions via horizontal gene transfer during the coevolution of lactic acid bacteria with their habitats.« less
Genetic diversity of three surface protein genes in Plasmodium malariae from three Asian countries.

PubMed

Srisutham, Suttipat; Saralamba, Naowarat; Sriprawat, Kanlaya; Mayxay, Mayfong; Smithuis, Frank; Nosten, Francois; Pukrittayakamee, Sasithon; Day, Nicholas P J; Dondorp, Arjen M; Imwong, Mallika

2018-01-11

Genetic diversity of the three important antigenic proteins, namely thrombospondin-related anonymous protein (TRAP), apical membrane antigen 1 (AMA1), and 6-cysteine protein (P48/45), all of which are found in various developmental stages of Plasmodium parasites is crucial for targeted vaccine development. While studies related to the genetic diversity of these proteins are available for Plasmodium falciparum and Plasmodium vivax, barely enough information exists regarding Plasmodium malariae. The present study aims to demonstrate the genetic variations existing among these three genes in P. malariae by analysing their diversity at nucleotide and protein levels. Three surface protein genes were isolated from 45 samples collected in Thailand (N = 33), Myanmar (N = 8), and Lao PDR (N = 4), using conventional polymerase chain reaction (PCR) assay. Then, the PCR products were sequenced and analysed using BioEdit, MEGA6, and DnaSP programs. The average pairwise nucleotide diversities (π) of P. malariae trap, ama1, and p48/45 were 0.00169, 0.00413, and 0.00029, respectively. The haplotype diversities (Hd) of P. malariae trap, ama1, and p48/45 were 0.919, 0.946, and 0.130, respectively. Most of the nucleotide substitutions were non-synonymous, which indicated that the genetic variations of these genes were maintained by positive diversifying selection, thus, suggesting their role as a potential target of protective immune response. Amino acid substitutions of P. malariae TRAP, AMA1, and P48/45 could be categorized to 17, 20, and 2 unique amino-acid variants, respectively. For further vaccine development, carboxyl terminal of P48/45 would be a good candidate according to conserved amino acid at low genetic diversity (π = 0.2-0.3). High mutational diversity was observed in P. malariae trap and ama1 as compared to p48/45 in P. malariae samples isolated from Thailand, Myanmar, and Lao PDR. Taken together, these results suggest that P48/45 might be a good vaccine candidate against P. malariae infection because of its sufficiently low genetic diversity and highly conserved amino acids especially on the carboxyl end.
In silico analysis of β-1,3-glucanase from a psychrophilic yeast, Glaciozyma antarctica PI12

NASA Astrophysics Data System (ADS)

Mohammadi, Salimeh; Bakar, Farah Diba Abu; Rabu, Amir; Murad, Abdul Munir Abdul

2014-09-01

1,3-beta-glucanase is an industrially important enzyme having wide range of applications especially in food industry. It is crucial to gain an understanding about the structure and functional aspects of various beta-1,3-glucanase produced from diverse sources. In this, study a cDNA encoding β-1,3-glucanase (GaExg55) was isolated from a psychrophilic yeast, Glaciozyma antarctica PI12. The cDNA sequence has been submitted to Genbank with an accession number (KJ436377). Subsequently, the perdition protein was analyzed using various bioinformatics tools to explore the properties of the protein. GaEXG55 is consisting of 1,440-bp nucleotides encoding 480 amino acid residues. Alignment of the deduced amino acid for GaExg55 with other exo-β-1,3-glucanase available at the NCBI database indicate that deduced amino acids shared a consensus motif NEP, which is signature pattern of GH5 hydrolases. Predicted molecular weight of GaExg55 is 53.66 kDa. GaExg55 sequences possesses signal peptide sequence and it is highly conserved with other fungal exo-beta-1,3 glucanase.
The Thiamin Pyrophosphate-Motif

NASA Technical Reports Server (NTRS)

Dominiak, Paulina M.; Ciszak, Ewa M.

2003-01-01

Using databases the authors have identified a common thiamin pyrophosphate (TPP)-motif in the family of functionally diverse TPP-dependent enzymes. This common motif consists of multimeric organization of subunits, two catalytic centers, common amino acid sequence, and specific contacts to provide a flip-flop, or alternate site, mechanism of action. Each catalytic center [PP:PYR] is formed at the interface of the PP-domain binding the magnesium ion, pyrophosphate and aminopyrimidine ring of TPP, and the PYR-domain binding the aminopyrimidine ring of that cofactor. A pair of these catalytic centers constitutes the catalytic core [PP:PYR]* within these enzymes. Analysis of the structural elements of this catalytic core reveals novel definition of the common amino acid sequences, which are GX@&(G)@XXGQ, and GDGX25-30 within the PP- domain, and the E&(G)@XXG@ within the PYR-domain, where Q, corresponds to a hydrophobic amino acid. This TPP-motif provides a novel tool for annotation of TPP-dependent enzymes useful in advancing functional proteomics.
Genetic characterization and phylogenetic analysis of porcine circovirus type 2 (PCV2) in Serbia.

PubMed

Savic, Bozidar; Milicevic, Vesna; Jakic-Dimic, Dobrila; Bojkovski, Jovan; Prodanovic, Radisa; Kureljusic, Branislav; Potkonjak, Aleksandar; Savic, Borivoje

2012-01-01

Porcine circovirus type 2 (PCV2) is the main causative agent of postweaning multisystemic wasting syndrome (PMWS). To characterize and determine the genetic diversity of PCV2 in the porcine population of Serbia, nucleotide and deduced amino acid sequences of the open reading frame 2 (ORF2) of PCV2 collected from the tissues of pigs that either had died as a result of PMWS or did not exhibit disease symptoms were analyzed. Sequencing and phylogenetic analysis showed considerable diversity among PCV2 ORF2 sequences and the existence of two main PCV2 genotypes, PCV2b and PCV2a, with at least three clusters, 1A/B, 1C and 2D. In order to provide further proof that the 1C strain is circulating in the porcine population, the whole viral genome of one PCV2 isolate was sequenced. Genotyping and phylogenetic analysis using the entire viral genome sequences confirmed that there was a PMWS-associated 1C strain emerging in Serbia. Our analysis also showed that PCV2b is dominant in the porcine population, and that it is exclusively associated with PMWS occurrences in the country. These data constitute a useful basis for further epidemiological studies regarding the heterogeneity of PCV2 strains on the European continent.
A New Primer to Amplify pmoA Gene From NC10 Bacteria in the Sediments of Dongchang Lake and Dongping Lake.

PubMed

Wang, Shenghui; Liu, Yanjun; Liu, Guofu; Huang, Yaru; Zhou, Yu

2017-08-01

Nitrite-dependent anaerobic methane oxidation (n-damo) is catalyzed by the NC10 phylum bacterium "Candidatus Methylomirabilis oxyfera" (M. oxyfera). Generally, the pmoA gene is applied as a functional marker to test and identify NC10-like bacteria. However, it is difficult to detect the NC10 bacteria from sediments of freshwater lake (Dongchang Lake and Dongping Lake) with the previous pmoA gene primer sets. In this work, a new primer cmo208 was designed and used to amplify pmoA gene of NC10-like bacteria. A newly nested PCR approach was performed using the new primer cmo208 and the previous primers cmo182, cmo682, and cmo568 to detect the NC10 bacteria. The obtained pmoA gene sequences exhibited 85-92% nucleotide identity and 95-97% amino acid sequence identity to pmoA gene of M. oxyfera. The obtained diversity of pmoA gene sequences coincided well with the diversity of 16S rRNA sequences. These results indicated that the newly designed pmoA primer cmo208 could give one more option to detect NC10 bacteria from different environmental samples.
Analysis of heterogeneity of Copia-like retrotransposons in the genome of cassava (Manihot esculenta Crantz).

PubMed

Gbadegesin, Micheal A; Beeching, John R

2011-12-20

Retrotransposons are ubiquitous in eukaryotic genomes and now proving to be useful genetic tools for genetic diversity and phylogenetic analyses, especially in plants. In order to assess the diversity of Ty1/Copia-like retrotransposons of cassava, we used PCR primers anchored on the conserved domains of reverse transcriptases (RTs) to amplify cassava Ty1/Copia-like RT. The PCR product was cloned and sequenced. Sequences analysis of the clones revealed the presence of 69 families of Ty1/Copia-like retrotransposon in the genome of cassava. Comparative analyses of the predicted amino acid sequences of these clones with those of other plants showed that retroelements of this class are very heterogeneous in cassava. Cassava is widely grown for its edible roots in the tropical and subtropical regions of the world. Cassava roots, though poor in protein, are rich in starch (makes up about 80% of the dry matter), vitamin C, carotenes, calcium and potassium. It has a great commercial importance as a source of starch and starch based products. Realizing the importance of cassava, it stands out as a crop to benefit from biotechnology development. Heterogeneity of Mecops (Manihot esculenta copia-like Retrotransposons) showed that they may be useful for genetic diversity and phylogenetic analyses of cassava germplasm.
Adaptive microclimatic evolution of the dehydrin 6 gene in wild barley at "Evolution Canyon", Israel.

PubMed

Yang, Zujun; Zhang, Tao; Li, Guangrong; Nevo, Eviatar

2011-12-01

Dehydrins are one of the major stress-induced gene families, and the expression of dehydrin 6 (Dhn6) is strictly related to drought in barley. In order to investigate how the evolution of the Dhn6 gene is associated with adaptation to environmental changes, we examined 48 genotypes of wild barley, Hordeum spontaneum, from "Evolution Canyon" at Mount Carmel, Israel. The Dhn6 sequences of the 48 genotypes were identified, and a recent insertion of 342 bp at 5'UTR was found in the sequences of 11 genotypes. Both nucleotide and haplotype diversity of single nucleotide polymorphism in Dhn6 coding regions were higher on the AS ("African" slope or dry slope) than on the ES ("European" slope or humid slope), and the applied Tajima D and Fu-Li test rejected neutrality of SNP diversity. Expression analysis indicated that the 342 bp insertion at 5'UTR was associated with the earlier up-regulation of Dhn6 after dehydration. The genetic divergence of amino acids sequences indicated significant positive selection of Dhn6 among the wild barley populations. The diversity of Dhn6 in microclimatic divergence slopes suggested that Dhn6 has been subjected to natural selection and adaptively associated with drought resistance of wild barley at "Evolution Canyon".
Bacterial and archaeal diversity in two hot spring microbial mats from the geothermal region of Tengchong, China.

PubMed

Pagaling, Eulyn; Grant, William D; Cowan, Don A; Jones, Brian E; Ma, Yanhe; Ventosa, Antonio; Heaphy, Shaun

2012-07-01

We investigated the bacterial and archaeal diversity in two hot spring microbial mats from the geothermal region of Tengchong in the Yunnan Province, China, using direct molecular analyses. The Langpu (LP) laminated mat was found by the side of a boiling pool with temperature of 60-65 °C and a pH of 8.5, while the Tengchong (TC) streamer mat consisted of white streamers in a slightly acidic (pH 6.5) hot pool outflow with a temperature of 72 °C. Four 16S rRNA gene clone libraries were constructed and restriction enzyme analysis of the inserts was used to identify unique sequences and clone frequencies. From almost 200 clones screened, 55 unique sequences were retrieved. Phylogenetic analysis showed that the LP mat consisted of a diverse bacterial population [Cyanobacteria, Chloroflexi, Chlorobia, Nitrospirae, 'Deinococcus-Thermus', Proteobacteria (alpha, beta and delta subdivisions), Firmicutes, Bacteroidetes and Actinobacteria], while the archaeal population was dominated by methanogenic Euryarchaeota and Crenarchaeota. In contrast, the TC streamer mat consisted of a bacterial population dominated by Aquificae, while the archaeal population also contained Korarchaeota as well as Crenarchaeota and methanogenic Euryarchaeota. These mats harboured clone sequences affiliated to unidentified lineages, suggesting that they are a potential source for discovering novel bacteria and archaea.
Nonsynonymous substitution rate heterogeneity in the peptide-binding region among different HLA-DRB1 lineages in humans.

PubMed

Yasukochi, Yoshiki; Satta, Yoko

2014-05-02

An extraordinary diversity of amino acid sequences in the peptide-binding region (PBR) of human leukocyte antigen [HLA; human major histocompatibility complex (MHC)] molecules has been maintained by balancing selection. The process of accumulation of amino acid diversity in the PBR for six HLA genes (HLA-A, B, C, DRB1, DQB1, and DPB1) shows that the number of amino acid substitutions in the PBR among alleles does not linearly correlate with the divergence time of alleles at the six HLA loci. At these loci, some pairs of alleles show significantly less nonsynonymous substitutions at the PBR than expected from the divergence time. The same phenomenon was observed not only in the HLA but also in the rat MHC. To identify the cause for this, DRB1 sequences, a representative case of a typical nonlinear pattern of substitutions, were examined. When the amino acid substitutions in the PBR were placed with maximum parsimony on a maximum likelihood tree based on the non-PBR substitutions, heterogeneous rates of nonsynonymous substitutions in the PBR were observed on several branches. A computer simulation supported the hypothesis that allelic pairs with low PBR substitution rates were responsible for the stagnation of accumulation of PBR nonsynonymous substitutions. From these observations, we conclude that the nonsynonymous substitution rate at the PBR sites is not constant among the allelic lineages. The deceleration of the rate may be caused by the coexistence of certain pathogens for a substantially long time during HLA evolution. Copyright © 2014 Yasukochi and Satta.

Diversity and Characterization of Sulfate-Reducing Bacteria in Groundwater at a Uranium Mill Tailings Site

PubMed Central

Chang, Yun-Juan; Peacock, Aaron D.; Long, Philip E.; Stephen, John R.; McKinley, James P.; Macnaughton, Sarah J.; Hussain, A. K. M. Anwar; Saxton, Arnold M.; White, David C.

2001-01-01

Microbially mediated reduction and immobilization of U(VI) to U(IV) plays a role in both natural attenuation and accelerated bioremediation of uranium-contaminated sites. To realize bioremediation potential and accurately predict natural attenuation, it is important to first understand the microbial diversity of such sites. In this paper, the distribution of sulfate-reducing bacteria (SRB) in contaminated groundwater associated with a uranium mill tailings disposal site at Shiprock, N.Mex., was investigated. Two culture-independent analyses were employed: sequencing of clone libraries of PCR-amplified dissimilatory sulfite reductase (DSR) gene fragments and phospholipid fatty acid (PLFA) biomarker analysis. A remarkable diversity among the DSR sequences was revealed, including sequences from δ-Proteobacteria, gram-positive organisms, and the Nitrospira division. PLFA analysis detected at least 52 different mid-chain-branched saturate PLFA and included a high proportion of 10me16:0. Desulfotomaculum and Desulfotomaculum-like sequences were the most dominant DSR genes detected. Those belonging to SRB within δ-Proteobacteria were mainly recovered from low-uranium (≤302 ppb) samples. One Desulfotomaculum-like sequence cluster overwhelmingly dominated high-U (>1,500 ppb) sites. Logistic regression showed a significant influence of uranium concentration over the dominance of this cluster of sequences (P = 0.0001). This strong association indicates that Desulfotomaculum has remarkable tolerance and adaptation to high levels of uranium and suggests the organism's possible involvement in natural attenuation of uranium. The in situ activity level of Desulfotomaculum in uranium-contaminated environments and its comparison to the activities of other SRB and other functional groups should be an important area for future research. PMID:11425735
Lack of Microbial Diversity in an Extreme Mars Analog Setting: Poás Volcano, Costa Rica.

PubMed

Hynek, Brian M; Rogers, Karyn L; Antunovich, Monique; Avard, Geoffroy; Alvarado, Guillermo E

2018-04-24

The Poás volcano in Costa Rica has been studied as a Mars geochemical analog environment, since both the style of hydrothermal alteration present and the alteration mineralogy are consistent with Mars' relict hydrothermal systems. The site hosts an active volcano, with high-temperature fumaroles (up to 980°C) and an ultra-acidic lake. This lake, Laguna Caliente, is one of the most dynamic environments on Earth, with frequent phreatic eruptions, temperatures ranging from near-ambient to almost boiling, a pH range of -1 to 1.5, and a wide range of chemistries and redox potential. Martian acid-sulfate hydrothermal systems were likely similarly dynamic and equally challenging to life. The microbiology existing within Laguna Caliente was characterized for the first time, with sampling taking place in November, 2013. The diversity of the microbial community was surveyed via extraction of environmental DNA from fluid and sediment samples followed by Illumina sequencing of the 16S rRNA gene. The microbial diversity was limited to a single species of the bacterial genus Acidiphilium. This organism likely gets its energy from oxidation of reduced sulfur in the lake, including elemental sulfur. Given Mars' propensity for sulfur and acid-sulfate environments, this type of organism is of significant interest to the search for past or present life on the Red Planet. Key Words: Mars astrobiology-Acid-sulfate hydrothermal systems-Extremophiles-Acidic-High temperature-Acidiphilium bacteria. Astrobiology 18, xxx-xxx.
Raw Sewage Harbors Diverse Viral Populations

PubMed Central

Cantalupo, Paul G.; Calgua, Byron; Zhao, Guoyan; Hundesa, Ayalkibet; Wier, Adam D.; Katz, Josh P.; Grabe, Michael; Hendrix, Roger W.; Girones, Rosina; Wang, David; Pipas, James M.

2011-01-01

ABSTRACT At this time, about 3,000 different viruses are recognized, but metagenomic studies suggest that these viruses are a small fraction of the viruses that exist in nature. We have explored viral diversity by deep sequencing nucleic acids obtained from virion populations enriched from raw sewage. We identified 234 known viruses, including 17 that infect humans. Plant, insect, and algal viruses as well as bacteriophages were also present. These viruses represented 26 taxonomic families and included viruses with single-stranded DNA (ssDNA), double-stranded DNA (dsDNA), positive-sense ssRNA [ssRNA(+)], and dsRNA genomes. Novel viruses that could be placed in specific taxa represented 51 different families, making untreated wastewater the most diverse viral metagenome (genetic material recovered directly from environmental samples) examined thus far. However, the vast majority of sequence reads bore little or no sequence relation to known viruses and thus could not be placed into specific taxa. These results show that the vast majority of the viruses on Earth have not yet been characterized. Untreated wastewater provides a rich matrix for identifying novel viruses and for studying virus diversity. Importance At this time, virology is focused on the study of a relatively small number of viral species. Specific viruses are studied either because they are easily propagated in the laboratory or because they are associated with disease. The lack of knowledge of the size and characteristics of the viral universe and the diversity of viral genomes is a roadblock to understanding important issues, such as the origin of emerging pathogens and the extent of gene exchange among viruses. Untreated wastewater is an ideal system for assessing viral diversity because virion populations from large numbers of individuals are deposited and because raw sewage itself provides a rich environment for the growth of diverse host species and thus their viruses. These studies suggest that the viral universe is far more vast and diverse than previously suspected. PMID:21972239
A newly constructed primer pair for the PCR amplification, cloning and sequencing of the flagellin (flaA) gene from isolatesof urease-negative Campylobacter lari.

PubMed

Sekizuka, Tsuyoshi; Yokoi, Taeko; Murayama, Ohoshi; Millar, B Cherie; Moore, Johne; Matsuda, Motoo

2005-08-01

A newly constructed primer pair (lari-Af/lari-Ar) designed to generate a product of the flagellin (flaA) gene for urease-negative Campylobacter lari produced a PCR amplicon of about 1700 bp for 16 isolates from 7 seagulls, 5 humans, 3 food animals and one mussel in Japan and Northern Ireland. Nucleotide sequencing and alignments of the flaA amplicons from these isolates demonstrated that the deduced amino acid sequences of the possible open reading frame were 564-572 amino acid residues in length with calculated molecular weights of 58,804 to 59,463. The deduced amino acid sequence similarity analysis strongly suggested that the ORF of the flaA from the 16 isolates showed 70-75% sequence similarities to those of Campylobacter jejuni isolates. The approximate Mr of the flagellin purified from some of the isolates of urease-negative C. lari was estimated to range from 59.6 to 61.8 kDa. Thus, flagellin from the isolates of urease-negative C. lari was shown for the first time to have a molecular size similar to those of C. jejuni and Campylobacter coli isolates, but to be different from the shorter flaA and smaller flagellin of urease-positive thermophilic Campylobacter (UPTC) isolates. Flagellins from C. lari spp., consisting of the two representative taxa of urease-negative C. lari and UPTC, thus show genotypic and phenotypic diversity.
Identification of Clinical Coryneform Bacterial Isolates: Comparison of Biochemical Methods and Sequence Analysis of 16S rRNA and rpoB Genes▿

PubMed Central

Adderson, Elisabeth E.; Boudreaux, Jan W.; Cummings, Jessica R.; Pounds, Stanley; Wilson, Deborah A.; Procop, Gary W.; Hayden, Randall T.

2008-01-01

We compared the relative levels of effectiveness of three commercial identification kits and three nucleic acid amplification tests for the identification of coryneform bacteria by testing 50 diverse isolates, including 12 well-characterized control strains and 38 organisms obtained from pediatric oncology patients at our institution. Between 33.3 and 75.0% of control strains were correctly identified to the species level by phenotypic systems or nucleic acid amplification assays. The most sensitive tests were the API Coryne system and amplification and sequencing of the 16S rRNA gene using primers optimized for coryneform bacteria, which correctly identified 9 of 12 control isolates to the species level, and all strains with a high-confidence call were correctly identified. Organisms not correctly identified were species not included in the test kit databases or not producing a pattern of reactions included in kit databases or which could not be differentiated among several genospecies based on reaction patterns. Nucleic acid amplification assays had limited abilities to identify some bacteria to the species level, and comparison of sequence homologies was complicated by the inclusion of allele sequences obtained from uncultivated and uncharacterized strains in databases. The utility of rpoB genotyping was limited by the small number of representative gene sequences that are currently available for comparison. The correlation between identifications produced by different classification systems was poor, particularly for clinical isolates. PMID:18160450
Fatty Acid Profile and Unigene-Derived Simple Sequence Repeat Markers in Tung Tree (Vernicia fordii)

PubMed Central

Zhang, Lin; Jia, Baoguang; Tan, Xiaofeng; Thammina, Chandra S.; Long, Hongxu; Liu, Min; Wen, Shanna; Song, Xianliang; Cao, Heping

2014-01-01

Tung tree (Vernicia fordii) provides the sole source of tung oil widely used in industry. Lack of fatty acid composition and molecular markers hinders biochemical, genetic and breeding research. The objectives of this study were to determine fatty acid profiles and develop unigene-derived simple sequence repeat (SSR) markers in tung tree. Fatty acid profiles of 41 accessions showed that the ratio of α-eleostearic acid was increasing continuously with a parallel trend to the amount of tung oil accumulation while the ratios of other fatty acids were decreasing in different stages of the seeds and that α-eleostearic acid (18∶3) consisted of 77% of the total fatty acids in tung oil. Transcriptome sequencing identified 81,805 unigenes from tung cDNA library constructed using seed mRNA and discovered 6,366 SSRs in 5,404 unigenes. The di- and tri-nucleotide microsatellites accounted for 92% of the SSRs with AG/CT and AAG/CTT being the most abundant SSR motifs. Fifteen polymorphic genic-SSR markers were developed from 98 unigene loci tested in 41 cultivated tung accessions by agarose gel and capillary electrophoresis. Genbank database search identified 10 of them putatively coding for functional proteins. Quantitative PCR demonstrated that all 15 polymorphic SSR-associated unigenes were expressed in tung seeds and some of them were highly correlated with oil composition in the seeds. Dendrogram revealed that most of the 41 accessions were clustered according to the geographic region. These new polymorphic genic-SSR markers will facilitate future studies on genetic diversity, molecular fingerprinting, comparative genomics and genetic mapping in tung tree. The lipid profiles in the seeds of 41 tung accessions will be valuable for biochemical and breeding studies. PMID:25167054
Regional variations in the diversity and predicted metabolic potential of benthic prokaryotes in coastal northern Zhejiang, East China Sea

PubMed Central

Wang, Kai; Ye, Xiansen; Zhang, Huajun; Chen, Heping; Zhang, Demin; Liu, Lian

2016-01-01

Knowledge about the drivers of benthic prokaryotic diversity and metabolic potential in interconnected coastal sediments at regional scales is limited. We collected surface sediments across six zones covering ~200 km in coastal northern Zhejiang, East China Sea and combined 16 S rRNA gene sequencing, community-level metabolic prediction, and sediment physicochemical measurements to investigate variations in prokaryotic diversity and metabolic gene composition with geographic distance and under local environmental conditions. Geographic distance was the most influential factor in prokaryotic β-diversity compared with major environmental drivers, including temperature, sediment texture, acid-volatile sulfide, and water depth, but a large unexplained variation in community composition suggested the potential effects of unmeasured abiotic/biotic factors and stochastic processes. Moreover, prokaryotic assemblages showed a biogeographic provincialism across the zones. The predicted metabolic gene composition similarly shifted as taxonomic composition did. Acid-volatile sulfide was strongly correlated with variation in metabolic gene composition. The enrichments in the relative abundance of sulfate-reducing bacteria and genes relevant with dissimilatory sulfate reduction were observed and predicted, respectively, in the Yushan area. These results provide insights into the relative importance of geographic distance and environmental condition in driving benthic prokaryotic diversity in coastal areas and predict specific biogeochemically-relevant genes for future studies. PMID:27917954
Genetic diversity in the 3'-terminal region of papaya ringspot virus (PRSV-W) isolates from watermelon in Oklahoma.

PubMed

Abdalla, Osama A; Ali, Akhtar

2012-03-01

The 3'-terminal region (1191 nt) containing part of the NIb gene, complete coat protein (CP) and poly-A tail of 64 papaya ringspot virus (PRSV-W) isolates collected during 2008-2009 from watermelon in commercial fields of four different counties of Oklahoma were cloned and sequenced. Nucleotide and amino acid sequence identities ranged from 95.2-100% and 97.1-100%, respectively, among the Oklahoman PRSV-W isolates. Phylogenetic analysis showed that PRSW-W isolates clustered according to the locations where they were collected within Oklahoma, and each cluster contained two subgroups. All subgroups of Oklahoman PRSV-W isolates were on separate branches when compared to 35 known isolates originating from other parts of the world, including the one reported previously from the USA. This study helps in our understanding about the genetic diversity of PRSV-W isolates infecting cucurbits in Oklahoma.
Allelic diversity of the MHC class II DRB genes in brown bears (Ursus arctos) and a comparison of DRB sequences within the family Ursidae.

PubMed

Goda, N; Mano, T; Kosintsev, P; Vorobiev, A; Masuda, R

2010-11-01

The allelic diversity of the DRB locus in major histocompatibility complex (MHC) genes was analyzed in the brown bear (Ursus arctos) from the Hokkaido Island of Japan, Siberia, and Kodiak of Alaska. Nineteen alleles of the DRB exon 2 were identified from a total of 38 individuals of U. arctos and were highly polymorphic. Comparisons of non-synonymous and synonymous substitutions in the antigen-binding sites of deduced amino acid sequences indicated evidence for balancing selection on the bear DRB locus. The phylogenetic analysis of the DRB alleles among three genera (Ursus, Tremarctos, and Ailuropoda) in the family Ursidae revealed that DRB allelic lineages were not separated according to species. This strongly shows trans-species persistence of DRB alleles within the Ursidae. © 2010 John Wiley & Sons A/S.
Mining for Nonribosomal Peptide Synthetase and Polyketide Synthase Genes Revealed a High Level of Diversity in the Sphagnum Bog Metagenome

PubMed Central

Müller, Christina A.; Oberauner-Wappis, Lisa; Peyman, Armin; Amos, Gregory C. A.; Wellington, Elizabeth M. H.

2015-01-01

Sphagnum bog ecosystems are among the oldest vegetation forms harboring a specific microbial community and are known to produce an exceptionally wide variety of bioactive substances. Although the Sphagnum metagenome shows a rich secondary metabolism, the genes have not yet been explored. To analyze nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs), the diversity of NRPS and PKS genes in Sphagnum-associated metagenomes was investigated by in silico data mining and sequence-based screening (PCR amplification of 9,500 fosmid clones). The in silico Illumina-based metagenomic approach resulted in the identification of 279 NRPSs and 346 PKSs, as well as 40 PKS-NRPS hybrid gene sequences. The occurrence of NRPS sequences was strongly dominated by the members of the Protebacteria phylum, especially by species of the Burkholderia genus, while PKS sequences were mainly affiliated with Actinobacteria. Thirteen novel NRPS-related sequences were identified by PCR amplification screening, displaying amino acid identities of 48% to 91% to annotated sequences of members of the phyla Proteobacteria, Actinobacteria, and Cyanobacteria. Some of the identified metagenomic clones showed the closest similarity to peptide synthases from Burkholderia or Lysobacter, which are emerging bacterial sources of as-yet-undescribed bioactive metabolites. This report highlights the role of the extreme natural ecosystems as a promising source for detection of secondary compounds and enzymes, serving as a source for biotechnological applications. PMID:26002894
Diversity and evolutionary patterns of immune genes in free-ranging Namibian leopards (Panthera pardus pardus).

PubMed

Castro-Prieto, Aines; Wachter, Bettina; Melzheimer, Joerg; Thalwitzer, Susanne; Sommer, Simone

2011-01-01

The genes of the major histocompatibility complex (MHC) are a key component of the mammalian immune system and have become important molecular markers for fitness-related genetic variation in wildlife populations. Currently, no information about the MHC sequence variation and constitution in African leopards exists. In this study, we isolated and characterized genetic variation at the adaptively most important region of MHC class I and MHC class II-DRB genes in 25 free-ranging African leopards from Namibia and investigated the mechanisms that generate and maintain MHC polymorphism in the species. Using single-stranded conformation polymorphism analysis and direct sequencing, we detected 6 MHC class I and 6 MHC class II-DRB sequences, which likely correspond to at least 3 MHC class I and 3 MHC class II-DRB loci. Amino acid sequence variation in both MHC classes was higher or similar in comparison to other reported felids. We found signatures of positive selection shaping the diversity of MHC class I and MHC class II-DRB loci during the evolutionary history of the species. A comparison of MHC class I and MHC class II-DRB sequences of the leopard to those of other felids revealed a trans-species mode of evolution. In addition, the evolutionary relationships of MHC class II-DRB sequences between African and Asian leopard subspecies are discussed.
Phylogenetic Diversity of Koala Retrovirus within a Wild Koala Population.

PubMed

Chappell, K J; Brealey, J C; Amarilla, A A; Watterson, D; Hulse, L; Palmieri, C; Johnston, S D; Holmes, E C; Meers, J; Young, P R

2017-02-01

Koala populations are in serious decline across many areas of mainland Australia, with infectious disease a contributing factor. Koala retrovirus (KoRV) is a gammaretrovirus present in most wild koala populations and captive colonies. Five subtypes of KoRV (A to E) have been identified based on amino acid sequence divergence in a hypervariable region of the receptor binding domain of the envelope protein. However, analysis of viral genetic diversity has been conducted primarily on KoRV in captive koalas housed in zoos in Japan, the United States, and Germany. Wild koalas within Australia have not been comparably assessed. Here we report a detailed analysis of KoRV genetic diversity in samples collected from 18 wild koalas from southeast Queensland. By employing deep sequencing we identified 108 novel KoRV envelope sequences and determined their phylogenetic diversity. Genetic diversity in KoRV was abundant and fell into three major groups; two comprised the previously identified subtypes A and B, while the third contained the remaining hypervariable region subtypes (C, D, and E) as well as four hypervariable region subtypes that we newly define here (F, G, H, and I). In addition to the ubiquitous presence of KoRV-A, which may represent an exclusively endogenous variant, subtypes B, D, and F were found to be at high prevalence, while subtypes G, H, and I were present in a smaller number of animals. Koala retrovirus (KoRV) is thought to be a significant contributor to koala disease and population decline across mainland Australia. This study is the first to determine KoRV subtype prevalence among a wild koala population, and it significantly expands the total number of KoRV sequences available, providing a more precise picture of genetic diversity. This understanding of KoRV subtype prevalence and genetic diversity will be important for conservation efforts attempting to limit the spread of KoRV. Furthermore, KoRV is one of the only retroviruses shown to exist in both endogenous (transmitted vertically to offspring in the germ line DNA) and exogenous (horizontally transmitted between infected individuals) forms, a division of fundamental evolutionary importance. Copyright © 2017 American Society for Microbiology.
Genome-wide diversity and selective pressure in the human rhinovirus

PubMed Central

Kistler, Amy L; Webster, Dale R; Rouskin, Silvi; Magrini, Vince; Credle, Joel J; Schnurr, David P; Boushey, Homer A; Mardis, Elaine R; Li, Hao; DeRisi, Joseph L

2007-01-01

Background The human rhinoviruses (HRV) are one of the most common and diverse respiratory pathogens of humans. Over 100 distinct HRV serotypes are known, yet only 6 genomes are available. Due to the paucity of HRV genome sequence, little is known about the genetic diversity within HRV or the forces driving this diversity. Previous comparative genome sequence analyses indicate that recombination drives diversification in multiple genera of the picornavirus family, yet it remains unclear if this holds for HRV. Results To resolve this and gain insight into the forces driving diversification in HRV, we generated a representative set of 34 fully sequenced HRVs. Analysis of these genomes shows consistent phylogenies across the genome, conserved non-coding elements, and only limited recombination. However, spikes of genetic diversity at both the nucleotide and amino acid level are detectable within every locus of the genome. Despite this, the HRV genome as a whole is under purifying selective pressure, with islands of diversifying pressure in the VP1, VP2, and VP3 structural genes and two non-structural genes, the 3C protease and 3D polymerase. Mapping diversifying residues in these factors onto available 3-dimensional structures revealed the diversifying capsid residues partition to the external surface of the viral particle in statistically significant proximity to antigenic sites. Diversifying pressure in the pleconaril binding site is confined to a single residue known to confer drug resistance (VP1 191). In contrast, diversifying pressure in the non-structural genes is less clear, mapping both nearby and beyond characterized functional domains of these factors. Conclusion This work provides a foundation for understanding HRV genetic diversity and insight into the underlying biology driving evolution in HRV. It expands our knowledge of the genome sequence space that HRV reference serotypes occupy and how the pattern of genetic diversity across HRV genomes differs from other picornaviruses. It also reveals evidence of diversifying selective pressure in both structural genes known to interact with the host immune system and in domains of unassigned function in the non-structural 3C and 3D genes, raising the possibility that diversification of undiscovered functions in these essential factors may influence HRV fitness and evolution. PMID:17477878
Aminoacyl-tRNA synthetases database Y2K

PubMed Central

Szymanski, Maciej; Barciszewski, Jan

2000-01-01

The aminoacyl-tRNA synthetases (AARS) are a diverse group of enzymes that ensure the fidelity of transfer of genetic information from DNA into protein. They catalyse the attachment of amino acids to transfer RNAs and thereby establish the rules of the genetic code by virtue of matching the nucleotide triplet of the anticodon with its cognate amino acid. Currently, 818 AARS primary structures have been reported from archaebacteria, eubacteria, mitochondria, chloroplasts and eukaryotic cells. The database is a compilation of the amino acid sequences of all AARSs, known to date, which are available as separate entries or alignments of related proteins via the WWW at http://rose.man.poznan.pl/aars/index.html PMID:10592262
Aminoacyl-tRNA synthetases database Y2K.

PubMed

Szymanski, M; Barciszewski, J

2000-01-01

The aminoacyl-tRNA synthetases (AARS) are a diverse group of enzymes that ensure the fidelity of transfer of genetic information from DNA into protein. They catalyse the attachment of amino acids to transfer RNAs and thereby establish the rules of the genetic code by virtue of matching the nucleotide triplet of the anticodon with its cognate amino acid. Currently, 818 AARS primary structures have been reported from archaebacteria, eubacteria, mitochondria, chloro-plasts and eukaryotic cells. The database is a compilation of the amino acid sequences of all AARSs, known to date, which are available as separate entries or alignments of related proteins via the WWW at http://rose.man.poznan.pl/aars/index.html
Diverse Bacterial PKS Sequences Derived From Okadaic Acid-Producing Dinoflagellates

PubMed Central

Perez, Roberto; Liu, Li; Lopez, Jose; An, Tianying; Rein, Kathleen S.

2008-01-01

Okadaic acid (OA) and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS) genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum. PMID:18728765
TALEN-mediated targeted mutagenesis of fatty acid desaturase 2 (FAD2) in peanut (Arachis hypogaea L.) promotes the accumulation of oleic acid.

PubMed

Wen, Shijie; Liu, Hao; Li, Xingyu; Chen, Xiaoping; Hong, Yanbin; Li, Haifen; Lu, Qing; Liang, Xuanqiang

2018-05-01

A first creation of high oleic acid peanut varieties by using transcription activator-like effecter nucleases (TALENs) mediated targeted mutagenesis of Fatty Acid Desaturase 2 (FAD2). Transcription activator like effector nucleases (TALENs), which allow the precise editing of DNA, have already been developed and applied for genome engineering in diverse organisms. However, they are scarcely used in higher plant study and crop improvement, especially in allopolyploid plants. In the present study, we aimed to create targeted mutagenesis by TALENs in peanut. Targeted mutations in the conserved coding sequence of Arachis hypogaea fatty acid desaturase 2 (AhFAD2) were created by TALENs. Genetic stability of AhFAD2 mutations was identified by DNA sequencing in up to 9.52 and 4.11% of the regeneration plants at two different targeted sites, respectively. Mutation frequencies among AhFAD2 mutant lines were significantly correlated to oleic acid accumulation. Genetically, stable individuals of positive mutant lines displayed a 0.5-2 fold increase in the oleic acid content compared with non-transgenic controls. This finding suggested that TALEN-mediated targeted mutagenesis could increase the oleic acid content in edible peanut oil. Furthermore, this was the first report on peanut genome editing event, and the obtained high oleic mutants could serve for peanut breeding project.
NASBA: A detection and amplification system uniquely suited for RNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sooknanan, R.; Malek, L.T.

1995-06-01

The invention of PCR (polymerase chain reaction) has revolutionized our ability to amplify and manipulate a nucleic acid sequence in vitro. The commercial rewards of this revolution have driven the development of other nuclei acid amplification and detection methodologies. This has created an alphabet soup of technologies that use different amplification methods, including NASBA (nucleic acid sequence-based amplification), LCR (ligase chain reaction), SDA (strand displacement amplification), QBR (Q-beta replicase), CPR (cycling probe reaction), and bDNA (branched DNA). Despite the differences in their processes, these amplification systems can be separated into two broad categories based on how they achieve their goal:more » sequence-based amplification systems, such as PCR, NASBA, and SDA, amplify a target nucleic acid sequence. Signal-based amplification systems, such as LCR, QBR, CPR and bDNA, amplify or alter a signal from a detection reaction that is target-dependent. While the various methods have relative strengths and weaknesses, only NASBA offers the unique ability to homogeneously amplify an RNA analyte in the presence of homologous genomic DNA under isothermal conditions. Since the detection of RNA sequences almost invariably measures biological activity, it is an excellent prognostic indicator of activities as diverse as virus production, gene expression, and cell viability. The isothermal nature of the reaction makes NASBA especially suitable for large-scale manual screening. These features extend NASBA`s application range from research to commercial diagnostic applications. Field test kits are presently under development for human diagnostics as well as the burgeoning fields of food and environmental diagnostic testing. These developments suggest future integration of NASBA into robotic workstations for high-throughput screening as well. 17 refs., 1 tab.« less
Characterization of gonadotrophin-releasing hormone precursor cDNA in the Old World mole-rat Cryptomys hottentotus pretoriae: high degree of identity with the New World guinea pig sequence.

PubMed

Kalamatianos, T; du Toit, L; Hrabovszky, E; Kalló, I; Marsh, P J; Bennett, N C; Coen, C W

2005-05-01

Regulation of pituitary gonadotrophins by the decapeptide gonadotrophin-releasing hormone 1 (GnRH1) is crucial for the development and maintenance of reproductive functions. A common amino acid sequence for this decapeptide, designated as 'mammalian' GnRH, has been identified in all mammals thus far investigated with the exception of the guinea pig, in which there are two amino acid substitutions. Among hystricognath rodents, the members of the family Bathyergidae regulate reproduction in response to diverse cues. Thus, highveld mole-rats (Cryptomys hottentotus pretoriae) are social bathyergids in which breeding is restricted to a particular season in the dominant female, but continuously suppressed in subordinate colony members. Elucidation of reproductive control in these animals will be facilitated by characterization of their GnRH1 gene. A partial sequence of GnRH1 precursor cDNA was isolated and characterized. Comparative analysis revealed the highest degree of identity (86%) to guinea pig GnRH1 precursor mRNA. Nevertheless, the deduced amino acid sequence of the mole-rat decapeptide is identical to the 'mammalian' sequence rather than that of guinea pigs. Successful detection of GnRH1-synthesizing neurones using either a guinea pig GnRH1 riboprobe or an antibody against the 'mammalian' decapeptide is consistent with the guinea pig-like sequence for the precursor and the classic 'mammalian' form for the decapeptide. The high degree of identity in the GnRH1 precursor sequence between this Old World mole-rat and the New World guinea pig is consistent with the theory that caviomorphs and phiomorphs originated from a common ancestral line in the Palaeocene to mid Eocene, some 63-45 million years ago.
Conifer R2R3-MYB transcription factors: sequence analyses and gene expression in wood-forming tissues of white spruce (Picea glauca)

PubMed Central

Bedon, Frank; Grima-Pettenati, Jacqueline; Mackay, John

2007-01-01

Background Several members of the R2R3-MYB family of transcription factors act as regulators of lignin and phenylpropanoid metabolism during wood formation in angiosperm and gymnosperm plants. The angiosperm Arabidopsis has over one hundred R2R3-MYBs genes; however, only a few members of this family have been discovered in gymnosperms. Results We isolated and characterised full-length cDNAs encoding R2R3-MYB genes from the gymnosperms white spruce, Picea glauca (13 sequences), and loblolly pine, Pinus taeda L. (five sequences). Sequence similarities and phylogenetic analyses placed the spruce and pine sequences in diverse subgroups of the large R2R3-MYB family, although several of the sequences clustered closely together. We searched the highly variable C-terminal region of diverse plant MYBs for conserved amino acid sequences and identified 20 motifs in the spruce MYBs, nine of which have not previously been reported and three of which are specific to conifers. The number and length of the introns in spruce MYB genes varied significantly, but their positions were well conserved relative to angiosperm MYB genes. Quantitative RTPCR of MYB genes transcript abundance in root and stem tissues revealed diverse expression patterns; three MYB genes were preferentially expressed in secondary xylem, whereas others were preferentially expressed in phloem or were ubiquitous. The MYB genes expressed in xylem, and three others, were up-regulated in the compression wood of leaning trees within 76 hours of induction. Conclusion Our survey of 18 conifer R2R3-MYB genes clearly showed a gene family structure similar to that of Arabidopsis. Three of the sequences are likely to play a role in lignin metabolism and/or wood formation in gymnosperm trees, including a close homolog of the loblolly pine PtMYB4, shown to regulate lignin biosynthesis in transgenic tobacco. PMID:17397551

Different Lactobacillus populations dominate in "Chorizo de León" manufacturing performed in different production plants.

PubMed

Quijada, Narciso M; De Filippis, Francesca; Sanz, José Javier; García-Fernández, María Del Camino; Rodríguez-Lázaro, David; Ercolini, Danilo; Hernández, Marta

2018-04-01

"Chorizo de Léon" is a high-value Spanish dry fermented sausage traditionally manufactured without the use of starter cultures, owing to the activity of a house-specific autochthonous microbiota that naturally contaminates the meat from the environment, the equipment and the raw materials. Lactic acid bacteria (particularly Lactobacillus) and coagulase-negative cocci (mainly Staphylococcus) have been reported as the most important bacterial groups regarding the organoleptic and safety properties of the dry fermented sausages. In this study, samples from raw minced meat to final products were taken from five different producers and the microbial diversity was investigated by high-throughput sequencing of 16S rRNA gene amplicons. The diverse microbial composition observed during the first stages of "Chorizo de Léon" evolved during ripening to a microbiota mainly composed by Lactobacillus in the final product. Oligotyping performed on 16S rRNA gene sequences of Lactobacillus and Staphylococcus populations revealed sub-genus level diversity within the different manufacturers, likely responsible of the characteristic organoleptic properties of the products from different companies. Copyright © 2017 Elsevier Ltd. All rights reserved.
Draft Genome Sequence of the Nicotinate-Metabolizing Soil Bacterium Bacillus niacini DSM 2923.

PubMed

Harvey, Zachary H; Snider, Mark J

2014-12-04

Bacillus niacini is a member of a small yet diverse group of bacteria able to catabolize nicotinic acid. We report here the availability of a draft genome for B. niacini, which we will use to understand the evolution of its namesake phenotype, which appears to be unique among the species in its phylogenetic neighborhood. Copyright © 2014 Harvey and Snider.
NEP: web server for epitope prediction based on antibody neutralization of viral strains with diverse sequences.

PubMed

Chuang, Gwo-Yu; Liou, David; Kwong, Peter D; Georgiev, Ivelin S

2014-07-01

Delineation of the antigenic site, or epitope, recognized by an antibody can provide clues about functional vulnerabilities and resistance mechanisms, and can therefore guide antibody optimization and epitope-based vaccine design. Previously, we developed an algorithm for antibody-epitope prediction based on antibody neutralization of viral strains with diverse sequences and validated the algorithm on a set of broadly neutralizing HIV-1 antibodies. Here we describe the implementation of this algorithm, NEP (Neutralization-based Epitope Prediction), as a web-based server. The users must supply as input: (i) an alignment of antigen sequences of diverse viral strains; (ii) neutralization data for the antibody of interest against the same set of antigen sequences; and (iii) (optional) a structure of the unbound antigen, for enhanced prediction accuracy. The prediction results can be downloaded or viewed interactively on the antigen structure (if supplied) from the web browser using a JSmol applet. Since neutralization experiments are typically performed as one of the first steps in the characterization of an antibody to determine its breadth and potency, the NEP server can be used to predict antibody-epitope information at no additional experimental costs. NEP can be accessed on the internet at http://exon.niaid.nih.gov/nep. Published by Oxford University Press on behalf of Nucleic Acids Research 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
High-throughput sequence-based analysis of the bacterial composition of kefir and an associated kefir grain.

PubMed

Dobson, Alleson; O'Sullivan, Orla; Cotter, Paul D; Ross, Paul; Hill, Colin

2011-07-01

Lacticin 3147 is a two-peptide broad spectrum lantibiotic produced by Lactococcus lactis DPC3147 shown to inhibit a number of clinically relevant Gram-positive pathogens. Initially isolated from an Irish kefir grain, lacticin 3147 is one of the most extensively studied lantibiotics to date. In this study, the bacterial diversity of the Irish kefir grain from which L. lactis DPC3147 was originally isolated was for the first time investigated using a high-throughput parallel sequencing strategy. A total of 17 416 unique V4 variable regions of the 16S rRNA gene were analysed from both the kefir starter grain and its derivative kefir-fermented milk. Firmicutes (which includes the lactic acid bacteria) was the dominant phylum accounting for > 92% of sequences. Within the Firmicutes, dramatic differences in abundance were observed when the starter grain and kefir milk fermentate were compared. The kefir grain-associated bacterial community was largely composed of the Lactobacillaceae family while Streptococcaceae (primarily Lactococcus spp.) was the dominant family within the kefir milk fermentate. Sequencing data confirmed previous findings that the microbiota of kefir milk and the starter grain are quite different while at the same time, establishing that the microbial diversity of the starter grain is not uniform with a greater level of diversity associated with the interior kefir starter grain compared with the exterior. © 2011 Teagasc Food Research Centre, Moorepark. FEMS Microbiology Letters © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd.
Deep sequencing in library selection projects: what insight does it bring?

PubMed

Glanville, J; D'Angelo, S; Khan, T A; Reddy, S T; Naranjo, L; Ferrara, F; Bradbury, A R M

2015-08-01

High throughput sequencing is poised to change all aspects of the way antibodies and other binders are discovered and engineered. Millions of available sequence reads provide an unprecedented sampling depth able to guide the design and construction of effective, high quality naïve libraries containing tens of billions of unique molecules. Furthermore, during selections, high throughput sequencing enables quantitative tracing of enriched clones and position-specific guidance to amino acid variation under positive selection during antibody engineering. Successful application of the technologies relies on specific PCR reagent design, correct sequencing platform selection, and effective use of computational tools and statistical measures to remove error, identify antibodies, estimate diversity, and extract signatures of selection from the clone down to individual structural positions. Here we review these considerations and discuss some of the remaining challenges to the widespread adoption of the technology. Copyright © 2015 Elsevier Ltd. All rights reserved.
Deep sequencing in library selection projects: what insight does it bring?

PubMed Central

Glanville, J; D’Angelo, S; Khan, T.A.; Reddy, S. T.; Naranjo, L.; Ferrara, F.; Bradbury, A.R.M.

2015-01-01

High throughput sequencing is poised to change all aspects of the way antibodies and other binders are discovered and engineered. Millions of available sequence reads provide an unprecedented sampling depth able to guide the design and construction of effective, high quality naïve libraries containing tens of billions of unique molecules. Furthermore, during selections, high throughput sequencing enables quantitative tracing of enriched clones and position-specific guidance to amino acid variation under positive selection during antibody engineering. Successful application of the technologies relies on specific PCR reagent design, correct sequencing platform selection, and effective use of computational tools and statistical measures to remove error, identify antibodies, estimate diversity, and extract signatures of selection from the clone down to individual structural positions. Here we review these considerations and discuss some of the remaining challenges to the widespread adoption of the technology. PMID:26451649
Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

PubMed

Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

1998-10-20

Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.
Histidine-lysine peptides as carriers of nucleic acids.

PubMed

Leng, Qixin; Goldgeier, Lisa; Zhu, Jingsong; Cambell, Patricia; Ambulos, Nicholas; Mixson, A James

2007-03-01

With their biodegradability and diversity of permutations, peptides have significant potential as carriers of nucleic acids. This review will focus on the sequence and branching patterns of peptide carriers composed primarily of histidines and lysines. While lysines within peptides are important for binding to the negatively charged phosphates, histidines are critical for endosomal lysis enabling nucleic acids to reach the cytosol. Histidine-lysine (HK) polymers by either covalent or ionic bonds with liposomes augment transfection compared to liposome carriers alone. More recently, we have examined peptides as sole carriers of nucleic acids because of their intrinsic advantages compared to the bipartite HK/liposome carriers. With a protocol change and addition of a histidine-rich tail, HK peptides as sole carriers were more effective than liposomes alone in several cell lines. While four-branched polymers with a primary repeating sequence pattern of -HHK- were more effective as carriers of plasmids, eight-branched polymers with a sequence pattern of -HHHK- were more effective as carriers of siRNA. Compared to polyethylenimine, HK carriers of siRNA and plasmids had reduced toxicity. When injected intravenously, HK polymers in complex with plasmids encoding antiangiogenic proteins significantly decreased tumor growth. Furthermore, modification of HK polymers with polyethylene glycol and vascular-specific ligands increased specificity of the polyplex to the tumor by more than 40-fold. Together with further development and insight on the structure of HK polyplexes, HK peptides may prove to be useful as carriers of different forms of nucleic acids both in vitro and in vivo.
Correlation between fibroin amino acid sequence and physical silk properties.

PubMed

Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

2003-09-12

The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet.
In search of actionable targets for agrigenomics and microalgal biofuel production: sequence-structural diversity studies on algal and higher plants with a focus on GPAT protein.

PubMed

Misra, Namrata; Panda, Prasanna Kumar

2013-04-01

The triacylglycerol (TAG) pathway provides several targets for genetic engineering to optimize microalgal lipid productivity. GPAT (glycerol-3-phosphate acyltransferase) is a crucial enzyme that catalyzes the initial step of TAG biosynthesis. Despite many recent biochemical studies, a comprehensive sequence-structure analysis of GPAT across diverse lipid-yielding organisms is lacking. Hence, we performed a comparative genomic analysis of plastid-located GPAT proteins from 7 microalgae and 3 higher plants species. The close evolutionary relationship observed between red algae/diatoms and green algae/plant lineages in the phylogenetic tree were further corroborated by motif and gene structure analysis. The predicted molecular weight, amino acid composition, Instability Index, and hydropathicity profile gave an overall representation of the biochemical features of GPAT protein across the species under study. Furthermore, homology models of GPAT from Chlamydomonas reinhardtii, Arabidopsis thaliana, and Glycine max provided deep insights into the protein architecture and substrate binding sites. Despite low sequence identity found between algal and plant GPATs, the developed models exhibited strikingly conserved topology consisting of 14α helices and 9β sheets arranged in two domains. However, subtle variations in amino acids of fatty acyl binding site were identified that might influence the substrate selectivity of GPAT. Together, the results will provide useful resources to understand the functional and evolutionary relationship of GPAT and potentially benefit in development of engineered enzyme for augmenting algal biofuel production.
Termite hindguts and the ecology of microbial communities in the sequencing age.

PubMed

Tai, Vera; Keeling, Patrick J

2013-01-01

Advances in high-throughput nucleic acid sequencing have improved our understanding of microbial communities in a number of ways. Deeper sequence coverage provides the means to assess diversity at the resolution necessary to recover ecological and biogeographic patterns, and at the same time single-cell genomics provides detailed information about the interactions between members of a microbial community. Given the vastness and complexity of microbial ecosystems, such analyses remain challenging for most environments, so greater insight can also be drawn from analysing less dynamic ecosystems. Here, we outline the advantages of one such environment, the wood-digesting hindgut communities of termites and cockroaches, and how it is a model to examine and compare both protist and bacterial communities. Beyond the analysis of diversity, our understanding of protist community ecology will depend on using statistically sound sampling regimes at biologically relevant scales, transitioning from discovery-based to experimental ecology, incorporating single-cell microbiology and other data sources, and continued development of analytical tools. © 2013 The Author(s) Journal of Eukaryotic Microbiology © 2013 International Society of Protistologists.
RosettaAntibodyDesign (RAbD): A general framework for computational antibody design

PubMed Central

Adolf-Bryfogle, Jared; Kalyuzhniy, Oleks; Kubitz, Michael; Hu, Xiaozhen; Adachi, Yumiko; Schief, William R.

2018-01-01

A structural-bioinformatics-based computational methodology and framework have been developed for the design of antibodies to targets of interest. RosettaAntibodyDesign (RAbD) samples the diverse sequence, structure, and binding space of an antibody to an antigen in highly customizable protocols for the design of antibodies in a broad range of applications. The program samples antibody sequences and structures by grafting structures from a widely accepted set of the canonical clusters of CDRs (North et al., J. Mol. Biol., 406:228–256, 2011). It then performs sequence design according to amino acid sequence profiles of each cluster, and samples CDR backbones using a flexible-backbone design protocol incorporating cluster-based CDR constraints. Starting from an existing experimental or computationally modeled antigen-antibody structure, RAbD can be used to redesign a single CDR or multiple CDRs with loops of different length, conformation, and sequence. We rigorously benchmarked RAbD on a set of 60 diverse antibody–antigen complexes, using two design strategies—optimizing total Rosetta energy and optimizing interface energy alone. We utilized two novel metrics for measuring success in computational protein design. The design risk ratio (DRR) is equal to the frequency of recovery of native CDR lengths and clusters divided by the frequency of sampling of those features during the Monte Carlo design procedure. Ratios greater than 1.0 indicate that the design process is picking out the native more frequently than expected from their sampled rate. We achieved DRRs for the non-H3 CDRs of between 2.4 and 4.0. The antigen risk ratio (ARR) is the ratio of frequencies of the native amino acid types, CDR lengths, and clusters in the output decoys for simulations performed in the presence and absence of the antigen. For CDRs, we achieved cluster ARRs as high as 2.5 for L1 and 1.5 for H2. For sequence design simulations without CDR grafting, the overall recovery for the native amino acid types for residues that contact the antigen in the native structures was 72% in simulations performed in the presence of the antigen and 48% in simulations performed without the antigen, for an ARR of 1.5. For the non-contacting residues, the ARR was 1.08. This shows that the sequence profiles are able to maintain the amino acid types of these conserved, buried sites, while recovery of the exposed, contacting residues requires the presence of the antigen-antibody interface. We tested RAbD experimentally on both a lambda and kappa antibody–antigen complex, successfully improving their affinities 10 to 50 fold by replacing individual CDRs of the native antibody with new CDR lengths and clusters. PMID:29702641
RosettaAntibodyDesign (RAbD): A general framework for computational antibody design.

PubMed

Adolf-Bryfogle, Jared; Kalyuzhniy, Oleks; Kubitz, Michael; Weitzner, Brian D; Hu, Xiaozhen; Adachi, Yumiko; Schief, William R; Dunbrack, Roland L

2018-04-01

A structural-bioinformatics-based computational methodology and framework have been developed for the design of antibodies to targets of interest. RosettaAntibodyDesign (RAbD) samples the diverse sequence, structure, and binding space of an antibody to an antigen in highly customizable protocols for the design of antibodies in a broad range of applications. The program samples antibody sequences and structures by grafting structures from a widely accepted set of the canonical clusters of CDRs (North et al., J. Mol. Biol., 406:228-256, 2011). It then performs sequence design according to amino acid sequence profiles of each cluster, and samples CDR backbones using a flexible-backbone design protocol incorporating cluster-based CDR constraints. Starting from an existing experimental or computationally modeled antigen-antibody structure, RAbD can be used to redesign a single CDR or multiple CDRs with loops of different length, conformation, and sequence. We rigorously benchmarked RAbD on a set of 60 diverse antibody-antigen complexes, using two design strategies-optimizing total Rosetta energy and optimizing interface energy alone. We utilized two novel metrics for measuring success in computational protein design. The design risk ratio (DRR) is equal to the frequency of recovery of native CDR lengths and clusters divided by the frequency of sampling of those features during the Monte Carlo design procedure. Ratios greater than 1.0 indicate that the design process is picking out the native more frequently than expected from their sampled rate. We achieved DRRs for the non-H3 CDRs of between 2.4 and 4.0. The antigen risk ratio (ARR) is the ratio of frequencies of the native amino acid types, CDR lengths, and clusters in the output decoys for simulations performed in the presence and absence of the antigen. For CDRs, we achieved cluster ARRs as high as 2.5 for L1 and 1.5 for H2. For sequence design simulations without CDR grafting, the overall recovery for the native amino acid types for residues that contact the antigen in the native structures was 72% in simulations performed in the presence of the antigen and 48% in simulations performed without the antigen, for an ARR of 1.5. For the non-contacting residues, the ARR was 1.08. This shows that the sequence profiles are able to maintain the amino acid types of these conserved, buried sites, while recovery of the exposed, contacting residues requires the presence of the antigen-antibody interface. We tested RAbD experimentally on both a lambda and kappa antibody-antigen complex, successfully improving their affinities 10 to 50 fold by replacing individual CDRs of the native antibody with new CDR lengths and clusters.
Sperm Bindin Divergence under Sexual Selection and Concerted Evolution in Sea Stars.

PubMed

Patiño, Susana; Keever, Carson C; Sunday, Jennifer M; Popovic, Iva; Byrne, Maria; Hart, Michael W

2016-08-01

Selection associated with competition among males or sexual conflict between mates can create positive selection for high rates of molecular evolution of gamete recognition genes and lead to reproductive isolation between species. We analyzed coding sequence and repetitive domain variation in the gene encoding the sperm acrosomal protein bindin in 13 diverse sea star species. We found that bindin has a conserved coding sequence domain structure in all 13 species, with several repeated motifs in a large central region that is similar among all sea stars in organization but highly divergent among genera in nucleotide and predicted amino acid sequence. More bindin codons and lineages showed positive selection for high relative rates of amino acid substitution in genera with gonochoric outcrossing adults (and greater expected strength of sexual selection) than in selfing hermaphrodites. That difference is consistent with the expectation that selfing (a highly derived mating system) may moderate the strength of sexual selection and limit the accumulation of bindin amino acid differences. The results implicate both positive selection on single codons and concerted evolution within the repetitive region in bindin divergence, and suggest that both single amino acid differences and repeat differences may affect sperm-egg binding and reproductive compatibility. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Reading biological processes from nucleotide sequences

NASA Astrophysics Data System (ADS)

Murugan, Anand

Cellular processes have traditionally been investigated by techniques of imaging and biochemical analysis of the molecules involved. The recent rapid progress in our ability to manipulate and read nucleic acid sequences gives us direct access to the genetic information that directs and constrains biological processes. While sequence data is being used widely to investigate genotype-phenotype relationships and population structure, here we use sequencing to understand biophysical mechanisms. We present work on two different systems. First, in chapter 2, we characterize the stochastic genetic editing mechanism that produces diverse T-cell receptors in the human immune system. We do this by inferring statistical distributions of the underlying biochemical events that generate T-cell receptor coding sequences from the statistics of the observed sequences. This inferred model quantitatively describes the potential repertoire of T-cell receptors that can be produced by an individual, providing insight into its potential diversity and the probability of generation of any specific T-cell receptor. Then in chapter 3, we present work on understanding the functioning of regulatory DNA sequences in both prokaryotes and eukaryotes. Here we use experiments that measure the transcriptional activity of large libraries of mutagenized promoters and enhancers and infer models of the sequence-function relationship from this data. For the bacterial promoter, we infer a physically motivated 'thermodynamic' model of the interaction of DNA-binding proteins and RNA polymerase determining the transcription rate of the downstream gene. For the eukaryotic enhancers, we infer heuristic models of the sequence-function relationship and use these models to find synthetic enhancer sequences that optimize inducibility of expression. Both projects demonstrate the utility of sequence information in conjunction with sophisticated statistical inference techniques for dissecting underlying biophysical mechanisms.
Biodiversity of yeasts, lactic acid bacteria and acetic acid bacteria in the fermentation of "Shanxi aged vinegar", a traditional Chinese vinegar.

PubMed

Wu, Jia Jia; Ma, Ying Kun; Zhang, Fen Fen; Chen, Fu Sheng

2012-05-01

Shanxi aged vinegar is a famous traditional Chinese vinegar made from several kinds of cereal by spontaneous solid-state fermentation techniques. In order to get a comprehensive understanding of culturable microorganism's diversity present in its fermentation, the indigenous microorganisms including 47 yeast isolates, 28 lactic acid bacteria isolates and 58 acetic acid bacteria isolates were recovered in different fermenting time and characterized based on a combination of phenotypic and genotypic approaches including inter-delta/PCR, PCR-RFLP, ERIC/PCR analysis, as well as 16S rRNA and 26S rRNA partial gene sequencing. In the alcoholic fermentation, the dominant yeast species Saccharomyces (S.) cerevisiae (96%) exhibited low phenotypic and genotypic diversity among the isolates, while Lactobacillus (Lb.) fermentum together with Lb. plantarum, Lb. buchneri, Lb. casei, Pediococcus (P.) acidilactici, P. pentosaceus and Weissella confusa were predominated in the bacterial population at the same stage. Acetobacter (A.) pasteurianus showing great variety both in genotypic and phenotypic tests was the dominant species (76%) in the acetic acid fermentation stage, while the other acetic acid bacteria species including A. senegalensis, A. indonesiensis, A. malorum and A. orientalis, as well as Gluconobacter (G.) oxydans were detected at initial point of alcoholic and acetic acid fermentation stage respectively. Copyright © 2011 Elsevier Ltd. All rights reserved.
Phylogenetic relationships and taxonomic position of Chlorella-like isolates from low pH environments (pH < 3.0)

PubMed Central

Huss, Volker AR; Ciniglia, Claudia; Cennamo, Paola; Cozzolino, Salvatore; Pinto, Gabriele; Pollio, Antonino

2002-01-01

Background Little is known about phytoplankton communities inhabiting low pH environments such as volcanic and geothermal sites or acidic waters. Only specialised organisms are able to tolerate such extreme conditions. There is, thus, low species diversity. We have characterised the previously isolated acid tolerant Chlorella-like microalgae Viridiella fridericiana and Chlorella protothecoides var. acidicola by microscopical and biomolecular methods in order to assess their phylogenetic relationships. Results Both isolates belong to the trebouxiophycean lineage of chlorophytes. 18S and ITS1 sequence data clearly confirm that Viridiella fridericiana constitutes a new genus apart from the morphologically similar and likewise acid tolerant microalga Chlorella saccharophila. Chlorella protothecoides var. acidicola on the other hand is not a variety of Chlorella protothecoides but falls within a heterogeneous cluster consisting of Nannochloris, "Chlorella" spec. Yanaqocha, and Koliella, and is most closely related to algae which were also isolated from extreme environments. Conclusions The distribution of acid tolerant strains in the 18S rRNA tree shows that acquisition of acid tolerance was unlikely a monophyletic event in green microalgae. We propose that different strains have independently adapted to extreme environments. Some of them have spread worldwide and were able to colonise other extreme habitats. Considering the problems of successfully isolating acid tolerant strains, acidic soils could represent an unsuspected source of biological diversity with high potential for biotechnological utilisations. PMID:12194702
DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.

PubMed

Eernisse, D J

1992-04-01

DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.
Insect symbionts as valuable grist for the biotechnological mill: an alkaliphilic silkworm gut bacterium for efficient lactic acid production.

PubMed

Liang, Xili; Sun, Chao; Chen, Bosheng; Du, Kaiqian; Yu, Ting; Luang-In, Vijitra; Lu, Xingmeng; Shao, Yongqi

2018-06-01

Insects constitute the most abundant and diverse animal class and act as hosts to an extraordinary variety of symbiotic microorganisms. These microbes living inside the insects play critical roles in host biology and are also valuable bioresources. Enterococcus mundtii EMB156, isolated from the larval gut (gut pH >10) of the model organism Bombyx mori (Lepidoptera: Bombycidae), efficiently produces lactic acid, an important metabolite for industrial production of bioplastic materials. E. mundtii EMB156 grows well under alkaline conditions and stably converts various carbon sources into lactic acid, offering advantages in downstream fermentative processes. High-yield lactic acid production can be achieved by the strain EMB156 from renewable biomass substrates under alkaline pretreatments. Single-molecule real-time (SMRT) sequencing technology revealed its 3.01 Mbp whole genome sequence. A total of 2956 protein-coding sequences, 65 tRNA genes, and 6 rRNA operons were predicted in the EMB156 chromosome. Remarkable genomic features responsible for lactic acid fermentation included key enzymes involved in the pentose phosphate (PP)/glycolytic pathway, and an alpha amylase and xylose isomerase were characterized in EMB156. This genomic information coincides with the phenotype of E. mundtii EMB156, reflecting its metabolic flexibility in efficient lactate fermentation, and established a foundation for future biotechnological application. Interestingly, enzyme activities of amylase were quite stable in high-pH broths, indicating a possible mechanism for strong EMB156 growth in an alkaline environment, thereby facilitating lactic acid production. Together, these findings implied that valuable lactic acid-producing bacteria can be discovered efficiently by screening under the extremely alkaline conditions, as exemplified by gut microbial symbionts of Lepidoptera insects.
FIST: a sensory domain for diverse signal transduction pathways in prokaryotes and ubiquitin signaling in eukaryotes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Borziak, Kirill; Jouline, Igor B

2007-01-01

Motivation: Sensory domains that are conserved among Bacteria, Archaea and Eucarya are important detectors of common signals detected by living cells. Due to their high sequence divergence, sensory domains are difficult to identify. We systematically look for novel sensory domains using sensitive profile-based searches initi-ated with regions of signal transduction proteins where no known domains can be identified by current domain models. Results: Using profile searches followed by multiple sequence alignment, structure prediction, and domain architecture analysis, we have identified a novel sensory domain termed FIST, which is present in signal transduction proteins from Bacteria, Archaea and Eucarya. Remote similaritymore » to a known ligand-binding fold and chromosomal proximity of FIST-encoding genes to those coding for proteins involved in amino acid metabolism and transport suggest that FIST domains bind small ligands, such as amino acids.« less

Diversity and evolution of Lactobacillus casei group isolated from fermented dairy products in Tibet.

PubMed

Feng, Jing; Jiang, Yujun; Li, Mingyu; Zhao, Siyu; Zhang, Yanming; Li, Xuesong; Wang, Hui; Lin, Guangen; Wang, Hao; Li, Tiejing; Man, Chaoxin

2018-05-25

Bacteria in Lactobacillus casei group, including Lactobacillus casei (L. casei), Lactobacillus paracasei (L. paracasei), and Lactobacillus rhamnosus (L. rhamnosus) are important lactic acid bacteria in the production of fermented dairy products and are faced with the controversial nomenclatural status due to their close phylogenetic similarity. To probe the evolution and phylogeny of L. casei group, 100 isolates of lactic acid bacteria originated from naturally fermented dairy products in Tibet of China were subjected to multilocus sequence typing (MLST). The MLST scheme, based on analysis of the housekeeping genes fusA, ileS, lepA, leuS, pyrG, recA and recG, revealed that all the isolates belonged to a group containing the L. paracasei reference strains and were clearly different from the strains of L. casei and L. rhamnosus. Although nucleotide diversity (π) was low for the seven genes (ranging from 0.00341 for fusA to 0.01307 for recG), high genetic diversity represented by 83 sequence types (STs) with a discriminatory index of 0.98 was detected. A network-like structure based on split decomposition analysis, and the high values of the relative effect of recombination and mutation in the diversification of the lineages (r/m = 4.76) and the relative frequency of occurrence of recombination and mutation (ρ/θ = 2.62) indicated that intra-species recombination occurred frequently and homologous recombination played a key role in generating genotypic diversity amongst L. paracasei strains in Tibet. The discovery of 51 new STs and the results of STRUCTURE analysis suggested that the L. casei group in Tibet had an individual and particular population structure in comparison to European isolates. Overall, this research might be the first report about genetic diversity and population structure of Lactobacillus populations isolated from naturally fermented dairy products in Tibet based on MLST scheme.
Microbial diversity in ultra-high-pressure rocks and fluids from the Chinese Continental Scientific Drilling Project in China.

PubMed

Zhang, Gengxin; Dong, Hailiang; Xu, Zhiqin; Zhao, Donggao; Zhang, Chuanlun

2005-06-01

Microbial communities in ultra-high-pressure (UHP) rocks and drilling fluids from the Chinese Continental Scientific Drilling Project were characterized. The rocks had a porosity of 1 to 3.5% and a permeability of approximately 0.5 mDarcy. Abundant fluid and gas inclusions were present in the minerals. The rocks contained significant amounts of Fe2O3, FeO, P2O5, and nitrate (3 to 16 ppm). Acridine orange direct counting and phospholipid fatty acid analysis indicated that the total counts in the rocks and the fluids were 5.2 x 10(3) to 2.4 x 10(4) cells/g and 3.5 x 10(8) to 4.2 x 10(9) cells/g, respectively. Enrichment assays resulted in successful growth of thermophilic and alkaliphilic bacteria from the fluids, and some of these bacteria reduced Fe(III) to magnetite. 16S rRNA gene analyses indicated that the rocks were dominated by sequences similar to sequences of Proteobacteria and that most organisms were related to nitrate reducers from a saline, alkaline, cold habitat; however, some phylotypes were either members of a novel lineage or closely related to uncultured clones. The bacterial communities in the fluids were more diverse and included Proteobacteria, Bacteroidetes, gram-positive bacteria, Planctomycetes, and Candidatus taxa. The archaeal diversity was lower, and most sequences were not related to any known cultivated species. Some archaeal sequences were 90 to 95% similar to sequences recovered from ocean sediments or other subsurface environments. Some archaeal sequences from the drilling fluids were >93% similar to sequences of Sulfolobus solfataricus, and the thermophilic nature was consistent with the in situ temperature. We inferred that the microbes in the UHP rocks reside in fluid and gas inclusions, whereas those in the drilling fluids may be derived from subsurface fluids.
Microbial Diversity in Ultra-High-Pressure Rocks and Fluids from the Chinese Continental Scientific Drilling Project in China

PubMed Central

Zhang, Gengxin; Dong, Hailiang; Xu, Zhiqin; Zhao, Donggao; Zhang, Chuanlun

2005-01-01

Microbial communities in ultra-high-pressure (UHP) rocks and drilling fluids from the Chinese Continental Scientific Drilling Project were characterized. The rocks had a porosity of 1 to 3.5% and a permeability of ∼0.5 mDarcy. Abundant fluid and gas inclusions were present in the minerals. The rocks contained significant amounts of Fe2O3, FeO, P2O5, and nitrate (3 to 16 ppm). Acridine orange direct counting and phospholipid fatty acid analysis indicated that the total counts in the rocks and the fluids were 5.2 × 103 to 2.4 × 104 cells/g and 3.5 × 108 to 4.2 × 109 cells/g, respectively. Enrichment assays resulted in successful growth of thermophilic and alkaliphilic bacteria from the fluids, and some of these bacteria reduced Fe(III) to magnetite. 16S rRNA gene analyses indicated that the rocks were dominated by sequences similar to sequences of Proteobacteria and that most organisms were related to nitrate reducers from a saline, alkaline, cold habitat; however, some phylotypes were either members of a novel lineage or closely related to uncultured clones. The bacterial communities in the fluids were more diverse and included Proteobacteria, Bacteroidetes, gram-positive bacteria, Planctomycetes, and Candidatus taxa. The archaeal diversity was lower, and most sequences were not related to any known cultivated species. Some archaeal sequences were 90 to 95% similar to sequences recovered from ocean sediments or other subsurface environments. Some archaeal sequences from the drilling fluids were >93% similar to sequences of Sulfolobus solfataricus, and the thermophilic nature was consistent with the in situ temperature. We inferred that the microbes in the UHP rocks reside in fluid and gas inclusions, whereas those in the drilling fluids may be derived from subsurface fluids. PMID:15933024
Molecular evolution of type 2 porcine reproductive and respiratory syndrome viruses circulating in Vietnam from 2007 to 2015.

PubMed

Do, Hai Quynh; Trinh, Dinh Thau; Nguyen, Thi Lan; Vu, Thi Thu Hang; Than, Duc Duong; Van Lo, Thi; Yeom, Minjoo; Song, Daesub; Choe, SeEun; An, Dong-Jun; Le, Van Phan

2016-11-17

Porcine respiratory and reproductive syndrome (PRRS) virus is one of the most economically significant pathogens in the Vietnamese swine industry. ORF5, which participates in many functional processes, including virion assembly, entry of the virus into the host cell, and viral adaptation to the host immune response, has been widely used in molecular evolution and phylogeny studies. Knowing of molecular evolution of PRRSV fields strains might contribute to PRRS control in Vietnam. The results showed that phylogenetic analysis indicated that all strains belonged to sub-lineages 8.7 and 5.1. The nucleotide and amino acid identities between strains were 84.5-100% and 82-100%, respectively. Furthermore, the results revealed differences in nucleotide and amino acid identities between the 2 sub-lineage groups. N-glycosylation prediction identified 7 potential N-glycosylation sites and 11 glycotypes. Analyses of the GP5 sequences, revealed 7 sites under positive selective pressure and 25 under negative selective pressure. Phylogenetic analysis based on ORF5 sequence indicated the diversity of PRRSV in Vietnam. Furthermore, the variance of N-glycosylation sites and position under selective pressure were demonstrated. This study expands existing knowledge on the genetic diversity and evolution of PRRSV in Vietnam and assists the effective strategies for PRRS vaccine development in Vietnam.
Cloning and sequence analysis of a full-length cDNA of SmPP1cb encoding turbot protein phosphatase 1 beta catalytic subunit

NASA Astrophysics Data System (ADS)

Qi, Fei; Guo, Huarong; Wang, Jian

2008-02-01

Reversible protein phosphorylation, catalyzed by protein kinases and phosphatases, is an important and versatile mechanism by which eukaryotic cells regulate almost all the signaling processes. Protein phosphatase 1 (PP1) is the first and well-characterized member of the protein serine/threonine phosphatase family. In the present study, a full-length cDNA encoding the beta isoform of the catalytic subunit of protein phosphatase 1(PP1cb), was for the first time isolated and sequenced from the skin tissue of flatfish turbot Scophthalmus maximus, designated SmPP1cb, by the rapid amplification of cDNA ends (RACE) technique. The cDNA sequence of SmPP1cb we obtained contains a 984 bp open reading frame (ORF), flanked by a complete 39 bp 5' untranslated region and 462 bp 3' untranslated region. The ORF encodes a putative 327 amino acid protein, and the N-terminal section of this protein is highly acidic, Met-Ala-Glu-Gly-Glu-Leu-Asp-Val-Asp, a common feature for PP1 catalytic subunit but absent in protein phosphatase 2B (PP2B). And its calculated molecular mass is 37 193 Da and pI 5.8. Sequence analysis indicated that, SmPP1cb is extremely conserved in both amino acid and nucleotide acid levels compared with the PP1cb of other vertebrates and invertebrates, and its Kozak motif contained in the 5'UTR around ATG start codon is GXXAXXGXX ATGG, which is different from mammalian in two positions A-6 and G-3, indicating the possibility of different initiation of translation in turbot, and also the 3'UTR of SmPP1cb is highly diverse in the sequence similarity and length compared with other animals, especially zebrafish. The cloning and sequencing of SmPP1cb gene lays a good foundation for the future work on the biological functions of PP1 in the flatfish turbot.
Genomic diversity of bacteriophages infecting the fish pathogen Flavobacterium psychrophilum.

PubMed

Castillo, Daniel; Middelboe, Mathias

2016-12-01

Bacteriophages infecting the fish pathogen Flavobacterium psychrophilum can potentially be used to prevent and control outbreaks of this bacterium in salmonid aquaculture. However, the application of bacteriophages in disease control requires detailed knowledge on their genetic composition. To explore the diversity of F. pyschrophilum bacteriophages, we have analyzed the complete genome sequences of 17 phages isolated from two distant geographic areas (Denmark and Chile), including the previously characterized temperate bacteriophage 6H. Phage genome size ranged from 39 302 to 89 010 bp with a G+C content of 27%-32%. None of the bacteriophages isolated in Denmark contained genes associated with lysogeny, whereas the Chilean isolates were all putative temperate phages and similar to bacteriophage 6H. Comparative genome analysis showed that phages grouped in three different genetic clusters based on genetic composition and gene content, indicating a limited genetic diversity of F. psychrophilum-specific bacteriophages. However, amino acid sequence dissimilarity (25%) was found in putative structural proteins, which could be related to the host specificity determinants. This study represents the first analysis of genomic diversity and composition among bacteriophages infecting the fish pathogen F. psychrophilum and discusses the implications for the application of phages in disease control. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Metagenomics analysis of microbial communities associated with a traditional rice wine starter culture (Xaj-pitha) of Assam, India.

PubMed

Bora, Sudipta Sankar; Keot, Jyotshna; Das, Saurav; Sarma, Kishore; Barooah, Madhumita

2016-12-01

This is the first report on the microbial diversity of xaj-pitha, a rice wine fermentation starter culture through a metagenomics approach involving Illumine-based whole genome shotgun (WGS) sequencing method. Metagenomic DNA was extracted from rice wine starter culture concocted by Ahom community of Assam and analyzed using a MiSeq ® System. A total of 2,78,231 contigs, with an average read length of 640.13 bp, were obtained. Data obtained from the use of several taxonomic profiling tools were compared with previously reported microbial diversity studies through the culture-dependent and culture-independent method. The microbial community revealed the existence of amylase producers, such as Rhizopus delemar, Mucor circinelloides, and Aspergillus sp. Ethanol producers viz., Meyerozyma guilliermondii, Wickerhamomyces ciferrii, Saccharomyces cerevisiae, Candida glabrata, Debaryomyces hansenii, Ogataea parapolymorpha, and Dekkera bruxellensis, were found associated with the starter culture along with a diverse range of opportunistic contaminants. The bacterial microflora was dominated by lactic acid bacteria (LAB). The most frequent occurring LAB was Lactobacillus plantarum, Lactobacillus brevis, Leuconostoc lactis, Weissella cibaria, Lactococcus lactis, Weissella para mesenteroides, Leuconostoc pseudomesenteroides, etc. Our study provided a comprehensive picture of microbial diversity associated with rice wine fermentation starter and indicated the superiority of metagenomic sequencing over previously used techniques.
Increasing Sequence Diversity with Flexible Backbone Protein Design: The Complete Redesign of a Protein Hydrophobic Core

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, Grant S.; Mills, Jeffrey L.; Miley, Michael J.

2015-10-15

Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helixmore » bundle protein. Only small perturbations to the backbone, 12 {angstrom}, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 {angstrom}).« less
Characterization and Exploitation of CRISPR Loci in Bifidobacterium longum

PubMed Central

Hidalgo-Cantabrana, Claudio; Crawley, Alexandra B.; Sanchez, Borja; Barrangou, Rodolphe

2017-01-01

Diverse CRISPR-Cas systems provide adaptive immunity in many bacteria and most archaea, via a DNA-encoded, RNA-mediated, nucleic-acid targeting mechanism. Over time, CRISPR loci expand via iterative uptake of invasive DNA sequences into the CRISPR array during the adaptation process. These genetic vaccination cards thus provide insights into the exposure of strains to phages and plasmids in space and time, revealing the historical predatory exposure of a strain. These genetic loci thus constitute a unique basis for genotyping of strains, with potential of resolution at the strain-level. Here, we investigate the occurrence and diversity of CRISPR-Cas systems in the genomes of various Bifidobacterium longum strains across three sub-species. Specifically, we analyzed the genomic content of 66 genomes belonging to B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis, and identified 25 strains that carry 29 total CRISPR-Cas systems. We identify various Type I and Type II CRISPR-Cas systems that are widespread in this species, notably I-C, I-E, and II-C. Noteworthy, Type I-C systems showed extended CRISPR arrays, with extensive spacer diversity. We show how these hypervariable loci can be used to gain insights into strain origin, evolution and phylogeny, and can provide discriminatory sequences to distinguish even clonal isolates. By investigating CRISPR spacer sequences, we reveal their origin and implicate phages and prophages as drivers of CRISPR immunity expansion in this species, with redundant targeting of select prophages. Analysis of CRISPR spacer origin also revealed novel PAM sequences. Our results suggest that CRISPR-Cas immune systems are instrumental in mounting diversified viral resistance in B. longum, and show that these sequences are useful for typing across three subspecies. PMID:29033911
Characterization and Exploitation of CRISPR Loci in Bifidobacterium longum.

PubMed

Hidalgo-Cantabrana, Claudio; Crawley, Alexandra B; Sanchez, Borja; Barrangou, Rodolphe

2017-01-01

Diverse CRISPR-Cas systems provide adaptive immunity in many bacteria and most archaea, via a DNA-encoded, RNA-mediated, nucleic-acid targeting mechanism. Over time, CRISPR loci expand via iterative uptake of invasive DNA sequences into the CRISPR array during the adaptation process. These genetic vaccination cards thus provide insights into the exposure of strains to phages and plasmids in space and time, revealing the historical predatory exposure of a strain. These genetic loci thus constitute a unique basis for genotyping of strains, with potential of resolution at the strain-level. Here, we investigate the occurrence and diversity of CRISPR-Cas systems in the genomes of various Bifidobacterium longum strains across three sub-species. Specifically, we analyzed the genomic content of 66 genomes belonging to B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis , and identified 25 strains that carry 29 total CRISPR-Cas systems. We identify various Type I and Type II CRISPR-Cas systems that are widespread in this species, notably I-C, I-E, and II-C. Noteworthy, Type I-C systems showed extended CRISPR arrays, with extensive spacer diversity. We show how these hypervariable loci can be used to gain insights into strain origin, evolution and phylogeny, and can provide discriminatory sequences to distinguish even clonal isolates. By investigating CRISPR spacer sequences, we reveal their origin and implicate phages and prophages as drivers of CRISPR immunity expansion in this species, with redundant targeting of select prophages. Analysis of CRISPR spacer origin also revealed novel PAM sequences. Our results suggest that CRISPR-Cas immune systems are instrumental in mounting diversified viral resistance in B. longum , and show that these sequences are useful for typing across three subspecies.
Diversity Analysis of Dairy and Nondairy Lactococcus lactis Isolates, Using a Novel Multilocus Sequence Analysis Scheme and (GTG)5-PCR Fingerprinting▿

PubMed Central

Rademaker, Jan L. W.; Herbet, Hélène; Starrenburg, Marjo J. C.; Naser, Sabri M.; Gevers, Dirk; Kelly, William J.; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E. T.

2007-01-01

The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)5-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene. PMID:17890345
Diversity analysis of dairy and nondairy Lactococcus lactis isolates, using a novel multilocus sequence analysis scheme and (GTG)5-PCR fingerprinting.

PubMed

Rademaker, Jan L W; Herbet, Hélène; Starrenburg, Marjo J C; Naser, Sabri M; Gevers, Dirk; Kelly, William J; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E T

2007-11-01

The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)(5)-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene.
Responses of soil N-fixing bacteria communities to invasive plant species under different types of simulated acid deposition

NASA Astrophysics Data System (ADS)

Wang, Congyan; Zhou, Jiawei; Jiang, Kun; Liu, Jun; Du, Daolin

2017-06-01

Biological invasions have incurred serious threats to native ecosystems in China, and soil N-fixing bacteria communities (SNB) may play a vital role in the successful plant invasion. Meanwhile, anthropogenic acid deposition is increasing in China, which may modify or upgrade the effects that invasive plant species can cause on SNB. We analyzed the structure and diversity of SNB by means of new generation sequencing technology in soils with different simulated acid deposition (SAD), i.e., different SO4 2- to NO3 - ratios, and where the invasive ( Amaranthus retroflexus L.) and the native species ( Amaranthus tricolor L.) grew mixed or isolated for 3 months. A. retroflexus itself did not exert significant effects on the diversity and richness of SNB but did it under certain SO4 2- to NO3 - ratios. Compared to soils where the native species grew isolated, the soils where the invasive A. retroflexus grew isolated showed lower relative abundance of some SNB classes under certain SAD treatments. Some types of SAD can alter soil nutrient content which in turn could affect SNB diversity and abundance. Specifically, greater SO4 2- to NO3 - ratios tended to have more toxic effects on SNB likely due to the higher exchange capacity of hydroxyl groups (OH-) between SO4 2- and NO3 -. As a conclusion, it can be expected a change in the structure of SNB after A. retroflexus invasion under acid deposition rich in sulfuric acid. This change may create a plant soil feedback favoring future A. retroflexus invasions.
Responses of soil N-fixing bacteria communities to invasive plant species under different types of simulated acid deposition.

PubMed

Wang, Congyan; Zhou, Jiawei; Jiang, Kun; Liu, Jun; Du, Daolin

2017-06-01

Biological invasions have incurred serious threats to native ecosystems in China, and soil N-fixing bacteria communities (SNB) may play a vital role in the successful plant invasion. Meanwhile, anthropogenic acid deposition is increasing in China, which may modify or upgrade the effects that invasive plant species can cause on SNB. We analyzed the structure and diversity of SNB by means of new generation sequencing technology in soils with different simulated acid deposition (SAD), i.e., different SO 4 2- to NO 3 - ratios, and where the invasive (Amaranthus retroflexus L.) and the native species (Amaranthus tricolor L.) grew mixed or isolated for 3 months. A. retroflexus itself did not exert significant effects on the diversity and richness of SNB but did it under certain SO 4 2- to NO 3 - ratios. Compared to soils where the native species grew isolated, the soils where the invasive A. retroflexus grew isolated showed lower relative abundance of some SNB classes under certain SAD treatments. Some types of SAD can alter soil nutrient content which in turn could affect SNB diversity and abundance. Specifically, greater SO 4 2- to NO 3 - ratios tended to have more toxic effects on SNB likely due to the higher exchange capacity of hydroxyl groups (OH - ) between SO 4 2- and NO 3 - . As a conclusion, it can be expected a change in the structure of SNB after A. retroflexus invasion under acid deposition rich in sulfuric acid. This change may create a plant soil feedback favoring future A. retroflexus invasions.
Metabolic characteristics of dominant microbes and key rare species from an acidic hot spring in Taiwan revealed by metagenomics

DOE PAGES

Lin, Kuei -Han; Liao, Ben -Yang; Chang, Hao -Wei; ...

2015-12-03

Microbial diversity and community structures in acidic hot springs have been characterized by 16S rRNA gene-based diversity surveys. However, our understanding regarding the interactions among microbes, or between microbes and environmental factors, remains limited. In the present study, a metagenomic approach, followed by bioinformatics analyses, were used to predict interactions within the microbial ecosystem in Shi-Huang-Ping (SHP), an acidic hot spring in northern Taiwan. Characterizing environmental parameters and potential metabolic pathways highlighted the importance of carbon assimilatory pathways. Four distinct carbon assimilatory pathways were identified in five dominant genera of bacteria. Of those dominant carbon fixers, Hydrogenobaculum bacteria outcompeted othermore » carbon assimilators and dominated the SHP, presumably due to their ability to metabolize hydrogen and to withstand an anaerobic environment with fluctuating temperatures. Furthermore, most dominant microbes were capable of metabolizing inorganic sulfur-related compounds (abundant in SHP). However, Acidithiobacillus ferrooxidans was the only species among key rare microbes with the capability to fix nitrogen, suggesting a key role in nitrogen cycling. In addition to potential metabolic interactions, based on the 16S rRNAs gene sequence of Nanoarchaeum-related and its potential host Ignicoccus-related archaea, as well as sequences of viruses and CRISPR arrays, we inferred that there were complex microbe-microbe interactions. In conclusion, our study provided evidence that there were numerous microbe-microbe and microbe-environment interactions within the microbial community in an acidic hot spring. We proposed that Hydrogenobaculum bacteria were the dominant microbial genus, as they were able to metabolize hydrogen, assimilate carbon and live in an anaerobic environment with fluctuating temperatures.« less
Metabolic characteristics of dominant microbes and key rare species from an acidic hot spring in Taiwan revealed by metagenomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lin, Kuei -Han; Liao, Ben -Yang; Chang, Hao -Wei

Microbial diversity and community structures in acidic hot springs have been characterized by 16S rRNA gene-based diversity surveys. However, our understanding regarding the interactions among microbes, or between microbes and environmental factors, remains limited. In the present study, a metagenomic approach, followed by bioinformatics analyses, were used to predict interactions within the microbial ecosystem in Shi-Huang-Ping (SHP), an acidic hot spring in northern Taiwan. Characterizing environmental parameters and potential metabolic pathways highlighted the importance of carbon assimilatory pathways. Four distinct carbon assimilatory pathways were identified in five dominant genera of bacteria. Of those dominant carbon fixers, Hydrogenobaculum bacteria outcompeted othermore » carbon assimilators and dominated the SHP, presumably due to their ability to metabolize hydrogen and to withstand an anaerobic environment with fluctuating temperatures. Furthermore, most dominant microbes were capable of metabolizing inorganic sulfur-related compounds (abundant in SHP). However, Acidithiobacillus ferrooxidans was the only species among key rare microbes with the capability to fix nitrogen, suggesting a key role in nitrogen cycling. In addition to potential metabolic interactions, based on the 16S rRNAs gene sequence of Nanoarchaeum-related and its potential host Ignicoccus-related archaea, as well as sequences of viruses and CRISPR arrays, we inferred that there were complex microbe-microbe interactions. In conclusion, our study provided evidence that there were numerous microbe-microbe and microbe-environment interactions within the microbial community in an acidic hot spring. We proposed that Hydrogenobaculum bacteria were the dominant microbial genus, as they were able to metabolize hydrogen, assimilate carbon and live in an anaerobic environment with fluctuating temperatures.« less
Discovery of phosphonic acid natural products by mining the genomes of 10,000 actinomycetes.

PubMed

Ju, Kou-San; Gao, Jiangtao; Doroghazi, James R; Wang, Kwo-Kwang A; Thibodeaux, Christopher J; Li, Steven; Metzger, Emily; Fudala, John; Su, Joleen; Zhang, Jun Kai; Lee, Jaeheon; Cioni, Joel P; Evans, Bradley S; Hirota, Ryuichi; Labeda, David P; van der Donk, Wilfred A; Metcalf, William W

2015-09-29

Although natural products have been a particularly rich source of human medicines, activity-based screening results in a very high rate of rediscovery of known molecules. Based on the large number of natural product biosynthetic genes in microbial genomes, many have proposed "genome mining" as an alternative approach for discovery efforts; however, this idea has yet to be performed experimentally on a large scale. Here, we demonstrate the feasibility of large-scale, high-throughput genome mining by screening a collection of over 10,000 actinomycetes for the genetic potential to make phosphonic acids, a class of natural products with diverse and useful bioactivities. Genome sequencing identified a diverse collection of phosphonate biosynthetic gene clusters within 278 strains. These clusters were classified into 64 distinct groups, of which 55 are likely to direct the synthesis of unknown compounds. Characterization of strains within five of these groups resulted in the discovery of a new archetypical pathway for phosphonate biosynthesis, the first (to our knowledge) dedicated pathway for H-phosphinates, and 11 previously undescribed phosphonic acid natural products. Among these compounds are argolaphos, a broad-spectrum antibacterial phosphonopeptide composed of aminomethylphosphonate in peptide linkage to a rare amino acid N(5)-hydroxyarginine; valinophos, an N-acetyl l-Val ester of 2,3-dihydroxypropylphosphonate; and phosphonocystoximate, an unusual thiohydroximate-containing molecule representing a new chemotype of sulfur-containing phosphonate natural products. Analysis of the genome sequences from the remaining strains suggests that the majority of the phosphonate biosynthetic repertoire of Actinobacteria has been captured at the gene level. This dereplicated strain collection now provides a reservoir of numerous, as yet undiscovered, phosphonate natural products.
Discovery of Novel Bmy1 Alleles Increasing β-Amylase Activity in Chinese Landraces and Tibetan Wild Barley for Improvement of Malting Quality via MAS

PubMed Central

Gong, Xue; Westcott, Sharon; Zhang, Xiao-Qi; Yan, Guijun; Lance, Reg; Zhang, Guoping; Sun, Dongfa; Li, Chengdao

2013-01-01

China has a large barley germplasm collection which has not been well characterized and is therefore underutilized. The Bmy1 locus encoding the β-amylase enzyme on chromosome 4H has been well characterized in the worldwide barley germplasm collections due to its importance in the malting and brewing industry. The Bmy1 locus was chosen as an indicator to understand genetic potential for improvement of malting quality in Chinese landraces and Tibetan wild barley. The genetic diversity of 91 barley accessions was assessed using allele specific Multiplex-ready molecular markers. Eight accessions were further sequenced, based on the Multiplex-ready marker diversity for Bmy1 in the germplasm. Six of the eight accessions clustered together in a unique group, and showed similarities to ‘Haruna Nijo’, wild barley accession PI296896 and ‘Ashqelon’. Sequence comparisons with the known Bmy1 alleles identified not only the existing 13 amino acid substitutions, but also a new substitution positioned at A387T from a Chinese landrace W127, which has the highest β-amylase activity. Two new alleles/haplotypes namely Bmy1-Sd1c and Bmy1-Sd5 were designated based on different amino acid combinations. We identified new amino acid combination of C115, D165, V233, S347 and V430 in the germplasm. The broad variation in both β-amylase activity and amino acid composition provides novel alleles for the improvement of malting quality for different brewing styles, which indicates the high potential value of the Chinese landraces and Tibetan wild barley. PMID:24019884
pH as a Driver for Ammonia-Oxidizing Archaea in Forest Soils.

PubMed

Stempfhuber, Barbara; Engel, Marion; Fischer, Doreen; Neskovic-Prit, Ganna; Wubet, Tesfaye; Schöning, Ingo; Gubry-Rangin, Cécile; Kublik, Susanne; Schloter-Hai, Brigitte; Rattei, Thomas; Welzl, Gerhard; Nicol, Graeme W; Schrumpf, Marion; Buscot, Francois; Prosser, James I; Schloter, Michael

2015-05-01

In this study, we investigated the impact of soil pH on the diversity and abundance of archaeal ammonia oxidizers in 27 different forest soils across Germany. DNA was extracted from topsoil samples, the amoA gene, encoding ammonia monooxygenase, was amplified; and the amplicons were sequenced using a 454-based pyrosequencing approach. As expected, the ratio of archaeal (AOA) to bacterial (AOB) ammonia oxidizers' amoA genes increased sharply with decreasing soil pH. The diversity of AOA differed significantly between sites with ultra-acidic soil pH (<3.5) and sites with higher pH values. The major OTUs from soil samples with low pH could be detected at each site with a soil pH <3.5 but not at sites with pH >4.5, regardless of geographic position and vegetation. These OTUs could be related to the Nitrosotalea group 1.1 and the Nitrososphaera subcluster 7.2, respectively, and showed significant similarities to OTUs described from other acidic environments. Conversely, none of the major OTUs typical of sites with a soil pH >4.6 could be found in the ultra- and extreme acidic soils. Based on a comparison with the amoA gene sequence data from a previous study performed on agricultural soils, we could clearly show that the development of AOA communities in soils with ultra-acidic pH (<3.5) is mainly triggered by soil pH and is not influenced significantly by the type of land use, the soil type, or the geographic position of the site, which was observed for sites with acido-neutral soil pH.
Discovery of phosphonic acid natural products by mining the genomes of 10,000 actinomycetes

PubMed Central

Ju, Kou-San; Gao, Jiangtao; Doroghazi, James R.; Wang, Kwo-Kwang A.; Thibodeaux, Christopher J.; Li, Steven; Metzger, Emily; Fudala, John; Su, Joleen; Zhang, Jun Kai; Lee, Jaeheon; Cioni, Joel P.; Evans, Bradley S.; Hirota, Ryuichi; Labeda, David P.; van der Donk, Wilfred A.; Metcalf, William W.

2015-01-01

Although natural products have been a particularly rich source of human medicines, activity-based screening results in a very high rate of rediscovery of known molecules. Based on the large number of natural product biosynthetic genes in microbial genomes, many have proposed “genome mining” as an alternative approach for discovery efforts; however, this idea has yet to be performed experimentally on a large scale. Here, we demonstrate the feasibility of large-scale, high-throughput genome mining by screening a collection of over 10,000 actinomycetes for the genetic potential to make phosphonic acids, a class of natural products with diverse and useful bioactivities. Genome sequencing identified a diverse collection of phosphonate biosynthetic gene clusters within 278 strains. These clusters were classified into 64 distinct groups, of which 55 are likely to direct the synthesis of unknown compounds. Characterization of strains within five of these groups resulted in the discovery of a new archetypical pathway for phosphonate biosynthesis, the first (to our knowledge) dedicated pathway for H-phosphinates, and 11 previously undescribed phosphonic acid natural products. Among these compounds are argolaphos, a broad-spectrum antibacterial phosphonopeptide composed of aminomethylphosphonate in peptide linkage to a rare amino acid N5-hydroxyarginine; valinophos, an N-acetyl l-Val ester of 2,3-dihydroxypropylphosphonate; and phosphonocystoximate, an unusual thiohydroximate-containing molecule representing a new chemotype of sulfur-containing phosphonate natural products. Analysis of the genome sequences from the remaining strains suggests that the majority of the phosphonate biosynthetic repertoire of Actinobacteria has been captured at the gene level. This dereplicated strain collection now provides a reservoir of numerous, as yet undiscovered, phosphonate natural products. PMID:26324907

A novel cysteine-rich antifungal peptide ToAMP4 from Taraxacum officinale Wigg. flowers.

PubMed

Astafieva, A A; Rogozhin, Eugene A; Andreev, Yaroslav A; Odintsova, T I; Kozlov, S A; Grishin, Eugene V; Egorov, Tsezi A

2013-09-01

A novel peptide named ToAMP4 was isolated from Taraxacum officinale Wigg. flowers by a combination of acetic acid extraction and different types of chromatography: affinity, size-exclusion, and RP-HPLC. The amino acid sequence of ToAMP4 was determined by automated Edman degradation. The peptide is basic, consists of 41 amino acids, and incorporates three disulphide bonds. Due to the unusual cysteine spacing pattern, ToAMP4 does not belong to any known plant AMP family, but classifies together with two other antimicrobial peptides ToAMP1 and ToAMP2 previously isolated from the dandelion flowers. To study the biological activity of ToAMP4, it was successfully produced in a prokaryotic expression system as a fusion protein with thioredoxin. The recombinant peptide was shown to be identical to the native ToAMP4 by chromatographic behavior, molecular mass, and N-terminal amino acid sequence. The peptide displays broad-spectrum antifungal activity against important phytopathogens. Two ToAMP4-mediated inhibition strategies depending on the fungus were demonstrated. The results obtained add to our knowledge on the structural and functional diversity of AMPs in plants. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Design and Synthesis of a Library of Tetracyclic Hydroazulenoisoindoles

PubMed Central

Brummond, Kay M.; Mao, Shuli; Shinde, Sunita N.; Johnston, Paul J.; Day, Billy W.

2009-01-01

Forty-four tetracyclic hydroazulenoisoindoles were synthesized via a tandem cyclopropanation/Cope rearrangement followed by a Diels-Alder sequence from easily available five-membered cyclic cross-conjugated trienones. These trienones were obtained from two different routes depending upon whether R1 and R2 are alkyl or amino acid derived functional groups, via a rhodium(I)-catalyzed cycloisomerization reaction. In order to increase diversity, four maleimides and two 1,2,4-triazoline-3,5-diones were used as dienophiles in the Diels-Alder step. Several Diels-Alder adducts were further reacted under palladium-catalyzed hydrogenation conditions, leading to a diastereoselective reduction of the trisubstituted double bond. This library has demonstrated rapid access to a variety of structurally complex natural product-like compounds via stereochemical diversity and building block diversity approaches. PMID:19366169
Diversity and Functional Analysis of Bacterial Communities Associated with Natural Hydrocarbon Seeps in Acidic Soils at Rainbow Springs, Yellowstone National Park

PubMed Central

Hamamura, Natsuko; Olson, Sarah H.; Ward, David M.; Inskeep, William P.

2005-01-01

In this paper we describe the bacterial communities associated with natural hydrocarbon seeps in nonthermal soils at Rainbow Springs, Yellowstone National Park. Soil chemical analysis revealed high sulfate concentrations and low pH values (pH 2.8 to 3.8), which are characteristic of acid-sulfate geothermal activity. The hydrocarbon composition of the seep soils consisted almost entirely of saturated, acyclic alkanes (e.g., n-alkanes with chain lengths of C15 to C30, as well as branched alkanes, predominately pristane and phytane). Bacterial populations present in the seep soils were phylogenetically characterized by 16S rRNA gene clone library analysis. The majority of the sequences recovered (>75%) were related to sequences of heterotrophic acidophilic bacteria, including Acidisphaera spp. and Acidiphilium spp. of the α-Proteobacteria. Clones related to the iron- and sulfur-oxidizing chemolithotroph Acidithiobacillus spp. were also recovered from one of the seep soils. Hydrocarbon-amended soil-sand mixtures were established to examine [14C]hexadecane mineralization and corresponding changes in the bacterial populations using denaturing gradient gel electrophoresis (DGGE) of 16S rRNA gene fragments. Approximately 50% of the [14C]hexadecane added was recovered as 14CO2 during an 80-day incubation, and this was accompanied by detection of heterotrophic acidophile-related sequences as dominant DGGE bands. An alkane-degrading isolate was cultivated, whose 16S rRNA gene sequence was identical to the sequence of a dominant DGGE band in the soil-sand mixture, as well as the clone sequence recovered most frequently from the original soil. This and the presence of an alkB gene homolog in this isolate confirmed the alkane degradation capability of one population indigenous to acidic hydrocarbon seep soils. PMID:16204508
Diversity and functional analysis of bacterial communities associated with natural hydrocarbon seeps in acidic soils at Rainbow Springs, Yellowstone National Park.

PubMed

Hamamura, Natsuko; Olson, Sarah H; Ward, David M; Inskeep, William P

2005-10-01

In this paper we describe the bacterial communities associated with natural hydrocarbon seeps in nonthermal soils at Rainbow Springs, Yellowstone National Park. Soil chemical analysis revealed high sulfate concentrations and low pH values (pH 2.8 to 3.8), which are characteristic of acid-sulfate geothermal activity. The hydrocarbon composition of the seep soils consisted almost entirely of saturated, acyclic alkanes (e.g., n-alkanes with chain lengths of C15 to C30, as well as branched alkanes, predominately pristane and phytane). Bacterial populations present in the seep soils were phylogenetically characterized by 16S rRNA gene clone library analysis. The majority of the sequences recovered (>75%) were related to sequences of heterotrophic acidophilic bacteria, including Acidisphaera spp. and Acidiphilium spp. of the alpha-Proteobacteria. Clones related to the iron- and sulfur-oxidizing chemolithotroph Acidithiobacillus spp. were also recovered from one of the seep soils. Hydrocarbon-amended soil-sand mixtures were established to examine [14C]hexadecane mineralization and corresponding changes in the bacterial populations using denaturing gradient gel electrophoresis (DGGE) of 16S rRNA gene fragments. Approximately 50% of the [14C]hexadecane added was recovered as 14CO2 during an 80-day incubation, and this was accompanied by detection of heterotrophic acidophile-related sequences as dominant DGGE bands. An alkane-degrading isolate was cultivated, whose 16S rRNA gene sequence was identical to the sequence of a dominant DGGE band in the soil-sand mixture, as well as the clone sequence recovered most frequently from the original soil. This and the presence of an alkB gene homolog in this isolate confirmed the alkane degradation capability of one population indigenous to acidic hydrocarbon seep soils.
On the conservative nature of intragenic recombination

PubMed Central

Drummond, D. Allan; Silberg, Jonathan J.; Meyer, Michelle M.; Wilke, Claus O.; Arnold, Frances H.

2005-01-01

Intragenic recombination rapidly creates protein sequence diversity compared with random mutation, but little is known about the relative effects of recombination and mutation on protein function. Here, we compare recombination of the distantly related β-lactamases PSE-4 and TEM-1 to mutation of PSE-4. We show that, among β-lactamase variants containing the same number of amino acid substitutions, variants created by recombination retain function with a significantly higher probability than those generated by random mutagenesis. We present a simple model that accurately captures the differing effects of mutation and recombination in real and simulated proteins with only four parameters: (i) the amino acid sequence distance between parents, (ii) the number of substitutions, (iii) the average probability that random substitutions will preserve function, and (iv) the average probability that substitutions generated by recombination will preserve function. Our results expose a fundamental functional enrichment in regions of protein sequence space accessible by recombination and provide a framework for evaluating whether the relative rates of mutation and recombination observed in nature reflect the underlying imbalance in their effects on protein function. PMID:15809422
On the conservative nature of intragenic recombination.

PubMed

Drummond, D Allan; Silberg, Jonathan J; Meyer, Michelle M; Wilke, Claus O; Arnold, Frances H

2005-04-12

Intragenic recombination rapidly creates protein sequence diversity compared with random mutation, but little is known about the relative effects of recombination and mutation on protein function. Here, we compare recombination of the distantly related beta-lactamases PSE-4 and TEM-1 to mutation of PSE-4. We show that, among beta-lactamase variants containing the same number of amino acid substitutions, variants created by recombination retain function with a significantly higher probability than those generated by random mutagenesis. We present a simple model that accurately captures the differing effects of mutation and recombination in real and simulated proteins with only four parameters: (i) the amino acid sequence distance between parents, (ii) the number of substitutions, (iii) the average probability that random substitutions will preserve function, and (iv) the average probability that substitutions generated by recombination will preserve function. Our results expose a fundamental functional enrichment in regions of protein sequence space accessible by recombination and provide a framework for evaluating whether the relative rates of mutation and recombination observed in nature reflect the underlying imbalance in their effects on protein function.
The Thiamine-Pyrophosphate-Motif

NASA Technical Reports Server (NTRS)

Ciszak, Ewa; Dominiak, Paulina

2004-01-01

Thiamin pyrophosphate (TPP), a derivative of vitamin B1, is a cofactor for enzymes performing catalysis in pathways of energy production including the well known decarboxylation of a-keto acid dehydrogenases followed by transketolation. TPP-dependent enzymes constitute a structurally and functionally diverse group exhibiting multimeric subunit organization, multiple domains and two chemically equivalent catalytic centers. Annotation of functional TPP-dependcnt enzymes, therefore, has not been trivial due to low sequence similarity related to this complex organization. Our approach to analysis of structures of known TPP-dependent enzymes reveals for the first time features common to this group, which we have termed the TPP-motif. The TPP-motif consists of specific spatial arrangements of structural elements and their specific contacts to provide for a flip-flop, or alternate site, enzymatic mechanism of action. Analysis of structural elements entrained in the flip-flop action displayed by TPP-dependent enzymes reveals a novel definition of the common amino acid sequences. These sequences allow for annotation of TPP-dependent enzymes, thus advancing functional proteomics. Further details of three-dimensional structures of TPP-dependent enzymes will be discussed.
Antarctic ice core samples: culturable bacterial diversity.

PubMed

Shivaji, Sisinthy; Begum, Zareena; Shiva Nageswara Rao, Singireesu Soma; Vishnu Vardhan Reddy, Puram V; Manasa, Poorna; Sailaja, Buddi; Prathiba, Mambatta S; Thamban, Meloth; Krishnan, Kottekkatu P; Singh, Shiv M; Srinivas, Tanuku N R

2013-01-01

Culturable bacterial abundance at 11 different depths of a 50.26 m ice core from the Tallaksenvarden Nunatak, Antarctica, varied from 0.02 to 5.8 × 10(3) CFU ml(-1) of the melt water. A total of 138 bacterial strains were recovered from the 11 different depths of the ice core. Based on 16S rRNA gene sequence analyses, the 138 isolates could be categorized into 25 phylotypes belonging to phyla Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria. All isolates had 16S rRNA sequences similar to previously determined sequences (97.2-100%). No correlation was observed in the distribution of the isolates at the various depths either at the phylum, genus or species level. The 25 phylotypes varied in growth temperature range, tolerance to NaCl, growth pH range and ability to produce eight different extracellular enzymes at either 4 or 18 °C. Iso-, anteiso-, unsaturated and saturated fatty acids together constituted a significant proportion of the total fatty acid composition. Copyright © 2012 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Modified RNA-seq method for microbial community and diversity analysis using rRNA in different types of environmental samples

PubMed Central

Yan, Yong-Wei; Zou, Bin; Zhu, Ting; Hozzein, Wael N.

2017-01-01

RNA-seq-based SSU (small subunit) rRNA (ribosomal RNA) analysis has provided a better understanding of potentially active microbial community within environments. However, for RNA-seq library construction, high quantities of purified RNA are typically required. We propose a modified RNA-seq method for SSU rRNA-based microbial community analysis that depends on the direct ligation of a 5’ adaptor to RNA before reverse-transcription. The method requires only a low-input quantity of RNA (10–100 ng) and does not require a DNA removal step. The method was initially tested on three mock communities synthesized with enriched SSU rRNA of archaeal, bacterial and fungal isolates at different ratios, and was subsequently used for environmental samples of high or low biomass. For high-biomass salt-marsh sediments, enriched SSU rRNA and total nucleic acid-derived RNA-seq datasets revealed highly consistent community compositions for all of the SSU rRNA sequences, and as much as 46.4%-59.5% of 16S rRNA sequences were suitable for OTU (operational taxonomic unit)-based community and diversity analyses with complete coverage of V1-V2 regions. OTU-based community structures for the two datasets were also highly consistent with those determined by all of the 16S rRNA reads. For low-biomass samples, total nucleic acid-derived RNA-seq datasets were analyzed, and highly active bacterial taxa were also identified by the OTU-based method, notably including members of the previously underestimated genus Nitrospira and phylum Acidobacteria in tap water, members of the phylum Actinobacteria on a shower curtain, and members of the phylum Cyanobacteria on leaf surfaces. More than half of the bacterial 16S rRNA sequences covered the complete region of primer 8F, and non-coverage rates as high as 38.7% were obtained for phylum-unclassified sequences, providing many opportunities to identify novel bacterial taxa. This modified RNA-seq method will provide a better snapshot of diverse microbial communities, most notably by OTU-based analysis, even communities with low-biomass samples. PMID:29016661
DOE Office of Scientific and Technical Information (OSTI.GOV)

Boore, Jeffrey L.; Staton, Joseph

We have determined the sequence of about half (7470 nts) of the mitochondrial genome of the sipunculid Phascolopsis gouldii, the first representative of this phylum to be so studied. All of the 19 identified genes are transcribed from the same DNA strand. The arrangement of these genes is remarkably similar to that of the oligochaete annelid Lumbricus terrestris. Comparison of both the inferred amino acid sequences and the gene arrangements of a variety of diverse metazoan taxa reveals that the phylum Sipuncula is more closely related to Annelida than to Mollusca. This requires reinterpretation of the homology of several embryologicalmore » features and of patterns of animal body plan evolution.« less
Sequence-specific unusual (1-->2)-type helical turns in alpha/beta-hybrid peptides.

PubMed

Prabhakaran, Panchami; Kale, Sangram S; Puranik, Vedavati G; Rajamohanan, P R; Chetina, Olga; Howard, Judith A K; Hofmann, Hans-Jörg; Sanjayan, Gangadhar J

2008-12-31

This article describes novel conformationally ordered alpha/beta-hybrid peptides consisting of repeating l-proline-anthranilic acid building blocks. These oligomers adopt a compact, right-handed helical architecture determined by the intrinsic conformational preferences of the individual amino acid residues. The striking feature of these oligomers is their ability to display an unusual periodic pseudo beta-turn network of nine-membered hydrogen-bonded rings formed in the forward direction of the sequence by 1-->2 amino acid interactions both in solid-state and in solution. Conformational investigations of several of these oligomers by single-crystal X-ray diffraction, solution-state NMR, and ab initio MO theory suggest that the characteristic steric and dihedral angle restraints exerted by proline are essential for stabilizing the unusual pseudo beta-turn network found in these oligomers. Replacing proline by the conformationally flexible analogue alanine (Ala) or by the conformationally more constrained alpha-amino isobutyric acid (Aib) had an adverse effect on the stabilization of this structural architecture. These findings increase the potential to design novel secondary structure elements profiting from the steric and dihedral angle constraints of the amino acid constituents and help to augment the conformational space available for synthetic oligomer design with diverse backbone structures.
Diversity of interferon inducible Mx gene in horses and association of variations with susceptibility vis-à-vis resistance against equine influenza infection.

PubMed

Manuja, Balvinder K; Manuja, Anju; Dahiya, Rajni; Singh, Sandeep; Sharma, R C; Gahlot, S K

2014-10-01

Equine influenza (EI) is primarily an infection of the upper respiratory tract and is one of the major infectious respiratory diseases of economic importance in equines. Re-emergence of the disease, species jumping by H3N8 virus in canines and possible threat of human pandemic due to the unpredictable nature of the virus have necessitated research on devising strategies for preventing the disease. The myxovirus resistance protein (Mx) has been reported to confer resistance to Orthomyxo virus infection by modifying cellular functions needed along the viral replication pathway. Polymorphisms and differential antiviral activities of Mx gene have been reported in pigs and chicken. Here we report the diversity of Mx gene, its expression in response to stimulation with interferon (IFN) α/β and their association with EI resistance and susceptibility in Marwari horses. Blood samples were collected from horses declared positive for equine influenza and in contact animals with a history of no clinical signs. Mx gene was amplified by reverse transcription from total RNA isolated from peripheral blood mononuclear cells (PBMCs) stimulated with IFN α/β using gene specific primers. The amplified gene products from representative samples were cloned and sequenced. Nucleotide sequences and deduced amino acid sequences were analyzed. Out of a total 24 amino acids substitutions sorting intolerant from tolerant (SIFT) analysis predicted 13 substitutions with functional consequences. Five substitutions (V67A, W123L, E346Y, N347Y, S689N) were observed only in resistant animals. Evolutionary distances based on nucleotide sequences with in equines ranged between 0.3-2.0% and 20-24% with other species. On phylogenetic analysis all equine sequences clustered together while other species formed separate clades. Copyright © 2014 Elsevier B.V. All rights reserved.
DNA Microarray Profiling of a Diverse Collection of Nosocomial Methicillin-Resistant Staphylococcus aureus Isolates Assigns the Majority to the Correct Sequence Type and Staphylococcal Cassette Chromosome mec (SCCmec) Type and Results in the Subsequent Identification and Characterization of Novel SCCmec-SCCM1 Composite Islands

PubMed Central

Brennan, Orla M.; Deasy, Emily C.; Rossney, Angela S.; Kinnevey, Peter M.; Ehricht, Ralf; Monecke, Stefan; Coleman, David C.

2012-01-01

One hundred seventy-five isolates representative of methicillin-resistant Staphylococcus aureus (MRSA) clones that predominated in Irish hospitals between 1971 and 2004 and that previously underwent multilocus sequence typing (MLST) and staphylococcal cassette chromosome mec (SCCmec) typing were characterized by spa typing (175 isolates) and DNA microarray profiling (107 isolates). The isolates belonged to 26 sequence type (ST)-SCCmec types and subtypes and 35 spa types. The array assigned all isolates to the correct MLST clonal complex (CC), and 94% (100/107) were assigned an ST, with 98% (98/100) correlating with MLST. The array assigned all isolates to the correct SCCmec type, but subtyping of only some SCCmec elements was possible. Additional SCCmec/SCC genes or DNA sequence variation not detected by SCCmec typing was detected by array profiling, including the SCC-fusidic acid resistance determinant Q6GD50/fusC. Novel SCCmec/SCC composite islands (CIs) were detected among CC8 isolates and comprised SCCmec IIA-IIE, IVE, IVF, or IVg and a ccrAB4-SCC element with 99% DNA sequence identity to SCCM1 from ST8/t024-MRSA, SCCmec VIII, and SCC-CI in Staphylococcus epidermidis. The array showed that the majority of isolates harbored one or more superantigen (94%; 100/107) and immune evasion cluster (91%; 97/107) genes. Apart from fusidic acid and trimethoprim resistance, the correlation between isolate antimicrobial resistance phenotype and the presence of specific resistance genes was ≥97%. Array profiling allowed high-throughput, accurate assignment of MRSA to CCs/STs and SCCmec types and provided further evidence of the diversity of SCCmec/SCC. In most cases, array profiling can accurately predict the resistance phenotype of an isolate. PMID:22869569
Feature selection using a one dimensional naïve Bayes’ classifier increases the accuracy of support vector machine classification of CDR3 repertoires

PubMed Central

Cinelli, Mattia; Sun, , Yuxin; Best, Katharine; Heather, James M.; Reich-Zeliger, Shlomit; Shifrut, Eric; Friedman, Nir; Shawe-Taylor, John; Chain, Benny

2017-01-01

Abstract Motivation: Somatic DNA recombination, the hallmark of vertebrate adaptive immunity, has the potential to generate a vast diversity of antigen receptor sequences. How this diversity captures antigen specificity remains incompletely understood. In this study we use high throughput sequencing to compare the global changes in T cell receptor β chain complementarity determining region 3 (CDR3β) sequences following immunization with ovalbumin administered with complete Freund’s adjuvant (CFA) or CFA alone. Results: The CDR3β sequences were deconstructed into short stretches of overlapping contiguous amino acids. The motifs were ranked according to a one-dimensional Bayesian classifier score comparing their frequency in the repertoires of the two immunization classes. The top ranking motifs were selected and used to create feature vectors which were used to train a support vector machine. The support vector machine achieved high classification scores in a leave-one-out validation test reaching >90% in some cases. Summary: The study describes a novel two-stage classification strategy combining a one-dimensional Bayesian classifier with a support vector machine. Using this approach we demonstrate that the frequency of a small number of linear motifs three amino acids in length can accurately identify a CD4 T cell response to ovalbumin against a background response to the complex mixture of antigens which characterize Complete Freund’s Adjuvant. Availability and implementation: The sequence data is available at www.ncbi.nlm.nih.gov/sra/?term¼SRP075893. The Decombinator package is available at github.com/innate2adaptive/Decombinator. The R package e1071 is available at the CRAN repository https://cran.r-project.org/web/packages/e1071/index.html. Contact: b.chain@ucl.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28073756
Pervasive adaptive protein evolution apparent in diversity patterns around amino acid substitutions in Drosophila simulans.

PubMed

Sattath, Shmuel; Elyashiv, Eyal; Kolodny, Oren; Rinott, Yosef; Sella, Guy

2011-02-10

In Drosophila, multiple lines of evidence converge in suggesting that beneficial substitutions to the genome may be common. All suffer from confounding factors, however, such that the interpretation of the evidence-in particular, conclusions about the rate and strength of beneficial substitutions-remains tentative. Here, we use genome-wide polymorphism data in D. simulans and sequenced genomes of its close relatives to construct a readily interpretable characterization of the effects of positive selection: the shape of average neutral diversity around amino acid substitutions. As expected under recurrent selective sweeps, we find a trough in diversity levels around amino acid but not around synonymous substitutions, a distinctive pattern that is not expected under alternative models. This characterization is richer than previous approaches, which relied on limited summaries of the data (e.g., the slope of a scatter plot), and relates to underlying selection parameters in a straightforward way, allowing us to make more reliable inferences about the prevalence and strength of adaptation. Specifically, we develop a coalescent-based model for the shape of the entire curve and use it to infer adaptive parameters by maximum likelihood. Our inference suggests that ∼13% of amino acid substitutions cause selective sweeps. Interestingly, it reveals two classes of beneficial fixations: a minority (approximately 3%) that appears to have had large selective effects and accounts for most of the reduction in diversity, and the remaining 10%, which seem to have had very weak selective effects. These estimates therefore help to reconcile the apparent conflict among previously published estimates of the strength of selection. More generally, our findings provide unequivocal evidence for strongly beneficial substitutions in Drosophila and illustrate how the rapidly accumulating genome-wide data can be leveraged to address enduring questions about the genetic basis of adaptation.
Mining for Nonribosomal Peptide Synthetase and Polyketide Synthase Genes Revealed a High Level of Diversity in the Sphagnum Bog Metagenome.

PubMed

Müller, Christina A; Oberauner-Wappis, Lisa; Peyman, Armin; Amos, Gregory C A; Wellington, Elizabeth M H; Berg, Gabriele

2015-08-01

Sphagnum bog ecosystems are among the oldest vegetation forms harboring a specific microbial community and are known to produce an exceptionally wide variety of bioactive substances. Although the Sphagnum metagenome shows a rich secondary metabolism, the genes have not yet been explored. To analyze nonribosomal peptide synthetases (NRPSs) and polyketide synthases (PKSs), the diversity of NRPS and PKS genes in Sphagnum-associated metagenomes was investigated by in silico data mining and sequence-based screening (PCR amplification of 9,500 fosmid clones). The in silico Illumina-based metagenomic approach resulted in the identification of 279 NRPSs and 346 PKSs, as well as 40 PKS-NRPS hybrid gene sequences. The occurrence of NRPS sequences was strongly dominated by the members of the Protebacteria phylum, especially by species of the Burkholderia genus, while PKS sequences were mainly affiliated with Actinobacteria. Thirteen novel NRPS-related sequences were identified by PCR amplification screening, displaying amino acid identities of 48% to 91% to annotated sequences of members of the phyla Proteobacteria, Actinobacteria, and Cyanobacteria. Some of the identified metagenomic clones showed the closest similarity to peptide synthases from Burkholderia or Lysobacter, which are emerging bacterial sources of as-yet-undescribed bioactive metabolites. This report highlights the role of the extreme natural ecosystems as a promising source for detection of secondary compounds and enzymes, serving as a source for biotechnological applications. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Comparative analysis of the feline immunoglobulin repertoire.

PubMed

Steiniger, Sebastian C J; Glanville, Jacob; Harris, Douglas W; Wilson, Thomas L; Ippolito, Gregory C; Dunham, Steven A

2017-03-01

Next-Generation Sequencing combined with bioinformatics is a powerful tool for analyzing the large number of DNA sequences present in the expressed antibody repertoire and these data sets can be used to advance a number of research areas including antibody discovery and engineering. The accurate measurement of the immune repertoire sequence composition, diversity and abundance is important for understanding the repertoire response in infections, vaccinations and cancer immunology and could also be useful for elucidating novel molecular targets. In this study 4 individual domestic cats (Felis catus) were subjected to antibody repertoire sequencing with total number of sequences generated 1079863 for VH for IgG, 1050824 VH for IgM, 569518 for VK and 450195 for VL. Our analysis suggests that a similar VDJ expression patterns exists across all cats. Similar to the canine repertoire, the feline repertoire is dominated by a single subgroup, namely VH3. The antibody paratope of felines showed similar amino acid variation when compared to human, mouse and canine counterparts. All animals show a similarly skewed VH CDR-H3 profile and, when compared to canine, human and mouse, distinct differences are observed. Our study represents the first attempt to characterize sequence diversity in the expressed feline antibody repertoire and this demonstrates the utility of using NGS to elucidate entire antibody repertoires from individual animals. These data provide significant insight into understanding the feline immune system function. Copyright © 2017 International Alliance for Biological Standardization. Published by Elsevier Ltd. All rights reserved.
Propionibacterium acnes: Disease-Causing Agent or Common Contaminant? Detection in Diverse Patient Samples by Next-Generation Sequencing

PubMed Central

Friis-Nielsen, Jens; Vinner, Lasse; Hansen, Thomas Arn; Richter, Stine Raith; Fridholm, Helena; Herrera, Jose Alejandro Romero; Lund, Ole; Brunak, Søren; Izarzugaza, Jose M. G.; Mourier, Tobias; Nielsen, Lars Peter

2016-01-01

Propionibacterium acnes is the most abundant bacterium on human skin, particularly in sebaceous areas. P. acnes is suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence of P. acnes DNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions of P. acnes DNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples. P. acnes reads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show that P. acnes can be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully when P. acnes is detected in clinical samples. We advocate that detection of P. acnes always be accompanied by experiments validating the association between this bacterium and any clinical condition. PMID:26818667
Evolutionary Pattern of the FAE1 Gene in Brassicaceae and Its Correlation with the Erucic Acid Trait

PubMed Central

Li, Mimi; Peng, Bin; Guo, Haisong; Yan, Qinqin; Hang, Yueyu

2013-01-01

The fatty acid elongase 1 (FAE1) gene catalyzes the initial condensation step in the elongation pathway of VLCFA (very long chain fatty acid) biosynthesis and is thus a key gene in erucic acid biosynthesis. Based on a worldwide collection of 62 accessions representing 14 tribes, 31 genera, 51 species, 4 subspecies and 7 varieties, we conducted a phylogenetic reconstruction and correlation analysis between genetic variations in the FAE1 gene and the erucic acid trait, attempting to gain insight into the evolutionary patterns and the correlations between genetic variations in FAE1 and trait variations. The five clear, deeply diverged clades detected in the phylogenetic reconstruction are largely congruent with a previous multiple gene-derived phylogeny. The Ka/Ks ratio (<1) and overall low level of nucleotide diversity in the FAE1 gene suggest that purifying selection is the major evolutionary force acting on this gene. Sequence variations in FAE1 show a strong correlation with the content of erucic acid in seeds, suggesting a causal link between the two. Furthermore, we detected 16 mutations that were fixed between the low and high phenotypes of the FAE1 gene, which constitute candidate active sites in this gene for altering the content of erucic acid in seeds. Our findings begin to shed light on the evolutionary pattern of this important gene and represent the first step in elucidating how the sequence variations impact the production of erucic acid in plants. PMID:24358289
Genetic diversity of influenza A(H1N1)2009 virus circulating during the season 2010-2011 in Spain.

PubMed

Ledesma, Juan; Pozo, Francisco; Reina, Gabriel; Blasco, Miriam; Rodríguez, Guadalupe; Montes, Milagrosa; López-Miragaya, Isabel; Salvador, Carmen; Reina, Jordi; Ortíz de Lejarazu, Raúl; Egido, Pilar; López Barba, José; Delgado, Concepción; Cuevas, María Teresa; Casas, Inmaculada

2012-01-01

Genetic diversity of influenza A(H1N1)2009 viruses has been reported since the pandemic virus emerged in April 2009. Different genetic clades have been identified and defined based on amino acid substitutions found in the haemagglutinin (HA) protein sequences. In Spain, circulating influenza viruses are monitored each season by the regional laboratories enrolled in the Spanish Influenza Surveillance System (SISS). The analysis of the HA gene sequence helps to detect the genetic diversity and viral evolution. To perform an analysis of the genetic diversity of influenza A(H1N1)2009 viruses circulating in Spain during the season 2010-2011 based on analysis of the HA sequence gene. Phylogenetic analysis based on the HA1 subunit of the haemagglutinin gene was carried out on 220 influenza A(H1N1)2009 viruses circulating during the season 2010-2011. Six different genetic groups were identified among circulating A(H1N1)2009 viruses, five of them were previously reported during season 2010-2011. A new group, characterized by E172K and K308E changes and a proline at position 83, was observed in 12.27% of the Spanish viruses. Co-circulation of six different genetic groups of influenza A(H1N1)2009 viruses was identified in Spain during the season 2010-2011. Nevertheless, at this stage, none of the groups identified to date have resulted in significant antigenic changes according to data collected by World Health Organization Collaborating Centres for influenza surveillance. Copyright © 2011 Elsevier B.V. All rights reserved.

SeSaM-Tv-II generates a protein sequence space that is unobtainable by epPCR.

PubMed

Mundhada, Hemanshu; Marienhagen, Jan; Scacioc, Andreea; Schenk, Alexander; Roccatano, Danilo; Schwaneberg, Ulrich

2011-07-04

Generating high-quality mutant libraries in which each amino acid is equally targeted and substituted in a chemically diverse manner is crucial to obtain improved variants in small mutant libraries. The sequence saturation mutagenesis method (SeSaM-Tv(+) ) offers the opportunity to generate such high-quality mutant libraries by introducing consecutive mutations and by enriching transversions. In this study, automated gel electrophoresis, real-time quantitative PCR, and a phosphorimager quantification system were developed and employed to optimize each step of previously reported SeSaM-Tv(+) method. Advancements of the SeSaM-Tv(+) protocol and the use of a novel DNA polymerase quadrupled the number of transversions, by doubling the fraction of consecutive mutations (from 16.7 to 37.1 %). About 33 % of all amino acid substitutions observed in a model library are rarely introduced by epPCR methods, and around 10 % of all clones carried amino acid substitutions that are unobtainable by epPCR. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Two Theileria parva CD8 T Cell Antigen Genes Are More Variable in Buffalo than Cattle Parasites, but Differ in Pattern of Sequence Diversity

PubMed Central

Pelle, Roger; Graham, Simon P.; Njahira, Moses N.; Osaso, Julius; Saya, Rosemary M.; Odongo, David O.; Toye, Philip G.; Spooner, Paul R.; Musoke, Anthony J.; Mwangi, Duncan M.; Taracha, Evans L. N.; Morrison, W. Ivan; Weir, William; Silva, Joana C.; Bishop, Richard P.

2011-01-01

Background Theileria parva causes an acute fatal disease in cattle, but infections are asymptomatic in the African buffalo (Syncerus caffer). Cattle can be immunized against the parasite by infection and treatment, but immunity is partially strain specific. Available data indicate that CD8+ T lymphocyte responses mediate protection and, recently, several parasite antigens recognised by CD8+ T cells have been identified. This study set out to determine the nature and extent of polymorphism in two of these antigens, Tp1 and Tp2, which contain defined CD8+ T-cell epitopes, and to analyse the sequences for evidence of selection. Methodology/Principal Findings Partial sequencing of the Tp1 gene and the full-length Tp2 gene from 82 T. parva isolates revealed extensive polymorphism in both antigens, including the epitope-containing regions. Single nucleotide polymorphisms were detected at 51 positions (∼12%) in Tp1 and in 320 positions (∼61%) in Tp2. Together with two short indels in Tp1, these resulted in 30 and 42 protein variants of Tp1 and Tp2, respectively. Although evidence of positive selection was found for multiple amino acid residues, there was no preferential involvement of T cell epitope residues. Overall, the extent of diversity was much greater in T. parva isolates originating from buffalo than in isolates known to be transmissible among cattle. Conclusions/Significance The results indicate that T. parva parasites maintained in cattle represent a subset of the overall T. parva population, which has become adapted for tick transmission between cattle. The absence of obvious enrichment for positively selected amino acid residues within defined epitopes indicates either that diversity is not predominantly driven by selection exerted by host T cells, or that such selection is not detectable by the methods employed due to unidentified epitopes elsewhere in the antigens. Further functional studies are required to address this latter point. PMID:21559495
Two Theileria parva CD8 T cell antigen genes are more variable in buffalo than cattle parasites, but differ in pattern of sequence diversity.

PubMed

Pelle, Roger; Graham, Simon P; Njahira, Moses N; Osaso, Julius; Saya, Rosemary M; Odongo, David O; Toye, Philip G; Spooner, Paul R; Musoke, Anthony J; Mwangi, Duncan M; Taracha, Evans L N; Morrison, W Ivan; Weir, William; Silva, Joana C; Bishop, Richard P

2011-04-29

Theileria parva causes an acute fatal disease in cattle, but infections are asymptomatic in the African buffalo (Syncerus caffer). Cattle can be immunized against the parasite by infection and treatment, but immunity is partially strain specific. Available data indicate that CD8(+) T lymphocyte responses mediate protection and, recently, several parasite antigens recognised by CD8(+) T cells have been identified. This study set out to determine the nature and extent of polymorphism in two of these antigens, Tp1 and Tp2, which contain defined CD8(+) T-cell epitopes, and to analyse the sequences for evidence of selection. Partial sequencing of the Tp1 gene and the full-length Tp2 gene from 82 T. parva isolates revealed extensive polymorphism in both antigens, including the epitope-containing regions. Single nucleotide polymorphisms were detected at 51 positions (∼12%) in Tp1 and in 320 positions (∼61%) in Tp2. Together with two short indels in Tp1, these resulted in 30 and 42 protein variants of Tp1 and Tp2, respectively. Although evidence of positive selection was found for multiple amino acid residues, there was no preferential involvement of T cell epitope residues. Overall, the extent of diversity was much greater in T. parva isolates originating from buffalo than in isolates known to be transmissible among cattle. The results indicate that T. parva parasites maintained in cattle represent a subset of the overall T. parva population, which has become adapted for tick transmission between cattle. The absence of obvious enrichment for positively selected amino acid residues within defined epitopes indicates either that diversity is not predominantly driven by selection exerted by host T cells, or that such selection is not detectable by the methods employed due to unidentified epitopes elsewhere in the antigens. Further functional studies are required to address this latter point.
Microbial Community Structure and Arsenic Biogeochemistry in an Acid Vapor-Formed Spring in Tengchong Geothermal Area, China.

PubMed

Jiang, Zhou; Li, Ping; Jiang, Dawei; Dai, Xinyue; Zhang, Rui; Wang, Yanhong; Wang, Yanxin

2016-01-01

Arsenic biogeochemistry has been studied extensively in acid sulfate-chloride hot springs, but not in acid sulfate hot springs with low chloride. In this study, Zhenzhuquan in Tengchong geothermal area, a representative acid sulfate hot spring with low chloride, was chosen to study arsenic geochemistry and microbial community structure using Illumina MiSeq sequencing. Over 0.3 million 16S rRNA sequence reads were obtained from 6-paired parallel water and sediment samples along its outflow channel. Arsenic oxidation occurred in the Zhenxhuquan pool, with distinctly high ratios of arsenate to total dissolved arsenic (0.73-0.86). Coupled with iron and sulfur oxidation along the outflow channel, arsenic accumulated in downstream sediments with concentrations up to 16.44 g/kg and appeared to significantly constrain their microbial community diversity. These oxidations might be correlated with the appearance of some putative functional microbial populations, such as Aquificae and Pseudomonas (arsenic oxidation), Sulfolobus (sulfur and iron oxidation), Metallosphaera and Acidicaldus (iron oxidation). Temperature, total organic carbon and dissolved oxygen significantly shaped the microbial community structure of upstream and downstream samples. In the upstream outflow channel region, most microbial populations were microaerophilic/anaerobic thermophiles and hyperthermophiles, such as Sulfolobus, Nocardia, Fervidicoccus, Delftia, and Ralstonia. In the downstream region, aerobic heterotrophic mesophiles and thermophiles were identified, including Ktedonobacteria, Acidicaldus, Chthonomonas and Sphingobacteria. A total of 72.41-95.91% unassigned-genus sequences were derived from the downstream high arsenic sediments 16S rRNA clone libraries. This study could enable us to achieve an integrated understanding on arsenic biogeochemistry in acid hot springs.
Molecular modelling of the Norrie disease protein predicts a cystine knot growth factor tertiary structure.

PubMed

Meitinger, T; Meindl, A; Bork, P; Rost, B; Sander, C; Haasemann, M; Murken, J

1993-12-01

The X-lined gene for Norrie disease, which is characterized by blindness, deafness and mental retardation has been cloned recently. This gene has been thought to code for a putative extracellular factor; its predicted amino acid sequence is homologous to the C-terminal domain of diverse extracellular proteins. Sequence pattern searches and three-dimensional modelling now suggest that the Norrie disease protein (NDP) has a tertiary structure similar to that of transforming growth factor beta (TGF beta). Our model identifies NDP as a member of an emerging family of growth factors containing a cystine knot motif, with direct implications for the physiological role of NDP. The model also sheds light on sequence related domains such as the C-terminal domain of mucins and of von Willebrand factor.
Algorithm to find distant repeats in a single protein sequence

PubMed Central

Banerjee, Nirjhar; Sarani, Rangarajan; Ranjani, Chellamuthu Vasuki; Sowmiya, Govindaraj; Michael, Daliah; Balakrishnan, Narayanasamy; Sekar, Kanagaraj

2008-01-01

Distant repeats in protein sequence play an important role in various aspects of protein analysis. A keen analysis of the distant repeats would enable to establish a firm relation of the repeats with respect to their function and three-dimensional structure during the evolutionary process. Further, it enlightens the diversity of duplication during the evolution. To this end, an algorithm has been developed to find all distant repeats in a protein sequence. The scores from Point Accepted Mutation (PAM) matrix has been deployed for the identification of amino acid substitutions while detecting the distant repeats. Due to the biological importance of distant repeats, the proposed algorithm will be of importance to structural biologists, molecular biologists, biochemists and researchers involved in phylogenetic and evolutionary studies. PMID:19052663
Sequence variation of the glycoprotein gene identifies three distinct lineages within field isolates of viral hemorrhagic septicemia virus, a fish rhabdovirus

USGS Publications Warehouse

Benmansour, A.; Bascuro, B.; Monnier, A.F.; Vende, P.; Winton, J.R.; de Kinkelin, P.

1997-01-01

To evaluate the genetic diversity of viral haemorrhagic septicaemia virus (VHSV), the sequence of the glycoprotein genes (G) of 11 North American and European isolates were determined. Comparison with the G protein of representative members of the family Rhabdoviridae suggested that VHSV was a different virus species from infectious haemorrhagic necrosis virus (IHNV) and Hirame rhabdovirus (HIRRV). At a higher taxonomic level, VHSV, IHNV and HIRRV formed a group which was genetically closest to the genus Lyssavirus. Compared with each other, the G genes of VHSV displayed a dissimilar overall genetic diversity which correlated with differences in geographical origin. The multiple sequence alignment of the complete G protein, showed that the divergent positions were not uniformly distributed along the sequence. A central region (amino acid position 245-300) accumulated substitutions and appeared to be highly variable. The genetic heterogeneity within a single isolate was high, with an apparent internal mutation frequency of 1.2 x 10(-3) per nucleotide site, attesting the quasispecies nature of the viral population. The phylogeny separated VHSV strains according to the major geographical area of isolation: genotype I for continental Europe, genotype II for the British Isles, and genotype III for North America. Isolates from continental Europe exhibited the highest genetic variability, with sub-groups correlated partially with the serological classification. Neither neutralizing polyclonal sera, nor monoclonal antibodies, were able to discriminate between the genotypes. The overall structure of the phylogenetic tree suggests that VHSV genetic diversity and evolution fit within the model of random change and positive selection operating on quasispecies.
Antimicrobial peptides containing unnatural amino acid exhibit potent bactericidal activity against ESKAPE pathogens.

PubMed

Hicks, R P; Abercrombie, J J; Wong, R K; Leung, K P

2013-01-01

A series of 36 synthetic antimicrobial peptides containing unnatural amino acids were screened to determine their effectiveness to treat Enterococcus faecium, Staphylococcus aureus, Klebsiella pnemoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter species (ESKAPE) pathogens, which are known to commonly infect chronic wounds. The primary amino acid sequences of these peptides incorporate either three or six dipeptide units consisting of the unnatural amino acids Tetrahydroisoquinolinecarboxylic acid (Tic) and Octahydroindolecarboxylic acid (Oic). The Tic-Oic dipeptide units are separated by SPACER amino acids with specific physicochemical properties that control how these peptides interact with bacterial cell membranes of different chemical compositions. These peptides exhibited minimum inhibitory concentrations (MIC) against these pathogens in the range from >100 to 6.25 μg/mL. The observed diversity of MIC values for these peptides against the various bacterial strains are consistent with our hypothesis that the complementarity of the physicochemical properties of the peptide and the lipid of the bacteria's cell membrane determines the resulting antibacterial activity of the peptide. Published by Elsevier Ltd.
Multiple Genetic Mechanisms Contribute to Visual Sensitivity Variation in the Labridae

PubMed Central

Phillips, Genevieve A.C.; Carleton, Karen L.; Marshall, N. Justin

2016-01-01

Coral reefs are one of the most spectrally diverse environments, both in terms of habitat and animal color. Species identity, sex, and camouflage are drivers of the phenotypic diversity seen in coral reef fishes, but how the phenotypic diversity is reflected in the genotype remains to be answered. The labrids are a large, polyphyletic family of coral reef fishes that display a diverse range of colors, including developmental color morphs and extensive behavioral ecologies. Here, we assess the opsin sequence and expression diversity among labrids from the Great Barrier Reef, Australia. We found that labrids express a diverse palette of visual opsins, with gene duplications in both RH2 and LWS genes. The majority of opsins expressed were within the mid-to-long wavelength sensitive classes (RH2 and LWS). Three of the labrid species expressed SWS1 (ultra-violet sensitive) opsins with the majority expressing the violet-sensitive SWS2B gene and none expressing SWS2A. We used knowledge about spectral tuning sites to calculate approximate spectral sensitivities (λmax) for individual species’ visual pigments, which corresponded well with previously published λmax values for closely related species (SWS1: 356–370 nm; SWS2B: 421–451 nm; RH2B: 452–492 nm; RH2A: 516–528 nm; LWS1: 554–555 nm; LWS2: 561–562 nm). In contrast to the phenotypic diversity displayed via color patterns and feeding ecology, there was little amino acid diversity within the known opsin sequence tuning sites. However, gene duplications and differential expression provide alternative mechanisms for tuning visual pigments, resulting in variable visual sensitivities among labrid species. PMID:26464127
DNA Sequence Polymorphism of the Lactate Dehydrogenase Genefrom Iranian Plasmodium vivax and Plasmodium falciparum Isolates.

PubMed

Getacher Feleke, Daniel; Nateghpour, Mehdi; Motevalli Haghi, Afsaneh; Hajjaran, Homa; Farivar, Leila; Mohebali, Mehdi; Raoofian, Reza

2015-01-01

Parasite lactate dehydrogenase (pLDH) is extensively employed as malaria rapid diagnostic tests (RDTs). Moreover, it is a well-known drug target candidate. However, the genetic diversity of this gene might influence performance of RDT kits and its drug target candidacy. This study aimed to determine polymorphism of pLDH gene from Iranian isolates of P. vivax and P. falciparum. Genomic DNA was extracted from whole blood of microscopically confirmed P. vivax and P. falciparum infected patients. pLDH gene of P. falciparum and P. vivax was amplified using conventional PCR from 43 symptomatic malaria patients from Sistan and Baluchistan Province, Southeast Iran from 2012 to 2013. Sequence analysis of 15 P. vivax LDH showed fourteen had 100% identity with P. vivax Sal-1 and Belem strains. Two nucleotide substitutions were detected with only one resulted in amino acid change. Analysis of P. falciparum LDH sequences showed six of the seven sequences had 100% homology with P. falciparum 3D7 and Mzr-1. Moreover, PfLDH displayed three nucleotide changes that resulted in changing only one amino acid. PvLDH and PfLDH showed 75%-76% nucleotide and 90.4%-90.76% amino acid homology. pLDH gene from Iranian P. falciparum and P. vivax isolates displayed 98.8-100% homology with 1-3 nucleotide substitutions. This indicated this gene was relatively conserved. Additional studies can be done weather this genetic variation can influence the performance of pLDH based RDTs or not.
Cloning and sequence analysis of sucrose phosphate synthase gene from varieties of Pennisetum species.

PubMed

Li, H C; Lu, H B; Yang, F Y; Liu, S J; Bai, C J; Zhang, Y W

2015-03-31

Sucrose phosphate synthase (SPS) is an enzyme used by higher plants for sucrose synthesis. In this study, three primer sets were designed on the basis of known SPS sequences from maize (GenBank: NM_001112224.1) and sugarcane (GenBank: JN584485.1), and five novel SPS genes were identified by RT-PCR from the genomes of Pennisetum spp (the hybrid P. americanum x P. purpureum, P. purpureum Schum., P. purpureum Schum. cv. Red, P. purpureum Schum. cv. Taiwan, and P. purpureum Schum. cv. Mott). The cloned sequences showed 99.9% identity and 80-88% similarity to the SPS sequences of other plants. The SPS gene of hybrid Pennisetum had one nucleotide and four amino acid polymorphisms compared to the other four germplasms, and cluster analysis was performed to assess genetic diversity in this species. Additional characterization of the SPS gene product can potentially allow Pennisetum to be exploited as a biofuel source.
Microbial Diversity and Its Relationship to Physicochemical Characteristics of the Water in Two Extreme Acidic Pit Lakes from the Iberian Pyrite Belt (SW Spain)

PubMed Central

López-Pamo, Enrique; Gomariz, María; Amils, Ricardo; Aguilera, Ángeles

2013-01-01

The Iberian Pyrite Belt (IPB) hosts one of the world’s largest accumulations of acidic mine wastes and pit lakes. The mineralogical and textural characteristics of the IPB ores have favored the oxidation and dissolution of metallic sulfides, mainly pyrite, and the subsequent formation of acidic mining drainages. This work reports the physical properties, hydrogeochemical characteristics, and microbial diversity of two pit lakes located in the IPB. Both pit lakes are acidic and showed high concentrations of sulfate and dissolved metals. Concentrations of sulfate and heavy metals were higher in the Nuestra Señora del Carmen lake (NSC) by one order of magnitude than in the Concepción (CN) lake. The hydrochemical characteristics of NSC were typical of acid mine waters and can be compared with other acidic environments. When compared to other IPB acidic pit lakes, the superficial water of CN is more diluted than that of any of the others due, probably, to the strong influence of runoff water. Both pit lakes showed chemical and thermal stratification with well defined chemoclines. One particular characteristic of NSC is that it has developed a chemocline very close to the surface (2 m depth). Microbial community composition of the water column was analyzed by 16S and 18S rRNA gene cloning and sequencing. The microorganisms detected in NSC were characteristic of acid mine drainage (AMD), including iron oxidizing bacteria (Leptospirillum, Acidithiobacillus ferrooxidans) and facultative iron reducing bacteria and archaea (Acidithiobacillus ferrooxidans, Acidiphilium, Actinobacteria, Acidimicrobiales, Ferroplasma) detected in the bottom layer. Diversity in CN was higher than in NSC. Microorganisms known from AMD systems (Acidiphilium, Acidobacteria and Ferrovum) and microorganisms never reported from AMD systems were identified. Taking into consideration the hydrochemical characteristics of these pit lakes and the spatial distribution of the identified microorganisms, a model explaining their geomicrobiology is advanced. PMID:23840525
Ecological roles of dominant and rare prokaryotes in acid mine drainage revealed by metagenomics and metatranscriptomics.

PubMed

Hua, Zheng-Shuang; Han, Yu-Jiao; Chen, Lin-Xing; Liu, Jun; Hu, Min; Li, Sheng-Jin; Kuang, Jia-Liang; Chain, Patrick S G; Huang, Li-Nan; Shu, Wen-Sheng

2015-06-01

High-throughput sequencing is expanding our knowledge of microbial diversity in the environment. Still, understanding the metabolic potentials and ecological roles of rare and uncultured microbes in natural communities remains a major challenge. To this end, we applied a 'divide and conquer' strategy that partitioned a massive metagenomic data set (>100 Gbp) into subsets based on K-mer frequency in sequence assembly to a low-diversity acid mine drainage (AMD) microbial community and, by integrating with an additional metatranscriptomic assembly, successfully obtained 11 draft genomes most of which represent yet uncultured and/or rare taxa (relative abundance <1%). We report the first genome of a naturally occurring Ferrovum population (relative abundance >90%) and its metabolic potentials and gene expression profile, providing initial molecular insights into the ecological role of these lesser known, but potentially important, microorganisms in the AMD environment. Gene transcriptional analysis of the active taxa revealed major metabolic capabilities executed in situ, including carbon- and nitrogen-related metabolisms associated with syntrophic interactions, iron and sulfur oxidation, which are key in energy conservation and AMD generation, and the mechanisms of adaptation and response to the environmental stresses (heavy metals, low pH and oxidative stress). Remarkably, nitrogen fixation and sulfur oxidation were performed by the rare taxa, indicating their critical roles in the overall functioning and assembly of the AMD community. Our study demonstrates the potential of the 'divide and conquer' strategy in high-throughput sequencing data assembly for genome reconstruction and functional partitioning analysis of both dominant and rare species in natural microbial assemblages.
Diversity and abundance of the arsenite oxidase gene aioA in geothermal areas of Tengchong, Yunnan, China.

PubMed

Jiang, Zhou; Li, Ping; Jiang, Dawei; Wu, Geng; Dong, Hailiang; Wang, Yanhong; Li, Bing; Wang, Yanxin; Guo, Qinghai

2014-01-01

A total of 12 samples were collected from the Tengchong geothermal areas of Yunnan, China, with the goal to assess the arsenite (AsIII) oxidation potential of the extant microbial communities as inferred by the abundance and diversity of the AsIII oxidase large subunit gene aioA relative to geochemical context. Arsenic concentrations were higher (on average 251.68 μg/L) in neutral or alkaline springs than in acidic springs (on average 30.88 μg/L). aioA abundance ranged from 1.63 × 10(1) to 7.08 × 10(3) per ng of DNA and positively correlated with sulfide and the ratios of arsenate (AsV):total dissolved arsenic (AsTot). Based on qPCR estimates of bacterial and archaeal 16S rRNA gene abundance, aioA-harboring organisms comprised as much as ~15% of the total community. Phylogenetically, the major aioA sequences (270 total) in the acidic hot springs (pH 3.3-4.4) were affiliated with Aquificales and Rhizobiales, while those in neutral or alkaline springs (pH 6.6-9.1) were inferred to be primarily bacteria related to Thermales and Burkholderiales. Interestingly, aioA abundance at one site greatly exceeded bacterial 16S rRNA gene abundance, suggesting these aioA genes were archaeal even though phylogenetically these aioA sequences were most similar to the Aquificales. In summary, this study described novel aioA sequences in geothermal features geographically far removed from those in the heavily studied Yellowstone geothermal complex.
Formation of conjugated delta8,delta10-double bonds by delta12-oleic-acid desaturase-related enzymes: biosynthetic origin of calendic acid.

PubMed

Cahoon, E B; Ripp, K G; Hall, S E; Kinney, A J

2001-01-26

Divergent forms of the plant Delta(12)-oleic-acid desaturase (FAD2) have previously been shown to catalyze the formation of acetylenic bonds, epoxy groups, and conjugated Delta(11),Delta(13)-double bonds by modification of an existing Delta(12)-double bond in C(18) fatty acids. Here, we report a class of FAD2-related enzymes that modifies a Delta(9)-double bond to produce the conjugated trans-Delta(8),trans-Delta(10)-double bonds found in calendic acid (18:3Delta(8trans,10trans,12cis)), the major component of the seed oil of Calendula officinalis. Using an expressed sequence tag approach, cDNAs for two closely related FAD2-like enzymes, designated CoFADX-1 and CoFADX-2, were identified from a C. officinalis developing seed cDNA library. The deduced amino acid sequences of these polypeptides share 40-50% identity with those of other FAD2 and FAD2-related enzymes. Expression of either CoFADX-1 or CoFADX-2 in somatic soybean embryos resulted in the production of calendic acid. In embryos expressing CoFADX-2, calendic acid accumulated to as high as 22% (w/w) of the total fatty acids. In addition, expression of CoFADX-1 and CoFADX-2 in Saccharomyces cerevisiae was accompanied by calendic acid accumulation when induced cells were supplied exogenous linoleic acid (18:2Delta(9cis,12cis)). These results are thus consistent with a route of calendic acid synthesis involving modification of the Delta(9)-double bond of linoleic acid. Regiospecificity for Delta(9)-double bonds is unprecedented among FAD2-related enzymes and further expands the functional diversity found in this family of enzymes.
Nucleic Acid Immunity.

PubMed

Hartmann, G

2017-01-01

Organisms throughout biology need to maintain the integrity of their genome. From bacteria to vertebrates, life has established sophisticated mechanisms to detect and eliminate foreign genetic material or to restrict its function and replication. Tremendous progress has been made in the understanding of these mechanisms which keep foreign or unwanted nucleic acids from viruses or phages in check. Mechanisms reach from restriction-modification systems and CRISPR/Cas in bacteria and archaea to RNA interference and immune sensing of nucleic acids, altogether integral parts of a system which is now appreciated as nucleic acid immunity. With inherited receptors and acquired sequence information, nucleic acid immunity comprises innate and adaptive components. Effector functions include diverse nuclease systems, intrinsic activities to directly restrict the function of foreign nucleic acids (e.g., PKR, ADAR1, IFIT1), and extrinsic pathways to alert the immune system and to elicit cytotoxic immune responses. These effects act in concert to restrict viral replication and to eliminate virus-infected cells. The principles of nucleic acid immunity are highly relevant for human disease. Besides its essential contribution to antiviral defense and restriction of endogenous retroelements, dysregulation of nucleic acid immunity can also lead to erroneous detection and response to self nucleic acids then causing sterile inflammation and autoimmunity. Even mechanisms of nucleic acid immunity which are not established in vertebrates are relevant for human disease when they are present in pathogens such as bacteria, parasites, or helminths or in pathogen-transmitting organisms such as insects. This review aims to provide an overview of the diverse mechanisms of nucleic acid immunity which mostly have been looked at separately in the past and to integrate them under the framework nucleic acid immunity as a basic principle of life, the understanding of which has great potential to advance medicine. © 2017 Elsevier Inc. All rights reserved.
HBC-Evo: predicting human breast cancer by exploiting amino acid sequence-based feature spaces and evolutionary ensemble system.

PubMed

Majid, Abdul; Ali, Safdar

2015-01-01

We developed genetic programming (GP)-based evolutionary ensemble system for the early diagnosis, prognosis and prediction of human breast cancer. This system has effectively exploited the diversity in feature and decision spaces. First, individual learners are trained in different feature spaces using physicochemical properties of protein amino acids. Their predictions are then stacked to develop the best solution during GP evolution process. Finally, results for HBC-Evo system are obtained with optimal threshold, which is computed using particle swarm optimization. Our novel approach has demonstrated promising results compared to state of the art approaches.
Cardiorespiratory fitness as a predictor of intestinal microbial diversity and distinct metagenomic functions.

PubMed

Estaki, Mehrbod; Pither, Jason; Baumeister, Peter; Little, Jonathan P; Gill, Sandeep K; Ghosh, Sanjoy; Ahmadi-Vand, Zahra; Marsden, Katelyn R; Gibson, Deanna L

2016-08-08

Reduced microbial diversity in human intestines has been implicated in various conditions such as diabetes, colorectal cancer, and inflammatory bowel disease. The role of physical fitness in the context of human intestinal microbiota is currently not known. We used high-throughput sequencing to analyze fecal microbiota of 39 healthy participants with similar age, BMI, and diets but with varying cardiorespiratory fitness levels. Fecal short-chain fatty acids were analyzed using gas chromatography. We showed that peak oxygen uptake (VO2peak), the gold standard measure of cardiorespiratory fitness, can account for more than 20 % of the variation in taxonomic richness, after accounting for all other factors, including diet. While VO2peak did not explain variation in beta diversity, it did play a significant role in explaining variation in the microbiomes' predicted metagenomic functions, aligning positively with genes related to bacterial chemotaxis, motility, and fatty acid biosynthesis. These predicted functions were supported by measured increases in production of fecal butyrate, a short-chain fatty acid associated with improved gut health, amongst physically fit participants. We also identified increased abundances of key butyrate-producing taxa (Clostridiales, Roseburia, Lachnospiraceae, and Erysipelotrichaceae) amongst these individuals, which likely contributed to the observed increases in butyrate levels. Results from this study show that cardiorespiratory fitness is correlated with increased microbial diversity in healthy humans and that the associated changes are anchored around a set of functional cores rather than specific taxa. The microbial profiles of fit individuals favor the production of butyrate. As increased microbiota diversity and butyrate production is associated with overall host health, our findings warrant the use of exercise prescription as an adjuvant therapy in combating dysbiosis-associated diseases.
A description of the lactic acid bacteria microbiota associated with the production of traditional fermented vegetables in Vietnam.

PubMed

Nguyen, Doan Thi Lam; Van Hoorde, Koenraad; Cnockaert, Margo; De Brandt, Evie; Aerts, Maarten; Binh Thanh, Le; Vandamme, Peter

2013-04-15

An important part of the daily nourishment in Vietnam constitutes of fermented vegetables. Bacteria and especially lactic acid bacteria play a central role in the production of many fermented vegetables. The current study was conducted to investigate the diversity of native lactic acid bacteria (LAB) populations in 'dua muoi' (mustard and beet fermentation) and 'ca muoi' (eggplant fermentation), three types of popular traditional fermented vegetables of Vietnamese origin. To this end a polyphasic approach combining matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) and pheS gene sequence analysis was used. In addition, denaturing gradient gel electrophoresis was performed as a culture-independent method to complement the observed culturable diversity data. A total of 881 LAB isolates were recovered from 21 different samples. Predominant LAB associated with 'dua muoi' and 'ca muoi' were identified as Lactobacillus fermentum (56.6%), Lactobacillus pentosus (24.4%) and Lactobacillus plantarum (17.1%). Less abundant species were Pediococcus pentosaceus (1.0%) and Lactobacillus brevis (0.5%). Species present less than 0.1% included Lactobacillus paracasei, Lactobacillus pantheris and Pediococcus acidilactici. In contrast to fermented mustard and beet with the highest prevalence of L. fermentum, the species most recovered from fermented eggplant samples was L. pentosus. In addition, an important degree of genetic variability within the different predominant species was observed and strain dependency correlating with the type of fermented vegetable or location of production could be demonstrated using multivariate statistics. This research gives an extensive and detailed inventory of the LAB diversity associated with the production of diverse Vietnamese fermented vegetables and demonstrates the influence of type of raw material and/or production location and conditions on this diversity. Copyright © 2013 Elsevier B.V. All rights reserved.
Activity and Phylogenetic Diversity of Bacterial Cells with High and Low Nucleic Acid Content and Electron Transport System Activity in an Upwelling Ecosystem

PubMed Central

Longnecker, K.; Sherr, B. F.; Sherr, E. B.

2005-01-01

We evaluated whether bacteria with higher cell-specific nucleic acid content (HNA) or an active electron transport system, i.e., positive for reduction of 5-cyano-2,3-ditolyl tetrazolium chloride (CTC), were responsible for the bulk of bacterioplankton metabolic activity. We also examined whether the phylogenetic diversity of HNA and CTC-positive cells differed from the diversity of Bacteria with low nucleic acid content (LNA). Bacterial assemblages were sampled both in eutrophic shelf waters and in mesotrophic offshore waters in the Oregon coastal upwelling region. Cytometrically sorted HNA, LNA, and CTC-positive cells were assayed for their cell-specific [3H]leucine incorporation rates. Phylogenetic diversity in sorted non-radioactively labeled samples was assayed using denaturing gradient gel electrophoresis (DGGE) of PCR-amplified 16S rRNA genes. Cell-specific rates of leucine incorporation of HNA and CTC-positive cells were on average only slightly greater than the cell-specific rates of LNA cells. HNA cells accounted for most bacterioplankton substrate incorporation due to high abundances, while the low abundances of CTC-positive cells resulted in only a small contribution by these cells to total bacterial activity. The proportion of the total bacterial leucine incorporation attributable to LNA cells was higher in offshore regions than in shelf waters. Sequence data obtained from DGGE bands showed broadly similar phylogenetic diversity across HNA, LNA, and CTC-positive cells, with between-sample and between-region variability in the distribution of phylotypes. Our results suggest that LNA bacteria are not substantially different from HNA bacteria in either cell-specific rates of substrate incorporation or phylogenetic composition and that they can be significant contributors to bacterial metabolism in the sea. PMID:16332746

Activity and phylogenetic diversity of bacterial cells with high and low nucleic acid content and electron transport system activity in an upwelling ecosystem.

PubMed

Longnecker, K; Sherr, B F; Sherr, E B

2005-12-01

We evaluated whether bacteria with higher cell-specific nucleic acid content (HNA) or an active electron transport system, i.e., positive for reduction of 5-cyano-2,3-ditolyl tetrazolium chloride (CTC), were responsible for the bulk of bacterioplankton metabolic activity. We also examined whether the phylogenetic diversity of HNA and CTC-positive cells differed from the diversity of Bacteria with low nucleic acid content (LNA). Bacterial assemblages were sampled both in eutrophic shelf waters and in mesotrophic offshore waters in the Oregon coastal upwelling region. Cytometrically sorted HNA, LNA, and CTC-positive cells were assayed for their cell-specific [3H]leucine incorporation rates. Phylogenetic diversity in sorted non-radioactively labeled samples was assayed using denaturing gradient gel electrophoresis (DGGE) of PCR-amplified 16S rRNA genes. Cell-specific rates of leucine incorporation of HNA and CTC-positive cells were on average only slightly greater than the cell-specific rates of LNA cells. HNA cells accounted for most bacterioplankton substrate incorporation due to high abundances, while the low abundances of CTC-positive cells resulted in only a small contribution by these cells to total bacterial activity. The proportion of the total bacterial leucine incorporation attributable to LNA cells was higher in offshore regions than in shelf waters. Sequence data obtained from DGGE bands showed broadly similar phylogenetic diversity across HNA, LNA, and CTC-positive cells, with between-sample and between-region variability in the distribution of phylotypes. Our results suggest that LNA bacteria are not substantially different from HNA bacteria in either cell-specific rates of substrate incorporation or phylogenetic composition and that they can be significant contributors to bacterial metabolism in the sea.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
T7 lytic phage-displayed peptide libraries exhibit less sequence bias than M13 filamentous phage-displayed peptide libraries.

PubMed

Krumpe, Lauren R H; Atkinson, Andrew J; Smythers, Gary W; Kandel, Andrea; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki

2006-08-01

We investigated whether the T7 system of phage display could produce peptide libraries of greater diversity than the M13 system of phage display due to the differing processes of lytic and filamentous phage morphogenesis. Using a bioinformatics-assisted computational approach, collections of random peptide sequences obtained from a T7 12-mer library (X(12)) and a T7 7-mer disulfide-constrained library (CX(7)C) were analyzed and compared with peptide populations obtained from New England BioLabs' M13 Ph.D.-12 and Ph.D.-C7C libraries. Based on this analysis, peptide libraries constructed with the T7 system have fewer amino acid biases, increased peptide diversity, and more normal distributions of peptide net charge and hydropathy than the M13 libraries. The greater diversity of T7-displayed libraries provides a potential resource of novel binding peptides for new as well as previously studied molecular targets. To demonstrate their utility, several of the T7-displayed peptide libraries were screened for streptavidin- and neutravidin-binding phage. Novel binding motifs were identified for each protein.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Separation of Single-stranded DNA, Double-stranded DNA and RNA from an Environmental Viral Community Using Hydroxyapatite Chromatography

PubMed Central

Fadrosh, Douglas W.; Andrews-Pfannkoch, Cynthia; Williamson, Shannon J.

2011-01-01

Viruses, particularly bacteriophages (phages), are the most numerous biological entities on Earth1,2. Viruses modulate host cell abundance and diversity, contribute to the cycling of nutrients, alter host cell phenotype, and influence the evolution of both host cell and viral communities through the lateral transfer of genes 3. Numerous studies have highlighted the staggering genetic diversity of viruses and their functional potential in a variety of natural environments. Metagenomic techniques have been used to study the taxonomic diversity and functional potential of complex viral assemblages whose members contain single-stranded DNA (ssDNA), double-stranded DNA (dsDNA) and RNA genotypes 4-9. Current library construction protocols used to study environmental DNA-containing or RNA-containing viruses require an initial nuclease treatment in order to remove nontargeted templates 10. However, a comprehensive understanding of the collective gene complement of the virus community and virus diversity requires knowledge of all members regardless of genome composition. Fractionation of purified nucleic acid subtypes provides an effective mechanism by which to study viral assemblages without sacrificing a subset of the community’s genetic signature. Hydroxyapatite, a crystalline form of calcium phosphate, has been employed in the separation of nucleic acids, as well as proteins and microbes, since the 1960s11. By exploiting the charge interaction between the positively-charged Ca2+ ions of the hydroxyapatite and the negatively charged phosphate backbone of the nucleic acid subtypes, it is possible to preferentially elute each nucleic acid subtype independent of the others. We recently employed this strategy to independently fractionate the genomes of ssDNA, dsDNA and RNA-containing viruses in preparation of DNA sequencing 12. Here, we present a method for the fractionation and recovery of ssDNA, dsDNA and RNA viral nucleic acids from mixed viral assemblages using hydroxyapatite chromotography. PMID:21989424
CML24, Regulated in Expression by Diverse Stimuli, Encodes a Potential Ca2+ Sensor That Functions in Responses to Abscisic Acid, Daylength, and Ion Stress1

PubMed Central

Delk, Nikkí A.; Johnson, Keith A.; Chowdhury, Naweed I.; Braam, Janet

2005-01-01

Changes in intracellular calcium (Ca2+) levels serve to signal responses to diverse stimuli. Ca2+ signals are likely perceived through proteins that bind Ca2+, undergo conformation changes following Ca2+ binding, and interact with target proteins. The 50-member calmodulin-like (CML) Arabidopsis (Arabidopsis thaliana) family encodes proteins containing the predicted Ca2+-binding EF-hand motif. The functions of virtually all these proteins are unknown. CML24, also known as TCH2, shares over 40% amino acid sequence identity with calmodulin, has four EF hands, and undergoes Ca2+-dependent changes in hydrophobic interaction chromatography and migration rate through denaturing gel electrophoresis, indicating that CML24 binds Ca2+ and, as a consequence, undergoes conformational changes. CML24 expression occurs in all major organs, and transcript levels are increased from 2- to 15-fold in plants subjected to touch, darkness, heat, cold, hydrogen peroxide, abscisic acid (ABA), and indole-3-acetic acid. However, CML24 protein accumulation changes were not detectable. The putative CML24 regulatory region confers reporter expression at sites of predicted mechanical stress; in regions undergoing growth; in vascular tissues and various floral organs; and in stomata, trichomes, and hydathodes. CML24-underexpressing transgenics are resistant to ABA inhibition of germination and seedling growth, are defective in long-day induction of flowering, and have enhanced tolerance to CoCl2, molybdic acid, ZnSO4, and MgCl2. MgCl2 tolerance is not due to reduced uptake or to elevated Ca2+ accumulation. Together, these data present evidence that CML24, a gene expressed in diverse organs and responsive to diverse stimuli, encodes a potential Ca2+ sensor that may function to enable responses to ABA, daylength, and presence of various salts. PMID:16113225
Diversity and distribution of culturable lactic acid bacterial species in Indonesian Sayur Asin.

PubMed

Mangunwardoyo, Wibowo; Abinawanto; Salamah, Andi; Sukara, Endang; Sulistiani; Dinoto, Achmad

2016-08-01

Lactic acid bacteria (LAB) play important roles in processing of Sayur Asin (spontaneously fermented mustard). Unfortunately, information about LAB in Indonesian Sayur Asin, prepared by traditional manufactures which is important as baseline data for maintenance of food quality and safety, is unclear. The aim of this study was to describe the diversity and distribution of culturable lactic acid bacteria in Sayur Asin of Indonesia. Four Sayur Asin samples (fermentation liquor and fermented mustard) were collected at harvesting times (3-7 days after fermentation) from two traditional manufactures in Tulung Agung (TA) and Kediri (KDR), East Java provinces, Indonesia. LAB strains were isolated by using MRS agar method supplemented with 1% CaCO 3 and characterized morphologically. Identification of the strains was performed basedon 16S rDNA analysis and the phylogenetic tree was drawn to understand the phylogenetic relationship of the collected strains. Different profiles were detected in total count of the plates, salinity and pH of fermenting liquor of Sayur Asin in TA and KDR provinces. A total of 172 LAB isolates were successfully isolated and identified based on their 16S rDNA sequences. Phylogenetic analysis of 27 representative LAB strains from Sayur Asin showed that these strains belonged to 5 distinct species namely Lactobacilus farciminis (N=32), L. fermentum (N=4), L. namurensis (N=15), L. plantarum (N=118) and L. parafarraginis (N=1). Strains D5-S-2013 and B4-S-2013 showed a close phylogenetic relationship with L. composti and L. paralimentarius, respectively where as the sequence had slightly lower similarity of lower than 99%, suggesting that they may be classified into novel species and need further investigation due to exhibition of significant differences in their nucleotide sequences. Lactobacillus plantarum was found being dominant in all sayur asin samples. Lactobacilli were recognized as the major group of lactic acid bacteria in Sayur Asin including 5 known and 2 novel candidate species. The distribution of LAB species was associated with the manufactures where Sayur Asin is produced.
Diversity and distribution of culturable lactic acid bacterial species in Indonesian Sayur Asin

PubMed Central

Mangunwardoyo, Wibowo; Abinawanto; Salamah, Andi; Sukara, Endang; Sulistiani; Dinoto, Achmad

2016-01-01

Background and Objectives: Lactic acid bacteria (LAB) play important roles in processing of Sayur Asin (spontaneously fermented mustard). Unfortunately, information about LAB in Indonesian Sayur Asin, prepared by traditional manufactures which is important as baseline data for maintenance of food quality and safety, is unclear. The aim of this study was to describe the diversity and distribution of culturable lactic acid bacteria in Sayur Asin of Indonesia. Materials and Methods: Four Sayur Asin samples (fermentation liquor and fermented mustard) were collected at harvesting times (3–7 days after fermentation) from two traditional manufactures in Tulung Agung (TA) and Kediri (KDR), East Java provinces, Indonesia. LAB strains were isolated by using MRS agar method supplemented with 1% CaCO 3 and characterized morphologically. Identification of the strains was performed basedon 16S rDNA analysis and the phylogenetic tree was drawn to understand the phylogenetic relationship of the collected strains. Results: Different profiles were detected in total count of the plates, salinity and pH of fermenting liquor of Sayur Asin in TA and KDR provinces. A total of 172 LAB isolates were successfully isolated and identified based on their 16S rDNA sequences. Phylogenetic analysis of 27 representative LAB strains from Sayur Asin showed that these strains belonged to 5 distinct species namely Lactobacilus farciminis (N=32), L. fermentum (N=4), L. namurensis (N=15), L. plantarum (N=118) and L. parafarraginis (N=1). Strains D5-S-2013 and B4-S-2013 showed a close phylogenetic relationship with L. composti and L. paralimentarius, respectively where as the sequence had slightly lower similarity of lower than 99%, suggesting that they may be classified into novel species and need further investigation due to exhibition of significant differences in their nucleotide sequences. Lactobacillus plantarum was found being dominant in all sayur asin samples. Conclusion: Lactobacilli were recognized as the major group of lactic acid bacteria in Sayur Asin including 5 known and 2 novel candidate species. The distribution of LAB species was associated with the manufactures where Sayur Asin is produced. PMID:28210467
Genetic Diversity of Avian Paramyxovirus Type 6 Isolated from Wild Ducks in the Republic of Korea.

PubMed

Choi, Kang-Seuk; Kim, Ji-Ye; Lee, Hyun-Jeong; Jang, Min-Jun; Kwon, Hyuk-Moo; Sung, Haan-Woo

2018-03-08

Eleven avian paramyxovirus type 6 (APMV-6) isolates from Eurasian Wigeon ( n=5; Anas penelope), Mallards ( n=2; Anas platyrhynchos), and unknown species of wild ducks ( n=4) from Korea were analyzed based on the nucleotide (nt) and deduced amino acid (aa) sequences of the fusion (F) gene. Fecal samples were collected in 2010-2014. Genotypes were assigned based on phylogenetic analyses. Our results revealed that APMV-6 could be classified into at least two distinct genotypes, G1 and G2. The open reading frame (ORF) of the G1 genotype was 1,668 nt in length, and the putative F0 cleavage site sequence was 113 PAPEPRL 119 . The G2 genotype viruses included five isolates from Eurasian wigeons and four isolates from unknown waterfowl species, together with two reference APMV-6 strains from the Red-necked Stint ( Calidris ruficollis) from Japan and an unknown duck from Italy. There was an N-truncated ORF (1,638 nt), due to an N-terminal truncation of 30 nt in the signal peptide region of the F gene, and the putative F0 cleavage site sequence was 103 SIREPRL 109 . The genetic diversity and ecology of APMV-6 are discussed.
Microbial Iron Cycling in Acidic Geothermal Springs of Yellowstone National Park: Integrating Molecular Surveys, Geochemical Processes, and Isolation of Novel Fe-Active Microorganisms

PubMed Central

Kozubal, Mark A.; Macur, Richard E.; Jay, Zackary J.; Beam, Jacob P.; Malfatti, Stephanie A.; Tringe, Susannah G.; Kocar, Benjamin D.; Borch, Thomas; Inskeep, William P.

2012-01-01

Geochemical, molecular, and physiological analyses of microbial isolates were combined to study the geomicrobiology of acidic iron oxide mats in Yellowstone National Park. Nineteen sampling locations from 11 geothermal springs were studied ranging in temperature from 53 to 88°C and pH 2.4 to 3.6. All iron oxide mats exhibited high diversity of crenarchaeal sequences from the Sulfolobales, Thermoproteales, and Desulfurococcales. The predominant Sulfolobales sequences were highly similar to Metallosphaera yellowstonensis str. MK1, previously isolated from one of these sites. Other groups of archaea were consistently associated with different types of iron oxide mats, including undescribed members of the phyla Thaumarchaeota and Euryarchaeota. Bacterial sequences were dominated by relatives of Hydrogenobaculum spp. above 65–70°C, but increased in diversity below 60°C. Cultivation of relevant iron-oxidizing and iron-reducing microbial isolates included Sulfolobus str. MK3, Sulfobacillus str. MK2, Acidicaldus str. MK6, and a new candidate genus in the Sulfolobales referred to as Sulfolobales str. MK5. Strains MK3 and MK5 are capable of oxidizing ferrous iron autotrophically, while strain MK2 oxidizes iron mixotrophically. Similar rates of iron oxidation were measured for M. yellowstonensis str. MK1 and Sulfolobales str. MK5. Biomineralized phases of ferric iron varied among cultures and field sites, and included ferric oxyhydroxides, K-jarosite, goethite, hematite, and scorodite depending on geochemical conditions. Strains MK5 and MK6 are capable of reducing ferric iron under anaerobic conditions with complex carbon sources. The combination of geochemical and molecular data as well as physiological observations of isolates suggests that the community structure of acidic Fe mats is linked with Fe cycling across temperatures ranging from 53 to 88°C. PMID:22470372
Identification of genes from pattern formation, tyrosine kinase, and potassium channel families by DNA amplification

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kamb, A.; Weir, M.; Rudy, B.

1989-06-01

The study of gene family members has been aided by the isolation of related genes on the basis of DNA homology. The authors have adapted the polymerase chain reaction to screen animal genomes very rapidly and reliably for likely gene family members. Using conserved amino acid sequences to design degenerate oligonucleotide primers, they have shown that the genome of the nematode Caenorhabditis elegans contains sequences homologous to many Drosophila genes involved in pattern formation, including the segment polarity gene wingless (vertebrate int-1), and homeobox sequences characteristic of the Antennapedia, engrailed, and paired families. In addition, they have used this methodmore » to show that C. elegans contains at least five different sequences homologous to genes in the tyrosine kinase family. Lastly, they have isolated six potassium channel sequences from humans, a result that validates the utility of the method with large genomes and suggests that human potassium channel gene diversity may be extensive.« less
Nitrous Oxide Reductase (nosZ) Gene Fragments Differ between Native and Cultivated Michigan Soils

PubMed Central

Stres, Blaž; Mahne, Ivan; Avguštin, Gorazd; Tiedje, James M.

2004-01-01

The effect of standard agricultural management on the genetic heterogeneity of nitrous oxide reductase (nosZ) fragments from denitrifying prokaryotes in native and cultivated soil was explored. Thirty-six soil cores were composited from each of the two soil management conditions. nosZ gene fragments were amplified from triplicate samples, and PCR products were cloned and screened by restriction fragment length polymorphism (RFLP). The total nosZ RFLP profiles increased in similarity with soil sample size until triplicate 3-g samples produced visually identical RFLP profiles for each treatment. Large differences in total nosZ profiles were observed between the native and cultivated soils. The fragments representing major groups of clones encountered at least twice and four randomly selected clones with unique RFLP patterns were sequenced to verify nosZ identity. The sequence diversity of nosZ clones from the cultivated field was higher, and only eight patterns were found in clone libraries from both soils among the 182 distinct nosZ RFLP patterns identified from the two soils. A group of clones that comprised 32% of all clones dominated the gene library of native soil, whereas many minor groups were observed in the gene library of cultivated soil. The 95% confidence intervals of the Chao1 nonparametric richness estimator for nosZ RFLP data did not overlap, indicating that the levels of species richness are significantly different in the two soils, the cultivated soil having higher diversity. Phylogenetic analysis of deduced amino acid sequences grouped the majority of nosZ clones into an interleaved Michigan soil cluster whose cultured members are α-Proteobacteria. Only four nosZ sequences from cultivated soil and one from the native soil were related to sequences found in γ-Proteobacteria. Sequences from the native field formed a distinct, closely related cluster (Dmean = 0.16) containing 91.6% of the native clones. Clones from the cultivated field were more distantly related to each other (Dmean = 0.26), and 65% were found outside of the cluster from the native soil, further indicating a difference in the two communities. Overall, there appears to be a relationship between use and richness, diversity, and the phylogenetic position of nosZ sequences, indicating that agricultural use of soil caused a shift to a more diverse denitrifying community. PMID:14711656
From algae to angiosperms–inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes

PubMed Central

2014-01-01

Background Next-generation sequencing has provided a wealth of plastid genome sequence data from an increasingly diverse set of green plants (Viridiplantae). Although these data have helped resolve the phylogeny of numerous clades (e.g., green algae, angiosperms, and gymnosperms), their utility for inferring relationships across all green plants is uncertain. Viridiplantae originated 700-1500 million years ago and may comprise as many as 500,000 species. This clade represents a major source of photosynthetic carbon and contains an immense diversity of life forms, including some of the smallest and largest eukaryotes. Here we explore the limits and challenges of inferring a comprehensive green plant phylogeny from available complete or nearly complete plastid genome sequence data. Results We assembled protein-coding sequence data for 78 genes from 360 diverse green plant taxa with complete or nearly complete plastid genome sequences available from GenBank. Phylogenetic analyses of the plastid data recovered well-supported backbone relationships and strong support for relationships that were not observed in previous analyses of major subclades within Viridiplantae. However, there also is evidence of systematic error in some analyses. In several instances we obtained strongly supported but conflicting topologies from analyses of nucleotides versus amino acid characters, and the considerable variation in GC content among lineages and within single genomes affected the phylogenetic placement of several taxa. Conclusions Analyses of the plastid sequence data recovered a strongly supported framework of relationships for green plants. This framework includes: i) the placement of Zygnematophyceace as sister to land plants (Embryophyta), ii) a clade of extant gymnosperms (Acrogymnospermae) with cycads + Ginkgo sister to remaining extant gymnosperms and with gnetophytes (Gnetophyta) sister to non-Pinaceae conifers (Gnecup trees), and iii) within the monilophyte clade (Monilophyta), Equisetales + Psilotales are sister to Marattiales + leptosporangiate ferns. Our analyses also highlight the challenges of using plastid genome sequences in deep-level phylogenomic analyses, and we provide suggestions for future analyses that will likely incorporate plastid genome sequence data for thousands of species. We particularly emphasize the importance of exploring the effects of different partitioning and character coding strategies. PMID:24533922
Genetic Variation and Its Reflection on Posttranslational Modifications in Frequency Clock and Mating Type a-1 Proteins in Sordaria fimicola

PubMed Central

Arif, Rabia; Akram, Faiza; Jamil, Tazeen; Lee, Siu Fai

2017-01-01

Posttranslational modifications (PTMs) occur in all essential proteins taking command of their functions. There are many domains inside proteins where modifications take place on side-chains of amino acids through various enzymes to generate different species of proteins. In this manuscript we have, for the first time, predicted posttranslational modifications of frequency clock and mating type a-1 proteins in Sordaria fimicola collected from different sites to see the effect of environment on proteins or various amino acids pickings and their ultimate impact on consensus sequences present in mating type proteins using bioinformatics tools. Furthermore, we have also measured and walked through genomic DNA of various Sordaria strains to determine genetic diversity by genotyping the short sequence repeats (SSRs) of wild strains of S. fimicola collected from contrasting environments of two opposing slopes (harsh and xeric south facing slope and mild north facing slope) of Evolution Canyon (EC), Israel. Based on the whole genome sequence of S. macrospora, we targeted 20 genomic regions in S. fimicola which contain short sequence repeats (SSRs). Our data revealed genetic variations in strains from south facing slope and these findings assist in the hypothesis that genetic variations caused by stressful environments lead to evolution. PMID:28717646
Genetic Variation and Its Reflection on Posttranslational Modifications in Frequency Clock and Mating Type a-1 Proteins in Sordaria fimicola.

PubMed

Arif, Rabia; Akram, Faiza; Jamil, Tazeen; Mukhtar, Hamid; Lee, Siu Fai; Saleem, Muhammad

2017-01-01

Posttranslational modifications (PTMs) occur in all essential proteins taking command of their functions. There are many domains inside proteins where modifications take place on side-chains of amino acids through various enzymes to generate different species of proteins. In this manuscript we have, for the first time, predicted posttranslational modifications of frequency clock and mating type a-1 proteins in Sordaria fimicola collected from different sites to see the effect of environment on proteins or various amino acids pickings and their ultimate impact on consensus sequences present in mating type proteins using bioinformatics tools. Furthermore, we have also measured and walked through genomic DNA of various Sordaria strains to determine genetic diversity by genotyping the short sequence repeats (SSRs) of wild strains of S. fimicola collected from contrasting environments of two opposing slopes (harsh and xeric south facing slope and mild north facing slope) of Evolution Canyon (EC), Israel. Based on the whole genome sequence of S. macrospora , we targeted 20 genomic regions in S. fimicola which contain short sequence repeats (SSRs). Our data revealed genetic variations in strains from south facing slope and these findings assist in the hypothesis that genetic variations caused by stressful environments lead to evolution.
Analysis of the beak and feather disease viral genome indicates the existence of several genotypes which have a complex psittacine host specificity.

PubMed

de Kloet, E; de Kloet, S R

2004-12-01

A study was made of the phylogenetic relationships between fifteen complete nucleotide sequences as well as 43 nucleotide sequences of the putative coat protein gene of different strains belonging to the virus species Beak and feather disease virus obtained from 39 individuals of 16 psittacine species. The species included among others, cockatoos ( Cacatuini), African grey parrots ( Psittacus erithacus) and peach-faced lovebirds ( Agapornis roseicollis), which were infected at different geographical locations, within and outside Australia, the native origin of the virus. The derived amino acid sequences of the putative coat protein were highly diverse, with differences between some strains amounting to 50 of the 250 amino acids. Phylogenetic analysis demonstrated that the putative coat gene sequences form six clusters which show a varying degree of psittacine species specificity. Most, but not all strains infecting African grey parrots formed a single cluster as did the strains infecting the cockatoos. Strains infecting the lovebirds clustered with those infecting such Australasian species as Eclectus roratus, Psittacula kramerii and Psephotus haematogaster. Although individual birds included in this study were, where studied, often infected by closely related strains, infection by highly diverged trains was also detected. The possible relationship between BFD viral strains and clinical disease signs is discussed.
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

PubMed

Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

2014-11-01

As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Archaeal β diversity patterns under the seafloor along geochemical gradients

NASA Astrophysics Data System (ADS)

Koyano, Hitoshi; Tsubouchi, Taishi; Kishino, Hirohisa; Akutsu, Tatsuya

2014-09-01

Recently, deep drilling into the seafloor has revealed that there are vast sedimentary ecosystems of diverse microorganisms, particularly archaea, in subsurface areas. We investigated the β diversity patterns of archaeal communities in sediment layers under the seafloor and their determinants. This study was accomplished by analyzing large environmental samples of 16S ribosomal RNA gene sequences and various geochemical data collected from a sediment core of 365.3 m, obtained by drilling into the seafloor off the east coast of the Shimokita Peninsula. To extract the maximum amount of information from these environmental samples, we first developed a method for measuring β diversity using sequence data by applying probability theory on a set of strings developed by two of the authors in a previous publication. We introduced an index of β diversity between sequence populations from which the sequence data were sampled. We then constructed an estimator of the β diversity index based on the sequence data and demonstrated that it converges to the β diversity index between sequence populations with probability of 1 as the number of sampled sequences increases. Next, we applied this new method to quantify β diversities between archaeal sequence populations under the seafloor and constructed a quantitative model of the estimated β diversity patterns. Nearly 90% of the variation in the archaeal β diversity was explained by a model that included as variables the differences in the abundances of chlorine, iodine, and carbon between the sediment layers.

On the Role of Aggregation Prone Regions in Protein Evolution, Stability, and Enzymatic Catalysis: Insights from Diverse Analyses

PubMed Central

Buck, Patrick M.; Kumar, Sandeep; Singh, Satish K.

2013-01-01

The various roles that aggregation prone regions (APRs) are capable of playing in proteins are investigated here via comprehensive analyses of multiple non-redundant datasets containing randomly generated amino acid sequences, monomeric proteins, intrinsically disordered proteins (IDPs) and catalytic residues. Results from this study indicate that the aggregation propensities of monomeric protein sequences have been minimized compared to random sequences with uniform and natural amino acid compositions, as observed by a lower average aggregation propensity and fewer APRs that are shorter in length and more often punctuated by gate-keeper residues. However, evidence for evolutionary selective pressure to disrupt these sequence regions among homologous proteins is inconsistent. APRs are less conserved than average sequence identity among closely related homologues (≥80% sequence identity with a parent) but APRs are more conserved than average sequence identity among homologues that have at least 50% sequence identity with a parent. Structural analyses of APRs indicate that APRs are three times more likely to contain ordered versus disordered residues and that APRs frequently contribute more towards stabilizing proteins than equal length segments from the same protein. Catalytic residues and APRs were also found to be in structural contact significantly more often than expected by random chance. Our findings suggest that proteins have evolved by optimizing their risk of aggregation for cellular environments by both minimizing aggregation prone regions and by conserving those that are important for folding and function. In many cases, these sequence optimizations are insufficient to develop recombinant proteins into commercial products. Rational design strategies aimed at improving protein solubility for biotechnological purposes should carefully evaluate the contributions made by candidate APRs, targeted for disruption, towards protein structure and activity. PMID:24146608
Microbial stratification in low pH oxic and suboxic macroscopic growths along an acid mine drainage

PubMed Central

Méndez-García, Celia; Mesa, Victoria; Sprenger, Richard R; Richter, Michael; Diez, María Suárez; Solano, Jennifer; Bargiela, Rafael; Golyshina, Olga V; Manteca, Ángel; Ramos, Juan Luis; Gallego, José R; Llorente, Irene; Martins dos Santos, Vitor AP; Jensen, Ole N; Peláez, Ana I; Sánchez, Jesús; Ferrer, Manuel

2014-01-01

Macroscopic growths at geographically separated acid mine drainages (AMDs) exhibit distinct populations. Yet, local heterogeneities are poorly understood. To gain novel mechanistic insights into this, we used OMICs tools to profile microbial populations coexisting in a single pyrite gallery AMD (pH ∼2) in three distinct compartments: two from a stratified streamer (uppermost oxic and lowermost anoxic sediment-attached strata) and one from a submerged anoxic non-stratified mat biofilm. The communities colonising pyrite and those in the mature formations appear to be populated by the greatest diversity of bacteria and archaea (including ‘ARMAN' (archaeal Richmond Mine acidophilic nano-organisms)-related), as compared with the known AMD, with ∼44.9% unclassified sequences. We propose that the thick polymeric matrix may provide a safety shield against the prevailing extreme condition and also a massive carbon source, enabling non-typical acidophiles to develop more easily. Only 1 of 39 species were shared, suggesting a high metabolic heterogeneity in local microenvironments, defined by the O2 concentration, spatial location and biofilm architecture. The suboxic mats, compositionally most similar to each other, are more diverse and active for S, CO2, CH4, fatty acid and lipopolysaccharide metabolism. The oxic stratum of the streamer, displaying a higher diversity of the so-called ‘ARMAN'-related Euryarchaeota, shows a higher expression level of proteins involved in signal transduction, cell growth and N, H2, Fe, aromatic amino acids, sphingolipid and peptidoglycan metabolism. Our study is the first to highlight profound taxonomic and functional shifts in single AMD formations, as well as new microbial species and the importance of H2 in acidic suboxic macroscopic growths. PMID:24430486
Estimates of Soil Bacterial Ribosome Content and Diversity Are Significantly Affected by the Nucleic Acid Extraction Method Employed

PubMed Central

Wüst, Pia K.; Nacke, Heiko; Kaiser, Kristin; Marhan, Sven; Sikorski, Johannes; Kandeler, Ellen; Daniel, Rolf

2016-01-01

Modern sequencing technologies allow high-resolution analyses of total and potentially active soil microbial communities based on their DNA and RNA, respectively. In the present study, quantitative PCR and 454 pyrosequencing were used to evaluate the effects of different extraction methods on the abundance and diversity of 16S rRNA genes and transcripts recovered from three different types of soils (leptosol, stagnosol, and gleysol). The quality and yield of nucleic acids varied considerably with respect to both the applied extraction method and the analyzed type of soil. The bacterial ribosome content (calculated as the ratio of 16S rRNA transcripts to 16S rRNA genes) can serve as an indicator of the potential activity of bacterial cells and differed by 2 orders of magnitude between nucleic acid extracts obtained by the various extraction methods. Depending on the extraction method, the relative abundances of dominant soil taxa, in particular Actinobacteria and Proteobacteria, varied by a factor of up to 10. Through this systematic approach, the present study allows guidelines to be deduced for the selection of the appropriate extraction protocol according to the specific soil properties, the nucleic acid of interest, and the target organisms. PMID:26896137
Diversity of predominant lactic acid bacteria associated with cocoa fermentation in Nigeria.

PubMed

Kostinek, Melanie; Ban-Koffi, Louis; Ottah-Atikpo, Margaret; Teniola, David; Schillinger, Ulrich; Holzapfel, Wilhelm H; Franz, Charles M A P

2008-04-01

The fermentation of cocoa relies on a complex succession of bacteria and filamentous fungi, all of which can have an impact on cocoa flavor. So far, few investigations have focused on the diversity of lactic acid bacteria involved in cocoa fermentation, and many earlier investigations did not rely on polyphasic taxonomical approaches, which take both phenotypic and genotypic characterization techniques into account. In our study, we characterized predominant lactic acid bacteria from cocoa fermentations in Nigeria, using a combination of phenotypic tests, repetitive extragenic palindromic PCR, and sequencing of the 16S rRNA gene of representative strains for accurate species identification. Thus, of a total of 193 lactic acid bacteria (LAB) strains isolated from common media used to cultivate LAB, 40 (20.7%) were heterofermentative and consisted of either L. brevis or L. fermentum strains. The majority of the isolates were homofermentative rods (110 strains; 57% of isolates) which were characterized as L. plantarum strains. The homofermentative cocci consisted predominantly of 35 (18.1% of isolates) Pediococcus acidilactici strains. Thus, the LAB populations derived from these media in this study were accurately described. This can contribute to the further assessment of the effect of common LAB strains on the flavor characteristics of fermenting cocoa in further studies.
Bacterial diversity and bioprospecting for cold-active enzymes from culturable bacteria associated with sediment from a melt water stream of Midtre Lovenbreen glacier, an Arctic glacier.

PubMed

Vardhan Reddy, Puram Vishnu; Shiva Nageswara Rao, Singireesu Soma; Pratibha, Mambatta Shankaranarayanan; Sailaja, Buddhi; Kavya, Bakka; Manorama, Ravoori Ruth; Singh, Shiv Mohan; Radha Srinivas, Tanuku Naga; Shivaji, Sisinthy

2009-10-01

Culturable bacterial diversity of Midtre Lovenbreen glacier, an Arctic glacier, was studied using 12 sediment samples collected from different points, along a transect, from the snout of Midtre Lovenbreen glacier up to the convergence point of the melt water stream with the sea. Bacterial abundance appeared to be closer to the convergence point of the glacial melt water stream with the sea than at the snout of the glacier. A total of 117 bacterial strains were isolated from the sediment samples. Based on 16S rRNA gene sequence analyses, the isolates (n=117) could be categorised in to 32 groups, with each group representing a different taxa belonging to 4 phyla (Actinobacteria, Bacilli, Flavobacteria and Proteobacteria). Representatives of the 32 groups varied in their growth temperature range (4-37 degrees C), in their tolerance to NaCl (0.1-1M NaCl) and in the growth pH range (2-13). Only 14 of 32 representative strains exhibited amylase, lipase and (or) protease activity and only one isolate (AsdM4-6) showed all three enzyme activities at 5 and 20 degrees C respectively. More than half of the isolates were pigmented. Fatty acid profile studies indicated that short-chain fatty acids, unsaturated fatty acids, branched fatty acids, cyclic and cis fatty acids are predominant in the psychrophilic bacteria.
Dysbiosis of the microbiome in gastric carcinogenesis.

PubMed

Castaño-Rodríguez, Natalia; Goh, Khean-Lee; Fock, Kwong Ming; Mitchell, Hazel M; Kaakoush, Nadeem O

2017-11-21

The gastric microbiome has been proposed as an etiological factor in gastric carcinogenesis. We compared the gastric microbiota in subjects presenting with gastric cancer (GC, n = 12) and controls (functional dyspepsia (FD), n = 20) from a high GC risk population in Singapore and Malaysia. cDNA from 16S rRNA transcripts were amplified (515F-806R) and sequenced using Illumina MiSeq 2 × 250 bp chemistry. Increased richness and phylogenetic diversity but not Shannon's diversity was found in GC as compared to controls. nMDS clustered GC and FD subjects separately, with PERMANOVA confirming a significant difference between the groups. H. pylori serological status had a significant impact on gastric microbiome α-diversity and composition. Several bacterial taxa were enriched in GC, including Lactococcus, Veilonella, and Fusobacteriaceae (Fusobacterium and Leptotrichia). Prediction of bacterial metabolic contribution indicated that serological status had a significant impact on metabolic function, while carbohydrate digestion and pathways were enriched in GC. Our findings highlight three mechanisms of interest in GC, including enrichment of pro-inflammatory oral bacterial species, increased abundance of lactic acid producing bacteria, and enrichment of short chain fatty acid production pathways.
Solution-Phase Synthesis of a Tricyclic Pyrrole-2-Carboxamide Discovery Library Applying a Stetter-Paal-Knorr Reaction Sequence

PubMed Central

Iyer, Pravin S.; Fodor, Matthew D.; Coleman, Claire M.; Twining, Leslie A.; Mitasev, Branko

2012-01-01

The solution phase synthesis of a discovery library of 178 tricyclic pyrrole-2-carboxamides was accomplished in nine steps and seven purifications starting with three benzoyl protected amino acid methyl esters. Further diversity was introduced by two glyoxaldehydes and forty-one primary amines. The combination of Pauson-Khand, Stetter and microwave assisted Paal Knorr reactions was applied as a key sequence. The discovery library was designed with the help of QikProp 2.1 and physicochemical data are presented for all pyrroles. Library members were synthesized and purified in parallel and analyzed by LC-MS. Selected compounds were fully characterized. PMID:16677007
Chemical identification of the mammalian oxytocin in a holocephalian fish, the ratfish (Hydrolagus colliei).

PubMed

Michel, G; Chauvet, J; Chauvet, M T; Clarke, C; Bern, H; Acher, R

1993-11-01

The neurohypophysial hormones of the ratfish (Hydrolagus colliei), a species belonging to the subclass Holocephali of cartilaginous fishes, have been investigated. An oxytocin-like hormone has been isolated from acetone-desiccated pituitary glands by using successively molecular sieving and high-pressure liquid chromatography. The peptide has been identified as oxytocin by coelution with synthetic oxytocin in HPLC, amino acid sequencing, mass spectrometry, and C-terminal sequencing through carboxypeptidase Y. Vasotocin may be present in a very small amount. Cartilaginous fishes appear to display a great diversity in their oxytocin-like hormones since five different peptides have been identified in rays and sharks that belong to the second subclass Selachii.
Anoxic carbon flux in photosynthetic microbial mats as revealed by metatranscriptomics [Anoxic carbon flux in photosynthetic microbial mats as revealed by metatranscriptomics and NanoSIMS.

DOE PAGES

Burow, Luke C.; Woebken, Dagmar; Marshall, Ian PG; ...

2012-11-29

Photosynthetic microbial mats possess extraordinary phylogenetic and functional diversity that makes linking specific pathways with individual microbial populations a daunting task. Close metabolic and spatial relationships between Cyanobacteria and Chloroflexi have previously been observed in diverse microbial mats. Here in this paper, we report that an expressed metabolic pathway for the anoxic catabolism of photosynthate involving Cyanobacteria and Chloroflexi in microbial mats can be reconstructed through metatranscriptomic sequencing of mats collected at Elkhorn Slough, Monterey Bay, CA, USA. In this reconstruction, Microcoleus spp., the most abundant cyanobacterial group in the mats, ferment photosynthate to organic acids, CO 2 and Hmore » 2 through multiple pathways, and an uncultivated lineage of the Chloroflexi take up these organic acids to store carbon as polyhydroxyalkanoates. The metabolic reconstruction is consistent with metabolite measurements and single cell microbial imaging with fluorescence in situ hybridization and NanoSIMS.« less
Anoxic carbon flux in photosynthetic microbial mats as revealed by metatranscriptomics [Anoxic carbon flux in photosynthetic microbial mats as revealed by metatranscriptomics and NanoSIMS.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Burow, Luke C.; Woebken, Dagmar; Marshall, Ian PG

Photosynthetic microbial mats possess extraordinary phylogenetic and functional diversity that makes linking specific pathways with individual microbial populations a daunting task. Close metabolic and spatial relationships between Cyanobacteria and Chloroflexi have previously been observed in diverse microbial mats. Here in this paper, we report that an expressed metabolic pathway for the anoxic catabolism of photosynthate involving Cyanobacteria and Chloroflexi in microbial mats can be reconstructed through metatranscriptomic sequencing of mats collected at Elkhorn Slough, Monterey Bay, CA, USA. In this reconstruction, Microcoleus spp., the most abundant cyanobacterial group in the mats, ferment photosynthate to organic acids, CO 2 and Hmore » 2 through multiple pathways, and an uncultivated lineage of the Chloroflexi take up these organic acids to store carbon as polyhydroxyalkanoates. The metabolic reconstruction is consistent with metabolite measurements and single cell microbial imaging with fluorescence in situ hybridization and NanoSIMS.« less
Cracking the ANP32 whips: Important functions, unequal requirement, and hints at disease implications

PubMed Central

Reilly, Patrick T; Yu, Yun; Hamiche, Ali; Wang, Lishun

2014-01-01

The acidic (leucine-rich) nuclear phosphoprotein 32 kDa (ANP32) family is composed of small, evolutionarily conserved proteins characterized by an N-terminal leucine-rich repeat domain and a C-terminal low-complexity acidic region. The mammalian family members (ANP32A, ANP32B, and ANP32E) are ascribed physiologically diverse functions including chromatin modification and remodelling, apoptotic caspase modulation, protein phosphatase inhibition, as well as regulation of intracellular transport. In addition to reviewing the widespread literature on the topic, we present a concept of the ANP32s as having a whip-like structure. We also present hypotheses that ANP32C and other intronless sequences should not currently be considered bona fide family members, that their disparate necessity in development may be due to compensatory mechanisms, that their contrasting roles in cancer are likely context-dependent, along with an underlying hypothesis that ANP32s represent an important node of physiological regulation by virtue of their diverse biochemical activities. PMID:25156960
Functional genomics of lipid metabolism in the oleaginous yeast Rhodosporidium toruloides

DOE Office of Scientific and Technical Information (OSTI.GOV)

Coradetti, Samuel T.; Pinel, Dominic; Geiselman, Gina M.

The basidiomycete yeast Rhodosporidium toruloides (also known as Rhodotorula toruloides) accumulates high concentrations of lipids and carotenoids from diverse carbon sources. It has great potential as a model for the cellular biology of lipid droplets and for sustainable chemical production. We developed a method for high-throughput genetics (RB-TDNAseq), using sequence-barcoded Agrobacterium tumefaciens T-DNA insertions. We identified 1,337 putative essential genes with low T-DNA insertion rates. We functionally profiled genes required for fatty acid catabolism and lipid accumulation, validating results with 35 targeted deletion strains. We identified a high-confidence set of 150 genes affecting lipid accumulation, including genes with predicted functionmore » in signaling cascades, gene expression, protein modification and vesicular trafficking, autophagy, amino acid synthesis and tRNA modification, and genes of unknown function. Lastly, these results greatly advance our understanding of lipid metabolism in this oleaginous species and demonstrate a general approach for barcoded mutagenesis that should enable functional genomics in diverse fungi.« less
Functional genomics of lipid metabolism in the oleaginous yeast Rhodosporidium toruloides

PubMed Central

Geiselman, Gina M; Ito, Masakazu; Mondo, Stephen J; Reilly, Morgann C; Cheng, Ya-Fang; Bauer, Stefan; Grigoriev, Igor V; Gladden, John M; Simmons, Blake A; Brem, Rachel B

2018-01-01

The basidiomycete yeast Rhodosporidium toruloides (also known as Rhodotorula toruloides) accumulates high concentrations of lipids and carotenoids from diverse carbon sources. It has great potential as a model for the cellular biology of lipid droplets and for sustainable chemical production. We developed a method for high-throughput genetics (RB-TDNAseq), using sequence-barcoded Agrobacterium tumefaciens T-DNA insertions. We identified 1,337 putative essential genes with low T-DNA insertion rates. We functionally profiled genes required for fatty acid catabolism and lipid accumulation, validating results with 35 targeted deletion strains. We identified a high-confidence set of 150 genes affecting lipid accumulation, including genes with predicted function in signaling cascades, gene expression, protein modification and vesicular trafficking, autophagy, amino acid synthesis and tRNA modification, and genes of unknown function. These results greatly advance our understanding of lipid metabolism in this oleaginous species and demonstrate a general approach for barcoded mutagenesis that should enable functional genomics in diverse fungi. PMID:29521624
Functional genomics of lipid metabolism in the oleaginous yeast Rhodosporidium toruloides

DOE PAGES

Coradetti, Samuel T.; Pinel, Dominic; Geiselman, Gina M.; ...

2018-03-09

The basidiomycete yeast Rhodosporidium toruloides (also known as Rhodotorula toruloides) accumulates high concentrations of lipids and carotenoids from diverse carbon sources. It has great potential as a model for the cellular biology of lipid droplets and for sustainable chemical production. We developed a method for high-throughput genetics (RB-TDNAseq), using sequence-barcoded Agrobacterium tumefaciens T-DNA insertions. We identified 1,337 putative essential genes with low T-DNA insertion rates. We functionally profiled genes required for fatty acid catabolism and lipid accumulation, validating results with 35 targeted deletion strains. We identified a high-confidence set of 150 genes affecting lipid accumulation, including genes with predicted functionmore » in signaling cascades, gene expression, protein modification and vesicular trafficking, autophagy, amino acid synthesis and tRNA modification, and genes of unknown function. Lastly, these results greatly advance our understanding of lipid metabolism in this oleaginous species and demonstrate a general approach for barcoded mutagenesis that should enable functional genomics in diverse fungi.« less
Genetic diversity of rice tungro spherical virus in tungro-endemic provinces of the Philippines and Indonesia.

PubMed

Azzam, O; Yambao, M L; Muhsin, M; McNally, K L; Umadhay, K M

2000-01-01

The two adjacent genes of coat protein 1 and 2 of rice tungro spherical virus (RTSV) were amplified from total RNA extracts of serologically indistinguishable field isolates from the Philippines and Indonesia, using reverse transcriptase polymerase chain reaction (RT-PCR). Digestion with HindIII and BstYI restriction endonucleases differentiated the amplified DNA products into eight distinct coat protein genotypes. These genotypes were then used as indicators of virus diversity in the field. Inter- and intra-site diversities were determined over three cropping seasons. At each of the sites surveyed, one or two main genotypes prevailed together with other related minor or mixed genotypes that did not replace the main genotype over the sampling time. The cluster of genotypes found at the Philippines sites was significantly different from the one at the Indonesia sites, suggesting geographic isolation for virus populations. Phylogenetic studies based on the nucleotide sequences of 38 selected isolates confirm the spatial distribution of RTSV virus populations but show that gene flow may occur between populations. Under the present conditions, rice varieties do not seem to exert selective pressure on the virus populations. Based on the selective constraints in the coat protein amino acid sequences and the virus genetic composition per site, a negative selection model followed by random-sampling events due to vector transmissions is proposed to explain the inter-site diversity observed.
Diverse archaeal community of a bat guano pile in Domica Cave (Slovak Karst, Slovakia).

PubMed

Chronáková, A; Horák, A; Elhottová, D; Kristůfek, V

2009-09-01

The molecular diversity of Archaea in a bat guano pile in Cave Domica (Slovakia), temperate cave ecosystem with significant bat colony (about 1600 individuals), was examined. The guano pile was created mainly by an activity of the Mediterranean horseshoe bat (Rhinolophus euryale) and provides a source of organic carbon and other nutrients in the oligotrophic subsurface ecosystem. The upper and the basal parts of guano surface were sampled where the latter one had higher pH and higher admixture of limestone bedrock and increased colonization of invertebrates. The relative proportion of Archaea determined using CARD-FISH in both parts was 3.5-3.9 % (the basal and upper part, respectively). The archaeal community was dominated by non-thermophilic Crenarchaeota (99 % of clones). Phylogenetic analysis of 115 16S rDNA sequences revealed the presence of Crenarchaeota previously isolated from temperate surface soils (group 1.1b, 62 clones), deep subsurface acid waters (group 1.1a, 52 clones) and Euryarchaeota (1 clone). Four of the analyzed sequences were found to have little similarity to those in public databases. The composition of both archaeal communities differed, with respect to higher diversity of Archaea in the upper part of the bat guano pile. High diversity archaeal population is present in the bat guano deposit and consists of both soil- and subsurface-born Crenarchaeota.
Cow teat skin, a potential source of diverse microbial populations for cheese production.

PubMed

Verdier-Metz, Isabelle; Gagne, Geneviève; Bornes, Stéphanie; Monsallier, Françoise; Veisseire, Philippe; Delbès-Paus, Céline; Montel, Marie-Christine

2012-01-01

The diversity of the microbial community on cow teat skin was evaluated using a culture-dependent method based on the use of different dairy-specific media, followed by the identification of isolates by 16S rRNA gene sequencing. This was combined with a direct molecular approach by cloning and 16S rRNA gene sequencing. This study highlighted the large diversity of the bacterial community that may be found on teat skin, where 79.8% of clones corresponded to various unidentified species as well as 66 identified species, mainly belonging to those commonly found in raw milk (Enterococcus, Pediococcus, Enterobacter, Pantoea, Aerococcus, and Staphylococcus). Several of them, such as nonstarter lactic acid bacteria (NSLAB), Staphylococcus, and Actinobacteria, may contribute to the development of the sensory characteristics of cheese during ripening. Therefore, teat skin could be an interesting source or vector of biodiversity for milk. Variations of microbial counts and diversity between the farms studied have been observed. Moreover, Staphylococcus auricularis, Staphylococcus devriesei, Staphylococcus arlettae, Streptococcus bovis, Streptococcus equinus, Clavibacter michiganensis, Coprococcus catus, or Arthrobacter gandavensis commensal bacteria of teat skin and teat canal, as well as human skin, are not common in milk, suggesting that there is a breakdown of microbial flow from animal to milk. It would then be interesting to thoroughly study this microbial flow from teat to milk.
Strategies for Achieving High Sequencing Accuracy for Low Diversity Samples and Avoiding Sample Bleeding Using Illumina Platform

PubMed Central

Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga

2015-01-01

Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer’s, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how analysis can be repeated from saved sequencing images using the Long Template Protocol to increase accuracy. PMID:25860802
Diversity and dynamics of lactic acid bacteria in Atole agrio, a traditional maize-based fermented beverage from South-Eastern Mexico, analysed by high throughput sequencing and culturing.

PubMed

Pérez-Cataluña, Alba; Elizaquível, Patricia; Carrasco, Purificación; Espinosa, Judith; Reyes, Dolores; Wacher, Carmen; Aznar, Rosa

2018-03-01

The purpose of this work was to analyse the diversity and dynamics of lactic acid bacteria (LAB) throughout the fermentation process in Atole agrio, a traditional maize based food of Mexican origin. Samples of different fermentation times were analysed using culture-dependent and -independent approaches. Identification of LAB isolates revealed the presence of members of the genera Pediococcus, Weissella, Lactobacillus, Leuconostoc and Lactococcus, and the predominance of Pediococcus pentosaceus and Weissella confusa in liquid and solid batches, respectively. High-throughput sequencing (HTS) of the 16S rRNA gene confirmed the predominance of Lactobacillaceae and Leuconostocaceae at the beginning of the process. In liquid fermentation Acetobacteraceae dominate after 4 h as pH decreased. In contrast, Leuconostocaceae dominated the solid fermentation except at 12 h that were overgrown by Acetobacteraceae. Regarding LAB genera, Lactobacillus dominated the liquid fermentation except at 12 h when Weissella, Lactococcus and Streptococcus were the most abundant. In solid fermentation Weissella predominated all through the process. HTS determined that Lactobacillus plantarum and W. confusa dominated in the liquid and solid batches, respectively. Two oligotypes have been identified for L. plantarum and W. confusa populations, differing in a single nucleotide position each. Only one of the oligotypes was detected among the isolates obtained from each species, the biological significance of which remains unclear.
Comparative microbiota assessment of wilted Italian ryegrass, whole crop corn, and wilted alfalfa silage using denaturing gradient gel electrophoresis and next-generation sequencing.

PubMed

Ni, Kuikui; Minh, Tang Thuy; Tu, Tran Thi Minh; Tsuruta, Takeshi; Pang, Huili; Nishino, Naoki

2017-02-01

The microbiota of pre-ensiled crop and silage were examined using denaturing gradient gel electrophoresis (DGGE) and next-generation sequencing (NGS). Wilted Italian ryegrass (IR), whole crop corn (WC), and wilted alfalfa (AL) silages stored for 2 months were examined. All silages contained lactic acid as a predominant fermentation product. Across the three crop species, DGGE detected 36 and 28 bands, and NGS identified 253 and 259 genera in the pre-ensiled crops and silages, respectively. The NGS demonstrated that, although lactic acid bacteria (LAB) became prevalent in all silages after 2 months of storage, the major groups were different between crops: Leuconostoc spp. and Pediococcus spp. for IR silage, Lactobacillus spp. for WC silage, and Enterococcus spp. for AL silage. The predominant silage LAB genera were also detected by DGGE, but the presence of diverse non-LAB species in pre-ensiled crops was far better detected by NGS. Likewise, good survival of Agrobacterium spp., Methylobacterium spp., and Sphingomonas spp. in IR and AL silages was demonstrated by NGS. The diversity of the microbiota described by principal coordinate analysis was similar between DGGE and NGS. Our finding that analysis of pre-ensiled crop microbiota did not help predict silage microbiota was true for both DGGE and NGS.

The alphabet of intrinsic disorder

PubMed Central

Theillet, Francois-Xavier; Kalmar, Lajos; Tompa, Peter; Han, Kyou-Hoon; Selenko, Philipp; Dunker, A. Keith; Daughdrill, Gary W.; Uversky, Vladimir N

2013-01-01

A significant fraction of every proteome is occupied by biologically active proteins that do not form unique three-dimensional structures. These intrinsically disordered proteins (IDPs) and IDP regions (IDPRs) have essential biological functions and are characterized by extensive structural plasticity. Such structural and functional behavior is encoded in the amino acid sequences of IDPs/IDPRs, which are enriched in disorder-promoting residues and depleted in order-promoting residues. In fact, amino acid residues can be arranged according to their disorder-promoting tendency to form an alphabet of intrinsic disorder that defines the structural complexity and diversity of IDPs/IDPRs. This review is the first in a series of publications dedicated to the roles that different amino acid residues play in defining the phenomenon of protein intrinsic disorder. We start with proline because data suggests that of the 20 common amino acid residues, this one is the most disorder-promoting. PMID:28516008
Genome-wide comparison of medieval and modern Mycobacterium leprae.

PubMed

Schuenemann, Verena J; Singh, Pushpendra; Mendum, Thomas A; Krause-Kyora, Ben; Jäger, Günter; Bos, Kirsten I; Herbig, Alexander; Economou, Christos; Benjak, Andrej; Busso, Philippe; Nebel, Almut; Boldsen, Jesper L; Kjellström, Anna; Wu, Huihai; Stewart, Graham R; Taylor, G Michael; Bauer, Peter; Lee, Oona Y-C; Wu, Houdini H T; Minnikin, David E; Besra, Gurdyal S; Tucker, Katie; Roffey, Simon; Sow, Samba O; Cole, Stewart T; Nieselt, Kay; Krause, Johannes

2013-07-12

Leprosy was endemic in Europe until the Middle Ages. Using DNA array capture, we have obtained genome sequences of Mycobacterium leprae from skeletons of five medieval leprosy cases from the United Kingdom, Sweden, and Denmark. In one case, the DNA was so well preserved that full de novo assembly of the ancient bacterial genome could be achieved through shotgun sequencing alone. The ancient M. leprae sequences were compared with those of 11 modern strains, representing diverse genotypes and geographic origins. The comparisons revealed remarkable genomic conservation during the past 1000 years, a European origin for leprosy in the Americas, and the presence of an M. leprae genotype in medieval Europe now commonly associated with the Middle East. The exceptional preservation of M. leprae biomarkers, both DNA and mycolic acids, in ancient skeletons has major implications for palaeomicrobiology and human pathogen evolution.
Evolution of the arginase fold and functional diversity

PubMed Central

Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.

2009-01-01

The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Searching for evidence of selection in avian DNA barcodes.

PubMed

Kerr, Kevin C R

2011-11-01

The barcode of life project has assembled a tremendous number of mitochondrial cytochrome c oxidase I (COI) sequences. Although these sequences were gathered to develop a DNA-based system for species identification, it has been suggested that further biological inferences may also be derived from this wealth of data. Recurrent selective sweeps have been invoked as an evolutionary mechanism to explain limited intraspecific COI diversity, particularly in birds, but this hypothesis has not been formally tested. In this study, I collated COI sequences from previous barcoding studies on birds and tested them for evidence of selection. Using this expanded data set, I re-examined the relationships between intraspecific diversity and interspecific divergence and sampling effort, respectively. I employed the McDonald-Kreitman test to test for neutrality in sequence evolution between closely related pairs of species. Because amino acid sequences were generally constrained between closely related pairs, I also included broader intra-order comparisons to quantify patterns of protein variation in avian COI sequences. Lastly, using 22 published whole mitochondrial genomes, I compared the evolutionary rate of COI against the other 12 protein-coding mitochondrial genes to assess intragenomic variability. I found no conclusive evidence of selective sweeps. Most evidence pointed to an overall trend of strong purifying selection and functional constraint. The COI protein did vary across the class Aves, but to a very limited extent. COI was the least variable gene in the mitochondrial genome, suggesting that other genes might be more informative for probing factors constraining mitochondrial variation within species. © 2011 Blackwell Publishing Ltd.
Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Identification of fungi in shotgun metagenomics datasets

PubMed Central

Donovan, Paul D.; Gonzalez, Gabriel; Higgins, Desmond G.

2018-01-01

Metagenomics uses nucleic acid sequencing to characterize species diversity in different niches such as environmental biomes or the human microbiome. Most studies have used 16S rRNA amplicon sequencing to identify bacteria. However, the decreasing cost of sequencing has resulted in a gradual shift away from amplicon analyses and towards shotgun metagenomic sequencing. Shotgun metagenomic data can be used to identify a wide range of species, but have rarely been applied to fungal identification. Here, we develop a sequence classification pipeline, FindFungi, and use it to identify fungal sequences in public metagenome datasets. We focus primarily on animal metagenomes, especially those from pig and mouse microbiomes. We identified fungi in 39 of 70 datasets comprising 71 fungal species. At least 11 pathogenic species with zoonotic potential were identified, including Candida tropicalis. We identified Pseudogymnoascus species from 13 Antarctic soil samples initially analyzed for the presence of bacteria capable of degrading diesel oil. We also show that Candida tropicalis and Candida loboi are likely the same species. In addition, we identify several examples where contaminating DNA was erroneously included in fungal genome assemblies. PMID:29444186
Identifying functionally informative evolutionary sequence profiles.

PubMed

Gil, Nelson; Fiser, Andras

2018-04-15

Multiple sequence alignments (MSAs) can provide essential input to many bioinformatics applications, including protein structure prediction and functional annotation. However, the optimal selection of sequences to obtain biologically informative MSAs for such purposes is poorly explored, and has traditionally been performed manually. We present Selection of Alignment by Maximal Mutual Information (SAMMI), an automated, sequence-based approach to objectively select an optimal MSA from a large set of alternatives sampled from a general sequence database search. The hypothesis of this approach is that the mutual information among MSA columns will be maximal for those MSAs that contain the most diverse set possible of the most structurally and functionally homogeneous protein sequences. SAMMI was tested to select MSAs for functional site residue prediction by analysis of conservation patterns on a set of 435 proteins obtained from protein-ligand (peptides, nucleic acids and small substrates) and protein-protein interaction databases. Availability and implementation: A freely accessible program, including source code, implementing SAMMI is available at https://github.com/nelsongil92/SAMMI.git. andras.fiser@einstein.yu.edu. Supplementary data are available at Bioinformatics online.
Partial bisulfite conversion for unique template sequencing.

PubMed

Kumar, Vijay; Rosenbaum, Julie; Wang, Zihua; Forcier, Talitha; Ronemus, Michael; Wigler, Michael; Levy, Dan

2018-01-25

We introduce a new protocol, mutational sequencing or muSeq, which uses sodium bisulfite to randomly deaminate unmethylated cytosines at a fixed and tunable rate. The muSeq protocol marks each initial template molecule with a unique mutation signature that is present in every copy of the template, and in every fragmented copy of a copy. In the sequenced read data, this signature is observed as a unique pattern of C-to-T or G-to-A nucleotide conversions. Clustering reads with the same conversion pattern enables accurate count and long-range assembly of initial template molecules from short-read sequence data. We explore count and low-error sequencing by profiling 135 000 restriction fragments in a PstI representation, demonstrating that muSeq improves copy number inference and significantly reduces sporadic sequencer error. We explore long-range assembly in the context of cDNA, generating contiguous transcript clusters greater than 3,000 bp in length. The muSeq assemblies reveal transcriptional diversity not observable from short-read data alone. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Bioinformatic flowchart and database to investigate the origins and diversity of Clan AA peptidases

PubMed Central

Llorens, Carlos; Futami, Ricardo; Renaud, Gabriel; Moya, Andrés

2009-01-01

Background Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the diversity of clan AA is larger than previously thought. The ensuing approach to investigate this enzyme group is by studying its phylogeny. However, clan AA is a difficult case to study due to the low similarity and different rates of evolution. This work is an ongoing attempt to investigate the different clan AA families to understand the cause of their diversity. Results In this paper, we describe in-progress database and bioinformatic flowchart designed to characterize the clan AA protein domain based on all possible protein families through ancestral reconstructions, sequence logos, and hidden markov models (HMMs). The flowchart includes the characterization of a major consensus sequence based on 6 amino acid patterns with correspondence with Andreeva's model, the structural template describing the clan AA peptidase fold. The set of tools is work in progress we have organized in a database within the GyDB project, referred to as Clan AA Reference Database . Conclusion The pre-existing classification combined with the evolutionary history of LTR retroelements permits a consistent taxonomical collection of sequence logos and HMMs. This set is useful for gene annotation but also a reference to evaluate the diversity of, and the relationships among, the different families. Comparisons among HMMs suggest a common ancestor for all dimeric clan AA peptidases that is halfway between single-domain nonviral peptidases and those coded by Ty3/Gypsy LTR retroelements. Sequence logos reveal how all clan AA families follow similar protein domain architecture related to the peptidase fold. In particular, each family nucleates a particular consensus motif in the sequence position related to the flap. The different motifs constitute a network where an alanine-asparagine-like variable motif predominates, instead of the canonical flap of the HIV-1 peptidase and closer relatives. Reviewers This article was reviewed by Daniel H. Haft, Vladimir Kapitonov (nominated by Jerry Jurka), and Ben M. Dunn (nominated by Claus Wilke). PMID:19173708
Genetic clustering and polymorphism of the merozoite surface protein-3 of Plasmodium knowlesi clinical isolates from Peninsular Malaysia.

PubMed

De Silva, Jeremy Ryan; Lau, Yee Ling; Fong, Mun Yik

2017-01-03

The simian malaria parasite Plasmodium knowlesi has been reported to cause significant numbers of human infection in South East Asia. Its merozoite surface protein-3 (MSP3) is a protein that belongs to a multi-gene family of proteins first found in Plasmodium falciparum. Several studies have evaluated the potential of P. falciparum MSP3 as a potential vaccine candidate. However, to date no detailed studies have been carried out on P. knowlesi MSP3 gene (pkmsp3). The present study investigates the genetic diversity, and haplotypes groups of pkmsp3 in P. knowlesi clinical samples from Peninsular Malaysia. Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008-2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes. From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups. A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection still occur.
Assessment of the microbial community in a constructed wetland that receives acid coal mine drainage

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nicomrat, D.; Dick, W.A.; Tuovinen, O.H.

2006-01-15

Constructed wetlands are used to treat acid drainage from surface or underground coal mines. However, little is known about the microbial communities in the receiving wetland cells. The purpose of this work was to characterize the microbial population present in a wetland that was receiving acid coal mine drainage (AMD). Samples were collected from the oxic sediment zone of a constructed wetland cell in southeastern Ohio that was treating acid drainage from an underground coal mine seep. Samples comprised Fe(Ill) precipitates and were pretreated with ammonium oxalate to remove interfering iron, and the DNA was extracted and purified by agarosemore » gel electrophoresis prior to amplification of portions of the 16S rRNA gene. Amplified products were separated by denaturing gradient gel electrophoresis and DNA from seven distinct bands was excised from the gel and sequenced. The sequences were matched to sequences in the GenBank bacterial 16S rDNA database. The DNA in two of the bands yielded matches with Acidithiobacillus ferrooxidans and the DNA in each of the remaining five bands was consistent with one of the following microorganisms: Acidithiobacillus thiooxidans, strain TRA3-20 (a eubacterium), strain BEN-4 (an arsenite-oxidizing bacterium), an Alcaligenes sp., and a Bordetella sp. Low bacterial diversity in these samples reflects the highly inorganic nature of the oxic sediment layer where high abundance of iron- and sulfur-oxidizing bacteria would be expected. The results we obtained by molecular methods supported our findings, obtained using culture methods, that the dominant microbial species in an acid receiving, oxic wetland are A. thiooxidans and A. ferrooxidans.« less
Ampicillin-Resistant Non-β-Lactamase-Producing Haemophilus influenzae in Spain: Recent Emergence of Clonal Isolates with Increased Resistance to Cefotaxime and Cefixime▿

PubMed Central

García-Cobos, Silvia; Campos, José; Lázaro, Edurne; Román, Federico; Cercenado, Emilia; García-Rey, César; Pérez-Vázquez, María; Oteo, Jesús; de Abajo, Francisco

2007-01-01

The sequence of the ftsI gene encoding the transpeptidase domain of penicillin-binding protein 3 (PBP 3) was determined for 354 nonconsecutive Haemophilus influenzae isolates from Spain; 17.8% of them were ampicillin susceptible, 56% were β-lactamase nonproducing ampicillin resistant (BLNAR), 15.8% were β-lactamase producers and ampicillin resistant, and 10.4% displayed both resistance mechanisms. The ftsI gene sequences had 28 different mutation patterns and amino acid substitutions at 23 positions. Some 93.2% of the BLNAR strains had amino acid substitutions at the Lys-Thr-Gly (KTG) motif, the two most common being Asn526 to Lys (83.9%) and Arg517 to His (9.3%). Amino acid substitutions at positions 377, 385, and 389, which conferred cefotaxime and cefixime MICs 10 to 60 times higher than those of susceptible strains, were found for the first time in Europe. In 72 isolates for which the repressor acrR gene of the AcrAB efflux pump was sequenced, numerous amino acid substitutions were found. Eight isolates with ampicillin MICs of 0.25 to 2 μg/ml showed changes that predicted the early termination of the acrR reading frame. Pulsed-field gel electrophoresis analysis demonstrated that most BLNAR strains were genetically diverse, although clonal dissemination was detected in a group of isolates presenting with increased resistance to cefotaxime and cefixime. Background antibiotic use at the community level revealed a marked trend toward increased amoxicillin-clavulanic acid consumption. BLNAR H. influenzae strains have arisen by vertical and horizontal spread and have evolved to adapt rapidly to the increased selective pressures posed by the use of oral penicillins and cephalosporins. PMID:17470649
Microbial Community Structure and Arsenic Biogeochemistry in an Acid Vapor-Formed Spring in Tengchong Geothermal Area, China

PubMed Central

Jiang, Zhou; Li, Ping; Jiang, Dawei; Dai, Xinyue; Zhang, Rui; Wang, Yanhong; Wang, Yanxin

2016-01-01

Arsenic biogeochemistry has been studied extensively in acid sulfate-chloride hot springs, but not in acid sulfate hot springs with low chloride. In this study, Zhenzhuquan in Tengchong geothermal area, a representative acid sulfate hot spring with low chloride, was chosen to study arsenic geochemistry and microbial community structure using Illumina MiSeq sequencing. Over 0.3 million 16S rRNA sequence reads were obtained from 6-paired parallel water and sediment samples along its outflow channel. Arsenic oxidation occurred in the Zhenxhuquan pool, with distinctly high ratios of arsenate to total dissolved arsenic (0.73–0.86). Coupled with iron and sulfur oxidation along the outflow channel, arsenic accumulated in downstream sediments with concentrations up to 16.44 g/kg and appeared to significantly constrain their microbial community diversity. These oxidations might be correlated with the appearance of some putative functional microbial populations, such as Aquificae and Pseudomonas (arsenic oxidation), Sulfolobus (sulfur and iron oxidation), Metallosphaera and Acidicaldus (iron oxidation). Temperature, total organic carbon and dissolved oxygen significantly shaped the microbial community structure of upstream and downstream samples. In the upstream outflow channel region, most microbial populations were microaerophilic/anaerobic thermophiles and hyperthermophiles, such as Sulfolobus, Nocardia, Fervidicoccus, Delftia, and Ralstonia. In the downstream region, aerobic heterotrophic mesophiles and thermophiles were identified, including Ktedonobacteria, Acidicaldus, Chthonomonas and Sphingobacteria. A total of 72.41–95.91% unassigned-genus sequences were derived from the downstream high arsenic sediments 16S rRNA clone libraries. This study could enable us to achieve an integrated understanding on arsenic biogeochemistry in acid hot springs. PMID:26761709
Bats host diverse parvoviruses as possible origin of mammalian dependoparvoviruses and source for bat-swine interspecies transmission.

PubMed

Lau, Susanna K P; Ahmed, Syed Shakeel; Tsoi, Hoi-Wah; Yeung, Hazel C; Li, Kenneth S M; Fan, Rachel Y Y; Zhao, Pyrear S H; Lau, Candy C C; Lam, Carol S F; Choi, Kelvin K F; Chan, Ben C H; Cai, Jian-Piao; Wong, Samson S Y; Chen, Honglin; Zhang, Hai-Lin; Zhang, Libiao; Wang, Ming; Woo, Patrick C Y; Yuen, Kwok-Yung

2017-11-06

Compared to the enormous species diversity of bats, relatively few parvoviruses have been reported. We detected diverse and potentially novel parvoviruses from bats in Hong Kong and mainland China. Parvoviruses belonging to Amdoparvovirus, Bocaparvovirus and Dependoparvovirus were detected in alimentary, liver and spleen samples from 16 different chiropteran species of five families by PCR. Phylogenetic analysis of partial helicase sequences showed that they potentially belonged to 25 bocaparvovirus, three dependoparvovirus and one amdoparvovirus species. Nearly complete genome sequencing confirmed the existence of at least four novel bat bocaparvovirus species (Rp-BtBoV1 and Rp-BtBoV2 from Rhinolophus pusillus, Rs-BtBoV2 from Rhinolophus sinicus and Rol-BtBoV1 from Rousettus leschenaultii) and two novel bat dependoparvovirus species (Rp-BtAAV1 from Rhinolophus pusillus and Rs-BtAAV1 from Rhinolophus sinicus). Rs-BtBoV2 was closely related to Ungulate bocaparvovirus 5 with 93, 72.1 and 78.7 % amino acid identities in the NS1, NP1 and VP1/VP2 genes, respectively. The detection of bat bocaparvoviruses, including Rs-BtBoV2, closely related to porcine bocaparvoviruses, suggests recent interspecies transmission of bocaparvoviruses between bats and swine. Moreover, Rp-BtAAV1 and Rs-BtAAV1 were most closely related to human AAV1 with 48.7 and 57.5 % amino acid identities in the rep gene. The phylogenetic relationship between BtAAVs and other mammalian AAVs suggests bats as the ancestral origin of mammalian AAVs. Furthermore, parvoviruses of the same species were detected from multiple bat species or families, supporting the ability of bat parvoviruses to cross species barriers. The results extend our knowledge on the diversity of bat parvoviruses and the role of bats in parvovirus evolution and emergence in humans and animals.
High quality draft genome sequence of Olivibacter sitiensis type strain (AW-6T), a diphenol degrader with genes involved in the catechol pathway

PubMed Central

Ntougias, Spyridon; Lapidus, Alla; Han, James; Mavromatis, Konstantinos; Pati, Amrita; Chen, Amy; Klenk, Hans-Peter; Woyke, Tanja; Fasseas, Constantinos; Kyrpides, Nikos C.; Zervakis, Georgios I.

2014-01-01

Olivibacter sitiensis Ntougias et al. 2007 is a member of the family Sphingobacteriaceae, phylum Bacteroidetes. Members of the genus Olivibacter are phylogenetically diverse and of significant interest. They occur in diverse habitats, such as rhizosphere and contaminated soils, viscous wastes, composts, biofilter clean-up facilities on contaminated sites and cave environments, and they are involved in the degradation of complex and toxic compounds. Here we describe the features of O. sitiensis AW-6T, together with the permanent-draft genome sequence and annotation. The organism was sequenced under the Genomic Encyclopedia for Bacteria and Archaea (GEBA) project at the DOE Joint Genome Institute and is the first genome sequence of a species within the genus Olivibacter. The genome is 5,053,571 bp long and is comprised of 110 scaffolds with an average GC content of 44.61%. Of the 4,565 genes predicted, 4,501 were protein-coding genes and 64 were RNA genes. Most protein-coding genes (68.52%) were assigned to a putative function. The identification of 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase-coding genes indicates involvement of this organism in the catechol catabolic pathway. In addition, genes encoding for β-1,4-xylanases and β-1,4-xylosidases reveal the xylanolytic action of O. sitiensis. PMID:25197463
Comparative “Omics” of the Fusarium fujikuroi Species Complex Highlights Differences in Genetic Potential and Metabolite Synthesis

PubMed Central

Niehaus, Eva-Maria; Münsterkötter, Martin; Proctor, Robert H.; Brown, Daren W.; Sharon, Amir; Idan, Yifat; Oren-Young, Liat; Sieber, Christian M.; Novák, Ondřej; Pěnčík, Aleš; Tarkowská, Danuše; Hromadová, Kristýna; Freeman, Stanley; Maymon, Marcel; Elazar, Meirav; Youssef, Sahar A.; El-Shabrawy, El Said M.; Shalaby, Abdel Baset A.; Houterman, Petra; Brock, Nelson L.; Burkhardt, Immo; Tsavkelova, Elena A.; Dickschat, Jeroen S.; Galuszka, Petr; Güldener, Ulrich; Tudzynski, Bettina

2016-01-01

Species of the Fusarium fujikuroi species complex (FFC) cause a wide spectrum of often devastating diseases on diverse agricultural crops, including coffee, fig, mango, maize, rice, and sugarcane. Although species within the FFC are difficult to distinguish by morphology, and their genes often share 90% sequence similarity, they can differ in host plant specificity and life style. FFC species can also produce structurally diverse secondary metabolites (SMs), including the mycotoxins fumonisins, fusarins, fusaric acid, and beauvericin, and the phytohormones gibberellins, auxins, and cytokinins. The spectrum of SMs produced can differ among closely related species, suggesting that SMs might be determinants of host specificity. To date, genomes of only a limited number of FFC species have been sequenced. Here, we provide draft genome sequences of three more members of the FFC: a single isolate of F. mangiferae, the cause of mango malformation, and two isolates of F. proliferatum, one a pathogen of maize and the other an orchid endophyte. We compared these genomes to publicly available genome sequences of three other FFC species. The comparisons revealed species-specific and isolate-specific differences in the composition and expression (in vitro and in planta) of genes involved in SM production including those for phytohormome biosynthesis. Such differences have the potential to impact host specificity and, as in the case of F. proliferatum, the pathogenic versus endophytic life style. PMID:28040774
A novel approach to tracking antigen-experienced CD4 T cells into functional compartments via tandem deep and shallow TCR clonotyping.

PubMed

Estorninho, Megan; Gibson, Vivienne B; Kronenberg-Versteeg, Deborah; Liu, Yuk-Fun; Ni, Chester; Cerosaletti, Karen; Peakman, Mark

2013-12-01

Extensive diversity in the human repertoire of TCRs for Ag is both a cornerstone of effective adaptive immunity that enables host protection against a multiplicity of pathogens and a weakness that gives rise to potential pathological self-reactivity. The complexity arising from diversity makes detection and tracking of single Ag-specific CD4 T cells (ASTs) involved in these immune responses challenging. We report a tandem, multistep process to quantify rare TCRβ-chain variable sequences of ASTs in large polyclonal populations. The approach combines deep high-throughput sequencing (HTS) within functional CD4 T cell compartments, such as naive/memory cells, with shallow, multiple identifier-based HTS of ASTs identified by activation marker upregulation after short-term Ag stimulation in vitro. We find that clonotypes recognizing HLA class II-restricted epitopes of both pathogen-derived Ags and self-Ags are oligoclonal and typically private. Clonotype tracking within an individual reveals private AST clonotypes resident in the memory population, as would be expected, representing clonal expansions (identical nucleotide sequence; "ultraprivate"). Other AST clonotypes share CDR3β amino acid sequences through convergent recombination and are found in memory populations of multiple individuals. Tandem HTS-based clonotyping will facilitate studying AST dynamics, epitope spreading, and repertoire changes that arise postvaccination and following Ag-specific immunotherapies for cancer and autoimmune disease.
Multi-Omics Reveals that Lead Exposure Disturbs Gut Microbiome Development, Key Metabolites, and Metabolic Pathways.

PubMed

Gao, Bei; Chi, Liang; Mahbub, Ridwan; Bian, Xiaoming; Tu, Pengcheng; Ru, Hongyu; Lu, Kun

2017-04-17

Lead exposure remains a global public health issue, and the recent Flint water crisis has renewed public concern about lead toxicity. The toxicity of lead has been well established in a variety of systems and organs. The gut microbiome has been shown to be highly involved in many critical physiological processes, including food digestion, immune system development, and metabolic homeostasis. However, despite the key role of the gut microbiome in human health, the functional impact of lead exposure on the gut microbiome has not been studied. The aim of this study is to define gut microbiome toxicity induced by lead exposure in C57BL/6 mice using multiomics approaches, including 16S rRNA sequencing, whole genome metagenomics sequencing, and gas chromatography-mass spectrometry (GC-MS) metabolomics. 16S rRNA sequencing revealed that lead exposure altered the gut microbiome trajectory and phylogenetic diversity. Metagenomics sequencing and metabolomics profiling showed that numerous metabolic pathways, including vitamin E, bile acids, nitrogen metabolism, energy metabolism, oxidative stress, and the defense/detoxification mechanism, were significantly disturbed by lead exposure. These perturbed molecules and pathways may have important implications for lead toxicity in the host. Taken together, these results demonstrated that lead exposure not only altered the gut microbiome community structures/diversity but also greatly affected metabolic functions, leading to gut microbiome toxicity.
Microbial Culturomics Broadens Human Vaginal Flora Diversity: Genome Sequence and Description of Prevotella lascolaii sp. nov. Isolated from a Patient with Bacterial Vaginosis.

PubMed

Diop, Khoudia; Diop, Awa; Levasseur, Anthony; Mediannikov, Oleg; Robert, Catherine; Armstrong, Nicholas; Couderc, Carine; Bretelle, Florence; Raoult, Didier; Fournier, Pierre-Edouard; Fenollar, Florence

2018-03-01

Microbial culturomics is a new subfield of postgenomic medicine and omics biotechnology application that has broadened our awareness on bacterial diversity of the human microbiome, including the human vaginal flora bacterial diversity. Using culturomics, a new obligate anaerobic Gram-stain-negative rod-shaped bacterium designated strain khD1 T was isolated in the vagina of a patient with bacterial vaginosis and characterized using taxonogenomics. The most abundant cellular fatty acids were C 15:0 anteiso (36%), C 16:0 (19%), and C 15:0 iso (10%). Based on an analysis of the full-length 16S rRNA gene sequences, phylogenetic analysis showed that the strain khD1 T exhibited 90% sequence similarity with Prevotella loescheii, the phylogenetically closest validated Prevotella species. With 3,763,057 bp length, the genome of strain khD1 T contained (mol%) 48.7 G + C and 3248 predicted genes, including 3194 protein-coding and 54 RNA genes. Given the phenotypical and biochemical characteristic results as well as genome sequencing, strain khD1 T is considered to represent a novel species within the genus Prevotella, for which the name Prevotella lascolaii sp. nov. is proposed. The type strain is khD1 T ( = CSUR P0109, = DSM 101754). These results show that microbial culturomics greatly improves the characterization of the human microbiome repertoire by isolating potential putative new species. Further studies will certainly clarify the microbial mechanisms of pathogenesis of these new microbes and their role in health and disease. Microbial culturomics is an important new addition to the diagnostic medicine toolbox and warrants attention in future medical, global health, and integrative biology postgraduate teaching curricula.
Evolutionary trends of European bat lyssavirus type 2 including genetic characterization of Finnish strains of human and bat origin 24 years apart.

PubMed

Jakava-Viljanen, Miia; Miia, Jakava-Viljanen; Nokireki, Tiina; Tiina, Nokireki; Sironen, Tarja; Tarja, Sironen; Vapalahti, Olli; Olli, Vapalahti; Sihvonen, Liisa; Liisa, Sihvonen; Huovilainen, Anita; Anita, Huovilainen

2015-06-01

Among other Lyssaviruses, Daubenton's and pond-bat-related European bat lyssavirus type 2 (EBLV-2) can cause human rabies. To investigate the diversity and evolutionary trends of EBLV-2, complete genome sequences of two Finnish isolates were analysed. One originated from a human case in 1985, and the other originated from a bat in 2009. The overall nucleotide and deduced amino acid sequence identity of the two Finnish isolates were high, as well as the similarity to fully sequenced EBLV-2 strains originating from the UK and the Netherlands. In phylogenetic analysis, the EBLV-2 strains formed a monophyletic group that was separate from other bat-type lyssaviruses, with significant support. EBLV-2 shared the most recent common ancestry with Bokeloh bat lyssavirus (BBLV) and Khujan virus (KHUV). EBLV-2 showed limited diversity compared to RABV and appears to be well adapted to its host bat species. The slow tempo of viral evolution was evident in the estimations of divergence times for EBLV-2: the current diversity was estimated to have built up during the last 2000 years, and EBLV-2 diverged from KHUV about 8000 years ago. In a phylogenetic tree of partial N gene sequences, the Finnish EBLV-2 strains clustered with strains from Central Europe, supporting the hypothesis that EBLV-2 circulating in Finland might have a Central European origin. The Finnish EBLV-2 strains and a Swiss strain were estimated to have diverged from other EBLV-2 strains during the last 1000 years, and the two Finnish strains appear to have evolved from a common ancestor during the last 200 years.

Ranking viruses: measures of positional importance within networks define core viruses for rational polyvalent vaccine development.

PubMed

Anderson, Tavis K; Laegreid, William W; Cerutti, Francesco; Osorio, Fernando A; Nelson, Eric A; Christopher-Hennings, Jane; Goldberg, Tony L

2012-06-15

The extraordinary genetic and antigenic variability of RNA viruses is arguably the greatest challenge to the development of broadly effective vaccines. No single viral variant can induce sufficiently broad immunity, and incorporating all known naturally circulating variants into one multivalent vaccine is not feasible. Furthermore, no objective strategies currently exist to select actual viral variants that should be included or excluded in polyvalent vaccines. To address this problem, we demonstrate a method based on graph theory that quantifies the relative importance of viral variants. We demonstrate our method through application to the envelope glycoprotein gene of a particularly diverse RNA virus of pigs: porcine reproductive and respiratory syndrome virus (PRRSV). Using distance matrices derived from sequence nucleotide difference, amino acid difference and evolutionary distance, we constructed viral networks and used common network statistics to assign each sequence an objective ranking of relative 'importance'. To validate our approach, we use an independent published algorithm to score our top-ranked wild-type variants for coverage of putative T-cell epitopes across the 9383 sequences in our dataset. Top-ranked viruses achieve significantly higher coverage than low-ranked viruses, and top-ranked viruses achieve nearly equal coverage as a synthetic mosaic protein constructed in silico from the same set of 9383 sequences. Our approach relies on the network structure of PRRSV but applies to any diverse RNA virus because it identifies subsets of viral variants that are most important to overall viral diversity. We suggest that this method, through the objective quantification of variant importance, provides criteria for choosing viral variants for further characterization, diagnostics, surveillance and ultimately polyvalent vaccine development.
Microbial Community Responses to Glycine Addition in Kansas Prairie Soils

NASA Astrophysics Data System (ADS)

Bottos, E.; Roy Chowdhury, T.; White, R. A., III; Brislawn, C.; Fansler, S.; Kim, Y. M.; Metz, T. O.; McCue, L. A.; Jansson, J.

2015-12-01

Advances in sequencing technologies are rapidly expanding our abilities to unravel aspects of microbial community structure and function in complex systems like soil; however, characterizing the highly diverse communities is problematic, due primarily to challenges in data analysis. To tackle this problem, we aimed to constrain the microbial diversity in a soil by enriching for particular functional groups within a community through addition of "trigger substrates". Such trigger substrates, characterized by low molecular weight, readily soluble and diffusible in soil solution, representative of soil organic matter derivatives, would also be rapidly degradable. A relatively small energy investment to maintain the cell in a state of metabolic alertness for such substrates would be a better evolutionary strategy and presumably select for a cohort of microorganisms with the energetics and cellular machinery for utilization and growth. We chose glycine, a free amino acid (AA) known to have short turnover times (in the range of hours) in soil. As such, AAs are a good source of nitrogen and easily degradable, and can serve as building blocks for microbial proteins and other biomass components. We hypothesized that the addition of glycine as a trigger substrate will decrease microbial diversity and evenness, as taxa capable of metabolizing it are enriched in relation to those that are not. We tested this hypothesis by incubating three Kansas native prairie soils with glycine for 24 hours at 21 degree Celsius, and measured community level responses by 16S rRNA gene sequencing, metagenomics, and metatranscriptomics. Preliminary evaluation of 16S rRNA gene sequences revealed minor changes in bacterial community composition in response to glycine addition. We will also present data on functional gene abundance and expression. The results of these analyses will be useful in designing sequencing strategies aimed at dissecting and deciphering complex microbial communities.
A world in one dimension: Linus Pauling, Francis Crick and the central dogma of molecular biology.

PubMed

Strasser, Bruno J

2006-01-01

In 1957, Francis Crick outlined a startling vision of life in which the great diversity of forms and shapes of macromolecules was encoded in the one-dimensional sequence of nucleic acids. This paper situates Crick's new vision in the debates of the 1950s about protein synthesis and gene action. After exploring the reception of Crick's ideas, it shows how they differed radically from a different model of protein synthesis which enjoyed wide currency in that decade. In this alternative model, advocated by Linus Pauling and other luminaries, three-dimensional templates directed the folding of proteins. Even though it was always considered somewhat speculative, this theory was supported by a number of empirical results originating in different experimental systems. It was eventually replaced by a model in which the forms and shapes of macromolecules resulted solely from their amino acid sequence, dramatically simplifying the problem of protein synthesis which Crick was attempting to solve in 1957.
The mitochondnal genome of Aspergillus nidulans contains reading frames homologous to the human URFs 1 and 4.

PubMed Central

Brown, T A; Davies, R W; Ray, J A; Waring, R B; Scazzocchio, C

1983-01-01

A 2830-bp segment of the mitochondrial genome of the fungus Aspergillus nidulans was sequenced and shown to contain two unidentified reading frames (URFs). These reading frames are 352 and 488 codons in length, and would specify unmodified proteins of mol. wts. 39,000 and 54,000, respectively. The derived amino acid sequences indicate that these genes are equivalent to the human mitochondrial URFs 1 and 4, with 39% amino acid homology for URF1 and 26% for URF4. Both URFs were shown by secondary structure predictions to code for predominantly beta-sheeted proteins with strong structural conservation between the fungal and human homologues. Counterparts of mammalian URFs have not previously been identified in non-mammalian genomes, and the discovery that A. nidulans possesses reading frames so closely homologous with URF1 and URF4 shows that these genes are of general functional importance in the mitochondria of diverse species. PMID:11894959
GOLabeler: Improving Sequence-based Large-scale Protein Function Prediction by Learning to Rank.

PubMed

You, Ronghui; Zhang, Zihan; Xiong, Yi; Sun, Fengzhu; Mamitsuka, Hiroshi; Zhu, Shanfeng

2018-03-07

Gene Ontology (GO) has been widely used to annotate functions of proteins and understand their biological roles. Currently only <1% of more than 70 million proteins in UniProtKB have experimental GO annotations, implying the strong necessity of automated function prediction (AFP) of proteins, where AFP is a hard multilabel classification problem due to one protein with a diverse number of GO terms. Most of these proteins have only sequences as input information, indicating the importance of sequence-based AFP (SAFP: sequences are the only input). Furthermore homology-based SAFP tools are competitive in AFP competitions, while they do not necessarily work well for so-called difficult proteins, which have <60% sequence identity to proteins with annotations already. Thus the vital and challenging problem now is how to develop a method for SAFP, particularly for difficult proteins. The key of this method is to extract not only homology information but also diverse, deep- rooted information/evidence from sequence inputs and integrate them into a predictor in a both effective and efficient manner. We propose GOLabeler, which integrates five component classifiers, trained from different features, including GO term frequency, sequence alignment, amino acid trigram, domains and motifs, and biophysical properties, etc., in the framework of learning to rank (LTR), a paradigm of machine learning, especially powerful for multilabel classification. The empirical results obtained by examining GOLabeler extensively and thoroughly by using large-scale datasets revealed numerous favorable aspects of GOLabeler, including significant performance advantage over state-of-the-art AFP methods. http://datamining-iip.fudan.edu.cn/golabeler. zhusf@fudan.edu.cn. Supplementary data are available at Bioinformatics online.
Insights from the metagenome of an acid salt lake: the role of biology in an extreme depositional environment.

PubMed

Johnson, Sarah Stewart; Chevrette, Marc Gerard; Ehlmann, Bethany L; Benison, Kathleen Counter

2015-01-01

The extremely acidic brine lakes of the Yilgarn Craton of Western Australia are home to some of the most biologically challenging waters on Earth. In this study, we employed metagenomic shotgun sequencing to generate a microbial profile of the depositional environment associated with the sulfur-rich sediments of one such lake. Of the 1.5 M high-quality reads generated, 0.25 M were mapped to protein features, which in turn provide new insights into the metabolic function of this community. In particular, 45 diverse genes associated with sulfur metabolism were identified, the majority of which were linked to either the conversion of sulfate to adenylylsulfate and the subsequent production of sulfide from sulfite or the oxidation of sulfide, elemental sulfur, and thiosulfate via the sulfur oxidation (Sox) system. This is the first metagenomic study of an acidic, hypersaline depositional environment, and we present evidence for a surprisingly high level of microbial diversity. Our findings also illuminate the possibility that we may be meaningfully underestimating the effects of biology on the chemistry of these sulfur-rich sediments, thereby influencing our understanding of past geobiological conditions that may have been present on Earth as well as early Mars.
[Microbial diversity and ammonia-oxidizing microorganism of a soil sample near an acid mine drainage lake].

PubMed

Liu, Ying; Wang, Li-Hua; Hao, Chun-Bo; Li, Lu; Li, Si-Yuan; Feng, Chuan-Ping

2014-06-01

The main physicochemical parameters of the soil sample which was collected near an acid mine drainage reservoir in Anhui province was analyzed. The microbial diversity and community structure was studied through the construction of bacteria and archaea 16S rRNA gene clone libraries and ammonia monooxygenase gene clone library of archaea. The functional groups which were responsible for the process of ammonia oxidation were also discussed. The results indicated that the soil sample had extreme low pH value (pH < 3) and high ions concentration, which was influenced by the acid mine drainage (AMD). All the 16S rRNA gene sequences of bacteria clone library fell into 11 phyla, and Acidobacteria played the most significant role in the ecosystem followed by Verrucomicrobia. A great number of acidophilic bacteria existed in the soil sample, such as Candidatus Koribacter versatilis and Holophaga sp.. The archaea clone library consisted of 2 phyla (Thaumarchaeota and Euryarchaeota). The abundance of Thaumarchaeota was remarkably higher than Euryarchaeota. The ammonia oxidation in the soil environment was probably driven by ammonia-oxidizing archaea, and new species of ammonia-oxidizing archaea existed in the soil sample.
Subset of Kappa and Lambda Germline Sequences Result in Light Chains with a Higher Molecular Mass Phenotype.

PubMed

Barnidge, David R; Lundström, Susanna L; Zhang, Bo; Dasari, Surendra; Murray, David L; Zubarev, Roman A

2015-12-04

In our previous work, we showed that electrospray ionization of intact polyclonal kappa and lambda light chains isolated from normal serum generates two distinct, Gaussian-shaped, molecular mass distributions representing the light-chain repertoire. During the analysis of a large (>100) patient sample set, we noticed a low-intensity molecular mass distribution with a mean of approximately 24 250 Da, roughly 800 Da higher than the mean of the typical kappa molecular-mass distribution mean of 23 450 Da. We also observed distinct clones in this region that did not appear to contain any typical post-translational modifications that would account for such a large mass shift. To determine the origin of the high molecular mass clones, we performed de novo bottom-up mass spectrometry on a purified IgM monoclonal light chain that had a calculated molecular mass of 24 275.03 Da. The entire sequence of the monoclonal light chain was determined using multienzyme digestion and de novo sequence-alignment software and was found to belong to the germline allele IGKV2-30. The alignment of kappa germline sequences revealed ten IGKV2 and one IGKV4 sequences that contained additional amino acids in their CDR1 region, creating the high-molecular-mass phenotype. We also performed an alignment of lambda germline sequences, which showed additional amino acids in the CDR2 region, and the FR3 region of functional germline sequences that result in a high-molecular-mass phenotype. The work presented here illustrates the ability of mass spectrometry to provide information on the diversity of light-chain molecular mass phenotypes in circulation, which reflects the germline sequences selected by the immunoglobulin-secreting B-cell population.
Expanding the biotechnology potential of lactobacilli through comparative genomics of 213 strains and associated genera.

PubMed

Sun, Zhihong; Harris, Hugh M B; McCann, Angela; Guo, Chenyi; Argimón, Silvia; Zhang, Wenyi; Yang, Xianwei; Jeffery, Ian B; Cooney, Jakki C; Kagawa, Todd F; Liu, Wenjun; Song, Yuqin; Salvetti, Elisa; Wrobel, Agnieszka; Rasinkangas, Pia; Parkhill, Julian; Rea, Mary C; O'Sullivan, Orla; Ritari, Jarmo; Douillard, François P; Paul Ross, R; Yang, Ruifu; Briner, Alexandra E; Felis, Giovanna E; de Vos, Willem M; Barrangou, Rodolphe; Klaenhammer, Todd R; Caufield, Page W; Cui, Yujun; Zhang, Heping; O'Toole, Paul W

2015-09-29

Lactobacilli are a diverse group of species that occupy diverse nutrient-rich niches associated with humans, animals, plants and food. They are used widely in biotechnology and food preservation, and are being explored as therapeutics. Exploiting lactobacilli has been complicated by metabolic diversity, unclear species identity and uncertain relationships between them and other commercially important lactic acid bacteria. The capacity for biotransformations catalysed by lactobacilli is an untapped biotechnology resource. Here we report the genome sequences of 213 Lactobacillus strains and associated genera, and their encoded genetic catalogue for modifying carbohydrates and proteins. In addition, we describe broad and diverse presence of novel CRISPR-Cas immune systems in lactobacilli that may be exploited for genome editing. We rationalize the phylogenomic distribution of host interaction factors and bacteriocins that affect their natural and industrial environments, and mechanisms to withstand stress during technological processes. We present a robust phylogenomic framework of existing species and for classifying new species.
Expanding the biotechnology potential of lactobacilli through comparative genomics of 213 strains and associated genera

PubMed Central

Sun, Zhihong; Harris, Hugh M. B.; McCann, Angela; Guo, Chenyi; Argimón, Silvia; Zhang, Wenyi; Yang, Xianwei; Jeffery, Ian B; Cooney, Jakki C.; Kagawa, Todd F.; Liu, Wenjun; Song, Yuqin; Salvetti, Elisa; Wrobel, Agnieszka; Rasinkangas, Pia; Parkhill, Julian; Rea, Mary C.; O'Sullivan, Orla; Ritari, Jarmo; Douillard, François P.; Paul Ross, R.; Yang, Ruifu; Briner, Alexandra E.; Felis, Giovanna E.; de Vos, Willem M.; Barrangou, Rodolphe; Klaenhammer, Todd R.; Caufield, Page W.; Cui, Yujun; Zhang, Heping; O'Toole, Paul W.

2015-01-01

Lactobacilli are a diverse group of species that occupy diverse nutrient-rich niches associated with humans, animals, plants and food. They are used widely in biotechnology and food preservation, and are being explored as therapeutics. Exploiting lactobacilli has been complicated by metabolic diversity, unclear species identity and uncertain relationships between them and other commercially important lactic acid bacteria. The capacity for biotransformations catalysed by lactobacilli is an untapped biotechnology resource. Here we report the genome sequences of 213 Lactobacillus strains and associated genera, and their encoded genetic catalogue for modifying carbohydrates and proteins. In addition, we describe broad and diverse presence of novel CRISPR-Cas immune systems in lactobacilli that may be exploited for genome editing. We rationalize the phylogenomic distribution of host interaction factors and bacteriocins that affect their natural and industrial environments, and mechanisms to withstand stress during technological processes. We present a robust phylogenomic framework of existing species and for classifying new species. PMID:26415554
Lactic acid bacteria involved in cocoa beans fermentation from Ivory Coast: Species diversity and citrate lyase production.

PubMed

Ouattara, Hadja D; Ouattara, Honoré G; Droux, Michel; Reverchon, Sylvie; Nasser, William; Niamke, Sébastien L

2017-09-01

Microbial fermentation is an indispensable process for high quality chocolate from cocoa bean raw material. lactic acid bacteria (LAB) are among the major microorganisms responsible for cocoa fermentation but their exact role remains to be elucidated. In this study, we analyzed the diversity of LAB in six cocoa producing regions of Ivory Coast. Ribosomal 16S gene sequence analysis showed that Lactobacillus plantarum and Leuconostoc mesenteroides are the dominant LAB species in these six regions. In addition, other species were identified as the minor microbial population, namely Lactobacillus curieae, Enterococcus faecium, Fructobacillus pseudoficulneus, Lactobacillus casei, Weissella paramesenteroides and Weissella cibaria. However, in each region, the LAB microbial population was composed of a restricted number of species (maximum 5 species), which varied between the different regions. LAB implication in the breakdown of citric acid was investigated as a fundamental property for a successful cocoa fermentation process. High citrate lyase producer strains were characterized by rapid citric acid consumption, as revealed by a 4-fold decrease in citric acid concentration in the growth medium within 12h, concomitant with an increase in acetic acid and lactic acid concentration. The production of citrate lyase was strongly dependent on environmental conditions, with optimum production at acidic pH (pH<5), and moderate temperature (30-40°C), which corresponds to conditions prevailing in the early stage of natural cocoa fermentation. This study reveals that one of the major roles of LAB in the cocoa fermentation process involves the breakdown of citric acid during the early stage of cocoa fermentation through the activity of citrate lyase. Copyright © 2017 Elsevier B.V. All rights reserved.
Ecological roles of dominant and rare prokaryotes in acid mine drainage revealed by metagenomics and metatranscriptomics

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hua, Zheng-Shuang; Han, Yu-Jiao; Chen, Lin-Xing

Here we report that high-throughput sequencing is expanding our knowledge of microbial diversity in the environment. Still, understanding the metabolic potentials and ecological roles of rare and uncultured microbes in natural communities remains a major challenge. To this end, we applied a ‘divide and conquer’ strategy that partitioned a massive metagenomic data set (>100 Gbp) into subsets based on K-mer frequency in sequence assembly to a low-diversity acid mine drainage (AMD) microbial community and, by integrating with an additional metatranscriptomic assembly, successfully obtained 11 draft genomes most of which represent yet uncultured and/or rare taxa (relative abundance <1%). We reportmore » the first genome of a naturally occurring Ferrovum population (relative abundance >90%) and its metabolic potentials and gene expression profile, providing initial molecular insights into the ecological role of these lesser known, but potentially important, microorganisms in the AMD environment. Gene transcriptional analysis of the active taxa revealed major metabolic capabilities executed in situ, including carbon- and nitrogen-related metabolisms associated with syntrophic interactions, iron and sulfur oxidation, which are key in energy conservation and AMD generation, and the mechanisms of adaptation and response to the environmental stresses (heavy metals, low pH and oxidative stress). Remarkably, nitrogen fixation and sulfur oxidation were performed by the rare taxa, indicating their critical roles in the overall functioning and assembly of the AMD community. Finally, our study demonstrates the potential of the ‘divide and conquer’ strategy in high-throughput sequencing data assembly for genome reconstruction and functional partitioning analysis of both dominant and rare species in natural microbial assemblages.« less
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle

PubMed Central

Nelson, William C.; Stegen, James C.

2015-01-01

Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in a broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. “Housekeeping” genes and genes for biosynthesis of peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides, and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle, or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest that the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum. PMID:26257709
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle

DOE PAGES

Nelson, William C.; Stegen, James C.

2015-07-21

Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in a broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. “Housekeeping” genes and genes for biosynthesismore » of peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides, and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle, or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest that the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum.« less
Ecological roles of dominant and rare prokaryotes in acid mine drainage revealed by metagenomics and metatranscriptomics

DOE PAGES

Hua, Zheng-Shuang; Han, Yu-Jiao; Chen, Lin-Xing; ...

2014-11-07

Here we report that high-throughput sequencing is expanding our knowledge of microbial diversity in the environment. Still, understanding the metabolic potentials and ecological roles of rare and uncultured microbes in natural communities remains a major challenge. To this end, we applied a ‘divide and conquer’ strategy that partitioned a massive metagenomic data set (>100 Gbp) into subsets based on K-mer frequency in sequence assembly to a low-diversity acid mine drainage (AMD) microbial community and, by integrating with an additional metatranscriptomic assembly, successfully obtained 11 draft genomes most of which represent yet uncultured and/or rare taxa (relative abundance <1%). We reportmore » the first genome of a naturally occurring Ferrovum population (relative abundance >90%) and its metabolic potentials and gene expression profile, providing initial molecular insights into the ecological role of these lesser known, but potentially important, microorganisms in the AMD environment. Gene transcriptional analysis of the active taxa revealed major metabolic capabilities executed in situ, including carbon- and nitrogen-related metabolisms associated with syntrophic interactions, iron and sulfur oxidation, which are key in energy conservation and AMD generation, and the mechanisms of adaptation and response to the environmental stresses (heavy metals, low pH and oxidative stress). Remarkably, nitrogen fixation and sulfur oxidation were performed by the rare taxa, indicating their critical roles in the overall functioning and assembly of the AMD community. Finally, our study demonstrates the potential of the ‘divide and conquer’ strategy in high-throughput sequencing data assembly for genome reconstruction and functional partitioning analysis of both dominant and rare species in natural microbial assemblages.« less
The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nelson, William C.; Stegen, James C.

2015-07-21

Candidate phylum OD1 bacteria (also referred to as Parcubacteria) have been identified in broad range of anoxic environments through community survey analysis. Although none of these species have been isolated in the laboratory, several genome sequences have been reconstructed from metagenomic sequence data and single-cell sequencing. The organisms have small (generally <1 Mb) genomes with severely reduced metabolic capabilities. We have reconstructed 8 partial to near-complete OD1 genomes from oxic groundwater samples, and compared them against existing genomic data. The conserved core gene set comprises 202 genes, or ~28% of the genomic complement. ‘Housekeeping’ genes and genes for biosynthesis ofmore » peptidoglycan and Type IV pilus production are conserved. Gene sets for biosynthesis of cofactors, amino acids, nucleotides and fatty acids are absent entirely or greatly reduced. The only aspects of energy metabolism conserved are the non-oxidative branch of the pentose-phosphate shunt and central glycolysis. These organisms also lack some activities conserved in almost all other known bacterial genomes, including signal recognition particle, pseudouridine synthase A, and FAD synthase. Pan-genome analysis indicates a broad genotypic diversity and perhaps a highly fluid gene complement, indicating historical adaptation to a wide range of growth environments and a high degree of specialization. The genomes were examined for signatures suggesting either a free-living, streamlined lifestyle or a symbiotic lifestyle. The lack of biosynthetic capabilities and DNA repair, along with the presence of potential attachment and adhesion proteins suggest the Parcubacteria are ectosymbionts or parasites of other organisms. The wide diversity of genes that potentially mediate cell-cell contact suggests a broad range of partner/prey organisms across the phylum.« less
Genetic and Chemical Profiling of Gymnema sylvestre Accessions from Central India: Its Implication for Quality Control and Therapeutic Potential of Plant

PubMed Central

Verma, Ashutosh Kumar; Dhawan, Sunita Singh; Singh, Seema; Bharati, Kumar Avinash; Jyotsana

2016-01-01

Background: Gymnema sylvestre, a vulnerable plant species, is mentioned in Indian Pharmacopeia as an antidiabetic drug Objective: Study of genetic and chemical diversity and its implications in accessions of G. sylvestre Materials and Methods: Fourteen accessions of G. sylvestre collected from Central India and assessment of their genetic and chemical diversity were carried out using ISSR (inter simple sequence repeat) and HPLC (high performance liquid chromatography) fingerprinting methods Results: Among the screened 40 ISSR primers, 15 were found polymorphic and collectively produced nine unique accession-specific bands. The maximum and minimum numbers of amplicones were noted for ISSR-15 and ISSR-11, respectively. The ISSR -11 and ISSR-13 revealed 100% polymorphism. HPLC chromatograms showed that accessions possess the secondary metabolites of mid-polarity with considerable variability. Unknown peaks with retention time 2.63, 3.41, 23.83, 24.50, and 44.67 were found universal type. Comparative hierarchical clustering analysis based on foresaid fingerprints indicates that both techniques have equal potential to discriminate accessions according to percentage gymnemic acid in their leaf tissue. Second approach was noted more efficiently for separation of accessions according to their agro-climatic/collection site Conclusion: Highly polymorphic ISSRs could be utilized as molecular probes for further selection of high gymnemic acid yielding accessions. Observed accession specific bands may be used as a descriptor for plant accessions protection and converted into sequence tagged sites markers. Identified five universal type peaks could be helpful in identification of G. sylvestre-based various herbal preparations. SUMMARY Nine accession specific unique bandsFive marker peaks for G. sylvestre.Suitability of genetic and chemical fingerprinting Abbreviations used: HPLC: High Performance Liquid Chromatography, ISSR: Inter Simple Sequence Repeats, CTAB: Cetyl Trimethylammonium Bromide, DNTP: Deoxynucleotide Triphosphates PMID:27761067
Genetic and Chemical Profiling of Gymnema sylvestre Accessions from Central India: Its Implication for Quality Control and Therapeutic Potential of Plant.

PubMed

Verma, Ashutosh Kumar; Dhawan, Sunita Singh; Singh, Seema; Bharati, Kumar Avinash; Jyotsana

2016-07-01

Gymnema sylvestre , a vulnerable plant species, is mentioned in Indian Pharmacopeia as an antidiabetic drug. Study of genetic and chemical diversity and its implications in accessions of G. sylvestre . Fourteen accessions of G. sylvestre collected from Central India and assessment of their genetic and chemical diversity were carried out using ISSR (inter simple sequence repeat) and HPLC (high performance liquid chromatography) fingerprinting methods. Among the screened 40 ISSR primers, 15 were found polymorphic and collectively produced nine unique accession-specific bands. The maximum and minimum numbers of amplicones were noted for ISSR-15 and ISSR-11, respectively. The ISSR -11 and ISSR-13 revealed 100% polymorphism. HPLC chromatograms showed that accessions possess the secondary metabolites of mid-polarity with considerable variability. Unknown peaks with retention time 2.63, 3.41, 23.83, 24.50, and 44.67 were found universal type. Comparative hierarchical clustering analysis based on foresaid fingerprints indicates that both techniques have equal potential to discriminate accessions according to percentage gymnemic acid in their leaf tissue. Second approach was noted more efficiently for separation of accessions according to their agro-climatic/collection site. Highly polymorphic ISSRs could be utilized as molecular probes for further selection of high gymnemic acid yielding accessions. Observed accession specific bands may be used as a descriptor for plant accessions protection and converted into sequence tagged sites markers. Identified five universal type peaks could be helpful in identification of G. sylvestre -based various herbal preparations. Nine accession specific unique bandsFive marker peaks for G. sylvestre .Suitability of genetic and chemical fingerprinting Abbreviations used: HPLC: High Performance Liquid Chromatography, ISSR: Inter Simple Sequence Repeats, CTAB: Cetyl Trimethylammonium Bromide, DNTP: Deoxynucleotide Triphosphates.
A Signature in HIV-1 Envelope Leader Peptide Associated with Transition from Acute to Chronic Infection Impacts Envelope Processing and Infectivity

PubMed Central

Asmal, Mohammed; Hellmann, Ina; Liu, Weimin; Keele, Brandon F.; Perelson, Alan S.; Bhattacharya, Tanmoy; Gnanakaran, S.; Daniels, Marcus; Haynes, Barton F.; Korber, Bette T.; Hahn, Beatrice H.; Shaw, George M.; Letvin, Norman L.

2011-01-01

Mucosal transmission of the human immunodeficiency virus (HIV) results in a bottleneck in viral genetic diversity. Gnanakaran and colleagues used a computational strategy to identify signature amino acids at particular positions in Envelope that were associated either with transmitted sequences sampled very early in infection, or sequences sampled during chronic infection. Among the strongest signatures observed was an enrichment for the stable presence of histidine at position 12 at transmission and in early infection, and a recurrent loss of histidine at position 12 in chronic infection. This amino acid lies within the leader peptide of Envelope, a region of the protein that has been shown to influence envelope glycoprotein expression and virion infectivity. We show a strong association between a positively charged amino acid like histidine at position 12 in transmitted/founder viruses with more efficient trafficking of the nascent envelope polypeptide to the endoplasmic reticulum and higher steady-state glycoprotein expression compared to viruses that have a non-basic position 12 residue, a substitution that was enriched among viruses sampled from chronically infected individuals. When expressed in the context of other viral proteins, transmitted envelopes with a basic amino acid position 12 were incorporated at higher density into the virus and exhibited higher infectious titers than did non-signature envelopes. These results support the potential utility of using a computational approach to examine large viral sequence data sets for functional signatures and indicate the importance of Envelope expression levels for efficient HIV transmission. PMID:21876761
Molecular classification based on apomorphic amino acids (Arthropoda, Hexapoda): Integrative taxonomy in the era of phylogenomics.

PubMed

Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun

2016-06-17

With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy.

Molecular classification based on apomorphic amino acids (Arthropoda, Hexapoda): Integrative taxonomy in the era of phylogenomics

PubMed Central

Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun

2016-01-01

With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy. PMID:27312960
Endophyte Microbiome Diversity in Micropropagated Atriplex canescens and Atriplex torreyi var griffithsii

PubMed Central

Lucero, Mary E.; Unc, Adrian; Cooke, Peter; Dowd, Scot; Sun, Shulei

2011-01-01

Microbial diversity associated with micropropagated Atriplex species was assessed using microscopy, isolate culturing, and sequencing. Light, electron, and confocal microscopy revealed microbial cells in aseptically regenerated leaves and roots. Clone libraries and tag-encoded FLX amplicon pyrosequencing (TEFAP) analysis amplified sequences from callus homologous to diverse fungal and bacterial taxa. Culturing isolated some seed borne endophyte taxa which could be readily propagated apart from the host. Microbial cells were observed within biofilm-like residues associated with plant cell surfaces and intercellular spaces. Various universal primers amplified both plant and microbial sequences, with different primers revealing different patterns of fungal diversity. Bacterial and fungal TEFAP followed by alignment with sequences from curated databases revealed 7 bacterial and 17 ascomycete taxa in A. canescens, and 5 bacterial taxa in A. torreyi. Additional diversity was observed among isolates and clone libraries. Micropropagated Atriplex retains a complex, intimately associated microbiome which includes diverse strains well poised to interact in manners that influence host physiology. Microbiome analysis was facilitated by high throughput sequencing methods, but primer biases continue to limit recovery of diverse sequences from even moderately complex communities. PMID:21437280
Development of Genomic Microsatellite Markers in Carthamus tinctorius L. (Safflower) Using Next Generation Sequencing and Assessment of Their Cross-Species Transferability and Utility for Diversity Analysis

PubMed Central

Variath, Murali Tottekkad; Joshi, Gopal; Bali, Sapinder; Agarwal, Manu; Kumar, Amar; Jagannath, Arun; Goel, Shailendra

2015-01-01

Background Safflower (Carthamus tinctorius L.), an Asteraceae member, yields high quality edible oil rich in unsaturated fatty acids and is resilient to dry conditions. The crop holds tremendous potential for improvement through concerted molecular breeding programs due to the availability of significant genetic and phenotypic diversity. Genomic resources that could facilitate such breeding programs remain largely underdeveloped in the crop. The present study was initiated to develop a large set of novel microsatellite markers for safflower using next generation sequencing. Principal Findings Low throughput genome sequencing of safflower was performed using Illumina paired end technology providing ~3.5X coverage of the genome. Analysis of sequencing data allowed identification of 23,067 regions harboring perfect microsatellite loci. The safflower genome was found to be rich in dinucleotide repeats followed by tri-, tetra-, penta- and hexa-nucleotides. Primer pairs were designed for 5,716 novel microsatellite sequences with repeat length ≥ 20 bases and optimal flanking regions. A subset of 325 microsatellite loci was tested for amplification, of which 294 loci produced robust amplification. The validated primers were used for assessment of 23 safflower accessions belonging to diverse agro-climatic zones of the world leading to identification of 93 polymorphic primers (31.6%). The numbers of observed alleles at each locus ranged from two to four and mean polymorphism information content was found to be 0.3075. The polymorphic primers were tested for cross-species transferability on nine wild relatives of cultivated safflower. All primers except one showed amplification in at least two wild species while 25 primers amplified across all the nine species. The UPGMA dendrogram clustered C. tinctorius accessions and wild species separately into two major groups. The proposed progenitor species of safflower, C. oxyacantha and C. palaestinus were genetically closer to cultivated safflower and formed a distinct cluster. The cluster analysis also distinguished diploid and tetraploid wild species of safflower. Conclusion Next generation sequencing of safflower genome generated a large set of microsatellite markers. The novel markers developed in this study will add to the existing repertoire of markers and can be used for diversity analysis, synteny studies, construction of linkage maps and marker-assisted selection. PMID:26287743
The LANL hemorrhagic fever virus database, a new platform for analyzing biothreat viruses.

PubMed

Kuiken, Carla; Thurmond, Jim; Dimitrijevic, Mira; Yoon, Hyejin

2012-01-01

Hemorrhagic fever viruses (HFVs) are a diverse set of over 80 viral species, found in 10 different genera comprising five different families: arena-, bunya-, flavi-, filo- and togaviridae. All these viruses are highly variable and evolve rapidly, making them elusive targets for the immune system and for vaccine and drug design. About 55,000 HFV sequences exist in the public domain today. A central website that provides annotated sequences and analysis tools will be helpful to HFV researchers worldwide. The HFV sequence database collects and stores sequence data and provides a user-friendly search interface and a large number of sequence analysis tools, following the model of the highly regarded and widely used Los Alamos HIV database [Kuiken, C., B. Korber, and R.W. Shafer, HIV sequence databases. AIDS Rev, 2003. 5: p. 52-61]. The database uses an algorithm that aligns each sequence to a species-wide reference sequence. The NCBI RefSeq database [Sayers et al. (2011) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res., 39, D38-D51.] is used for this; if a reference sequence is not available, a Blast search finds the best candidate. Using this method, sequences in each genus can be retrieved pre-aligned. The HFV website can be accessed via http://hfv.lanl.gov.
Decreased plant productivity resulting from plant group removal experiment constrains soil microbial functional diversity.

PubMed

Zhang, Ximei; Johnston, Eric R; Barberán, Albert; Ren, Yi; Lü, Xiaotao; Han, Xingguo

2017-10-01

Anthropogenic environmental changes are accelerating the rate of biodiversity loss on Earth. Plant diversity loss is predicted to reduce soil microbial diversity primarily due to the decreased variety of carbon/energy resources. However, this intuitive hypothesis is supported by sparse empirical evidence, and most underlying mechanisms remain underexplored or obscure altogether. We constructed four diversity gradients (0-3) in a five-year plant functional group removal experiment in a steppe ecosystem in Inner Mongolia, China, and quantified microbial taxonomic and functional diversity with shotgun metagenome sequencing. The treatments had little effect on microbial taxonomic diversity, but were found to decrease functional gene diversity. However, the observed decrease in functional gene diversity was more attributable to a loss in plant productivity, rather than to the loss of any individual plant functional group per se. Reduced productivity limited fresh plant resources supplied to microorganisms, and thus, intensified the pressure of ecological filtering, favoring genes responsible for energy production/conversion, material transport/metabolism and amino acid recycling, and accordingly disfavored many genes with other functions. Furthermore, microbial respiration was correlated with the variation in functional composition but not taxonomic composition. Overall, the amount of carbon/energy resources driving microbial gene diversity was identified to be the critical linkage between above- and belowground communities, contrary to the traditional framework of linking plant clade/taxonomic diversity to microbial taxonomic diversity. © 2017 John Wiley & Sons Ltd.
Class IIa Bacteriocins: Diversity and New Developments

PubMed Central

Cui, Yanhua; Zhang, Chao; Wang, Yunfeng; Shi, John; Zhang, Lanwei; Ding, Zhongqing; Qu, Xiaojun; Cui, Hongyu

2012-01-01

Class IIa bacteriocins are heat-stable, unmodified peptides with a conserved amino acids sequence YGNGV on their N-terminal domains, and have received much attention due to their generally recognized as safe (GRAS) status, their high biological activity, and their excellent heat stability. They are promising and attractive agents that could function as biopreservatives in the food industry. This review summarizes the new developments in the area of class IIa bacteriocins and aims to provide uptodate information that can be used in designing future research. PMID:23222636
Diversity of the microbiota involved in wine and organic apple cider submerged vinegar production as revealed by DHPLC analysis and next-generation sequencing.

PubMed

Trček, Janja; Mahnič, Aleksander; Rupnik, Maja

2016-04-16

Unfiltered vinegar samples collected from three oxidation cycles of the submerged industrial production of each, red wine and organic apple cider vinegars, were sampled in a Slovene vinegar producing company. The samples were systematically collected from the beginning to the end of an oxidation cycle and used for culture-independent microbial analyses carried out by denaturing high pressure liquid chromatography (DHPLC) and Illumina MiSeq sequencing of 16S rRNA gene variable regions. Both approaches showed a very homogeneous bacterial structure during wine vinegar production but more heterogeneous during organic apple cider vinegar production. In all wine vinegar samples Komagataeibacter oboediens (formerly Gluconacetobacter oboediens) was a predominating species. In apple cider vinegar the acetic acid and lactic acid bacteria were two major groups of bacteria. The acetic acid bacterial consortium was composed of Acetobacter and Komagataeibacter with the Komagataeibacter genus outcompeting the Acetobacter in all apple cider vinegar samples at the end of oxidation cycle. Among the lactic acid bacterial consortium two dominating genera were identified, Lactobacillus and Oenococcus, with Oenococcus prevailing with increasing concentration of acetic acid in vinegars. Unexpectedly, a minor genus of the acetic acid bacterial consortium in organic apple cider vinegar was Gluconobacter, suggesting a possible development of the Gluconobacter population with a tolerance against ethanol and acetic acid. Among the accompanying bacteria of the wine vinegar, the genus Rhodococcus was detected, but it decreased substantially by the end of oxidation cycles. Copyright © 2016 Elsevier B.V. All rights reserved.
Molecular Aspects and Comparative Genomics of Bacteriophage Endolysins

PubMed Central

Oliveira, Hugo; Melo, Luís D. R.; Santos, Sílvio B.; Nóbrega, Franklin L.; Ferreira, Eugénio C.; Cerca, Nuno; Azeredo, Joana

2013-01-01

Phages are recognized as the most abundant and diverse entities on the planet. Their diversity is determined predominantly by their dynamic adaptation capacities when confronted with different selective pressures in an endless cycle of coevolution with a widespread group of bacterial hosts. At the end of the infection cycle, progeny virions are confronted with a rigid cell wall that hinders their release into the environment and the opportunity to start a new infection cycle. Consequently, phages encode hydrolytic enzymes, called endolysins, to digest the peptidoglycan. In this work, we bring to light all phage endolysins found in completely sequenced double-stranded nucleic acid phage genomes and uncover clues that explain the phage-endolysin-host ecology that led phages to recruit unique and specialized endolysins. PMID:23408602
Genes encoding two Theileria parva antigens recognized by CD8+ T-cells exhibit sequence diversity in South Sudanese cattle populations but the majority of alleles are similar to the Muguga component of the live vaccine cocktail

PubMed Central

Pelle, Roger; Mwacharo, Joram M.; Njahira, Moses N.; Marcellino, Wani L.; Kiara, Henry; Malak, Agol K.; EL Hussein, Abdel Rahim M.; Bishop, Richard; Skilton, Robert A.

2017-01-01

East Coast fever (ECF), caused by Theileria parva infection, is a frequently fatal disease of cattle in eastern, central and southern Africa, and an emerging disease in South Sudan. Immunization using the infection and treatment method (ITM) is increasingly being used for control in countries affected by ECF, but not yet in South Sudan. It has been reported that CD8+ T-cell lymphocytes specific for parasitized cells play a central role in the immunity induced by ITM and a number of T. parva antigens recognized by parasite-specific CD8+ T-cells have been identified. In this study we determined the sequence diversity among two of these antigens, Tp1 and Tp2, which are under evaluation as candidates for inclusion in a sub-unit vaccine. T. parva samples (n = 81) obtained from cattle in four geographical regions of South Sudan were studied for sequence polymorphism in partial sequences of the Tp1 and Tp2 genes. Eight positions (1.97%) in Tp1 and 78 positions (15.48%) in Tp2 were shown to be polymorphic, giving rise to four and 14 antigen variants in Tp1 and Tp2, respectively. The overall nucleotide diversity in the Tp1 and Tp2 genes was π = 1.65% and π = 4.76%, respectively. The parasites were sampled from regions approximately 300 km apart, but there was limited evidence for genetic differentiation between populations. Analyses of the sequences revealed limited numbers of amino acid polymorphisms both overall and in residues within the mapped CD8+ T-cell epitopes. Although novel epitopes were identified in the samples from South Sudan, a large number of the samples harboured several epitopes in both antigens that were similar to those in the T. parva Muguga reference stock, which is a key component in the widely used live vaccine cocktail. PMID:28231338
Sequence analyses of fimbriae subunit FimA proteins on Actinomyces naeslundii genospecies 1 and 2 and Actinomyces odontolyticus with variant carbohydrate binding specificities

PubMed Central

Drobni, Mirva; Hallberg, Kristina; Öhman, Ulla; Birve, Anna; Persson, Karina; Johansson, Ingegerd; Strömberg, Nicklas

2006-01-01

Background Actinomyces naeslundii genospecies 1 and 2 express type-2 fimbriae (FimA subunit polymers) with variant Galβ binding specificities and Actinomyces odontolyticus a sialic acid specificity to colonize different oral surfaces. However, the fimbrial nature of the sialic acid binding property and sequence information about FimA proteins from multiple strains are lacking. Results Here we have sequenced fimA genes from strains of A.naeslundii genospecies 1 (n = 4) and genospecies 2 (n = 4), both of which harboured variant Galβ-dependent hemagglutination (HA) types, and from A.odontolyticus PK984 with a sialic acid-dependent HA pattern. Three unique subtypes of FimA proteins with 63.8–66.4% sequence identity were present in strains of A. naeslundii genospecies 1 and 2 and A. odontolyticus. The generally high FimA sequence identity (>97.2%) within a genospecies revealed species specific sequences or segments that coincided with binding specificity. All three FimA protein variants contained a signal peptide, pilin motif, E box, proline-rich segment and an LPXTG sorting motif among other conserved segments for secretion, assembly and sorting of fimbrial proteins. The highly conserved pilin, E box and LPXTG motifs are present in fimbriae proteins from other Gram-positive bacteria. Moreover, only strains of genospecies 1 were agglutinated with type-2 fimbriae antisera derived from A. naeslundii genospecies 1 strain 12104, emphasizing that the overall folding of FimA may generate different functionalities. Western blot analyses with FimA antisera revealed monomers and oligomers of FimA in whole cell protein extracts and a purified recombinant FimA preparation, indicating a sortase-independent oligomerization of FimA. Conclusion The genus Actinomyces involves a diversity of unique FimA proteins with conserved pilin, E box and LPXTG motifs, depending on subspecies and associated binding specificity. In addition, a sortase independent oligomerization of FimA subunit proteins in solution was indicated. PMID:16686953
Evolution of the deaminase fold and multiple origins of eukaryotic editing and mutagenic nucleic acid deaminases from bacterial toxin systems

PubMed Central

Iyer, Lakshminarayan M.; Zhang, Dapeng; Rogozin, Igor B.; Aravind, L.

2011-01-01

The deaminase-like fold includes, in addition to nucleic acid/nucleotide deaminases, several catalytic domains such as the JAB domain, and others involved in nucleotide and ADP-ribose metabolism. Using sensitive sequence and structural comparison methods, we develop a comprehensive natural classification of the deaminase-like fold and show that its ancestral version was likely to operate on nucleotides or nucleic acids. Consequently, we present evidence that a specific group of JAB domains are likely to possess a DNA repair function, distinct from the previously known deubiquitinating peptidase activity. We also identified numerous previously unknown clades of nucleic acid deaminases. Using inference based on contextual information, we suggest that most of these clades are toxin domains of two distinct classes of bacterial toxin systems, namely polymorphic toxins implicated in bacterial interstrain competition and those that target distantly related cells. Genome context information suggests that these toxins might be delivered via diverse secretory systems, such as Type V, Type VI, PVC and a novel PrsW-like intramembrane peptidase-dependent mechanism. We propose that certain deaminase toxins might be deployed by diverse extracellular and intracellular pathogens as also endosymbionts as effectors targeting nucleic acids of host cells. Our analysis suggests that these toxin deaminases have been acquired by eukaryotes on several independent occasions and recruited as organellar or nucleo-cytoplasmic RNA modifiers, operating on tRNAs, mRNAs and short non-coding RNAs, and also as mutators of hyper-variable genes, viruses and selfish elements. This scenario potentially explains the origin of mutagenic AID/APOBEC-like deaminases, including novel versions from Caenorhabditis, Nematostella and diverse algae and a large class of fast-evolving fungal deaminases. These observations greatly expand the distribution of possible unidentified mutagenic processes catalyzed by nucleic acid deaminases. PMID:21890906
Genome-Wide Analysis of the RAV Family in Soybean and Functional Identification of GmRAV-03 Involvement in Salt and Drought Stresses and Exogenous ABA Treatment

PubMed Central

Zhao, Shu-Ping; Xu, Zhao-Shi; Zheng, Wei-Jun; Zhao, Wan; Wang, Yan-Xia; Yu, Tai-Fei; Chen, Ming; Zhou, Yong-Bin; Min, Dong-Hong; Ma, You-Zhi; Chai, Shou-Cheng; Zhang, Xiao-Hong

2017-01-01

Transcription factors play vital roles in plant growth and in plant responses to abiotic stresses. The RAV transcription factors contain a B3 DNA binding domain and/or an APETALA2 (AP2) DNA binding domain. Although genome-wide analyses of RAV family genes have been performed in several species, little is known about the family in soybean (Glycine max L.). In this study, a total of 13 RAV genes, named as GmRAVs, were identified in the soybean genome. We predicted and analyzed the amino acid compositions, phylogenetic relationships, and folding states of conserved domain sequences of soybean RAV transcription factors. These soybean RAV transcription factors were phylogenetically clustered into three classes based on their amino acid sequences. Subcellular localization analysis revealed that the soybean RAV proteins were located in the nucleus. The expression patterns of 13 RAV genes were analyzed by quantitative real-time PCR. Under drought stresses, the RAV genes expressed diversely, up- or down-regulated. Following NaCl treatments, all RAV genes were down-regulated excepting GmRAV-03 which was up-regulated. Under abscisic acid (ABA) treatment, the expression of all of the soybean RAV genes increased dramatically. These results suggested that the soybean RAV genes may be involved in diverse signaling pathways and may be responsive to abiotic stresses and exogenous ABA. Further analysis indicated that GmRAV-03 could increase the transgenic lines resistance to high salt and drought and result in the transgenic plants insensitive to exogenous ABA. This present study provides valuable information for understanding the classification and putative functions of the RAV transcription factors in soybean. PMID:28634481
Naturally occurring resistance mutations within the core and NS5B regions in hepatitis C genotypes, particularly genotype 5a, in South Africa.

PubMed

Prabdial-Sing, N; Blackard, J T; Puren, A J; Mahomed, A; Abuelhassan, W; Mahlangu, J; Vermeulen, M; Bowyer, S M

2016-03-01

Approximately 1 million South Africans are infected with Hepatitis C virus (HCV). The standard of care (SOC) in South Africa is combination therapy (pegylated interferon and ribavirin). HCV genotypes and/or mutations in the core/non-structural regions have been associated with response to therapy and/or disease progression. This study examines mutations in the core (29-280 amino acids, including ∼ 90 E1 amino acids) and NS5B (241-306 amino acids) regions on pre-treatment isolates from patients attending Johannesburg hospitals or asymptomatic South African blood donors. Diversity within known CD4+ and CD8+ T-cell epitopes was also explored. Samples grouped into subtypes 1a(N = 10) 1b(N = 12), 3a(N = 5), 4a(N = 3) and 5a(N = 61). Two mutations, associated with interferon resistance-R70Q and T110N-were present in 29 genotype 5a core sequences. No resistance mutation to NS5B nucleotide inhibitors, sofosbuvir was found. Six putative CD8+ and one CD4+ T-cell epitope sequence in the core region showed binding scores of <300 IC50nM to HLA alleles frequently observed in the South African population. No known CD8+ and CD4+ T-cell epitopes were mapped in the NS5B region. The analysis begs the question whether those infected with genotype 5a will benefit better on interferon-free combination therapies. This study provides new insight into one of the lesser studied HCV genotypes and compares the diversity seen in a large pre-treatment cohort with other subtypes. Copyright © 2015 Elsevier B.V. All rights reserved.
Phylogenetic analysis of the cytochrome P450 3 (CYP3) gene family.

PubMed

McArthur, Andrew G; Hegelund, Tove; Cox, Rachel L; Stegeman, John J; Liljenberg, Mette; Olsson, Urban; Sundberg, Per; Celander, Malin C

2003-08-01

Cytochrome P450 genes (CYP) constitute a superfamily with members known from the Bacteria, Archaea, and Eukarya. The CYP3 gene family includes the CYP3A and CYP3B subfamilies. Members of the CYP3A subfamily represent the dominant CYP forms expressed in the digestive and respiratory tracts of vertebrates. The CYP3A enzymes metabolize a wide variety of chemically diverse lipophilic organic compounds. To understand vertebrate CYP3 diversity better, we determined the killifish (Fundulus heteroclitus) CYP3A30 and CYP3A56 and the ball python (Python regius) CYP3A42 sequences. We performed phylogenetic analyses of 45 vertebrate CYP3 amino acid sequences using a Bayesian approach. Our analyses indicate that teleost, diapsid, and mammalian CYP3A genes have undergone independent diversification and that the ancestral vertebrate genome contained a single CYP3A gene. Most CYP3A diversity is the product of recent gene duplication events. There is strong support for placement of the guinea pig CYP3A genes within the rodent CYP3A diversification. The rat, mouse, and hamster CYP3A genes are mixed among several rodent CYP3A subclades, indicative of a complex history involving speciation and gene duplication.
Sequence diversity of NanA manifests in distinct enzyme kinetics and inhibitor susceptibility

NASA Astrophysics Data System (ADS)

Xu, Zhongli; von Grafenstein, Susanne; Walther, Elisabeth; Fuchs, Julian E.; Liedl, Klaus R.; Sauerbrei, Andreas; Schmidtke, Michaela

2016-04-01

Streptococcus pneumoniae is the leading pathogen causing bacterial pneumonia and meningitis. Its surface-associated virulence factor neuraminidase A (NanA) promotes the bacterial colonization by removing the terminal sialyl residues from glycoconjugates on eukaryotic cell surface. The predominant role of NanA in the pathogenesis of pneumococci renders it an attractive target for therapeutic intervention. Despite the highly conserved activity of NanA, our alignment of the 11 NanAs revealed the evolutionary diversity of this enzyme. The amino acid substitutions we identified, particularly those in the lectin domain and in the insertion domain next to the catalytic centre triggered our special interest. We synthesised the representative NanAs and the mutagenized derivatives from E. coli for enzyme kinetics study and neuraminidase inhibitor susceptibility test. Via molecular docking we got a deeper insight into the differences between the two major variants of NanA and their influence on the ligand-target interactions. In addition, our molecular dynamics simulations revealed a prominent intrinsic flexibility of the linker between the active site and the insertion domain, which influences the inhibitor binding. Our findings for the first time associated the primary sequence diversity of NanA with the biochemical properties of the enzyme and with the inhibitory efficiency of neuraminidase inhibitors.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Klenk, Hans-Peter; Lu, Megan; Lucas, Susan

Saccharomonospora marina Liu et al. 2010 is a member to the genomically so far poorly characterized genus Saccharomonospora in the family Pseudonocardiaceae. Members of the genus Sacharomonospora are of interest because they originate from diverse habitats, such as leaf litter, manure, compost, surface of peat, moist, over-heated grain, and ocean sediment, where they might play a role in the primary degradation of plant material by attacking hemicellulose. Organisms belonging to the genus are usually Gram-positive staining, non-acid fast, and classify among the actinomycetes. Next to S. viridis and S. azurea, S. marina is the third member in the genus Saccharomonosporamore » for with a completely sequenced (permanent draft status) type strain genome will be published. Here we describe the features of this organism, together with the complete genome sequence, and annotation. The 5,965,593 bp long chromosome with its 5,727 protein-coding and 57 RNA genes was sequenced as part of the DOE funded Community Sequencing Program (CSP) 2010 at the Joint Genome Institute (JGI).« less
Microbial genomic taxonomy

PubMed Central

2013-01-01

A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, <10 in Karlin genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups. PMID:24365132
Hepatitis C Virus Antigenic Convergence

PubMed Central

Campo, David S.; Dimitrova, Zoya; Yokosawa, Jonny; Hoang, Duc; Perez, Nestor O.; Ramachandran, Sumathi; Khudyakov, Yury

2012-01-01

Vaccine development against hepatitis C virus (HCV) is hindered by poor understanding of factors defining cross-immunoreactivity among heterogeneous epitopes. Using synthetic peptides and mouse immunization as a model, we conducted a quantitative analysis of cross-immunoreactivity among variants of the HCV hypervariable region 1 (HVR1). Analysis of 26,883 immunological reactions among pairs of peptides showed that the distribution of cross-immunoreactivity among HVR1 variants was skewed, with antibodies against a few variants reacting with all tested peptides. The HVR1 cross-immunoreactivity was accurately modeled based on amino acid sequence alone. The tested peptides were mapped in the HVR1 sequence space, which was visualized as a network of 11,319 sequences. The HVR1 variants with a greater network centrality showed a broader cross-immunoreactivity. The entire sequence space is explored by each HCV genotype and subtype. These findings indicate that HVR1 antigenic diversity is extensively convergent and effectively limited, suggesting significant implications for vaccine development. PMID:22355779
Evaluation of cysteine proteases of Plasmodium vivax as antimalarial drug targets: sequence analysis and sensitivity to cysteine protease inhibitors.

PubMed

Na, Byoung-Kuk; Kim, Tong-Soo; Rosenthal, Philip J; Lee, Jong-Koo; Kong, Yoon

2004-10-01

Cysteine proteases perform critical roles in the life cycles of malaria parasites. In Plasmodium falciparum, treatment of cysteine protease inhibitors inhibits hemoglobin hydrolysis and blocks the parasite development in vitro and in vivo, suggesting that plasmodial cysteine proteases may be interesting targets for new chemotherapeutics. To determine whether sequence diversity may limit chemotherapy against Plasmodium vivax, we analyzed sequence variations in the genes encoding three cysteine proteases, vivapain-1, -2 and -3, in 22 wild isolates of P. vivax. The sequences were highly conserved among wild isolates. A small number of substitutions leading to amino acid changes were found, while they did not modify essential residues for the function or structure of the enzymes. The substrate specificities and sensitivities to synthetic cysteine protease inhibitors of vivapain-2 and -3 from wild isolates were also very similar. These results support the suggestion that cysteine proteases of P. vivax are promising antimalarial chemotherapeutic targets.
Tracking global changes induced in the CD4 T-cell receptor repertoire by immunization with a complex antigen using short stretches of CDR3 protein sequence.

PubMed

Thomas, Niclas; Best, Katharine; Cinelli, Mattia; Reich-Zeliger, Shlomit; Gal, Hilah; Shifrut, Eric; Madi, Asaf; Friedman, Nir; Shawe-Taylor, John; Chain, Benny

2014-11-15

The clonal theory of adaptive immunity proposes that immunological responses are encoded by increases in the frequency of lymphocytes carrying antigen-specific receptors. In this study, we measure the frequency of different T-cell receptors (TcR) in CD4 + T cell populations of mice immunized with a complex antigen, killed Mycobacterium tuberculosis, using high throughput parallel sequencing of the TcRβ chain. Our initial hypothesis that immunization would induce repertoire convergence proved to be incorrect, and therefore an alternative approach was developed that allows accurate stratification of TcR repertoires and provides novel insights into the nature of CD4 + T-cell receptor recognition. To track the changes induced by immunization within this heterogeneous repertoire, the sequence data were classified by counting the frequency of different clusters of short (3 or 4) continuous stretches of amino acids within the antigen binding complementarity determining region 3 (CDR3) repertoire of different mice. Both unsupervised (hierarchical clustering) and supervised (support vector machine) analyses of these different distributions of sequence clusters differentiated between immunized and unimmunized mice with 100% efficiency. The CD4 + TcR repertoires of mice 5 and 14 days postimmunization were clearly different from that of unimmunized mice but were not distinguishable from each other. However, the repertoires of mice 60 days postimmunization were distinct both from naive mice and the day 5/14 animals. Our results reinforce the remarkable diversity of the TcR repertoire, resulting in many diverse private TcRs contributing to the T-cell response even in genetically identical mice responding to the same antigen. However, specific motifs defined by short stretches of amino acids within the CDR3 region may determine TcR specificity and define a new approach to TcR sequence classification. The analysis was implemented in R and Python, and source code can be found in Supplementary Data. b.chain@ucl.ac.uk Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

Low Lactobacilli abundance and polymicrobial diversity in the lower reproductive tract of female rhesus monkeys do not compromise their reproductive success.

PubMed

Amaral, Wellington Z; Lubach, Gabriele R; Kapoor, Amita; Proctor, Alexandra; Phillips, Gregory J; Lyte, Mark; Coe, Christopher L

2017-10-01

The lower reproductive tract of nonhuman primates is colonized with a diverse microbiota, resembling bacterial vaginosis (BV), a gynecological condition associated with negative reproductive outcomes in women. Our 4 aims were to: (i) assess the prevalence of low Lactobacilli and a BV-like profile in female rhesus monkeys; (ii) quantify cytokines in their cervicovaginal fluid (CVF); (iii) examine the composition and structure of their mucosal microbiota with culture-independent sequencing methods; and (iv) evaluate the potential influence on reproductive success. CVF specimens were obtained from 27 female rhesus monkeys for Gram's staining, and to determine acidity (pH), and quantify proinflammatory cytokines. Based on Nugent's classification, 40% had a score of 7 or higher, which would be indicative of BV in women. Nugent scores were significantly correlated with the pH of the CVF. Interleukin-1ß was present at high concentrations, but not further elevated by high Nugent scores. Vaginal swabs were obtained from eight additional females to determine microbial diversity by rRNA gene amplicon sequencing. At the phylum level, the Firmicutes/Bacteroidetes ratio was low. The relative abundance of Lactobacilli was also low (between 3% and 17%), and 11 other genera were present at >1%. However, neither the microbial diversity in the community structure, nor high Nugent scores, was associated with reduced fecundity. Female monkeys provide an opportunity to understand how reproductive success can be sustained in the presence of a diverse polymicrobial community in the reproductive tract. © 2017 Wiley Periodicals, Inc.
The Contribution of DNA Metabarcoding to Fungal Conservation: Diversity Assessment, Habitat Partitioning and Mapping Red-Listed Fungi in Protected Coastal Salix repens Communities in the Netherlands

PubMed Central

Geml, József; Gravendeel, Barbara; van der Gaag, Kristiaan J.; Neilen, Manon; Lammers, Youri; Raes, Niels; Semenova, Tatiana A.; de Knijff, Peter; Noordeloos, Machiel E.

2014-01-01

Western European coastal sand dunes are highly important for nature conservation. Communities of the creeping willow (Salix repens) represent one of the most characteristic and diverse vegetation types in the dunes. We report here the results of the first kingdom-wide fungal diversity assessment in S. repens coastal dune vegetation. We carried out massively parallel pyrosequencing of ITS rDNA from soil samples taken at ten sites in an extended area of joined nature reserves located along the North Sea coast of the Netherlands, representing habitats with varying soil pH and moisture levels. Fungal communities in Salix repens beds are highly diverse and we detected 1211 non-singleton fungal 97% sequence similarity OTUs after analyzing 688,434 ITS2 rDNA sequences. Our comparison along a north-south transect indicated strong correlation between soil pH and fungal community composition. The total fungal richness and the number OTUs of most fungal taxonomic groups negatively correlated with higher soil pH, with some exceptions. With regard to ecological groups, dark-septate endophytic fungi were more diverse in acidic soils, ectomycorrhizal fungi were represented by more OTUs in calcareous sites, while detected arbuscular mycorrhizal genera fungi showed opposing trends regarding pH. Furthermore, we detected numerous red listed species in our samples often from previously unknown locations, indicating that some of the fungal species currently considered rare may be more abundant in Dutch S. repens communities than previously thought. PMID:24937200
Microbial diversity and component variation in Xiaguan Tuo Tea during pile fermentation

PubMed Central

Li, Min; Yang, Xinrui; Gui, Xin; Chen, Guofeng; Chu, Jiuyun; He, Xingwang; Wang, Weitao; Han, Feng

2018-01-01

Xiaguan Tuo Tea is largely consumed by the Chinese, but there is little research into the microbial diversity and component changes during the fermentation of this tea. In this study, we first used fluorescence in situ hybridization (FISH), next-generation sequencing (NGS) and chemical analysis methods to determine the microbial abundance and diversity and the chemical composition during fermentation. The FISH results showed that the total number of microorganisms ranges from 2.3×102 to 4.0×108 cells per gram of sample during fermentation and is mainly dominated by fungi. In the early fermentation stages, molds are dominant (0.6×102~2.8×106 cells/g, 0~35 d). However, in the late stages of fermentation, yeasts are dominant (3.6×104~9.6×106 cells/g, 35~56 d). The bacteria have little effect during the fermentation of tea (102~103 cells/g, <1% of fungus values). Of these fungi, A. niger (Aspergillus niger) and B. adeninivorans (Blastobotrys adeninivorans) are identified as the two most common strains, based on Next-generation Sequencing (NGS) analysis. Peak diversity in tea was observed at day 35 of fermentation (Shannon–Weaver index: 1.195857), and lower diversity was observed on days 6 and 56 of fermentation (Shannon–Weaver index 0.860589 and 1.119106, respectively). During the microbial fermentation, compared to the unfermented tea, the tea polyphenol content decreased by 54%, and the caffeine content increased by 59%. Theanine and free amino acid contents were reduced during fermentation by 81.1 and 92.85%, respectively. PMID:29462204
Microbial diversity and component variation in Xiaguan Tuo Tea during pile fermentation.

PubMed

Li, Haizhou; Li, Min; Yang, Xinrui; Gui, Xin; Chen, Guofeng; Chu, Jiuyun; He, Xingwang; Wang, Weitao; Han, Feng; Li, Ping

2018-01-01

Xiaguan Tuo Tea is largely consumed by the Chinese, but there is little research into the microbial diversity and component changes during the fermentation of this tea. In this study, we first used fluorescence in situ hybridization (FISH), next-generation sequencing (NGS) and chemical analysis methods to determine the microbial abundance and diversity and the chemical composition during fermentation. The FISH results showed that the total number of microorganisms ranges from 2.3×102 to 4.0×108 cells per gram of sample during fermentation and is mainly dominated by fungi. In the early fermentation stages, molds are dominant (0.6×102~2.8×106 cells/g, 0~35 d). However, in the late stages of fermentation, yeasts are dominant (3.6×104~9.6×106 cells/g, 35~56 d). The bacteria have little effect during the fermentation of tea (102~103 cells/g, <1% of fungus values). Of these fungi, A. niger (Aspergillus niger) and B. adeninivorans (Blastobotrys adeninivorans) are identified as the two most common strains, based on Next-generation Sequencing (NGS) analysis. Peak diversity in tea was observed at day 35 of fermentation (Shannon-Weaver index: 1.195857), and lower diversity was observed on days 6 and 56 of fermentation (Shannon-Weaver index 0.860589 and 1.119106, respectively). During the microbial fermentation, compared to the unfermented tea, the tea polyphenol content decreased by 54%, and the caffeine content increased by 59%. Theanine and free amino acid contents were reduced during fermentation by 81.1 and 92.85%, respectively.
Serotype and genetic diversity of human rhinovirus strains that circulated in Kenya in 2008.

PubMed

Milanoi, Sylvia; Ongus, Juliette R; Gachara, George; Coldren, Rodney; Bulimo, Wallace

2016-05-01

Human rhinoviruses (HRVs) are a well-established cause of the common cold and recent studies indicated that they may be associated with severe acute respiratory illnesses (SARIs) like pneumonia, asthma, and bronchiolitis. Despite global studies on the genetic diversity of the virus, the serotype diversity of these viruses across diverse geographic regions in Kenya has not been characterized. This study sought to characterize the serotype diversity of HRV strains that circulated in Kenya in 2008. A total of 517 archived nasopharyngeal samples collected in a previous respiratory virus surveillance program across Kenya in 2008 were selected. Participants enrolled were outpatients who presented with influenza-like (ILI) symptoms. Real-time RT-PCR was employed for preliminary HRV detection. HRV-positive samples were amplified using RT-PCR and thereafter the nucleotide sequences of the amplicons were determined followed by phylogenetic analysis. Twenty-five percent of the samples tested positive for HRV. Phylogenetic analysis revealed that the Kenyan HRVs clustered into three main species comprising HRV-A (54%), HRV-B (12%), and HRV-C (35%). Overall, 20 different serotypes were identified. Intrastrain sequence homology among the Kenyan strains ranged from 58% to 100% at the nucleotide level and 55% to 100% at the amino acid level. These results show that a wide range of HRV serotypes with different levels of nucleotide variation were present in Kenya. Furthermore, our data show that HRVs contributed substantially to influenza-like illness in Kenya in 2008. © 2016 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
Nucleic acid arrays and methods of synthesis

DOEpatents

Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles

2001-01-01

The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.
Proteogenomic Investigation of Strain Variation in Clinical Mycobacterium tuberculosis Isolates.

PubMed

Heunis, Tiaan; Dippenaar, Anzaan; Warren, Robin M; van Helden, Paul D; van der Merwe, Ruben G; Gey van Pittius, Nicolaas C; Pain, Arnab; Sampson, Samantha L; Tabb, David L

2017-10-06

Mycobacterium tuberculosis consists of a large number of different strains that display unique virulence characteristics. Whole-genome sequencing has revealed substantial genetic diversity among clinical M. tuberculosis isolates, and elucidating the phenotypic variation encoded by this genetic diversity will be of the utmost importance to fully understand M. tuberculosis biology and pathogenicity. In this study, we integrated whole-genome sequencing and mass spectrometry (GeLC-MS/MS) to reveal strain-specific characteristics in the proteomes of two clinical M. tuberculosis Latin American-Mediterranean isolates. Using this approach, we identified 59 peptides containing single amino acid variants, which covered ∼9% of all coding nonsynonymous single nucleotide variants detected by whole-genome sequencing. Furthermore, we identified 29 distinct peptides that mapped to a hypothetical protein not present in the M. tuberculosis H37Rv reference proteome. Here, we provide evidence for the expression of this protein in the clinical M. tuberculosis SAWC3651 isolate. The strain-specific databases enabled confirmation of genomic differences (i.e., large genomic regions of difference and nonsynonymous single nucleotide variants) in these two clinical M. tuberculosis isolates and allowed strain differentiation at the proteome level. Our results contribute to the growing field of clinical microbial proteogenomics and can improve our understanding of phenotypic variation in clinical M. tuberculosis isolates.
Occurrence and activity of a type II CRISPR-Cas system in Lactobacillus gasseri.

PubMed

Sanozky-Dawes, Rosemary; Selle, Kurt; O'Flaherty, Sarah; Klaenhammer, Todd; Barrangou, Rodolphe

2015-09-01

Bacteria encode clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR-associated genes (cas), which collectively form an RNA-guided adaptive immune system against invasive genetic elements. In silico surveys have revealed that lactic acid bacteria harbour a prolific and diverse set of CRISPR-Cas systems. Thus, the natural evolutionary role of CRISPR-Cas systems may be investigated in these ecologically, industrially, scientifically and medically important microbes. In this study, 17 Lactobacillus gasseri strains were investigated and 6 harboured a type II-A CRISPR-Cas system, with considerable diversity in array size and spacer content. Several of the spacers showed similarity to phage and plasmid sequences, which are typical targets of CRISPR-Cas immune systems. Aligning the protospacers facilitated inference of the protospacer adjacent motif sequence, determined to be 5'-NTAA-3' flanking the 3' end of the protospacer. The system in L. gasseri JV-V03 and NCK 1342 interfered with transforming plasmids containing sequences matching the most recently acquired CRISPR spacers in each strain. We report the distribution and function of a native type II-A CRISPR-Cas system in the commensal species L. gasseri. Collectively, these results open avenues for applications for bacteriophage protection and genome modification in L. gasseri, and contribute to the fundamental understanding of CRISPR-Cas systems in bacteria.
Genetic Diversity in Oxytocin Ligands and Receptors in New World Monkeys

PubMed Central

Ren, Dongren; Lu, Guoqing; Moriyama, Hideaki; Mustoe, Aaryn C.; Harrison, Emily B.; French, Jeffrey A.

2015-01-01

Oxytocin (OXT) is an important neurohypophyseal hormone that influences wide spectrum of reproductive and social processes. Eutherian mammals possess a highly conserved sequence of OXT (Cys-Tyr-Ile-Gln-Asn-Cys-Pro-Leu-Gly). However, in this study, we sequenced the coding region for OXT in 22 species covering all New World monkeys (NWM) genera and clades, and characterize five OXT variants, including consensus mammalian Leu8-OXT, major variant Pro8-OXT, and three previously unreported variants: Ala8-OXT, Thr8-OXT, and Phe2-OXT. Pro8-OXT shows clear structural and physicochemical differences from Leu8-OXT. We report multiple predicted amino acid substitutions in the G protein-coupled OXT receptor (OXTR), especially in the critical N-terminus, which is crucial for OXT recognition and binding. Genera with same Pro8-OXT tend to cluster together on a phylogenetic tree based on OXTR sequence, and we demonstrate significant coevolution between OXT and OXTR. NWM species are characterized by high incidence of social monogamy, and we document an association between OXTR phylogeny and social monogamy. Our results demonstrate remarkable genetic diversity in the NWM OXT/OXTR system, which can provide a foundation for molecular, pharmacological, and behavioral studies of the role of OXT signaling in regulating complex social phenotypes. PMID:25938568
Cardiomyocytes In Vitro Adhesion Is Actively Influenced by Biomimetic Synthetic Peptides for Cardiac Tissue Engineering

PubMed Central

Huerta-Cantillo, Rocio; Comisso, Marina; Danesin, Roberta; Ghezzo, Francesca; Naso, Filippo; Gastaldello, Alessandra; Schittullo, Eleonora; Buratto, Edward; Spina, Michele; Gerosa, Gino; Dettin, Monica

2012-01-01

Scaffolds for tissue engineering must be designed to direct desired events such as cell attachment, growth, and differentiation. The incorporation of extracellular matrix-derived peptides into biomaterials has been proposed to mimic biochemical signals. In this study, three synthetic fragments of fibronectin, vitronectin, and stromal-derived factor-1 were investigated for the first time as potential adhesive sequences for cardiomyocytes (CMs) compared to smooth muscle cells. CMs are responsive to all peptides to differing degrees, demonstrating the existence of diverse adhesion mechanisms. The pretreatment of nontissue culture well surfaces with the (Arginine-Glycine-Aspartic Acid) RGD sequence anticipated the appearance of CMs' contractility compared to the control (fibronectin-coated well) and doubled the length of cell viability. Future prospects are the inclusion of these sequences into biomaterial formulation with the improvement in cell adhesion that could play an important role in cell retention during dynamic cell seeding. PMID:22011064
Comparison of a High-Resolution Melting Assay to Next-Generation Sequencing for Analysis of HIV Diversity

PubMed Central

Cousins, Matthew M.; Ou, San-San; Wawer, Maria J.; Munshaw, Supriya; Swan, David; Magaret, Craig A.; Mullis, Caroline E.; Serwadda, David; Porcella, Stephen F.; Gray, Ronald H.; Quinn, Thomas C.; Donnell, Deborah; Eshleman, Susan H.

2012-01-01

Next-generation sequencing (NGS) has recently been used for analysis of HIV diversity, but this method is labor-intensive, costly, and requires complex protocols for data analysis. We compared diversity measures obtained using NGS data to those obtained using a diversity assay based on high-resolution melting (HRM) of DNA duplexes. The HRM diversity assay provides a single numeric score that reflects the level of diversity in the region analyzed. HIV gag and env from individuals in Rakai, Uganda, were analyzed in a previous study using NGS (n = 220 samples from 110 individuals). Three sequence-based diversity measures were calculated from the NGS sequence data (percent diversity, percent complexity, and Shannon entropy). The amplicon pools used for NGS were analyzed with the HRM diversity assay. HRM scores were significantly associated with sequence-based measures of HIV diversity for both gag and env (P < 0.001 for all measures). The level of diversity measured by the HRM diversity assay and NGS increased over time in both regions analyzed (P < 0.001 for all measures except for percent complexity in gag), and similar amounts of diversification were observed with both methods (P < 0.001 for all measures except for percent complexity in gag). Diversity measures obtained using the HRM diversity assay were significantly associated with those from NGS, and similar increases in diversity over time were detected by both methods. The HRM diversity assay is faster and less expensive than NGS, facilitating rapid analysis of large studies of HIV diversity and evolution. PMID:22785188
High-Throughput Sequencing and Metagenomics: Moving Forward in the Culture-Independent Analysis of Food Microbial Ecology

PubMed Central

2013-01-01

Following recent trends in environmental microbiology, food microbiology has benefited from the advances in molecular biology and adopted novel strategies to detect, identify, and monitor microbes in food. An in-depth study of the microbial diversity in food can now be achieved by using high-throughput sequencing (HTS) approaches after direct nucleic acid extraction from the sample to be studied. In this review, the workflow of applying culture-independent HTS to food matrices is described. The current scenario and future perspectives of HTS uses to study food microbiota are presented, and the decision-making process leading to the best choice of working conditions to fulfill the specific needs of food research is described. PMID:23475615
Characteristics common to a cytokine family spanning five orders of insects.

PubMed

Matsumoto, Hitoshi; Tsuzuki, Seiji; Date-Ito, Atsuko; Ohnishi, Atsushi; Hayakawa, Yoichi

2012-06-01

Growth-blocking peptide (GBP) is a member of an insect cytokine family with diverse functions including growth and immunity controls. Members of this cytokine family have been reported in 15 species of Lepidoptera, and we have recently identified GBP-like peptides in Diptera such as Lucilia cuprina and Drosophila melanogaster, indicating that this peptide family is not specific to Lepidoptera. In order to extend our knowledge of this peptide family, we purified the same family peptide from one of the tenebrionids, Zophobas atratus,(1) isolated its cDNA, and sequenced it. The Z. atratus GBP sequence together with reported sequence data of peptides from the same family enabled us to perform BLAST searches against EST and genome databases of several insect species including Coleoptera, Diptera, Hymenoptera, and Hemiptera and identify homologous peptide genes. Here we report conserved structural features in these sequence data. They consist of 19-30 amino acid residues encoded at the C terminus of a 73-152 amino acid precursor and contain the motif C-x(2)-G-x(4,6)-G-x(1,2)-C-[KR], which shares a certain similarity with the motif in the mammalian EGF peptide family. These data indicate that these small cytokines belonging to one family are present in at least five insect orders. Copyright © 2012 Elsevier Ltd. All rights reserved.
Genomic perspectives of spider silk genes through target capture sequencing: Conservation of stabilization mechanisms and homology-based structural models of spidroin terminal regions.

PubMed

Collin, Matthew A; Clarke, Thomas H; Ayoub, Nadia A; Hayashi, Cheryl Y

2018-07-01

A powerful system for studying protein aggregation, particularly rapid self-assembly, is spider silk. Spider silks are proteinaceous and silk proteins are synthesized and stored within silk glands as liquid dope. As needed, liquid dope is near-instantaneously transformed into solid fibers or viscous adhesives. The dominant constituents of silks are spidroins (spider fibroins) and their terminal domains are vital for the tight control of silk self-assembly. To better understand spidroin termini, we used target capture and deep sequencing to identify spidroin gene sequences from six species representing the araneoid families of Araneidae, Nephilidae, and Theridiidae. We obtained 145 terminal regions, of which 103 are newly annotated here, as well as novel variants within nine diverse spidroin types. Our comparative analyses demonstrated the conservation of acidic, basic, and cysteine amino acid residues across spidroin types that had been proposed to be important for monomer stability, dimer formation, and self-assembly from a limited sampling of spidroins. Computational, protein homology modeling revealed areas of spidroin terminal regions that are highly conserved in three-dimensions despite sequence divergence across spidroin types. Analyses of our dense sampling of terminal regions suggest that most spidroins share stabilization mechanisms, dimer formation, and tertiary structure, despite producing functionally distinct materials. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Unexpected Diversity of Escherichia coli Sialate O-Acetyl Esterase NanS

PubMed Central

Rangel, Ariel; Steenbergen, Susan M.

2016-01-01

ABSTRACT The sialic acids (N-acylneuraminates) are a group of nine-carbon keto-sugars existing mainly as terminal residues on animal glycoprotein and glycolipid carbohydrate chains. Bacterial commensals and pathogens exploit host sialic acids for nutrition, adhesion, or antirecognition, where N-acetyl- or N-glycolylneuraminic acids are the two predominant chemical forms of sialic acids. Each form may be modified by acetyl esters at carbon position 4, 7, 8, or 9 and by a variety of less-common modifications. Modified sialic acids produce challenges for colonizing bacteria, because the chemical alterations to N-acetylneuraminic acid (Neu5Ac) confer increased resistance to sialidase and aldolase activities essential for the catabolism of host sialic acids. Bacteria with O-acetyl sialate esterase(s) utilize acetylated sialic acids for growth, thereby gaining a presumed metabolic advantage over competitors lacking this activity. Here, we demonstrate the esterase activity of Escherichia coli NanS after purifying it as a C-terminal HaloTag fusion. Using a similar approach, we show that E. coli strain O157:H7 Stx prophage or prophage remnants invariably include paralogs of nanS often located downstream of the Shiga-like toxin genes. These paralogs may include sequences encoding N- or C-terminal domains of unknown function where the NanS domains can act as sialate O-acetyl esterases, as shown by complementation of an E. coli strain K-12 nanS mutant and the unimpaired growth of an E. coli O157 nanS mutant on O-acetylated sialic acid. We further demonstrate that nanS homologs in Streptococcus spp. also encode active esterase, demonstrating an unexpected diversity of bacterial sialate O-acetyl esterase. IMPORTANCE The sialic acids are a family of over 40 naturally occurring 9-carbon keto-sugars that function in a variety of host-bacterium interactions. These sugars occur primarily as terminal carbohydrate residues on host glycoproteins and glycolipids. Available evidence indicates that diverse bacterial species use host sialic acids for adhesion or as sources of carbon and nitrogen. Our results show that the catabolism of the diacetylated form of host sialic acid requires a specialized esterase, NanS. Our results further show that nanS homologs exist in bacteria other than Escherichia coli, as well as part of toxigenic E. coli prophage. The unexpected diversity of these enzymes suggests new avenues for investigating host-bacterium interactions. Therefore, these original results extend our previous studies of nanS to include mucosal pathogens, prophage, and prophage remnants. This expansion of the nanS superfamily suggests important, although as-yet-unknown, functions in host-microbe interactions. PMID:27481927
Enteral High Fat-Polyunsaturated Fatty Acid Blend Alters the Pathogen Composition of the Intestinal Microbiome in Premature Infants with an Enterostomy

PubMed Central

Younge, Noelle; Yang, Qing; Seed, Patrick C.

2016-01-01

Objective To determine the effect of enteral fish oil and safflower oil supplementation on the intestinal microbiome in premature infants with an enterostomy. Study design Premature infants with an enterostomy were randomized to receive early enteral supplementation with a high fat-polyunsaturated fatty acid (HF-PUFA) blend of fish oil and safflower oil versus standard nutritional therapy. We used 16S rRNA gene sequencing for longitudinal profiling of the microbiome from the time of study entry until bowel reanastomosis. We used weighted gene co-expression network analysis to identify microbial community modules that differed between study groups over time. We performed imputed metagenomic analysis to determine metabolic pathways associated with the microbial genes. Results Sixteen infants were randomized to receive enteral HF-PUFA supplementation and 16 infants received standard care. The intestinal microbiota of infants in the treatment group differed from those in the control group, with greater bacterial diversity and lower abundance of Streptococcus, Clostridium, and many pathogenic genera within the Enterobacteriaceae family. We identified four microbial community modules with significant differences between groups over time. Imputed metagenomic analysis of the microbial genes revealed metabolic pathways that differed between groups, including metabolism of amino acids, carbohydrates, fatty acids, and secondary bile acid synthesis. Conclusion Enteral HF-PUFA supplementation was associated with decreased abundance of pathogenic bacteria, greater bacterial diversity, and shifts in the potential metabolic functions of intestinal microbiota. Trial registration ClinicalTrials.gov: NCT01306838 PMID:27856001
Enteral High Fat-Polyunsaturated Fatty Acid Blend Alters the Pathogen Composition of the Intestinal Microbiome in Premature Infants with an Enterostomy.

PubMed

Younge, Noelle; Yang, Qing; Seed, Patrick C

2017-02-01

To determine the effect of enteral fish oil and safflower oil supplementation on the intestinal microbiome in infants with an enterostomy born premature. Infants with an enterostomy born premature were randomized to receive early enteral supplementation with a high-fat polyunsaturated fatty acid (HF-PUFA) blend of fish oil and safflower oil vs standard nutritional therapy. We used 16S rRNA gene sequencing for longitudinal profiling of the microbiome from the time of study entry until bowel reanastomosis. We used weighted gene coexpression network analysis to identify microbial community modules that differed between study groups over time. We performed imputed metagenomic analysis to determine metabolic pathways associated with the microbial genes. Sixteen infants were randomized to receive enteral HF-PUFA supplementation, and 16 infants received standard care. The intestinal microbiota of infants in the treatment group differed from those in the control group, with greater bacterial diversity and lower abundance of Streptococcus, Clostridium, and many pathogenic genera within the Enterobacteriaceae family. We identified 4 microbial community modules with significant differences between groups over time. Imputed metagenomic analysis of the microbial genes revealed metabolic pathways that differed between groups, including metabolism of amino acids, carbohydrates, fatty acids, and secondary bile acid synthesis. Enteral HF-PUFA supplementation was associated with decreased abundance of pathogenic bacteria, greater bacterial diversity, and shifts in the potential metabolic functions of intestinal microbiota. ClinicalTrials.gov:NCT01306838. Copyright © 2016 Elsevier Inc. All rights reserved.
Identification and subspecific differentiation of Mycobacterium scrofulaceum by automated sequencing of a region of the gene (hsp65) encoding a 65-kilodalton heat shock protein.

PubMed Central

Swanson, D S; Pan, X; Musser, J M

1996-01-01

Mycobacterium scrofulaceum is most commonly recovered from children with cervical lymphadenitis, although it also accounts for approximately 2% of the mycobacterial infections in AIDS patients. Species assignment of M. scrofulaceum isolated by conventional techniques can be difficult and time-consuming. To develop a strategy for rapid species assignment of these organisms, a 360-bp region of the gene (hsp65) encoding a 65-kDa heat shock protein in 37 isolates from diverse sources was sequenced. Eight hsp65 alleles were identified, and these sequences formed phylogenetic clusters and lineages largely distinct from other Mycobacterium species. There was incomplete correlation between serovar designation and hsp65 allele assignment. The hsp65 data correlated strongly with the results of sequence analysis of the gene coding for 16S rRNA. Automated DNA sequencing of a 360-bp region of the hsp65 gene provides a rapid and unambiguous method for species assignment of these acid-fast organisms for diagnostic purposes. PMID:8940463
Sequence diversity of the leukotoxin (lktA) gene in caprine and ovine strains of Mannheimia haemolytica.

PubMed

Vougidou, C; Sandalakis, V; Psaroulaki, A; Petridou, E; Ekateriniadou, L

2013-04-20

Mannheimia haemolytica is the aetiological agent of pneumonic pasteurellosis in small ruminants. The primary virulence factor of the bacterium is a leukotoxin (LktA), which induces apoptosis in susceptible cells via mitochondrial targeting. It has been previously shown that certain lktA alleles are associated either with cattle or sheep. The objective of the present study was to investigate lktA sequence variation among ovine and caprine M haemolytica strains isolated from pneumonic lungs, revealing any potential adaptation for the caprine host, for which there is no available data. Furthermore, we investigated amino acid variation in the N-terminal part of the sequences and its effect on targeting mitochondria. Data analysis showed that the prevalent caprine genotype differed at a single non-synonymous site from a previously described uncommon bovine allele, whereas the ovine sequences represented new, distinct alleles. N-terminal sequence differences did not affect the mitochondrial targeting ability of the isolates; interestingly enough in one case, mitochondrial matrix targeting was indicated rather than membrane association, suggesting an alternative LktA trafficking pattern.
Artificial Intelligence, DNA Mimicry, and Human Health.

PubMed

Stefano, George B; Kream, Richard M

2017-08-14

The molecular evolution of genomic DNA across diverse plant and animal phyla involved dynamic registrations of sequence modifications to maintain existential homeostasis to increasingly complex patterns of environmental stressors. As an essential corollary, driver effects of positive evolutionary pressure are hypothesized to effect concerted modifications of genomic DNA sequences to meet expanded platforms of regulatory controls for successful implementation of advanced physiological requirements. It is also clearly apparent that preservation of updated registries of advantageous modifications of genomic DNA sequences requires coordinate expansion of convergent cellular proofreading/error correction mechanisms that are encoded by reciprocally modified genomic DNA. Computational expansion of operationally defined DNA memory extends to coordinate modification of coding and previously under-emphasized noncoding regions that now appear to represent essential reservoirs of untapped genetic information amenable to evolutionary driven recruitment into the realm of biologically active domains. Additionally, expansion of DNA memory potential via chemical modification and activation of noncoding sequences is targeted to vertical augmentation and integration of an expanded cadre of transcriptional and epigenetic regulatory factors affecting linear coding of protein amino acid sequences within open reading frames.

Insight into the bacterial diversity of fermentation woad dye vats as revealed by PCR-DGGE and pyrosequencing.

PubMed

Milanović, Vesna; Osimani, Andrea; Taccari, Manuela; Garofalo, Cristiana; Butta, Alessandro; Clementi, Francesca; Aquilanti, Lucia

2017-07-01

The bacterial diversity in fermenting dye vats with woad (Isatis tinctoria L.) prepared and maintained in a functional state for approximately 12 months was examined using a combination of culture-dependent and -independent PCR-DGGE analyses and next-generation sequencing of 16S rRNA amplicons. An extremely complex ecosystem including taxa potentially contributing to both indigo reduction and formation, as well as indigo degradation was found. PCR-DGGE analyses revealed the presence of Paenibacillus lactis, Sporosarcina koreensis, Bacillus licheniformis, and Bacillus thermoamylovorans, while Bacillus thermolactis, Bacillus pumilus and Bacillus megaterium were also identified but with sequence identities lower than 97%. Dominant operational taxonomic units (OTUs) identified by pyrosequencing included Clostridium ultunense, Tissierella spp., Alcaligenes faecalis, Erysipelothrix spp., Enterococcus spp., Virgibacillus spp. and Virgibacillus panthothenicus, while sub-dominant OTUs included clostridia, alkaliphiles, halophiles, bacilli, moderately thermophilic bacteria, lactic acid bacteria, Enterobacteriaceae, aerobes, and even photosynthetic bacteria. Based on the current knowledge of indigo-reducing bacteria, it is considered that indigo-reducing bacteria constituted only a small fraction in the unique microcosm detected in the natural indigo dye vats.
A global view of structure–function relationships in the tautomerase superfamily

PubMed Central

Davidson, Rebecca; Baas, Bert-Jan; Akiva, Eyal; Holliday, Gemma L.; Polacco, Benjamin J.; LeVieux, Jake A.; Pullara, Collin R.; Zhang, Yan Jessie; Whitman, Christian P.

2018-01-01

The tautomerase superfamily (TSF) consists of more than 11,000 nonredundant sequences present throughout the biosphere. Characterized members have attracted much attention because of the unusual and key catalytic role of an N-terminal proline. These few characterized members catalyze a diverse range of chemical reactions, but the full scale of their chemical capabilities and biological functions remains unknown. To gain new insight into TSF structure–function relationships, we performed a global analysis of similarities across the entire superfamily and computed a sequence similarity network to guide classification into distinct subgroups. Our results indicate that TSF members are found in all domains of life, with most being present in bacteria. The eukaryotic members of the cis-3-chloroacrylic acid dehalogenase subgroup are limited to fungal species, whereas the macrophage migration inhibitory factor subgroup has wide eukaryotic representation (including mammals). Unexpectedly, we found that 346 TSF sequences lack Pro-1, of which 85% are present in the malonate semialdehyde decarboxylase subgroup. The computed network also enabled the identification of similarity paths, namely sequences that link functionally diverse subgroups and exhibit transitional structural features that may help explain reaction divergence. A structure-guided comparison of these linker proteins identified conserved transitions between them, and kinetic analysis paralleled these observations. Phylogenetic reconstruction of the linker set was consistent with these findings. Our results also suggest that contemporary TSF members may have evolved from a short 4-oxalocrotonate tautomerase–like ancestor followed by gene duplication and fusion. Our new linker-guided strategy can be used to enrich the discovery of sequence/structure/function transitions in other enzyme superfamilies. PMID:29184004
Immunoglobulin from Antarctic fish species of Rajidae family.

PubMed

Coscia, Maria Rosaria; Cocca, Ennio; Giacomelli, Stefano; Cuccaro, Fausta; Oreste, Umberto

2012-03-01

Immunoglobulins (Ig) of Chondroichthyes have been extensively studied in sharks; in contrast, in skates investigations on Ig remain scarce and fragmentary despite the high occurrence of skates in all of the major oceans of the world. To focus on Rajidae Igμ, the most abundant heavy chain isotype, we have chosen the Antarctic species Bathyraja eatonii, Bathyraja albomaculata, Bathyraja brachyurops, and Amblyraja georgiana which live at high latitudes in the Southern Ocean, and at very low temperatures. We prepared mRNA from the spleen of individuals of each species and performed RT-PCR experiments using two oligonucleotides designed on the alignment of various elasmobranch Igμ heavy chain sequences available in GenBank. The PCR products, about 1400-nt long, were cloned and sequenced. Nucleotide sequence identities calculated for the constant region domains ranged from 88.5% to 97.5% between species, and from 91.1% to 99.7% within species. In a distance tree, including also Raja erinacea sequences, two major branches were obtained, one containing Arhynchobatinae sequences, the other one Rajinae sequences. Four presumptive D gene segments were identified in the region of the VH/D/JH recombination; two different D segments were often found in the same sequence. Moreover, 5-15 genomic fragments of different lengths, carrying the gene locus encoding Igμ chain were revealed by Southern blotting analysis. B. eatonii amino acid sequences were analyzed for the positional diversity by Shannon entropy analysis, showing CH4 as the most conserved domain, and CH3 as the most variable one. B. eatonii CDR3 region length varied between 11 and 15 amino acid residues; the mean length (13.4 aa) was greater than that of Leucoraja eglanteria sequences (7.7 aa). An alignment of representative sequences of Antarctic species and R. erinacea showed that more cysteine residues not involved in the intradomain disulfide bridges were present in Antarctic species. Copyright Â© 2011 Elsevier B.V. All rights reserved.
[Diversity of cultivable actinobacteria in Xinghu wetland sediments].

PubMed

Xue, Dong; Zhao, Guozhen; Yao, Qing; Zhao, Haiquan; Zhu, Honghui

2015-11-04

To study the diversity of cultivable actinobacteria in Xinghu wetland and screen actinobacteria with a pharmaceutical potential for producing biologically active secondary metabolites. We studied the diversity of actinobacteria isolated from Xinghu wetland by using different selective isolation media and methods. The high bioactive actinobacteria were identified and further investigated for the presence of polyketide synthases (PKS-I, PKS-II), nonribosomal peptide synthetases (NRPS), 3-amino-5-hydroxybenzoic acid synthases (AHBA) and 3-hydroxy-3-methylglutaryl Coenzyme A (HMG CoA) sequences by specific amplification. More than 300 actinobacteria were isolated, and 135 isolates were selected on the basis of their morphologies on different media and were further characterized by 16S rRNA gene sequencing. The isolates belonged to 7 orders, 10 families, 13 genera, Streptomyces was the most frequently isolated genus, followed by the genera Micromonospora and Nocardia. Twenty-four isolates showed high activity against Staphylococcus aureus and Escherichia coli, but there no strain displaying antagonistic activity against Salmonella sp. High frequencies of positive PCR amplification were obtained for PKS-I (16.7%, 4/24), PKS-II (62.5%,15/24), NRPS (16.7%, 4/24), HMG CoA (29.2%, 7/24) and AHBA (12.5%, 3/24) biosynthetic systems. High Performance Liquid Chromatography showed that strain XD7, XD114, XD128 produce lots of secondary metabolites. This study indicated that actinobacteria isolated from Xinghu wetland are abundant and have potentially beneficial and diverse bioactivities which should be pursued for their biotechnical promise.
Structure-Specific Ribonucleases for MS-Based Elucidation of Higher-Order RNA Structure

NASA Astrophysics Data System (ADS)

Scalabrin, Matteo; Siu, Yik; Asare-Okai, Papa Nii; Fabris, Daniele

2014-07-01

Supported by high-throughput sequencing technologies, structure-specific nucleases are experiencing a renaissance as biochemical probes for genome-wide mapping of nucleic acid structure. This report explores the benefits and pitfalls of the application of Mung bean (Mb) and V1 nuclease, which attack specifically single- and double-stranded regions of nucleic acids, as possible structural probes to be employed in combination with MS detection. Both enzymes were found capable of operating in ammonium-based solutions that are preferred for high-resolution analysis by direct infusion electrospray ionization (ESI). Sequence analysis by tandem mass spectrometry (MS/MS) was performed to confirm mapping assignments and to resolve possible ambiguities arising from the concomitant formation of isobaric products with identical base composition and different sequences. The observed products grouped together into ladder-type series that facilitated their assignment to unique regions of the substrate, but revealed also a certain level of uncertainty in identifying the boundaries between paired and unpaired regions. Various experimental factors that are known to stabilize nucleic acid structure, such as higher ionic strength, presence of Mg(II), etc., increased the accuracy of cleavage information, but did not completely eliminate deviations from expected results. These observations suggest extreme caution in interpreting the results afforded by these types of reagents. Regardless of the analytical platform of choice, the results highlighted the need to repeat probing experiments under the most diverse possible conditions to recognize potential artifacts and to increase the level of confidence in the observed structural information.
Molecular characterization and transcriptional analysis of the female-enriched chondroitin proteoglycan 2 of Toxocara canis.

PubMed

Ma, G X; Zhou, R Q; Hu, L; Luo, Y L; Luo, Y F; Zhu, H H

2018-03-01

Toxocara canis is an important but neglected zoonotic parasite, and is the causative agent of human toxocariasis. Chondroitin proteoglycans are biological macromolecules, widely distributed in extracellular matrices, with a great diversity of functions in mammals. However, there is limited information regarding chondroitin proteoglycans in nematode parasites. In the present study, a female-enriched chondroitin proteoglycan 2 gene of T. canis (Tc-cpg-2) was cloned and characterized. Quantitative real-time polymerase chain reaction (qRT-PCR) was employed to measure the transcription levels of Tc-cpg-2 among tissues of male and female adult worms. A 485-amino-acid (aa) polypeptide was predicted from a continuous 1458-nuleotide open reading frame and designated as TcCPG2, which contains a 21-aa signal peptide. Conserved domain searching indicated three chitin-binding peritrophin-A (CBM_14) domains in the amino acid sequence of TcCPG2. Multiple alignment with the inferred amino acid sequences of Caenorhabditis elegans and Ascaris suum showed that CBM_14 domains were well conserved among these species. Phylogenetic analysis suggested that TcCPG2 was closely related to the sequence of chondroitin proteoglycan 2 of A. suum. Interestingly, a high level of Tc-cpg-2 was detected in female germline tissues, particularly in the oviduct, suggesting potential roles of this gene in reproduction (e.g. oogenesis and embryogenesis) of adult T. canis. The functional roles of Tc-cpg-2 in reproduction and development in this parasite and related parasitic nematodes warrant further functional studies.
Microbial and genomic characterization of Geobacillus thermodenitrificans OS27, a marine thermophile that degrades diverse raw seaweeds.

PubMed

Fujii, Kenta; Tominaga, Yurie; Okunaka, Jyumpei; Yagi, Hisashi; Ohshiro, Takashi; Suzuki, Hirokazu

2018-06-01

Seaweeds are a nonlignocellulosic biomass, but they are often abundant in unique polysaccharides that common microbes can hardly utilize; therefore, polysaccharide degradation is key for the full utilization of seaweed biomass. Here, we isolated 13 thermophiles from seaweed homogenates that had been incubated at high temperature. All of the isolates were Gram-positive and preferentially grew at 60-70 °C. Most formed endospores and were tolerant to seawater salinity. Despite different sources, all isolates were identical regarding 16S rRNA gene sequences and were categorized as Geobacillus thermodenitrificans. Their growth occurred on seaweed polysaccharides with different profiles but required amino acids and/or vitamins, implying that they existed as proliferative cells by utilizing nutrients on seaweed viscous surfaces. Among 13 isolates, strain OS27 was further characterized to show that it can utilize a diverse range of seaweed polysaccharides and hemicelluloses. Notably, strain OS27 degraded raw seaweeds while releasing soluble saccharides. The degradation seemed to depend on enzymes that were extracellularly produced in an inducible manner. The strain could be genetically modified to produce heterologous endoglucanase, providing a transformant that degrades more diverse seaweeds with higher efficiency. The draft sequences of the OS27 genome contained 3766 coding sequences, which included intact genes for 28 glycoside hydrolases and many hypothetical proteins unusual among G. thermodenitrificans. These results suggest that G. thermodenitrificans OS27 serves as a genetic resource for thermostable enzymes to degrade seaweeds and potentially as a microbial platform for high temperature seaweed biorefinery via genetic modification.
Molecular identification and genetic diversity of open reading frame 7 field isolated porcine reproductive and respiratory syndrome in North Sumatera, Indonesia, in the period of 2008-2014.

PubMed

Faisal, Faisal; Widayanti, Rini; Haryanto, Aris; Tabu, Charles Rangga

2015-07-01

Molecular identification and genetic diversity of open reading frame 7 (ORF7) of field isolated porcine reproductive and respiratory syndrome virus (PRRSV) in North Sumatera, Indonesia, in the period of 2008-2014. A total of 47 PRRSV samples were collected from the death case of pigs. The samples were collected from different districts in the period of 2008-2014 from North Sumatera province. Two pairs of primer were designed to amplify ORF7 of Type 1 and 2 PRRSV based on the sequence of reference viruses VR2332 and Lelystad. Viral RNAs were extracted from samples using PureLink™ micro-to-Midi total RNA purification system (Invitrogen). To amplify the ORF7 of PRRSV, the synthesis cDNA and DNA amplification were performed by reverse transcription polymerase chain reaction (RT-PCR) and nested PCR method. Then the DNA sequencing of PCR products and phylogenetic analysis were accomplished by molecular evolutionary genetics analysis version 6.0 software program. RT-: PCR and nested PCR used in this study had successfully detected of 18 samples positive PRRS virus with the amplification products at 703bp and 508bp, respectively. Sequencing of the ORF7 shows that 18 PRRS viruses isolated from North Sumatera belonged to North American (NA). JXA1 Like and classic NA type viruses. Several mutations were detected, particularly in the area of nuclear localization signal (NLS1) and in NLS2. In the local viruses, which were related closed to JXA1 virus; there are two differences in amino acids in position 12 and 43 of ORF7. Our tested viruses showed that the amino acid positions 12 and 43 are Asparagine and Arginine, while the reference virus (VR2332, Lelystad, and JXA1) occupied both by Lysine. Based on differences in two amino acids at position 12 and 43 showed that viruses from North Sumatera has its own uniqueness and related closed to highly pathogenic PRRS (HP-PRRS) virus (JXA1). The results demonstrated that North Sumatera type PRRS virus has caused PRRS outbreaks in pig in North Sumatera between 2008 and 2014. The JAX1 like viruses had unique amino acid residue in position 12 and 43 of asparagine and lysine, and these were genetic determinants of North Sumatera viruses compared to other PRRS viruses.
Diversity of Prdm9 Zinc Finger Array in Wild Mice Unravels New Facets of the Evolutionary Turnover of this Coding Minisatellite

PubMed Central

Buard, Jérôme; Rivals, Eric; Dunoyer de Segonzac, Denis; Garres, Charlotte; Caminade, Pierre; de Massy, Bernard; Boursot, Pierre

2014-01-01

In humans and mice, meiotic recombination events cluster into narrow hotspots whose genomic positions are defined by the PRDM9 protein via its DNA binding domain constituted of an array of zinc fingers (ZnFs). High polymorphism and rapid divergence of the Prdm9 gene ZnF domain appear to involve positive selection at DNA-recognition amino-acid positions, but the nature of the underlying evolutionary pressures remains a puzzle. Here we explore the variability of the Prdm9 ZnF array in wild mice, and uncovered a high allelic diversity of both ZnF copy number and identity with the caracterization of 113 alleles. We analyze features of the diversity of ZnF identity which is mostly due to non-synonymous changes at codons −1, 3 and 6 of each ZnF, corresponding to amino-acids involved in DNA binding. Using methods adapted to the minisatellite structure of the ZnF array, we infer a phylogenetic tree of these alleles. We find the sister species Mus spicilegus and M. macedonicus as well as the three house mouse (Mus musculus) subspecies to be polyphyletic. However some sublineages have expanded independently in Mus musculus musculus and M. m. domesticus, the latter further showing phylogeographic substructure. Compared to random genomic regions and non-coding minisatellites, none of these patterns appears exceptional. In silico prediction of DNA binding sites for each allele, overlap of their alignments to the genome and relative coverage of the different families of interspersed repeated elements suggest a large diversity between PRDM9 variants with a potential for highly divergent distributions of recombination events in the genome with little correlation to evolutionary distance. By compiling PRDM9 ZnF protein sequences in Primates, Muridae and Equids, we find different diversity patterns among the three amino-acids most critical for the DNA-recognition function, suggesting different diversification timescales. PMID:24454780
Characterization of a stearoyl-acyl carrier protein desaturase gene family from chocolate tree, Theobroma cacao L.

PubMed

Zhang, Yufan; Maximova, Siela N; Guiltinan, Mark J

2015-01-01

In plants, the conversion of stearoyl-ACP to oleoyol-ACP is catalyzed by a plastid-localized soluble stearoyl-acyl carrier protein (ACP) desaturase (SAD). The activity of SAD significantly impacts the ratio of saturated and unsaturated fatty acids, and is thus a major determinant of fatty acid composition. The cacao genome contains eight putative SAD isoforms with high amino acid sequence similarities and functional domain conservation with SAD genes from other species. Sequence variation in known functional domains between different SAD family members suggested that these eight SAD isoforms might have distinct functions in plant development, a hypothesis supported by their diverse expression patterns in various cacao tissues. Notably, TcSAD1 is universally expressed across all the tissues, and its expression pattern in seeds is highly correlated with the dramatic change in fatty acid composition during seed maturation. Interestingly, TcSAD3 and TcSAD4 appear to be exclusively and highly expressed in flowers, functions of which remain unknown. To test the function of TcSAD1 in vivo, transgenic complementation of the Arabidopsis ssi2 mutant was performed, demonstrating that TcSAD1 successfully rescued all AtSSI2 related phenotypes further supporting the functional orthology between these two genes. The identification of the major SAD gene responsible for cocoa butter biosynthesis provides new strategies for screening for novel genotypes with desirable fatty acid compositions, and for use in breeding programs to help pyramid genes for quality and other traits such as disease resistance.
Big defensins, a diverse family of antimicrobial peptides that follows different patterns of expression in hemocytes of the oyster Crassostrea gigas.

PubMed

Rosa, Rafael D; Santini, Adrien; Fievet, Julie; Bulet, Philippe; Destoumieux-Garzón, Delphine; Bachère, Evelyne

2011-01-01

Big defensin is an antimicrobial peptide composed of a highly hydrophobic N-terminal region and a cationic C-terminal region containing six cysteine residues involved in three internal disulfide bridges. While big defensin sequences have been reported in various mollusk species, few studies have been devoted to their sequence diversity, gene organization and their expression in response to microbial infections. Using the high-throughput Digital Gene Expression approach, we have identified in Crassostrea gigas oysters several sequences coding for big defensins induced in response to a Vibrio infection. We showed that the oyster big defensin family is composed of three members (named Cg-BigDef1, Cg-BigDef2 and Cg-BigDef3) that are encoded by distinct genomic sequences. All Cg-BigDefs contain a hydrophobic N-terminal domain and a cationic C-terminal domain that resembles vertebrate β-defensins. Both domains are encoded by separate exons. We found that big defensins form a group predominantly present in mollusks and closer to vertebrate defensins than to invertebrate and fungi CSαβ-containing defensins. Moreover, we showed that Cg-BigDefs are expressed in oyster hemocytes only and follow different patterns of gene expression. While Cg-BigDef3 is non-regulated, both Cg-BigDef1 and Cg-BigDef2 transcripts are strongly induced in response to bacterial challenge. Induction was dependent on pathogen associated molecular patterns but not damage-dependent. The inducibility of Cg-BigDef1 was confirmed by HPLC and mass spectrometry, since ions with a molecular mass compatible with mature Cg-BigDef1 (10.7 kDa) were present in immune-challenged oysters only. From our biochemical data, native Cg-BigDef1 would result from the elimination of a prepropeptide sequence and the cyclization of the resulting N-terminal glutamine residue into a pyroglutamic acid. We provide here the first report showing that big defensins form a family of antimicrobial peptides diverse not only in terms of sequences but also in terms of genomic organization and regulation of gene expression.
Big Defensins, a Diverse Family of Antimicrobial Peptides That Follows Different Patterns of Expression in Hemocytes of the Oyster Crassostrea gigas

PubMed Central

Rosa, Rafael D.; Santini, Adrien; Fievet, Julie; Bulet, Philippe; Destoumieux-Garzón, Delphine; Bachère, Evelyne

2011-01-01

Background Big defensin is an antimicrobial peptide composed of a highly hydrophobic N-terminal region and a cationic C-terminal region containing six cysteine residues involved in three internal disulfide bridges. While big defensin sequences have been reported in various mollusk species, few studies have been devoted to their sequence diversity, gene organization and their expression in response to microbial infections. Findings Using the high-throughput Digital Gene Expression approach, we have identified in Crassostrea gigas oysters several sequences coding for big defensins induced in response to a Vibrio infection. We showed that the oyster big defensin family is composed of three members (named Cg-BigDef1, Cg-BigDef2 and Cg-BigDef3) that are encoded by distinct genomic sequences. All Cg-BigDefs contain a hydrophobic N-terminal domain and a cationic C-terminal domain that resembles vertebrate β-defensins. Both domains are encoded by separate exons. We found that big defensins form a group predominantly present in mollusks and closer to vertebrate defensins than to invertebrate and fungi CSαβ-containing defensins. Moreover, we showed that Cg-BigDefs are expressed in oyster hemocytes only and follow different patterns of gene expression. While Cg-BigDef3 is non-regulated, both Cg-BigDef1 and Cg-BigDef2 transcripts are strongly induced in response to bacterial challenge. Induction was dependent on pathogen associated molecular patterns but not damage-dependent. The inducibility of Cg-BigDef1 was confirmed by HPLC and mass spectrometry, since ions with a molecular mass compatible with mature Cg-BigDef1 (10.7 kDa) were present in immune-challenged oysters only. From our biochemical data, native Cg-BigDef1 would result from the elimination of a prepropeptide sequence and the cyclization of the resulting N-terminal glutamine residue into a pyroglutamic acid. Conclusions We provide here the first report showing that big defensins form a family of antimicrobial peptides diverse not only in terms of sequences but also in terms of genomic organization and regulation of gene expression. PMID:21980497
RubisCO Gene Clusters Found in a Metagenome Microarray from Acid Mine Drainage

PubMed Central

Guo, Xue; Yin, Huaqun; Cong, Jing; Dai, Zhimin; Liang, Yili

2013-01-01

The enzyme responsible for carbon dioxide fixation in the Calvin cycle, ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO), is always detected as a phylogenetic marker to analyze the distribution and activity of autotrophic bacteria. However, such an approach provides no indication as to the significance of genomic content and organization. Horizontal transfers of RubisCO genes occurring in eubacteria and plastids may seriously affect the credibility of this approach. Here, we presented a new method to analyze the diversity and genomic content of RubisCO genes in acid mine drainage (AMD). A metagenome microarray containing 7,776 large-insertion fosmids was constructed to quickly screen genome fragments containing RubisCO form I large-subunit genes (cbbL). Forty-six cbbL-containing fosmids were detected, and six fosmids were fully sequenced. To evaluate the reliability of the metagenome microarray and understand the microbial community in AMD, the diversities of cbbL and the 16S rRNA gene were analyzed. Fosmid sequences revealed that the form I RubisCO gene cluster could be subdivided into form IA and IB RubisCO gene clusters in AMD, because of significant divergences in molecular phylogenetics and conservative genomic organization. Interestingly, the form I RubisCO gene cluster coexisted with the form II RubisCO gene cluster in one fosmid genomic fragment. Phylogenetic analyses revealed that horizontal transfers of RubisCO genes may occur widely in AMD, which makes the evolutionary history of RubisCO difficult to reconcile with organismal phylogeny. PMID:23335778
Phylloplane bacteria of Jatropha curcas: diversity, metabolic characteristics, and growth-promoting attributes towards vigor of maize seedling.

PubMed

Dubey, Garima; Kollah, Bharati; Ahirwar, Usha; Mandal, Asit; Thakur, Jyoti Kumar; Patra, Ashok Kumar; Mohanty, Santosh Ranjan

2017-10-01

The complex role of phylloplane microorganisms is less understood than that of rhizospheric microorganisms in lieu of their pivotal role in plant's sustainability. This experiment aims to study the diversity of the culturable phylloplane bacteria of Jatropha curcas and evaluate their growth-promoting activities towards maize seedling vigor. Heterotrophic bacteria were isolated from the phylloplane of J. curcas and their 16S rRNA genes were sequenced. Sequences of the 16S rRNA gene were very similar to those of species belonging to the classes Bacillales (50%), Gammaproteobacteria (21.8%), Betaproteobacteria (15.6%), and Alphaproteobacteria (12.5%). The phylloplane bacteria preferred to utilize alcohol rather than monosaccharides and polysaccharides as a carbon source. Isolates exhibited ACC (1-aminocyclopropane-1-carboxylic acid) deaminase, phosphatase, potassium solubilization, and indole acetic acid (IAA) production activities. The phosphate-solubilizing capacity (mg of PO 4 solubilized by 10 8 cells) varied from 0.04 to 0.21. The IAA production potential (μg IAA produced by 10 8 cells in 48 h) of the isolates varied from 0.41 to 9.29. Inoculation of the isolates to maize seed significantly increased shoot and root lengths of maize seedlings. A linear regression model of the plant-growth-promoting activities significantly correlated (p < 0.01) with the growth parameters. Similarly, a correspondence analysis categorized ACC deaminase and IAA production as the major factors contributing 41% and 13.8% variation, respectively, to the growth of maize seedlings.
Evaluation of polymorphisms in pbp4 gene and genetic diversity in penicillin-resistant, ampicillin-susceptible Enterococcus faecalis from hospitals in different states in Brazil.

PubMed

Infante, Victor Hugo Pacagnelli; Conceição, Natália; de Oliveira, Adriana Gonçalves; Darini, Ana Lúcia da Costa

2016-04-01

The aim of the present study was to verify whether penicillin-resistant, ampicillin-susceptible Enterococcus faecalis (PRASEF) occurred in Brazil prior to the beginning of the 21st century, and to verify whether ampicillin susceptibility can predict susceptibility to other β-lactams in E. faecalis with this inconsistent phenotype. The presence of polymorphisms in the pbp4 gene and genetic diversity among the isolates were investigated. Of 21 PRASEF analyzed, 5 (23.8%) and 4 (19.0%) were imipenem and piperacillin resistant simultaneously by disk diffusion and broth dilution respectively, contradicting the current internationally accepted standards of susceptibility testing. Sequencing of pbp4 gene revealed an amino acid substitution (Asp-573→Glu) in all PRASEF isolates but not in the penicillin-susceptible, ampicillin-susceptible E. faecalis. Most PRASEF (90.5%) had related pulsed-field gel electrophoresis profiles, but were different from other PRASEF described to date. Results demonstrate that penicillin-resistant, ampicillin-susceptible phenotype was already a reality in the 1990s in E. faecalis isolates in different Brazilian states, and some of these isolates were also imipenem- and piperacillin-resistant; therefore, internationally accepted susceptibility criteria cannot be applied to these isolates. According to pbp4 gene sequencing, this study suggests that a specific amino acid substitution in pbp4 gene found in all PRASEF analyzed is associated with penicillin resistance. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Genetic analysis of paramyxovirus isolates from pacific salmon reveals two independently co-circulating lineages

USGS Publications Warehouse

Batts, W.N.; Falk, K.; Winton, J.R.

2008-01-01

Viruses with the morphological and biochemical characteristics of the family Paramyxoviridae (paramyxoviruses) have been isolated from adult salmon returning to rivers along the Pacific coast of North America since 1982. These Pacific salmon paramyxoviruses (PSPV), which have mainly been isolated from Chinook salmon Oncorhynchus tshawytscha, grow slowly in established fish cell lines and have not been associated with disease. Genetic analysis of a 505-base-pair region of the polymerase gene from 47 PsPV isolates produced 17 nucleotide sequence types that could be grouped into two major sublineages, designated A and B. The two independently co-circulating sublineages differed by 12.1-13.9% at the nucleotide level but by only 1.2% at the amino acid level. Isolates of PSPV from adult Pacific salmon returning to rivers from Alaska to California over a 25-year period showed little evidence of geographic or temporal grouping. Phylogenetic analyses revealed that these paramyxoviruses of Pacific salmon were most closely related to the Atlantic salmon paramyxovirus (ASPV) from Norway, having a maximum nucleotide diversity of 26.1 % and an amino acid diversity of 19.0%. When compared with homologous sequences of other paramyxoviruses, PSPV and ASPV were sufficiently distinct to suggest that they are not clearly members of any of the established genera in the family Paramyxoviridae. in the course of this study, a polymerase chain reaction assay was developed that can be used for confirmatory identification of PSPV. ?? Copyright by the American Fisheries Society 2008.
Analysis of nucleotide diversity among alleles of the major bacterial blight resistance gene Xa27 in cultivars of rice (Oryza sativa) and its wild relatives.

PubMed

Bimolata, Waikhom; Kumar, Anirudh; Sundaram, Raman Meenakshi; Laha, Gouri Shankar; Qureshi, Insaf Ahmed; Reddy, Gajjala Ashok; Ghazi, Irfan Ahmad

2013-08-01

Xa27 is one of the important R-genes, effective against bacterial blight disease of rice caused by Xanthomonas oryzae pv. oryzae (Xoo). Using natural population of Oryza, we analyzed the sequence variation in the functionally important domains of Xa27 across the Oryza species. DNA sequences of Xa27 alleles from 27 rice accessions revealed higher nucleotide diversity among the reported R-genes of rice. Sequence polymorphism analysis revealed synonymous and non-synonymous mutations in addition to a number of InDels in non-coding regions of the gene. High sequence variation was observed in the promoter region including the 5'UTR with 'π' value 0.00916 and 'θ w ' = 0.01785. Comparative analysis of the identified Xa27 alleles with that of IRBB27 and IR24 indicated the operation of both positive selection (Ka/Ks > 1) and neutral selection (Ka/Ks ≈ 0). The genetic distances of alleles of the gene from Oryza nivara were nearer to IRBB27 as compared to IR24. We also found the presence of conserved and null UPT (upregulated by transcriptional activator) box in the isolated alleles. Considerable amino acid polymorphism was localized in the trans-membrane domain for which the functional significance is yet to be elucidated. However, the absence of functional UPT box in all the alleles except IRBB27 suggests the maintenance of single resistant allele throughout the natural population.
Comparison of intrinsic dynamics of cytochrome p450 proteins using normal mode analysis

PubMed Central

Dorner, Mariah E; McMunn, Ryan D; Bartholow, Thomas G; Calhoon, Brecken E; Conlon, Michelle R; Dulli, Jessica M; Fehling, Samuel C; Fisher, Cody R; Hodgson, Shane W; Keenan, Shawn W; Kruger, Alyssa N; Mabin, Justin W; Mazula, Daniel L; Monte, Christopher A; Olthafer, Augustus; Sexton, Ashley E; Soderholm, Beatrice R; Strom, Alexander M; Hati, Sanchita

2015-01-01

Cytochrome P450 enzymes are hemeproteins that catalyze the monooxygenation of a wide-range of structurally diverse substrates of endogenous and exogenous origin. These heme monooxygenases receive electrons from NADH/NADPH via electron transfer proteins. The cytochrome P450 enzymes, which constitute a diverse superfamily of more than 8,700 proteins, share a common tertiary fold but < 25% sequence identity. Based on their electron transfer protein partner, cytochrome P450 proteins are classified into six broad classes. Traditional methods of pro are based on the canonical paradigm that attributes proteins' function to their three-dimensional structure, which is determined by their primary structure that is the amino acid sequence. It is increasingly recognized that protein dynamics play an important role in molecular recognition and catalytic activity. As the mobility of a protein is an intrinsic property that is encrypted in its primary structure, we examined if different classes of cytochrome P450 enzymes display any unique patterns of intrinsic mobility. Normal mode analysis was performed to characterize the intrinsic dynamics of five classes of cytochrome P450 proteins. The present study revealed that cytochrome P450 enzymes share a strong dynamic similarity (root mean squared inner product > 55% and Bhattacharyya coefficient > 80%), despite the low sequence identity (< 25%) and sequence similarity (< 50%) across the cytochrome P450 superfamily. Noticeable differences in Cα atom fluctuations of structural elements responsible for substrate binding were noticed. These differences in residue fluctuations might be crucial for substrate selectivity in these enzymes. PMID:26130403
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2011 CFR

2011-07-01

... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
Flow cytometric monitoring of bacterioplankton phenotypic diversity predicts high population-specific feeding rates by invasive dreissenid mussels.

PubMed

Props, Ruben; Schmidt, Marian L; Heyse, Jasmine; Vanderploeg, Henry A; Boon, Nico; Denef, Vincent J

2018-02-01

Species invasion is an important disturbance to ecosystems worldwide, yet knowledge about the impacts of invasive species on bacterial communities remains sparse. Using a novel approach, we simultaneously detected phenotypic and derived taxonomic change in a natural bacterioplankton community when subjected to feeding pressure by quagga mussels, a widespread aquatic invasive species. We detected a significant decrease in diversity within 1 h of feeding and a total diversity loss of 11.6 ± 4.1% after 3 h. This loss of microbial diversity was caused by the selective removal of high nucleic acid populations (29 ± 5% after 3 h). We were able to track the community diversity at high temporal resolution by calculating phenotypic diversity estimates from flow cytometry (FCM) data of minute amounts of sample. Through parallel FCM and 16S rRNA gene amplicon sequencing analysis of environments spanning a broad diversity range, we showed that the two approaches resulted in highly correlated diversity measures and captured the same seasonal and lake-specific patterns in community composition. Based on our results, we predict that selective feeding by invasive dreissenid mussels directly impacts the microbial component of the carbon cycle, as it may drive bacterioplankton communities toward less diverse and potentially less productive states. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

Metagenomic approaches for direct and cell culture evaluation of the virological quality of wastewater

DOE Office of Scientific and Technical Information (OSTI.GOV)

Aw, Tiong Gim; Howe, Adina; Rose, Joan B.

2014-12-01

Genomic-based molecular techniques are emerging as powerful tools that allow a comprehensive characterization of water and wastewater microbiomes. Most recently, next generation sequencing (NGS) technologies which produce large amounts of sequence data are beginning to impact the field of environmental virology. In this study, NGS and bioinformatics have been employed for the direct detection and characterization of viruses in wastewater and of viruses isolated after cell culture. Viral particles were concentrated and purified from sewage samples by polyethylene glycol precipitation. Viral nucleic acid was extracted and randomly amplified prior to sequencing using Illumina technology, yielding a total of 18 millionmore » sequence reads. Most of the viral sequences detected could not be characterized, indicating the great viral diversity that is yet to be discovered. This sewage virome was dominated by bacteriophages and contained sequences related to known human pathogenic viruses such as adenoviruses (species B, C and F), polyomaviruses JC and BK and enteroviruses (type B). An array of other animal viruses was also found, suggesting unknown zoonotic viruses. This study demonstrated the feasibility of metagenomic approaches to characterize viruses in complex environmental water samples.« less
Adaptive molecular evolution of the two-pore channel 1 gene TPC1 in the karst-adapted genus Primulina (Gesneriaceae)

PubMed Central

Tao, Junjie; Feng, Chao; Ai, Bin; Kang, Ming

2016-01-01

Background and Aims Limestone karst areas possess high floral diversity and endemism. The genus Primulina, which contributes to the unique calcicole flora, has high species richness and exhibit specific soil-based habitat associations that are mainly distributed on calcareous karst soils. The adaptive molecular evolutionary mechanism of the genus to karst calcium-rich environments is still not well understood. The Ca2+-permeable channel TPC1 was used in this study to test whether its gene is involved in the local adaptation of Primulina to karst high-calcium soil environments. Methods Specific amplification and sequencing primers were designed and used to amplify the full-length coding sequences of TPC1 from cDNA of 76 Primulina species. The sequence alignment without recombination and the corresponding reconstructed phylogeny tree were used in molecular evolutionary analyses at the nucleic acid level and amino acid level, respectively. Finally, the identified sites under positive selection were labelled on the predicted secondary structure of TPC1. Key Results Seventy-six full-length coding sequences of Primulina TPC1 were obtained. The length of the sequences varied between 2220 and 2286 bp and the insertion/deletion was located at the 5′ end of the sequences. No signal of substitution saturation was detected in the sequences, while significant recombination breakpoints were detected. The molecular evolutionary analyses showed that TPC1 was dominated by purifying selection and the selective pressures were not significantly different among species lineages. However, significant signals of positive selection were detected at both TPC1 codon level and amino acid level, and five sites under positive selective pressure were identified by at least three different methods. Conclusions The Ca2+-permeable channel TPC1 may be involved in the local adaptation of Primulina to karst Ca2+-rich environments. Different species lineages suffered similar selective pressure associated with calcium in karst environments, and episodic diversifying selection at a few sites may play a major role in the molecular evolution of Primulina TPC1. PMID:27582362
Electrostatic study of Alanine mutational effects on transcription: application to GATA-3:DNA interaction complex.

PubMed

El-Assaad, Atlal; Dawy, Zaher; Nemer, Georges

2015-01-01

Protein-DNA interaction is of fundamental importance in molecular biology, playing roles in functions as diverse as DNA transcription, DNA structure formation, and DNA repair. Protein-DNA association is also important in medicine; understanding Protein-DNA binding kinetics can assist in identifying disease root causes which can contribute to drug development. In this perspective, this work focuses on the transcription process by the GATA Transcription Factor (TF). GATA TF binds to DNA promoter region represented by `G,A,T,A' nucleotides sequence, and initiates transcription of target genes. When proper regulation fails due to some mutations on the GATA TF protein sequence or on the DNA promoter sequence (weak promoter), deregulation of the target genes might lead to various disorders. In this study, we aim to understand the electrostatic mechanism behind GATA TF and DNA promoter interactions, in order to predict Protein-DNA binding in the presence of mutations, while elaborating on non-covalent binding kinetics. To generate a family of mutants for the GATA:DNA complex, we replaced every charged amino acid, one at a time, with a neutral amino acid like Alanine (Ala). We then applied Poisson-Boltzmann electrostatic calculations feeding into free energy calculations, for each mutation. These calculations delineate the contribution to binding from each Ala-replaced amino acid in the GATA:DNA interaction. After analyzing the obtained data in view of a two-step model, we are able to identify potential key amino acids in binding. Finally, we applied the model to GATA-3:DNA (crystal structure with PDB-ID: 3DFV) binding complex and validated it against experimental results from the literature.
Genetic diversity and phylogenetic analysis of Aleutian mink disease virus isolates in north-east China.

PubMed

Leng, Xue; Liu, Dongxu; Li, Jianming; Shi, Kun; Zeng, Fanli; Zong, Ying; Liu, Yi; Sun, Zhibo; Zhang, Shanshan; Liu, Yadong; Du, Rui

2018-05-01

Aleutian mink disease is the most important disease in the mink-farming industry worldwide. So far, few large-scale molecular epidemiological studies of AMDV, based on the NS1 and VP2 genes, have been conducted in China. Here, eight new Chinese isolates of AMDV from three provinces in north-east China were analyzed to clarify the molecular epidemiology of AMDV. The seroprevalence of AMDV in north-east China was 41.8% according to counterimmuno-electrophoresis. Genetic variation analysis of the eight isolates showed significant non-synonymous substitutions in the NS1 and VP2 genes, especially in the NS1 gene. All eight isolates included the caspase-recognition sequence NS1:285 (DQTD↓S), but not the caspase recognition sequence NS1:227 (INTD↓S). The LN1 and LN2 strains had a new 10-amino-acid deletion in-between amino acids 28-37, while the JL3 strain had a one-amino-acid deletion at position 28 in the VP2 protein, compared with the AMDV-G strain. Phylogenetic analysis based on most of NS1 (1755 bp) and complete VP2 showed that the AMDV genotypes did not cluster according to their pathogenicity or geographic origin. Local and imported ADMV species are all prevalent in mink-farming populations in the north-east of China. This is the first study to report the molecular epidemiology of AMDV in north-east China based on most of NS1 and the complete VP2, and further provides information about polyG deletions and new variations in the amino acid sequences of NS1 and VP2 proteins. This report is a good foundation for further study of AMDV in China.
Reductionist Approach in Peptide-Based Nanotechnology.

PubMed

Gazit, Ehud

2018-06-20

The formation of ordered nanostructures by molecular self-assembly of proteins and peptides represents one of the principal directions in nanotechnology. Indeed, polyamides provide superior features as materials with diverse physical properties. A reductionist approach allowed the identification of extremely short peptide sequences, as short as dipeptides, which could form well-ordered amyloid-like β-sheet-rich assemblies comparable to supramolecular structures made of much larger proteins. Some of the peptide assemblies show remarkable mechanical, optical, and electrical characteristics. Another direction of reductionism utilized a natural noncoded amino acid, α-aminoisobutryic acid, to form short superhelical assemblies. The use of this exceptional helix inducer motif allowed the fabrication of single heptad repeats used in various biointerfaces, including their use as surfactants and DNA-binding agents. Two additional directions of the reductionist approach include the use of peptide nucleic acids (PNAs) and coassembly techniques. The diversified accomplishments of the reductionist approach, as well as the exciting future advances it bears, are discussed.
Identification of a novel nidovirus in an outbreak of fatal respiratory disease in ball pythons (Python regius).

PubMed

Uccellini, Lorenzo; Ossiboff, Robert J; de Matos, Ricardo E C; Morrisey, James K; Petrosov, Alexandra; Navarrete-Macias, Isamara; Jain, Komal; Hicks, Allison L; Buckles, Elizabeth L; Tokarz, Rafal; McAloose, Denise; Lipkin, Walter Ian

2014-08-08

Respiratory infections are important causes of morbidity and mortality in reptiles; however, the causative agents are only infrequently identified. Pneumonia, tracheitis and esophagitis were reported in a collection of ball pythons (Python regius). Eight of 12 snakes had evidence of bacterial pneumonia. High-throughput sequencing of total extracted nucleic acids from lung, esophagus and spleen revealed a novel nidovirus. PCR indicated the presence of viral RNA in lung, trachea, esophagus, liver, and spleen. In situ hybridization confirmed the presence of intracellular, intracytoplasmic viral nucleic acids in the lungs of infected snakes. Phylogenetic analysis based on a 1,136 amino acid segment of the polyprotein suggests that this virus may represent a new species in the subfamily Torovirinae. This report of a novel nidovirus in ball pythons may provide insight into the pathogenesis of respiratory disease in this species and enhances our knowledge of the diversity of nidoviruses.
Multistep divergent synthesis of benzimidazole linked benzoxazole/benzothiazole via copper catalyzed domino annulation.

PubMed

Liao, Jen-Yu; Selvaraju, Manikandan; Chen, Chih-Hau; Sun, Chung-Ming

2013-04-21

An efficient, facile synthesis of structurally diverse benzimidazole integrated benzoxazole and benzothiazoles has been developed. In a multi-step synthetic sequence, 4-fluoro-3-nitrobenzoic acid was converted into benzimidazole bis-heterocycles, via the intermediacy of benzimidazole linked ortho-chloro amines. The amphiphilic reactivity of this intermediate was designed to achieve the title compounds by the reaction of various acid chlorides and isothiocyanates in a single step through the in situ formation of ortho-chloro anilides and thioureas under microwave irradiation. A versatile one pot domino annulation reaction was developed to involve the reaction of benzimidazole linked ortho-chloro amines with acid chlorides and isothiocyanates. The initial acylation and urea formation followed by copper catalyzed intramolecular C-O and C-S cross coupling reactions furnished the angularly oriented bis-heterocycles which bear a close resemblance to the streptomyces antibiotic UK-1.
Improved Modeling of Side-Chain–Base Interactions and Plasticity in Protein–DNA Interface Design

PubMed Central

Thyme, Summer B.; Baker, David; Bradley, Philip

2012-01-01

Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed “motifs”) was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein–DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent. PMID:22426128
Improved modeling of side-chain--base interactions and plasticity in protein--DNA interface design.

PubMed

Thyme, Summer B; Baker, David; Bradley, Philip

2012-06-08

Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed "motifs") was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein-DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent. Published by Elsevier Ltd.
Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element.

PubMed

Watanabe, Kenneth A; Homayouni, Arielle; Gu, Lingkun; Huang, Kuan-Ying; Ho, Tuan-Hua David; Shen, Qingxi J

2017-09-01

Seeds serve as a great model to study plant responses to drought stress, which is largely mediated by abscisic acid (ABA). The ABA responsive element (ABRE) is a key cis-regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA-induced genes in rice aleurone cells, suggesting other ABREs may exist. To identify novel ABREs, RNA sequencing was performed on aleurone cells of rice seeds treated with 20 μM ABA. Gibbs sampling was used to identify enriched elements, and particle bombardment-mediated transient expression studies were performed to verify the function. Gene ontology analysis was performed to predict the roles of genes containing the novel ABREs. This study revealed 2443 ABA-inducible genes and a novel ABRE, designated as ABREN, which was experimentally verified to mediate ABA signalling in rice aleurone cells. Many of the ABREN-containing genes are predicted to be involved in stress responses and transcription. Analysis of other species suggests that the ABREN may be monocot specific. This study also revealed interesting expression patterns of genes involved in ABA metabolism and signalling. Collectively, this study advanced our understanding of diverse cis-regulatory sequences and the transcriptomes underlying ABA responses in rice aleurone cells. © 2017 John Wiley & Sons Ltd.
Illumina sequencing-based analyses of bacterial communities during short-chain fatty-acid production from food waste and sewage sludge fermentation at different pH values.

PubMed

Cheng, Weixiao; Chen, Hong; Yan, ShuHai; Su, Jianqiang

2014-09-01

Short-chain fatty acids (SCFAs) can be produced by primary and waste activated sludge anaerobic fermentation. The yield and product spectrum distribution of SCFAs can be significantly affected by different initial pH values. However, most studies have focused on the physical and chemical aspects of SCFA production by waste activated sludge fermentation at different pH values. Information on the bacterial community structures during acidogenic fermentation is limited. In this study, comparisons of the bacterial communities during the co-substrate fermentation of food wastes and sewage sludge at different pH values were performed using the barcoded Illumina paired-end sequencing method. The results showed that different pH environments harbored a characteristic bacterial community, including sequences related to Lactobacillus, Prevotella, Mitsuokella, Treponema, Clostridium, and Ureibacillus. The most abundant bacterial operational taxonomic units in the different pH environments were those related to carbohydrate-degrading bacteria, which are associated with constituents of co-substrate fermentation. Further analyses showed that during organic matter fermentation, a core microbiota composed of Firmicutes, Proteobacteria, and Bacteroidetes existed. Comparison analyses revealed that the bacterial community during fermentation was significantly affected by the pH, and that the diverse product distribution was related to the shift in bacterial communities.
Insights from the Metagenome of an Acid Salt Lake: The Role of Biology in an Extreme Depositional Environment

PubMed Central

Johnson, Sarah Stewart; Chevrette, Marc Gerard; Ehlmann, Bethany L.; Benison, Kathleen Counter

2015-01-01

The extremely acidic brine lakes of the Yilgarn Craton of Western Australia are home to some of the most biologically challenging waters on Earth. In this study, we employed metagenomic shotgun sequencing to generate a microbial profile of the depositional environment associated with the sulfur-rich sediments of one such lake. Of the 1.5 M high-quality reads generated, 0.25 M were mapped to protein features, which in turn provide new insights into the metabolic function of this community. In particular, 45 diverse genes associated with sulfur metabolism were identified, the majority of which were linked to either the conversion of sulfate to adenylylsulfate and the subsequent production of sulfide from sulfite or the oxidation of sulfide, elemental sulfur, and thiosulfate via the sulfur oxidation (Sox) system. This is the first metagenomic study of an acidic, hypersaline depositional environment, and we present evidence for a surprisingly high level of microbial diversity. Our findings also illuminate the possibility that we may be meaningfully underestimating the effects of biology on the chemistry of these sulfur-rich sediments, thereby influencing our understanding of past geobiological conditions that may have been present on Earth as well as early Mars. PMID:25923206
Solid phase sequencing of double-stranded nucleic acids

DOEpatents

Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

2002-01-01

This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.
The LANL hemorrhagic fever virus database, a new platform for analyzing biothreat viruses

PubMed Central

Kuiken, Carla; Thurmond, Jim; Dimitrijevic, Mira; Yoon, Hyejin

2012-01-01

Hemorrhagic fever viruses (HFVs) are a diverse set of over 80 viral species, found in 10 different genera comprising five different families: arena-, bunya-, flavi-, filo- and togaviridae. All these viruses are highly variable and evolve rapidly, making them elusive targets for the immune system and for vaccine and drug design. About 55 000 HFV sequences exist in the public domain today. A central website that provides annotated sequences and analysis tools will be helpful to HFV researchers worldwide. The HFV sequence database collects and stores sequence data and provides a user-friendly search interface and a large number of sequence analysis tools, following the model of the highly regarded and widely used Los Alamos HIV database [Kuiken, C., B. Korber, and R.W. Shafer, HIV sequence databases. AIDS Rev, 2003. 5: p. 52–61]. The database uses an algorithm that aligns each sequence to a species-wide reference sequence. The NCBI RefSeq database [Sayers et al. (2011) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res., 39, D38–D51.] is used for this; if a reference sequence is not available, a Blast search finds the best candidate. Using this method, sequences in each genus can be retrieved pre-aligned. The HFV website can be accessed via http://hfv.lanl.gov. PMID:22064861
Fatty acid-oxidizing consortia along a nutrient gradient in the Florida Everglades.

PubMed

Chauhan, Ashvini; Ogram, Andrew

2006-04-01

The Florida Everglades is one of the largest freshwater marshes in North America and has been subject to eutrophication for decades. A gradient in P concentrations extends for several kilometers into the interior of the northern regions of the marsh, and the structure and function of soil microbial communities vary along the gradient. In this study, stable isotope probing was employed to investigate the fate of carbon from the fermentation products propionate and butyrate in soils from three sites along the nutrient gradient. For propionate microcosms, 16S rRNA gene clone libraries from eutrophic and transition sites were dominated by sequences related to previously described propionate oxidizers, such as Pelotomaculum spp. and Syntrophobacter spp. Significant representation was also observed for sequences related to Smithella propionica, which dismutates propionate to butyrate. Sequences of dominant phylotypes from oligotrophic samples did not cluster with known syntrophs but with sulfate-reducing prokaryotes (SRP) and Pelobacter spp. In butyrate microcosms, sequences clustering with Syntrophospora spp. and Syntrophomonas spp. dominated eutrophic microcosms, and sequences related to Pelospora dominated the transition microcosm. Sequences related to Pelospora spp. and SRP dominated clone libraries from oligotrophic microcosms. Sequences from diverse bacterial phyla and primary fermenters were also present in most libraries. Archaeal sequences from eutrophic microcosms included sequences characteristic of Methanomicrobiaceae, Methanospirillaceae, and Methanosaetaceae. Oligotrophic microcosms were dominated by acetotrophs, including sequences related to Methanosarcina, suggesting accumulation of acetate.
Alteration of Rumen Bacteria and Protozoa Through Grazing Regime as a Tool to Enhance the Bioactive Fatty Acid Content of Bovine Milk

PubMed Central

Bainbridge, Melissa L.; Saldinger, Laurel K.; Barlow, John W.; Alvez, Juan P.; Roman, Joe; Kraft, Jana

2018-01-01

Rumen microorganisms are the origin of many bioactive fatty acids (FA) found in ruminant-derived food products. Differences in plant leaf anatomy and chemical composition between cool- and warm-season pastures may alter rumen microorganisms, potentially enhancing the quantity/profile of bioactive FA available for incorporation into milk. The objective of this study was to identify rumen bacteria and protozoa and their cellular FA when cows grazed a warm-season annual, pearl millet (PM), in comparison to a diverse cool-season pasture (CSP). Individual rumen digesta samples were obtained from five Holstein cows in a repeated measures design with 28-day periods. The treatment sequence was PM, CSP, then PM. Microbial DNA was extracted from rumen digesta and sequence reads were produced with Illumina MiSeq. Fatty acids (FA) were identified in rumen bacteria and protozoa using gas-liquid chromatography/mass spectroscopy. Microbial communities shifted in response to grazing regime. Bacteria of the phylum Bacteroidetes were more abundant during PM than CSP (P < 0.05), while protozoa of the genus Eudiplodinium were more abundant during CSP than PM (P < 0.05). Microbial cellular FA profiles differed between treatments. Bacteria and protozoa from cows grazing CSP contained more n-3 FA (P < 0.001) and vaccenic acid (P < 0.01), but lower proportions of branched-chain FA (P < 0.05). Microbial FA correlated with microbial taxa and levels of vaccenic acid, rumenic acid, and α-linolenic acid in milk. In conclusion, grazing regime can potentially be used to alter microbial communities shifting the FA profile of microbial cells, and subsequently, alter the milk FA profile. PMID:29867815
Alteration of Rumen Bacteria and Protozoa Through Grazing Regime as a Tool to Enhance the Bioactive Fatty Acid Content of Bovine Milk.

PubMed

Bainbridge, Melissa L; Saldinger, Laurel K; Barlow, John W; Alvez, Juan P; Roman, Joe; Kraft, Jana

2018-01-01

Rumen microorganisms are the origin of many bioactive fatty acids (FA) found in ruminant-derived food products. Differences in plant leaf anatomy and chemical composition between cool- and warm-season pastures may alter rumen microorganisms, potentially enhancing the quantity/profile of bioactive FA available for incorporation into milk. The objective of this study was to identify rumen bacteria and protozoa and their cellular FA when cows grazed a warm-season annual, pearl millet (PM), in comparison to a diverse cool-season pasture (CSP). Individual rumen digesta samples were obtained from five Holstein cows in a repeated measures design with 28-day periods. The treatment sequence was PM, CSP, then PM. Microbial DNA was extracted from rumen digesta and sequence reads were produced with Illumina MiSeq. Fatty acids (FA) were identified in rumen bacteria and protozoa using gas-liquid chromatography/mass spectroscopy. Microbial communities shifted in response to grazing regime. Bacteria of the phylum Bacteroidetes were more abundant during PM than CSP ( P < 0.05), while protozoa of the genus Eudiplodinium were more abundant during CSP than PM ( P < 0.05). Microbial cellular FA profiles differed between treatments. Bacteria and protozoa from cows grazing CSP contained more n-3 FA ( P < 0.001) and vaccenic acid ( P < 0.01), but lower proportions of branched-chain FA ( P < 0.05). Microbial FA correlated with microbial taxa and levels of vaccenic acid, rumenic acid, and α-linolenic acid in milk. In conclusion, grazing regime can potentially be used to alter microbial communities shifting the FA profile of microbial cells, and subsequently, alter the milk FA profile.
Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species.

PubMed

Larsen, Svend Arild; Mogensen, Line; Dietz, Rune; Baagøe, Hans Jørgen; Andersen, Mogens; Werge, Thomas; Rasmussen, Henrik Berg

2005-12-01

In this study we have identified and characterized dopamine receptor D4 (DRD4) exon III tandem repeats in 33 public available nucleotide sequences from different mammalian species. We found that the tandem repeat in canids could be described in a novel and simple way, namely, as a structure composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp tandem repeat, while a tandem repeat consisting of 27-bp modules was identified in a sequence from European badger. Both these tandem repeats were composed of 9-bp basic units, which were closely related with the 9-bp repeat modules identified in the mink and ferret. Tandem repeats could not be identified in sequences from rodents. All tandem repeats possessed a high GC content with a strong bias for C. On phylogenetic analysis of the tandem repeats evolutionary related species were clustered into the same groups. The degree of conservation of the tandem repeats varied significantly between species. The deduced amino acid sequences of most of the tandem repeats exhibited a high propensity for disorder. This was also the case with an amino acid sequence of the human DRD4 exon III tandem repeat, which was included in the study for comparative purposes. We identified proline-containing motifs for SH3 and WW domain binding proteins, potential phosphorylation sites, PDZ domain binding motifs, and FHA domain binding motifs in the amino acid sequences of the tandem repeats. The numbers of potential functional sites varied pronouncedly between species. Our observations provide a platform for future studies of the architecture and evolution of the DRD4 exon III tandem repeat, and they suggest that differences in the structure of this tandem repeat contribute to specialization and generation of diversity in receptor function.
Diversity and Gene Expression of Phosphatase Genes Provide Insight into Soil Phosphorus Dynamics in a New Zealand Managed Grassland

NASA Astrophysics Data System (ADS)

Dunfield, K. E.; Gaiero, J. R.; Condron, L.

2017-12-01

Healthy and diverse communities of soil organisms influence key soil ecosystem services such as carbon sequestration, water quality protection, climate regulation and nutrient cycling. Microbially driven mineralization of organic phosphorus is an important contributor to plant available inorganic orthophosphates. In acidic soils, microbes produce non-specific acid phosphatases (NSAPs) which act on common forms of organic phosphorus (P). Our current understanding of P turnover in soils has been limited by lack of research tools capable of targeting these genes. Thus, we developed a set of oligonucleotide PCR primers that targeted bacteria with the genetic potential for acid phosphatase production. A long term randomized-block pasture trial was sampled following 22 years of continued aerial biomass removal and retention. Primers were used to target genes encoding alkaline phosphatase (phoD) and the three classes (CAAP, CBAP, CCAP) of non-specific acid phosphatases. PCR amplicons targeting total genes and gene transcripts were sequenced using Illumina MiSeq to understand the diversity of the bacterial phosphatase producing communities. In general, the majority of operational taxonomic units (OTUs) were shared across both treatments and across metagenomes and transcriptomes. However, analysis of DNA OTUs revealed significantly different communities driven by treatment differences (P < 0.05). Transcript expression was highest in the removed biomass treatment which corresponded the reduced Olsen P levels (15 vs. 36 mg kg-1 in retained treatment). Acid phosphatase activity was measured in all samples, and found to be highest in the biomass retained treatment (16.8 vs. 11.4 µmol g-1 dry soil h-1), likely elevated due to plant-derived enzymes; however, was still correlated to bacterial gene abundances. Overall, the phosphatase producing microbial communities responded to the effect of consistent P limitation as expected, through alteration in the composition of the community structure and through increased levels of gene expression of the phosphatase genes.
Genetic diversity in the C-terminus of merozoite surface protein 1 among Plasmodium knowlesi isolates from Selangor and Sabah Borneo, Malaysia.

PubMed

Yap, Nan Jiun; Goh, Xiang Ting; Koehler, Anson V; William, Timothy; Yeo, Tsin Wen; Vythilingam, Indra; Gasser, Robin B; Lim, Yvonne A L

2017-10-01

Plasmodium knowlesi, a malaria parasite of macaques, has emerged as an important parasite of humans. Despite the significance of P. knowlesi malaria in parts of Southeast Asia, very little is known about the genetic variation in this parasite. Our aim here was to explore sequence variation in a molecule called the 42kDa merozoite surface protein-1 (MSP-1), which is found on the surface of blood stages of Plasmodium spp. and plays a key role in erythrocyte invasion. Several studies of P. falciparum have reported that the C-terminus (a 42kDa fragment) of merozoite surface protein-1 (MSP-1 42 ; consisting of MSP-1 19 and MSP-1 33 ) is a potential candidate for a malaria vaccine. However, to date, no study has yet investigated the sequence diversity of the gene encoding P. knowlesi MSP-1 42 (comprising Pk-msp-1 19 and Pk-msp-1 33 ) among isolates in Malaysia. The present study explored this aspect. Twelve P. knowlesi isolates were collected from patients from hospitals in Selangor and Sabah Borneo, Malaysia, between 2012 and 2014. The Pk-msp-1 42 gene was amplified by PCR and directly sequenced. Haplotype diversity (Hd) and nucleotide diversity (л) were studied among the isolates. There was relatively high genetic variation among P. knowlesi isolates; overall Hd and л were 1±0.034 and 0.01132±0.00124, respectively. A total of nine different haplotypes related to amino acid alterations at 13 positions, and the Pk-MSP-1 19 sequence was found to be more conserved than Pk-msp-1 33 . We have found evidence for negative selection in Pk-msp- 42 as well as the 33kDa and 19kDa fragments by comparing the rate of non-synonymous versus synonymous substitutions. Future investigations should study large numbers of samples from disparate geographical locations to critically assess whether this molecule might be a potential vaccine target for P. knowlesi. Copyright © 2017 Elsevier B.V. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.