hypervariable tag sequencing: Topics by Science.gov

Sample records for hypervariable tag sequencing

Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison

PubMed Central

2013-01-01

Background Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. Results We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. Conclusion CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets. PMID:23617892
Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison.

PubMed

Yang, Fang; Chia, Nicholas; White, Bryan A; Schook, Lawrence B

2013-04-23

Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets.
Skin Microbiome Surveys Are Strongly Influenced by Experimental Design.

PubMed

Meisel, Jacquelyn S; Hannigan, Geoffrey D; Tyldsley, Amanda S; SanMiguel, Adam J; Hodkinson, Brendan P; Zheng, Qi; Grice, Elizabeth A

2016-05-01

Culture-independent studies to characterize skin microbiota are increasingly common, due in part to affordable and accessible sequencing and analysis platforms. Compared to culture-based techniques, DNA sequencing of the bacterial 16S ribosomal RNA (rRNA) gene or whole metagenome shotgun (WMS) sequencing provides more precise microbial community characterizations. Most widely used protocols were developed to characterize microbiota of other habitats (i.e., gastrointestinal) and have not been systematically compared for their utility in skin microbiome surveys. Here we establish a resource for the cutaneous research community to guide experimental design in characterizing skin microbiota. We compare two widely sequenced regions of the 16S rRNA gene to WMS sequencing for recapitulating skin microbiome community composition, diversity, and genetic functional enrichment. We show that WMS sequencing most accurately recapitulates microbial communities, but sequencing of hypervariable regions 1-3 of the 16S rRNA gene provides highly similar results. Sequencing of hypervariable region 4 poorly captures skin commensal microbiota, especially Propionibacterium. WMS sequencing, which is resource and cost intensive, provides evidence of a community's functional potential; however, metagenome predictions based on 16S rRNA sequence tags closely approximate WMS genetic functional profiles. This study highlights the importance of experimental design for downstream results in skin microbiome surveys. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Skin microbiome surveys are strongly influenced by experimental design

PubMed Central

Meisel, Jacquelyn S.; Hannigan, Geoffrey D.; Tyldsley, Amanda S.; SanMiguel, Adam J.; Hodkinson, Brendan P.; Zheng, Qi; Grice, Elizabeth A.

2016-01-01

Culture-independent studies to characterize skin microbiota are increasingly common, due in part to affordable and accessible sequencing and analysis platforms. Compared to culture-based techniques, DNA sequencing of the bacterial 16S ribosomal RNA (rRNA) gene or whole metagenome shotgun (WMS) sequencing provide more precise microbial community characterizations. Most widely used protocols were developed to characterize microbiota of other habitats (i.e. gastrointestinal), and have not been systematically compared for their utility in skin microbiome surveys. Here we establish a resource for the cutaneous research community to guide experimental design in characterizing skin microbiota. We compare two widely sequenced regions of the 16S rRNA gene to WMS sequencing for recapitulating skin microbiome community composition, diversity, and genetic functional enrichment. We show that WMS sequencing most accurately recapitulates microbial communities, but sequencing of hypervariable regions 1-3 of the 16S rRNA gene provides highly similar results. Sequencing of hypervariable region 4 poorly captures skin commensal microbiota, especially Propionibacterium. WMS sequencing, which is resource- and cost-intensive, provides evidence of a community’s functional potential; however, metagenome predictions based on 16S rRNA sequence tags closely approximate WMS genetic functional profiles. This work highlights the importance of experimental design for downstream results in skin microbiome surveys. PMID:26829039
Development, characterization and cross species amplification of polymorphic microsatellite markers from expressed sequence tags of turmeric (Curcuma longa L.).

PubMed

Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A

2010-02-01

Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.
Improved serial analysis of V1 ribosomal sequence tags (SARST-V1) provides a rapid, comprehensive, sequence-based characterization of bacterial diversity and community composition.

PubMed

Yu, Zhongtang; Yu, Marie; Morrison, Mark

2006-04-01

Serial analysis of ribosomal sequence tags (SARST) is a recently developed technology that can generate large 16S rRNA gene (rrs) sequence data sets from microbiomes, but there are numerous enzymatic and purification steps required to construct the ribosomal sequence tag (RST) clone libraries. We report here an improved SARST method, which still targets the V1 hypervariable region of rrs genes, but reduces the number of enzymes, oligonucleotides, reagents, and technical steps needed to produce the RST clone libraries. The new method, hereafter referred to as SARST-V1, was used to examine the eubacterial diversity present in community DNA recovered from the microbiome resident in the ovine rumen. The 190 sequenced clones contained 1055 RSTs and no less than 236 unique phylotypes (based on > or = 95% sequence identity) that were assigned to eight different eubacterial phyla. Rarefaction and monomolecular curve analyses predicted that the complete RST clone library contains 99% of the 353 unique phylotypes predicted to exist in this microbiome. When compared with ribosomal intergenic spacer analysis (RISA) of the same community DNA sample, as well as a compilation of nine previously published conventional rrs clone libraries prepared from the same type of samples, the RST clone library provided a more comprehensive characterization of the eubacterial diversity present in rumen microbiomes. As such, SARST-V1 should be a useful tool applicable to comprehensive examination of diversity and composition in microbiomes and offers an affordable, sequence-based method for diversity analysis.
IM-TORNADO: a tool for comparison of 16S reads from paired-end libraries.

PubMed

Jeraldo, Patricio; Kalari, Krishna; Chen, Xianfeng; Bhavsar, Jaysheel; Mangalam, Ashutosh; White, Bryan; Nelson, Heidi; Kocher, Jean-Pierre; Chia, Nicholas

2014-01-01

16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads. We created a workflow known as Illinois Mayo Taxon Organization from RNA Dataset Operations (IM-TORNADO) for processing non-overlapping reads while retaining maximal information content. Using synthetic mock datasets, we show that the use of both reads produced answers with greater correlation to those from full length 16S rDNA when looking at taxonomy, phylogeny, and beta-diversity. IM-TORNADO is freely available at http://sourceforge.net/projects/imtornado and produces BIOM format output for cross compatibility with other pipelines such as QIIME, mothur, and phyloseq.
Coastal bacterioplankton community diversity along a latitudinal gradient in Latin America by means of V6 tag pyrosequencing.

PubMed

Thompson, Fabiano L; Bruce, Thiago; Gonzalez, Alessandra; Cardoso, Alexander; Clementino, Maysa; Costagliola, Marcela; Hozbor, Constanza; Otero, Ernesto; Piccini, Claudia; Peressutti, Silvia; Schmieder, Robert; Edwards, Robert; Smith, Mathew; Takiyama, Luis Roberto; Vieira, Ricardo; Paranhos, Rodolfo; Artigas, Luis Felipe

2011-02-01

The bacterioplankton diversity of coastal waters along a latitudinal gradient between Puerto Rico and Argentina was analyzed using a total of 134,197 high-quality sequences from the V6 hypervariable region of the small-subunit ribosomal RNA gene (16S rRNA) (mean length of 60 nt). Most of the OTUs were identified into Proteobacteria, Bacteriodetes, Cyanobacteria, and Actinobacteria, corresponding to approx. 80% of the total number of sequences. The number of OTUs corresponding to species varied between 937 and 1946 in the seven locations. Proteobacteria appeared at high frequency in the seven locations. An enrichment of Cyanobacteria was observed in Puerto Rico, whereas an enrichment of Bacteroidetes was detected in the Argentinian shelf and Uruguayan coastal lagoons. The highest number of sequences of Actinobacteria and Acidobacteria were obtained in the Amazon estuary mouth. The rarefaction curves and Good coverage estimator for species diversity suggested a significant coverage, with values ranging between 92 and 97% for Good coverage. Conserved taxa corresponded to aprox. 52% of all sequences. This study suggests that human-contaminated environments may influence bacterioplankton diversity.
IM-TORNADO: A Tool for Comparison of 16S Reads from Paired-End Libraries

PubMed Central

Jeraldo, Patricio; Kalari, Krishna; Chen, Xianfeng; Bhavsar, Jaysheel; Mangalam, Ashutosh; White, Bryan; Nelson, Heidi; Kocher, Jean-Pierre; Chia, Nicholas

2014-01-01

Motivation 16S rDNA hypervariable tag sequencing has become the de facto method for accessing microbial diversity. Illumina paired-end sequencing, which produces two separate reads for each DNA fragment, has become the platform of choice for this application. However, when the two reads do not overlap, existing computational pipelines analyze data from read separately and underutilize the information contained in the paired-end reads. Results We created a workflow known as Illinois Mayo Taxon Organization from RNA Dataset Operations (IM-TORNADO) for processing non-overlapping reads while retaining maximal information content. Using synthetic mock datasets, we show that the use of both reads produced answers with greater correlation to those from full length 16S rDNA when looking at taxonomy, phylogeny, and beta-diversity. Availability and Implementation IM-TORNADO is freely available at http://sourceforge.net/projects/imtornado and produces BIOM format output for cross compatibility with other pipelines such as QIIME, mothur, and phyloseq. PMID:25506826
Conserved patterns hidden within group A Streptococcus M protein hypervariability recognize human C4b-binding protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Buffalo, Cosmo Z.; Bahn-Suh, Adrian J.; Hirakis, Sophia P.

No vaccine exists against group A Streptococcus (GAS), a leading cause of worldwide morbidity and mortality. A severe hurdle is the hypervariability of its major antigen, the M protein, with >200 different M types known. Neutralizing antibodies typically recognize M protein hypervariable regions (HVRs) and confer narrow protection. In stark contrast, human C4b-binding protein (C4BP), which is recruited to the GAS surface to block phagocytic killing, interacts with a remarkably large number of M protein HVRs (apparently ~90%). Such broad recognition is rare, and we discovered a unique mechanism for this through the structure determination of four sequence-diverse M proteinsmore » in complexes with C4BP. The structures revealed a uniform and tolerant ‘reading head’ in C4BP, which detected conserved sequence patterns hidden within hypervariability. Our results open up possibilities for rational therapies that target the M–C4BP interaction, and also inform a path towards vaccine design.« less
Small subunit ribosomal RNA genes of tabanids and hippoboscids (Diptera: Brachycera): evolutionary relationships and comparison with other Diptera.

PubMed

Carreno, R A; Barta, J R

1998-11-01

The small subunit ribosomal RNA (SSU rRNA) genes of hippoboscid (Ornithoica vicina Walker) and tabanid (Chrysops niger Macquart) Diptera were sequenced to determine their phylogenetic position within the order and to determine whether or not extensive hypervariable regions in this gene are widespread in the Diptera. A parsimony analysis of an alignment containing 8 dipteran sequences produced a single most parsimonious tree that placed O. vicina as sister group to Drosophila melanogaster Meigen. The tabanid Chrysops niger was sister group to the asilomorphan taxa, and the sister group to the Brachycera was a Tipula sp. although this relationship was not supported by bootstrap analysis. The hippoboscid and tabanid sequences contain extensive hypervariable regions in the V2, V4, V6, and V7 regions as do other Diptera. When these regions of the alignment were excluded from the phylogenetic analysis, a single most parsimonious tree was found. This tree had an identical overall topology to the tree obtained from the total data set. The hypervariable regions in parts of the dipteran SSU rRNA genes were more extensive in the nematocerous dipteran sequences used in this study than in the other dipteran representatives; these hypervariable regions may be of more utility in inferring relationship among species and subspecies than at the suprageneric level.
Revisiting the phylogeny of Zoanthidea (Cnidaria: Anthozoa): Staggered alignment of hypervariable sequences improves species tree inference.

PubMed

Swain, Timothy D

2018-01-01

The recent rapid proliferation of novel taxon identification in the Zoanthidea has been accompanied by a parallel propagation of gene trees as a tool of species discovery, but not a corresponding increase in our understanding of phylogeny. This disparity is caused by the trade-off between the capabilities of automated DNA sequence alignment and data content of genes applied to phylogenetic inference in this group. Conserved genes or segments are easily aligned across the order, but produce poorly resolved trees; hypervariable genes or segments contain the evolutionary signal necessary for resolution and robust support, but sequence alignment is daunting. Staggered alignments are a form of phylogeny-informed sequence alignment composed of a mosaic of local and universal regions that allow phylogenetic inference to be applied to all nucleotides from both hypervariable and conserved gene segments. Comparisons between species tree phylogenies inferred from all data (staggered alignment) and hypervariable-excluded data (standard alignment) demonstrate improved confidence and greater topological agreement with other sources of data for the complete-data tree. This novel phylogeny is the most comprehensive to date (in terms of taxa and data) and can serve as an expandable tool for evolutionary hypothesis testing in the Zoanthidea. Spanish language abstract available in Text S1. Translation by L. O. Swain, DePaul University, Chicago, Illinois, 60604, USA. Copyright © 2017 Elsevier Inc. All rights reserved.
Development of an Analysis Pipeline Characterizing Multiple Hypervariable Regions of 16S rRNA Using Mock Samples.

PubMed

Barb, Jennifer J; Oler, Andrew J; Kim, Hyung-Suk; Chalmers, Natalia; Wallen, Gwenyth R; Cashion, Ann; Munson, Peter J; Ames, Nancy J

2016-01-01

There is much speculation on which hypervariable region provides the highest bacterial specificity in 16S rRNA sequencing. The optimum solution to prevent bias and to obtain a comprehensive view of complex bacterial communities would be to sequence the entire 16S rRNA gene; however, this is not possible with second generation standard library design and short-read next-generation sequencing technology. This paper examines a new process using seven hypervariable or V regions of the 16S rRNA (six amplicons: V2, V3, V4, V6-7, V8, and V9) processed simultaneously on the Ion Torrent Personal Genome Machine (Life Technologies, Grand Island, NY). Four mock samples were amplified using the 16S Ion Metagenomics Kit™ (Life Technologies) and their sequencing data is subjected to a novel analytical pipeline. Results are presented at family and genus level. The Kullback-Leibler divergence (DKL), a measure of the departure of the computed from the nominal bacterial distribution in the mock samples, was used to infer which region performed best at the family and genus levels. Three different hypervariable regions, V2, V4, and V6-7, produced the lowest divergence compared to the known mock sample. The V9 region gave the highest (worst) average DKL while the V4 gave the lowest (best) average DKL. In addition to having a high DKL, the V9 region in both the forward and reverse directions performed the worst finding only 17% and 53% of the known family level and 12% and 47% of the genus level bacteria, while results from the forward and reverse V4 region identified all 17 family level bacteria. The results of our analysis have shown that our sequencing methods using 6 hypervariable regions of the 16S rRNA and subsequent analysis is valid. This method also allowed for the assessment of how well each of the variable regions might perform simultaneously. Our findings will provide the basis for future work intended to assess microbial abundance at different time points throughout a clinical protocol.
Diversity in the 18S SSU rRNA V4 hyper-variable region of Theileria spp. in Cape buffalo (Syncerus caffer) and cattle from southern Africa.

PubMed

Mans, Ben J; Pienaar, Ronel; Latif, Abdalla A; Potgieter, Fred T

2011-05-01

Sequence variation within the 18S SSU rRNA V4 hyper-variable region can affect the accuracy of real-time hybridization probe-based diagnostics for the detection of Theileria spp. infections. This is relevant for assays that use non-specific primers, such as the real-time hybridization assay for T. parva (Sibeko et al. 2008). To assess the effect of sequence variation on this test, the Theileria 18S gene from 62 buffalo and 49 cattle samples was cloned and ∼1000 clones sequenced. Twenty-six genotypes were detected which included known and novel genotypes for the T. buffeli, T. mutans, T. taurotragi and T. velifera clades. A novel genotype related to T. sp. (sable) was also detected in 1 bovine sample. Theileria genotypic diversity was higher in buffalo compared to cattle. Polymorphism within the T. parva hyper-variable region was confirmed by aberrant real-time melting peaks and supported by sequencing of the S5 ribosomal gene. Analysis of the S5 gene suggests that this gene can be a marker for species differentiation. T. parva, T. sp. (buffalo) and T. sp. (bougasvlei) remain the only genotypes amplified by the primer set of the hybridization assay. Therefore, the 18S sequence diversity observed does not seem to affect the current real-time hybridization assay for T. parva.
Molecular characterization of the canine mitochondrial DNA control region for forensic applications.

PubMed

Eichmann, Cordula; Parson, Walther

2007-09-01

The canine mitochondrial DNA (mtDNA) control region of 133 dogs living in the area around Innsbruck, Austria was sequenced. A total of 40 polymorphic sites were observed in the first hypervariable segment and 15 in the second, which resulted in the differentiation of 40 distinct haplotypes. We observed five nucleotide positions that were highly polymorphic within different haplogroups, and they represent good candidates for mtDNA screening. We found five point heteroplasmic positions; all located in HVS-I and a polythymine region in HVS-II, the latter often being associated with length heteroplasmy. In contrast to human mtDNA, the canine control region contains a hypervariable 10 nucleotide repeat region, which is located between the two hypervariable regions. In our population sample, we observed eight different repeat types, which we characterized by direct sequencing and fragment length analysis. The discrimination power of the canine mtDNA control region was 0.93, not taking the polymorphic repeat region into consideration.
Foot-and-mouth disease virus (FMDV) with a stable FLAG epitope in the VP1 G-H loop as a new tool for studying FMDV pathogenesis

USDA-ARS?s Scientific Manuscript database

In this study, we generated a recombinant foot-and-mouth disease virus (FMDV) particle derived from A24 Cruzeiro with a FLAG tag (DYKDDDDK) substitution in the hypervariable antigenic site of the G-H loop of the VP1 capsid protein in an effort to expand the immunogenicity of the virus particle and t...
Biological properties of purified recombinant HCV particles with an epitope-tagged envelope

DOE Office of Scientific and Technical Information (OSTI.GOV)

Takahashi, Hitoshi; Akazawa, Daisuke; Toray Industries, Inc., Kanagawa

2010-05-14

To establish a simple system for purification of recombinant infectious hepatitis C virus (HCV) particles, we designed a chimeric J6/JFH-1 virus with a FLAG (FL)-epitope-tagged sequence at the N-terminal region of the E2 hypervariable region-1 (HVR1) gene (J6/JFH-1/1FL). We found that introduction of an adaptive mutation at the potential N-glycosylation site (E2N151K) leads to efficient production of the chimeric virus. This finding suggests the involvement of glycosylation at Asn within the envelope protein(s) in HCV morphogenesis. To further analyze the biological properties of the purified recombinant HCV particles, we developed a strategy for large-scale production and purification of recombinant J6/JFH-1/1FL/E2N151K.more » Infectious particles were purified from the culture medium of J6/JFH-1/1FL/E2N151K-infected Huh-7 cells using anti-FLAG affinity chromatography in combination with ultrafiltration. Electron microscopy of the purified particles using negative staining showed spherical particle structures with a diameter of 40-60 nm and spike-like projections. Purified HCV particle-immunization induced both an anti-E2 and an anti-FLAG antibody response in immunized mice. This strategy may contribute to future detailed analysis of HCV particle structure and to HCV vaccine development.« less
Identifying protist consumers of photosynthetic picoeukaryotes in the surface ocean using stable isotope probing.

PubMed

Orsi, William D; Wilken, Susanne; Del Campo, Javier; Heger, Thierry; James, Erick; Richards, Thomas A; Keeling, Patrick J; Worden, Alexandra Z; Santoro, Alyson E

2018-02-01

Photosynthetic picoeukaryotes contribute a significant fraction of primary production in the upper ocean. Micromonas pusilla is an ecologically relevant photosynthetic picoeukaryote, abundantly and widely distributed in marine waters. Grazing by protists may control the abundance of picoeukaryotes such as M. pusilla, but the diversity of the responsible grazers is poorly understood. To identify protists consuming photosynthetic picoeukaryotes in a productive North Pacific Ocean region, we amended seawater with living 15 N, 13 C-labelled M. pusilla cells in a 24-h replicated bottle experiment. DNA stable isotope probing, combined with high-throughput sequencing of V4 hypervariable regions from 18S rRNA gene amplicons (Tag-SIP), identified 19 operational taxonomic units (OTUs) of microbial eukaryotes that consumed M. pusilla. These OTUs were distantly related to cultured taxa within the dinoflagellates, ciliates, stramenopiles (MAST-1C and MAST-3 clades) and Telonema flagellates, thus, far known only from their environmental 18S rRNA gene sequences. Our discovery of eukaryotic prey consumption by MAST cells confirms that their trophic role in marine microbial food webs includes grazing upon picoeukaryotes. Our study provides new experimental evidence directly linking the genetic identity of diverse uncultivated microbial eukaryotes to the consumption of picoeukaryotic phytoplankton in the upper ocean. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Sequence polymorphism data of the hypervariable regions of mitochondrial DNA in the Yadav population of Haryana.

PubMed

Verma, Kapil; Sharma, Sapna; Sharma, Arun; Dalal, Jyoti; Bhardwaj, Tapeshwar

2018-06-01

Genetic variations among humans occur both within and among populations and range from single nucleotide changes to multiple-nucleotide variants. These multiple-nucleotide variants are useful for studying the relationships among individuals or various population groups. The study of human genetic variations can help scientists understand how different population groups are biologically related to one another. Sequence analysis of hypervariable regions of human mitochondrial DNA (mtDNA) has been successfully used for the genetic characterization of different population groups for forensic purposes. It is well established that different ethnic or population groups differ significantly in their mtDNA distributions. In the last decade, very little research has been conducted on mtDNA variations in the Indian population, although such data would be useful for elucidating the history of human population expansion across the world. Moreover, forensic studies on mtDNA variations in the Indian subcontinent are also scarce, particularly in the northern part of India. In this report, variations in the hypervariable regions of mtDNA were analyzed in the Yadav population of Haryana. Different molecular diversity indices were computed. Further, the obtained haplotypes were classified into different haplogroups and the phylogenetic relationship between different haplogroups was inferred.
Multiplexed Microsphere Suspension-Array Assay for Urine Mitochondrial DNA Typing by C-Stretch Length in Hypervariable Regions.

PubMed

Aoki, Kimiko; Tanaka, Hiroyuki; Kawahara, Takashi

2018-07-01

The standard method for personal identification and verification of urine samples in doping control is short tandem repeat (STR) analysis using nuclear DNA (nDNA). The DNA concentration of urine is very low and decreases under most conditions used for sample storage; therefore, the amount of DNA from cryopreserved urine samples may be insufficient for STR analysis. We aimed to establish a multiplexed assay for urine mitochondrial DNA typing containing only trace amounts of DNA, particularly for Japanese populations. A multiplexed suspension-array assay using oligo-tagged microspheres (Luminex MagPlex-TAG) was developed to measure C-stretch length in hypervariable region 1 (HV1) and 2 (HV2), five single nucleotide polymorphisms (SNPs), and one polymorphic indel. Based on these SNPs and the indel, the Japanese population can be classified into five major haplogroups (D4, B, M7a, A, D5). The assay was applied to DNA samples from urine cryopreserved for 1 - 1.5 years (n = 63) and fresh blood (n = 150). The assay with blood DNA enabled Japanese subjects to be categorized into 62 types, exhibiting a discriminatory power of 0.960. The detection limit for cryopreserved urine was 0.005 ng of nDNA. Profiling of blood and urine pairs revealed that 5 of 63 pairs showed different C-stretch patterns in HV1 or HV2. The assay described here yields valuable information in terms of the verification of urine sample sources employing only trace amounts of recovered DNA. However, blood cannot be used as a reference sample.

Ecology of the microbiome of the infected root canal system: a comparison between apical and coronal root segments

PubMed Central

Özok, A.R.; Persoon, I.F.; Huse, S.M.; Keijser, B.J.F.; Wesselink, P.R.; Crielaard, W.; Zaura, E.

2016-01-01

Aim To evaluate the microbial ecology of the coronal and apical segments of infected root canal systems using a complete sampling technique and next-generation sequencing. Methodology The roots of 23 extracted teeth with apical periodontitis were sectioned in half, horizontally, and cryo-pulverized. Bacterial communities were profiled using tagged 454 pyrosequencing of the 16S rDNA hypervariable V5–V6 region. Results The sequences were classified into 606 taxa (species or higher taxon), representing 24 bacterial phyla or candidate divisions and one archaeal phylum. Proteobacteria were more abundant in the apical samples (p<0.05), while Actinobacteria were in significantly higher proportions in the coronal samples. The apical samples harbored statistically significantly more taxa than the coronal samples (p=0.01), and showed a higher microbial diversity. Several taxa belonging to fastidious obligate anaerobes were significantly more abundant in the apical segments of the roots compared to their coronal counterparts. Conclusions Endodontic infections are more complex than reported previously. The apical part of the root canal system drives the selection of a more diverse and more anaerobe community than the coronal part. The presence of a distinct ecological niche in the apical region explains the difficulty of eradication of the infection, and emphasizes the need that new treatment approaches should be developed. PMID:22251411
Cadaver Thanatomicrobiome Signatures: The Ubiquitous Nature of Clostridium Species in Human Decomposition.

PubMed

Javan, Gulnaz T; Finley, Sheree J; Smith, Tasia; Miller, Joselyn; Wilkinson, Jeremy E

2017-01-01

Human thanatomicrobiome studies have established that an abundant number of putrefactive bacteria within internal organs of decaying bodies are obligate anaerobes, Clostridium spp. These microorganisms have been implicated as etiological agents in potentially life-threatening infections; notwithstanding, the scale and trajectory of these microbes after death have not been elucidated. We performed phylogenetic surveys of thanatomicrobiome signatures of cadavers' internal organs to compare the microbial diversity between the 16S rRNA gene V4 hypervariable region and V3-4 conjoined regions from livers and spleens of 45 cadavers undergoing forensic microbiological studies. Phylogenetic analyses of 16S rRNA gene sequences revealed that the V4 region had a significantly higher mean Chao1 richness within the total microbiome data. Permutational multivariate analysis of variance statistical tests, based on unweighted UniFrac distances, demonstrated that taxa compositions were significantly different between V4 and V3-4 hypervariable regions ( p < 0.001). Of note, we present the first study, using the largest cohort of criminal cases to date, that two hypervariable regions show discriminatory power for human postmortem microbial diversity. In conclusion, here we propose the impact of hypervariable region selection for the 16S rRNA gene in differentiating thanatomicrobiomic profiles to provide empirical data to explain a unique concept, the Postmortem Clostridium Effect.
Multiplex analysis of DNA

DOEpatents

Church, George M.; Kieffer-Higgins, Stephen

1992-01-01

This invention features vectors and a method for sequencing DNA. The method includes the steps of: a) ligating the DNA into a vector comprising a tag sequence, the tag sequence includes at least 15 bases, wherein the tag sequence will not hybridize to the DNA under stringent hybridization conditions and is unique in the vector, to form a hybrid vector, b) treating the hybrid vector in a plurality of vessels to produce fragments comprising the tag sequence, wherein the fragments differ in length and terminate at a fixed known base or bases, wherein the fixed known base or bases differs in each vessel, c) separating the fragments from each vessel according to their size, d) hybridizing the fragments with an oligonucleotide able to hybridize specifically with the tag sequence, and e) detecting the pattern of hybridization of the tag sequence, wherein the pattern reflects the nucleotide sequence of the DNA.
Highly diverse microbiota in dental root canals in cases of apical periodontitis (data of illumina sequencing).

PubMed

Vengerfeldt, Veiko; Špilka, Katerina; Saag, Mare; Preem, Jens-Konrad; Oopkaup, Kristjan; Truu, Jaak; Mändar, Reet

2014-11-01

Chronic apical periodontitis (CAP) is a frequent condition that has a considerable effect on a patient's quality of life. We aimed to reveal root canal microbial communities in antibiotic-naive patients by applying Illumina sequencing (Illumina Inc, San Diego, CA). Samples were collected under strict aseptic conditions from 12 teeth (5 with primary CAP, 3 with secondary CAP, and 4 with a periapical abscess [PA]) and characterized by profiling the microbial community on the basis of the V6 hypervariable region of the 16S ribosomal RNA gene by using Illumina HiSeq2000 sequencing combinatorial sequence-tagged polymerase chain reaction products. Root canal specimens displayed highly polymicrobial communities in all 3 patient groups. One sample contained 5-8 (mean = 6.5) phyla of bacteria. The most numerous were Firmicutes and Bacteroidetes, but Actinobacteria, Fusobacteria, Proteobacteria, Spirochaetes, Tenericutes, and Synergistetes were also present in most of the patients. One sample contained 30-70 different operational taxonomic units; the mean (± standard deviation) was lower in the primary CAP group (36 ± 4) than in the PA (45 ± 4) and secondary CAP (43 ± 13) groups (P < .05). The communities were individually different, but anaerobic bacteria predominated as the rule. Enterococcus faecalis was found only in patients with secondary CAP. One PA sample displayed a significantly high proportion (47%) of Proteobacteria, mainly at the expense of Janthinobacterium lividum. This study provided an in-depth characterization of the microbiota of periapical tissues, revealing highly polymicrobial communities and minor differences between the study groups. A full understanding of the etiology of periodontal disease will only be possible through further in-depth systems-level analyses of the host-microbiome interaction. Copyright © 2014 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Maize Endophytic Bacterial Diversity as Affected by Soil Cultivation History.

PubMed

Correa-Galeote, David; Bedmar, Eulogio J; Arone, Gregorio J

2018-01-01

The bacterial endophytic communities residing within roots of maize ( Zea mays L.) plants cultivated by a sustainable management in soils from the Quechua maize belt (Peruvian Andes) were examined using tags pyrosequencing spanning the V4 and V5 hypervariable regions of the 16S rRNA. Across four replicate libraries, two corresponding to sequences of endophytic bacteria from long time maize-cultivated soils and the other two obtained from fallow soils, 793 bacterial sequences were found that grouped into 188 bacterial operational taxonomic units (OTUs, 97% genetic similarity). The numbers of OTUs in the libraries from the maize-cultivated soils were significantly higher than those found in the libraries from fallow soils. A mean of 30 genera were found in the fallow soil libraries and 47 were in those from the maize-cultivated soils. Both alpha and beta diversity indexes showed clear differences between bacterial endophytic populations from plants with different soil cultivation history and that the soils cultivated for long time requires a higher diversity of endophytes. The number of sequences corresponding to main genera Sphingomonas, Herbaspirillum, Bradyrhizobium and Methylophilus in the maize-cultivated libraries were statistically more abundant than those from the fallow soils. Sequences of genera Dyella and Sreptococcus were significantly more abundant in the libraries from the fallow soils. Relative abundance of genera Burkholderia, candidatus Glomeribacter, Staphylococcus, Variovorax, Bacillus and Chitinophaga were similar among libraries. A canonical correspondence analysis of the relative abundance of the main genera showed that the four libraries distributed in two clearly separated groups. Our results suggest that cultivation history is an important driver of endophytic colonization of maize and that after a long time of cultivation of the soil the maize plants need to increase the richness of the bacterial endophytes communities.
Maize Endophytic Bacterial Diversity as Affected by Soil Cultivation History

PubMed Central

Correa-Galeote, David; Bedmar, Eulogio J.; Arone, Gregorio J.

2018-01-01

The bacterial endophytic communities residing within roots of maize (Zea mays L.) plants cultivated by a sustainable management in soils from the Quechua maize belt (Peruvian Andes) were examined using tags pyrosequencing spanning the V4 and V5 hypervariable regions of the 16S rRNA. Across four replicate libraries, two corresponding to sequences of endophytic bacteria from long time maize-cultivated soils and the other two obtained from fallow soils, 793 bacterial sequences were found that grouped into 188 bacterial operational taxonomic units (OTUs, 97% genetic similarity). The numbers of OTUs in the libraries from the maize-cultivated soils were significantly higher than those found in the libraries from fallow soils. A mean of 30 genera were found in the fallow soil libraries and 47 were in those from the maize-cultivated soils. Both alpha and beta diversity indexes showed clear differences between bacterial endophytic populations from plants with different soil cultivation history and that the soils cultivated for long time requires a higher diversity of endophytes. The number of sequences corresponding to main genera Sphingomonas, Herbaspirillum, Bradyrhizobium and Methylophilus in the maize-cultivated libraries were statistically more abundant than those from the fallow soils. Sequences of genera Dyella and Sreptococcus were significantly more abundant in the libraries from the fallow soils. Relative abundance of genera Burkholderia, candidatus Glomeribacter, Staphylococcus, Variovorax, Bacillus and Chitinophaga were similar among libraries. A canonical correspondence analysis of the relative abundance of the main genera showed that the four libraries distributed in two clearly separated groups. Our results suggest that cultivation history is an important driver of endophytic colonization of maize and that after a long time of cultivation of the soil the maize plants need to increase the richness of the bacterial endophytes communities. PMID:29662471
Application safety evaluation of the radio frequency identification tag under magnetic resonance imaging.

PubMed

Fei, Xiaolu; Li, Shanshan; Gao, Shan; Wei, Lan; Wang, Lihong

2014-09-04

Radio Frequency Identification(RFID) has been widely used in healthcare facilities, but it has been paid little attention whether RFID applications are safe enough under healthcare environment. The purpose of this study is to assess the effects of RFID tags on Magnetic Resonance (MR) imaging in a typical electromagnetic environment in hospitals, and to evaluate the safety of their applications. A Magphan phantom was used to simulate the imaging objects, while active RFID tags were placed at different distances (0, 4, 8, 10 cm) from the phantom border. The phantom was scanned by using three typical sequences including spin-echo (SE) sequence, gradient-echo (GRE) sequence and inversion-recovery (IR) sequence. The quality of the image was quantitatively evaluated by using signal-to-noise ratio (SNR), uniformity, high-contrast resolution, and geometric distortion. RFID tags were read by an RFID reader to calculate their usable rate. RFID tags can be read properly after being placed in high magnetic field for up to 30 minutes. SNR: There were no differences between the group with RFID tags and the group without RFID tags using SE and IR sequence, but it was lower when using GRE sequence.Uniformity: There was a significant difference between the group with RFID tags and the group without RFID tags using SE and GRE sequence. Geometric distortion and high-contrast resolution: There were no obvious differences found. Active RFID tags can affect MR imaging quality, especially using the GRE sequence. Increasing the distance from the RFID tags to the imaging objects can reduce that influence. When the distance was longer than 8 cm, MR imaging quality were almost unaffected. However, the Gradient Echo related sequence is not recommended when patients wear a RFID wristband.
The assembly of the plant urease activation complex and the essential role of the urease accessory protein G (UreG) in delivery of nickel to urease.

PubMed

Myrach, Till; Zhu, Anting; Witte, Claus-Peter

2017-09-01

Urease is a ubiquitous nickel metalloenzyme. In plants, its activation requires three urease accessory proteins (UAPs), UreD, UreF, and UreG. In bacteria, the UAPs interact with urease and facilitate activation, which involves the channeling of two nickel ions into the active site. So far this process has not been investigated in eukaryotes. Using affinity pulldowns of Strep-tagged UAPs from Arabidopsis and rice transiently expressed in planta , we demonstrate that a urease-UreD-UreF-UreG complex exists in plants and show its stepwise assembly. UreG is crucial for nickel delivery because UreG-dependent urease activation in vitro was observed only with UreG obtained from nickel-sufficient plants. This activation competence could not be generated in vitro by incubation of UreG with nickel, bicarbonate, and GTP. Compared with their bacterial orthologs, plant UreGs possess an N-terminal extension containing a His- and Asp/Glu-rich hypervariable region followed by a highly conserved sequence comprising two potential H X H metal-binding sites. Complementing the ureG-1 mutant of Arabidopsis with N-terminal deletion variants of UreG demonstrated that the hypervariable region has a minor impact on activation efficiency, whereas the conserved region up to the first H X H motif is highly beneficial and up to the second H X H motif strictly required for activation. We also show that urease reaches its full activity several days after nickel becomes available in the leaves, indicating that urease activation is limited by nickel accessibility in vivo Our data uncover the crucial role of UreG for nickel delivery during eukaryotic urease activation, inciting further investigations of the details of this process. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Phylogenetic analysis of Sicilian goats reveals a new mtDNA lineage.

PubMed

Sardina, M T; Ballester, M; Marmi, J; Finocchiaro, R; van Kaam, J B C H M; Portolano, B; Folch, J M

2006-08-01

The mitochondrial hypervariable region 1 (HVR1) sequence of 67 goats belonging to the Girgentana, Maltese and Derivata di Siria breeds was partially sequenced in order to present the first phylogenetic characterization of Sicilian goat breeds. These sequences were compared with published sequences of Indian and Pakistani domestic goats and wild goats. Mitochondrial lineage A was observed in most of the Sicilian goats. However, three Girgentana haplotypes were highly divergent from the Capra hircus clade, indicating that a new mtDNA lineage in domestic goats was found.
Bacterial diversity and community composition from seasurface to subseafloor.

PubMed

Walsh, Emily A; Kirkpatrick, John B; Rutherford, Scott D; Smith, David C; Sogin, Mitchell; D'Hondt, Steven

2016-04-01

We investigated compositional relationships between bacterial communities in the water column and those in deep-sea sediment at three environmentally distinct Pacific sites (two in the Equatorial Pacific and one in the North Pacific Gyre). Through pyrosequencing of the v4-v6 hypervariable regions of the 16S ribosomal RNA gene, we characterized 450,104 pyrotags representing 29,814 operational taxonomic units (OTUs, 97% similarity). Hierarchical clustering and non-metric multidimensional scaling partition the samples into four broad groups, regardless of geographic location: a photic-zone community, a subphotic community, a shallow sedimentary community and a subseafloor sedimentary community (⩾1.5 meters below seafloor). Abundance-weighted community compositions of water-column samples exhibit a similar trend with depth at all sites, with successive epipelagic, mesopelagic, bathypelagic and abyssopelagic communities. Taxonomic richness is generally highest in the water-column O2 minimum zone and lowest in the subseafloor sediment. OTUs represented by abundant tags in the subseafloor sediment are often present but represented by few tags in the water column, and represented by moderately abundant tags in the shallow sediment. In contrast, OTUs represented by abundant tags in the water are generally absent from the subseafloor sediment. These results are consistent with (i) dispersal of marine sedimentary bacteria via the ocean, and (ii) selection of the subseafloor sedimentary community from within the community present in shallow sediment.
Structural insights into the evolution of a sexy protein: novel topology and restricted backbone flexibility in a hypervariable pheromone from the red-legged salamander, Plethodon shermani.

PubMed

Wilburn, Damien B; Bowen, Kathleen E; Doty, Kari A; Arumugam, Sengodagounder; Lane, Andrew N; Feldhoff, Pamela W; Feldhoff, Richard C

2014-01-01

In response to pervasive sexual selection, protein sex pheromones often display rapid mutation and accelerated evolution of corresponding gene sequences. For proteins, the general dogma is that structure is maintained even as sequence or function may rapidly change. This phenomenon is well exemplified by the three-finger protein (TFP) superfamily: a diverse class of vertebrate proteins co-opted for many biological functions - such as components of snake venoms, regulators of the complement system, and coordinators of amphibian limb regeneration. All of the >200 structurally characterized TFPs adopt the namesake "three-finger" topology. In male red-legged salamanders, the TFP pheromone Plethodontid Modulating Factor (PMF) is a hypervariable protein such that, through extensive gene duplication and pervasive sexual selection, individual male salamanders express more than 30 unique isoforms. However, it remained unclear how this accelerated evolution affected the protein structure of PMF. Using LC/MS-MS and multidimensional NMR, we report the 3D structure of the most abundant PMF isoform, PMF-G. The high resolution structural ensemble revealed a highly modified TFP structure, including a unique disulfide bonding pattern and loss of secondary structure, that define a novel protein topology with greater backbone flexibility in the third peptide finger. Sequence comparison, models of molecular evolution, and homology modeling together support that this flexible third finger is the most rapidly evolving segment of PMF. Combined with PMF sequence hypervariability, this structural flexibility may enhance the plasticity of PMF as a chemical signal by permitting potentially thousands of structural conformers. We propose that the flexible third finger plays a critical role in PMF:receptor interactions. As female receptors co-evolve, this flexibility may allow PMF to still bind its receptor(s) without the immediate need for complementary mutations. Consequently, this unique adaptation may establish new paradigms for how receptor:ligand pairs co-evolve, in particular with respect to sexual conflict.
No evidence of radiation effect on mutation rates at hypervariable minisatellite loci in the germ cells of atomic bomb survivors.

PubMed

Kodaira, Mieko; Izumi, Shizue; Takahashi, Norio; Nakamura, Nori

2004-10-01

Human minisatellites consist of tandem arrays of short repeat sequences, and some are highly polymorphic in numbers of repeats among individuals. Since these loci mutate much more frequently than coding sequences, they make attractive markers for screening populations for genetic effects of mutagenic agents. Here we report the results of our analysis of mutations at eight hypervariable minisatellite loci in the offspring (61 from exposed families in 60 of which only one parent was exposed, and 58 from unexposed parents) of atomic bomb survivors with mean doses of >1 Sv. We found 44 mutations in paternal alleles and eight mutations in maternal alleles with no indication that the high doses of acutely applied radiation had caused significant genetic effects. Our finding contrasts with those of some other studies in which much lower radiation doses, applied chronically, caused significantly increased mutation rates. Possible reasons for this discrepancy are discussed.
Variable Copy Number, Intra-Genomic Heterogeneities and Lateral Transfers of the 16S rRNA Gene in Pseudomonas

PubMed Central

Bodilis, Josselin; Nsigue-Meilo, Sandrine; Besaury, Ludovic; Quillet, Laurent

2012-01-01

Even though the 16S rRNA gene is the most commonly used taxonomic marker in microbial ecology, its poor resolution is still not fully understood at the intra-genus level. In this work, the number of rRNA gene operons, intra-genomic heterogeneities and lateral transfers were investigated at a fine-scale resolution, throughout the Pseudomonas genus. In addition to nineteen sequenced Pseudomonas strains, we determined the 16S rRNA copy number in four other Pseudomonas strains by Southern hybridization and Pulsed-Field Gel Electrophoresis, and studied the intra-genomic heterogeneities by Denaturing Gradient Gel Electrophoresis and sequencing. Although the variable copy number (from four to seven) seems to be correlated with the evolutionary distance, some close strains in the P. fluorescens lineage showed a different number of 16S rRNA genes, whereas all the strains in the P. aeruginosa lineage displayed the same number of genes (four copies). Further study of the intra-genomic heterogeneities revealed that most of the Pseudomonas strains (15 out of 19 strains) had at least two different 16S rRNA alleles. A great difference (5 or 19 nucleotides, essentially grouped near the V1 hypervariable region) was observed only in two sequenced strains. In one of our strains studied (MFY30 strain), we found a difference of 12 nucleotides (grouped in the V3 hypervariable region) between copies of the 16S rRNA gene. Finally, occurrence of partial lateral transfers of the 16S rRNA gene was further investigated in 1803 full-length sequences of Pseudomonas available in the databases. Remarkably, we found that the two most variable regions (the V1 and V3 hypervariable regions) had probably been laterally transferred from another evolutionary distant Pseudomonas strain for at least 48.3 and 41.6% of the 16S rRNA sequences, respectively. In conclusion, we strongly recommend removing these regions of the 16S rRNA gene during the intra-genus diversity studies. PMID:22545126
Pyrosequencing-Derived Bacterial, Archaeal, and Fungal Diversity of Spacecraft Hardware Destined for Mars

PubMed Central

Vaishampayan, Parag; Nilsson, Henrik R.; Torok, Tamas; Venkateswaran, Kasthuri

2012-01-01

Spacecraft hardware and assembly cleanroom surfaces (233 m2 in total) were sampled, total genomic DNA was extracted, hypervariable regions of the 16S rRNA gene (bacteria and archaea) and ribosomal internal transcribed spacer (ITS) region (fungi) were subjected to 454 tag-encoded pyrosequencing PCR amplification, and 203,852 resulting high-quality sequences were analyzed. Bioinformatic analyses revealed correlations between operational taxonomic unit (OTU) abundance and certain sample characteristics, such as source (cleanroom floor, ground support equipment [GSE], or spacecraft hardware), cleaning regimen applied, and location about the facility or spacecraft. National Aeronautics and Space Administration (NASA) cleanroom floor and GSE surfaces gave rise to a larger number of diverse bacterial communities (619 OTU; 20 m2) than colocated spacecraft hardware (187 OTU; 162 m2). In contrast to the results of bacterial pyrosequencing, where at least some sequences were generated from each of the 31 sample sets examined, only 13 and 18 of these sample sets gave rise to archaeal and fungal sequences, respectively. As was the case for bacteria, the abundance of fungal OTU in the GSE surface samples dramatically diminished (9× less) once cleaning protocols had been applied. The presence of OTU representative of actinobacteria, deinococci, acidobacteria, firmicutes, and proteobacteria on spacecraft surfaces suggests that certain bacterial lineages persist even following rigorous quality control and cleaning practices. The majority of bacterial OTU observed as being recurrent belonged to actinobacteria and alphaproteobacteria, supporting the hypothesis that the measures of cleanliness exerted in spacecraft assembly cleanrooms (SAC) inadvertently select for the organisms which are the most fit to survive long journeys in space. PMID:22729532
Diversity and population structure of sewage derived microorganisms in wastewater treatment plant influent

PubMed Central

McLellan, S.L.; Huse, S.M.; Mueller-Spitz, S.R.; Andreishcheva, E.N.; Sogin, M.L.

2009-01-01

The release of untreated sewage introduces non-indigenous microbial populations of uncertain composition into surface waters. We used massively parallel 454 sequencing of hypervariable regions in rRNA genes to profile microbial communities from eight untreated sewage influent samples of two wastewater treatment plants (WWTP) in metropolitan Milwaukee. The sewage profiles included a discernable human fecal signature made up of several taxonomic groups including multiple Bifidobacteriaceae, Coriobacteriaceae, Bacteroidaceae, Lachnospiraceae, and Ruminococcaceae genera. The fecal signature made up a small fraction of the taxa present in sewage but the relative abundance of these sequence tags mirrored the population structures of human fecal samples. These genera were much more prevalent in the sewage influent than standard indicators species. High-abundance sequences from taxonomic groups within the Beta- and Gammaproteobacteria dominated the sewage samples but occurred at very low levels in fecal and surface water samples, suggesting that these organisms proliferate within the sewer system. Samples from Jones Island (JI – servicing residential plus a combined sewer system) and South Shore (SS – servicing a residential area) WWTPs had very consistent community profiles, with greater similarity between WWTPs on a given collection day than the same plant collected on different days. Rainfall increased influent flows at SS and JI WWTPs, and this corresponded to greater diversity in the community at both plants. Overall, the sewer system appears to be a defined environment with both infiltration of rainwater and stormwater inputs modulating community composition. Microbial sewage communities represent a combination of inputs from human fecal microbes and enrichment of specific microbes from the environment to form a unique population structure. PMID:19840106
Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

PubMed Central

Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

2016-01-01

DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
DAMe: a toolkit for the initial processing of datasets with PCR replicates of double-tagged amplicons for DNA metabarcoding analyses.

PubMed

Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P

2016-05-03

DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.
Positive selection of digestive Cys proteases in herbivorous Coleoptera.

PubMed

Vorster, Juan; Rasoolizadeh, Asieh; Goulet, Marie-Claire; Cloutier, Conrad; Sainsbury, Frank; Michaud, Dominique

2015-10-01

Positive selection is thought to contribute to the functional diversification of insect-inducible protease inhibitors in plants in response to selective pressures exerted by the digestive proteases of their herbivorous enemies. Here we assessed whether a reciprocal evolutionary process takes place on the insect side, and whether ingestion of a positively selected plant inhibitor may translate into a measurable rebalancing of midgut proteases in vivo. Midgut Cys proteases of herbivorous Coleoptera, including the major pest Colorado potato beetle (Leptinotarsa decemlineata), were first compared using a codon-based evolutionary model to look for the occurrence of hypervariable, positively selected amino acid sites among the tested sequences. Hypervariable sites were found, distributed within -or close to- amino acid regions interacting with Cys-type inhibitors of the plant cystatin protein family. A close examination of L. decemlineata sequences indicated a link between their assignment to protease functional families and amino acid identity at positively selected sites. A function-diversifying role for positive selection was further suggested empirically by in vitro protease assays and a shotgun proteomic analysis of L. decemlineata Cys proteases showing a differential rebalancing of protease functional family complements in larvae fed single variants of a model cystatin mutated at positively selected amino acid sites. These data confirm overall the occurrence of hypervariable, positively selected amino acid sites in herbivorous Coleoptera digestive Cys proteases. They also support the idea of an adaptive role for positive selection, useful to generate functionally diverse proteases in insect herbivores ingesting functionally diverse, rapidly evolving dietary cystatins. Copyright © 2015 Elsevier Ltd. All rights reserved.
The hypervariable region 1 protein of hepatitis C virus broadly reactive with sera of patients with chronic hepatitis C has a similar amino acid sequence with the consensus sequence.

PubMed

Watanabe, K; Yoshioka, K; Ito, H; Ishigami, M; Takagi, K; Utsunomiya, S; Kobayashi, M; Kishimoto, H; Yano, M; Kakumu, S

1999-11-10

Hypervariable region 1 (HVR1) proteins of hepatitis C virus (HCV) have been reported to react broadly with sera of patients with HCV infection. However, the variability of the broad reactivity of individual HVR1 proteins has not been elucidated. We assessed the reactivity of 25 different HVR1 proteins (genotype 1b) with sera of 81 patients with HCV infection (genotype 1b) by Western blot. HVR1 proteins reacted with 2-60 sera. The number of sera reactive with each HVR1 protein significantly correlated with the number of amino acid residues identical to the consensus sequence defined by Puntoriero et al. (G. Puntoriero, A. Lahm, S. Zucchelli, B. B. Ercole, R. Tafi, M. Penzzanera, M. U. Mondelli, R. Cortese, A. Tramontano, G. Galfre', and A. Nicosia. 1998. EMBO J. 17, 3521-3533. ) (r = 0.561, P < 0.005). The most widely reactive HVR1 protein, 12-22, had a sequence similar to the consensus sequence. The peptide with C-terminal 13-amino-acids sequence of HVR1 protein 12-22 (NH2-CSFTSLFTPGPSQK) was injected into rabbits as an immunogen. The rabbit immune sera reacted with 9 of 25 HVR1 proteins of genotype 1b including HVR1 protein 12-22 and with 3 of 12 proteins of genotype 2a. These results indicate that the HVR1 protein broadly reactive with patients' sera has a sequence similar to the consensus sequence, can induce broadly reactive sera, and could be one of the candidate immunogens in a prophylactic vaccine against HCV. Copyright 1999 Academic Press.
Pyrosequencing as a tool for the identification of common isolates of Mycobacterium sp.

PubMed

Tuohy, Marion J; Hall, Gerri S; Sholtis, Mary; Procop, Gary W

2005-04-01

Pyrosequencing technology, sequencing by addition, was evaluated for categorization of mycobacterial isolates. One hundred and eighty-nine isolates, including 18 ATCC and Trudeau Mycobacterial Culture Collection (TMC) strains, were studied. There were 38 Mycobacterium tuberculosis complex, 27 M. kansasii, 27 MAI complex, 21 M. marinum, 14 M. gordonae, 20 M. chelonae-abscessus group, 10 M. fortuitum, 5 M. xenopi, 3 M. celatum, 2 M. terrae complex, 20 M. mucogenicum, and 2 M. scrofulaceum. Nucleic acid extracts were prepared from solid media or MGIT broth. Traditional PCR was performed with one of the primers biotinylated; the assay targeted a portion of the 16S rRNA gene that contains a hypervariable region, which has been previously shown to be useful for the identification of mycobacteria. The PSQ Sample Preparation Kit was used, and the biotinylated PCR product was processed to a single-stranded DNA template. The sequencing primer was hybridized to the DNA template in a PSQ96 plate. Incorporation of the complementary nucleotides resulted in light generation peaks, forming a pyrogram, which was evaluated by the instrument software. Thirty basepairs were used for isolate categorization. Manual interpretation of the sequences was performed if the quality of the 30-bp sequence was in doubt or if more than 4 bp homopolymers were recognized. Sequences with more than 5 bp of bad quality were deemed unacceptable. When blasted against GenBank, 179 of 189 sequences (94.7%) assigned isolates to the correct molecular genus or group. Ten M. gordonae isolates had more than 5 bp of bad quality sequence and were not accepted. Pyrosequencing of this hypervariable region afforded rapid and acceptable characterization of common, routinely isolated clinical Mycobacterium sp. Algorithms are recommended for further differentiation with an additional sequencing primer or additional biochemicals.

Role of Sequencing the Measles Virus Hemagglutinin Gene and Hypervariable Region in the Measles Outbreak Investigations in Sweden During 2013-2014.

PubMed

Harvala, Heli; Wiman, Åsa; Wallensten, Anders; Zakikhany, Katherina; Englund, Hélène; Brytting, Maria

2016-02-15

It is increasingly difficult to differentiate measles viruses (MeVs) relating to certain outbreaks on the basis of the nucleoprotein (N) gene sequence only, as the diversity of circulating MeV strains has decreased. We studied genomic regions that could provide better molecular discrimination between epidemiologically linked and unlinked MeV variants identified in Sweden during 2013-2014. The hemagglutinin (H) gene and hypervariable region between the fusion and matrix genes (MF-HVR) from 53 MeV-positive samples were amplified and sequenced. Data on phylogenetic clustering of MeVs on the basis of N, H, and MF-HVR sequences were compared to epidemiological data. MeVs were genotyped: 27 were B3, and 26 were D8. One genotype B3 cluster based on the N gene sequence contained epidemiologically unrelated viruses from 4 outbreaks, whereas analysis of H and MF-HVR sequences separated them into phylogenetic clusters consistent with the epidemiological data. Similarly, the single cluster of viruses with a genotype D8 N gene could be divided into the 5 outbreak groups on the basis of the phylogeny of MF-HVR sequences. A detailed picture of MeV circulation with more-defined links between outbreaks was obtained by sequencing the H gene and MF-HVR. Further identification and better genetic characterization of MeVs internationally is essential in identifying sources and routes of MeV spread within and beyond Europe in the elimination end game. © The Author 2015. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
Hypervariable minisatellites: recombinators or innocent bystanders?

PubMed

Jarman, A P; Wells, R A

1989-11-01

It has become apparent in recent years that unexpectedly large numbers of minisatellites exist within the eukaryotic genome. Their use in genetics is well known, but as with any new class of sequence, there is also much speculation about their involvement in a range of biological processes. How much is known of their biology?
A polymorphic and hypervariable locus in the pseudoautosomal region of the CBA/H mouse sex chromosomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fennelly, J.; Laval, S.; Wright, E.

1996-04-01

We have identified a genomic locus (DXYH1) that is polymorphic and hypervariable within the CBA/H colony. Using a panel of C57BL/6 x Mus spretus backcross offspring, it was mapped to the distal end of the X chromosome. Pseudoautosomal inheritance was demonstrated through three generations of CBA/H x CBA/H and CBA/H x C57BL/6 crosses and confirmed through linkage to the Sxr locus in X/Y Sxr x 3H1 crosses. Meiotic recombination frequencies place DXYH1 {approximately}28% into the pseudoautosomal region from the boundary. The de novo generation of CBA/H variant DXYH1 restriction fragment length polymorphisms during spermatogenesis is suggestive of the germline instabilitymore » associated with hypermutable human minisatellites. The absence of DXY1-related sequences in Mus spretus provides DNA sequence evidence to support the observed failure of X-Y pairing during meiosis and consequent hybrid infertility in C57BL/6 x Mus spretus male F1 offspring. 19 refs., 4 figs.« less
Phylogenetic Diversity of Koala Retrovirus within a Wild Koala Population.

PubMed

Chappell, K J; Brealey, J C; Amarilla, A A; Watterson, D; Hulse, L; Palmieri, C; Johnston, S D; Holmes, E C; Meers, J; Young, P R

2017-02-01

Koala populations are in serious decline across many areas of mainland Australia, with infectious disease a contributing factor. Koala retrovirus (KoRV) is a gammaretrovirus present in most wild koala populations and captive colonies. Five subtypes of KoRV (A to E) have been identified based on amino acid sequence divergence in a hypervariable region of the receptor binding domain of the envelope protein. However, analysis of viral genetic diversity has been conducted primarily on KoRV in captive koalas housed in zoos in Japan, the United States, and Germany. Wild koalas within Australia have not been comparably assessed. Here we report a detailed analysis of KoRV genetic diversity in samples collected from 18 wild koalas from southeast Queensland. By employing deep sequencing we identified 108 novel KoRV envelope sequences and determined their phylogenetic diversity. Genetic diversity in KoRV was abundant and fell into three major groups; two comprised the previously identified subtypes A and B, while the third contained the remaining hypervariable region subtypes (C, D, and E) as well as four hypervariable region subtypes that we newly define here (F, G, H, and I). In addition to the ubiquitous presence of KoRV-A, which may represent an exclusively endogenous variant, subtypes B, D, and F were found to be at high prevalence, while subtypes G, H, and I were present in a smaller number of animals. Koala retrovirus (KoRV) is thought to be a significant contributor to koala disease and population decline across mainland Australia. This study is the first to determine KoRV subtype prevalence among a wild koala population, and it significantly expands the total number of KoRV sequences available, providing a more precise picture of genetic diversity. This understanding of KoRV subtype prevalence and genetic diversity will be important for conservation efforts attempting to limit the spread of KoRV. Furthermore, KoRV is one of the only retroviruses shown to exist in both endogenous (transmitted vertically to offspring in the germ line DNA) and exogenous (horizontally transmitted between infected individuals) forms, a division of fundamental evolutionary importance. Copyright © 2017 American Society for Microbiology.
The rapidly evolving centromere-specific histone has stringent functional requirements in Arabidopsis thaliana.

PubMed

Ravi, Maruthachalam; Kwong, Pak N; Menorca, Ron M G; Valencia, Joel T; Ramahi, Joseph S; Stewart, Jodi L; Tran, Robert K; Sundaresan, Venkatesan; Comai, Luca; Chan, Simon W-L

2010-10-01

Centromeres control chromosome inheritance in eukaryotes, yet their DNA structure and primary sequence are hypervariable. Most animals and plants have megabases of tandem repeats at their centromeres, unlike yeast with unique centromere sequences. Centromere function requires the centromere-specific histone CENH3 (CENP-A in human), which replaces histone H3 in centromeric nucleosomes. CENH3 evolves rapidly, particularly in its N-terminal tail domain. A portion of the CENH3 histone-fold domain, the CENP-A targeting domain (CATD), has been previously shown to confer kinetochore localization and centromere function when swapped into human H3. Furthermore, CENP-A in human cells can be functionally replaced by CENH3 from distantly related organisms including Saccharomyces cerevisiae. We have used cenh3-1 (a null mutant in Arabidopsis thaliana) to replace endogenous CENH3 with GFP-tagged variants. A H3.3 tail domain-CENH3 histone-fold domain chimera rescued viability of cenh3-1, but CENH3's lacking a tail domain were nonfunctional. In contrast to human results, H3 containing the A. thaliana CATD cannot complement cenh3-1. GFP-CENH3 from the sister species A. arenosa functionally replaces A. thaliana CENH3. GFP-CENH3 from the close relative Brassica rapa was targeted to centromeres, but did not complement cenh3-1, indicating that kinetochore localization and centromere function can be uncoupled. We conclude that CENH3 function in A. thaliana, an organism with large tandem repeat centromeres, has stringent requirements for functional complementation in mitosis.
TagDust2: a generic method to extract reads from sequencing data.

PubMed

Lassmann, Timo

2015-01-28

Arguably the most basic step in the analysis of next generation sequencing data (NGS) involves the extraction of mappable reads from the raw reads produced by sequencing instruments. The presence of barcodes, adaptors and artifacts subject to sequencing errors makes this step non-trivial. Here I present TagDust2, a generic approach utilizing a library of hidden Markov models (HMM) to accurately extract reads from a wide array of possible read architectures. TagDust2 extracts more reads of higher quality compared to other approaches. Processing of multiplexed single, paired end and libraries containing unique molecular identifiers is fully supported. Two additional post processing steps are included to exclude known contaminants and filter out low complexity sequences. Finally, TagDust2 can automatically detect the library type of sequenced data from a predefined selection. Taken together TagDust2 is a feature rich, flexible and adaptive solution to go from raw to mappable NGS reads in a single step. The ability to recognize and record the contents of raw reads will help to automate and demystify the initial, and often poorly documented, steps in NGS data analysis pipelines. TagDust2 is freely available at: http://tagdust.sourceforge.net .
The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

PubMed

Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

2007-02-14

The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses, population genetics, and phylogenetics.
High-resolution phylogenetic microbial community profiling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin

Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less
High-resolution phylogenetic microbial community profiling

DOE PAGES

Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin; ...

2016-02-09

Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less
Identification of Forensic Samples via Mitochondrial DNA in the Undergraduate Biochemistry Laboratory

NASA Astrophysics Data System (ADS)

Millard, Julie T.; Pilon, André M.

2003-04-01

A recent forensic approach for identification of unknown biological samples is mitochondrial DNA (mtDNA) sequencing. We describe a laboratory exercise suitable for an undergraduate biochemistry course in which the polymerase chain reaction is used to amplify a 440 base pair hypervariable region of human mtDNA from a variety of "crime scene" samples (e.g., teeth, hair, nails, cigarettes, envelope flaps, toothbrushes, and chewing gum). Amplification is verified via agarose gel electrophoresis and then samples are subjected to cycle sequencing. Sequence alignments are made via the program CLUSTAL W, allowing students to compare samples and solve the "crime."
Sequences from the hypervariable V3-V5 regions of the 16S rRNA gene amplified from the porcine proximal colon

USDA-ARS?s Scientific Manuscript database

Helminths, including GI nematodes, colonize > 1/3 of the world’s population and have evolved with humans and their microbiome. Parasites inherently regulate the host immune response to ensure their survival through mechanisms that dampen host inflammation. These unique properties of nematodes have b...
Exploring the Presence of microDNAs in Prostate Cancer Cell Lines, Tissue, and Sera of Prostate Cancer Patients and its Possible Application as Biomarker

DTIC Science & Technology

2016-04-01

Sequence tags were mapped on the human reference genome using the Novoalign software. Only those...ends of the linear islands to create a novel junctional sequence that does not exist in the genome . Thus the PE- sequence of a fragment that breaks at... genome (Fig. 3b). Those PE-tags where one tag maps uniquely to an island and the other remains unmapped, but passes the sequence quality filter,
Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

USDA-ARS?s Scientific Manuscript database

Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...
Deep sequencing of hepatitis C virus hypervariable region 1 reveals no correlation between genetic heterogeneity and antiviral treatment outcome

PubMed Central

2014-01-01

Background Hypervariable region 1 (HVR1) contained within envelope protein 2 (E2) gene is the most variable part of HCV genome and its translation product is a major target for the host immune response. Variability within HVR1 may facilitate evasion of the immune response and could affect treatment outcome. The aim of the study was to analyze the impact of HVR1 heterogeneity employing sensitive ultra-deep sequencing, on the outcome of PEG-IFN-α (pegylated interferon α) and ribavirin treatment. Methods HVR1 sequences were amplified from pretreatment serum samples of 25 patients infected with genotype 1b HCV (12 responders and 13 non-responders) and were subjected to pyrosequencing (GS Junior, 454/Roche). Reads were corrected for sequencing error using ShoRAH software, while population reconstruction was done using three different minimal variant frequency cut-offs of 1%, 2% and 5%. Statistical analysis was done using Mann–Whitney and Fisher’s exact tests. Results Complexity, Shannon entropy, nucleotide diversity per site, genetic distance and the number of genetic substitutions were not significantly different between responders and non-responders, when analyzing viral populations at any of the three frequencies (≥1%, ≥2% and ≥5%). When clonal sample was used to determine pyrosequencing error, 4% of reads were found to be incorrect and the most abundant variant was present at a frequency of 1.48%. Use of ShoRAH reduced the sequencing error to 1%, with the most abundant erroneous variant present at frequency of 0.5%. Conclusions While deep sequencing revealed complex genetic heterogeneity of HVR1 in chronic hepatitis C patients, there was no correlation between treatment outcome and any of the analyzed quasispecies parameters. PMID:25016390
Differential gene expression in the siphonophore Nanomia bijuga (Cnidaria) assessed with multiple next-generation sequencing workflows.

PubMed

Siebert, Stefan; Robinson, Mark D; Tintori, Sophia C; Goetz, Freya; Helm, Rebecca R; Smith, Stephen A; Shaner, Nathan; Haddock, Steven H D; Dunn, Casey W

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing.
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows

PubMed Central

Siebert, Stefan; Robinson, Mark D.; Tintori, Sophia C.; Goetz, Freya; Helm, Rebecca R.; Smith, Stephen A.; Shaner, Nathan; Haddock, Steven H. D.; Dunn, Casey W.

2011-01-01

We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing. PMID:21829563
Phylogenetic analysis of Tibetan mastiffs based on mitochondrial hypervariable region I.

PubMed

Ren, Zhanjun; Chen, Huiling; Yang, Xuejiao; Zhang, Chengdong

2017-03-01

Recently, the number of Tibetan mastiffs, which is a precious germplasm resource and cultural heritage, is decreasing sharply. Therefore, the genetic diversity of Tibetan mastiffs needs to be studied to clarify its phylogenetics relationships and lay the foundation for resource protection, rational development and utilization of Tibetan mastiffs. We sequenced hypervariable region I of mitochondrial DNA (mtDNA) of 110 individuals from Tibet region and Gansu province. A total of 12 polymorphic sites were identified which defined eight haplotypes of which H4 and H8 were unique to Tibetan population with H8 being identified first. The haplotype diversity (Hd: 0.808), nucleotide diversity (Pi: 0.603%), the average number of nucleotide difference (K: 3.917) of Tibetan mastiffs from Gansu were higher than those from Tibet region (Hd: 0.794; Pi: 0.589%; K: 3.831), which revealed higher genetic diversity in Gansu. In terms of total population, the genetic variation was low. The median-joining network and phylogenetic tree based on the mtDNA hypervariable region I showed that Tibetan mastiffs originated from grey wolves, as the other domestic dogs and had different history of maternal origin. The mismatch distribution analysis and neutrality tests indicated that Tibetan mastiffs were in genetic equilibrium or in a population decline.
Investigating the diversity of the 18S SSU rRNA hyper-variable region of Theileria in cattle and Cape buffalo (Syncerus caffer) from southern Africa using a next generation sequencing approach.

PubMed

Mans, Ben J; Pienaar, Ronel; Ratabane, John; Pule, Boitumelo; Latif, Abdalla A

2016-07-01

Molecular classification and systematics of the Theileria is based on the analysis of the 18S rRNA gene. Reverse line blot or conventional sequencing approaches have disadvantages in the study of 18S rRNA diversity and a next-generation 454 sequencing approach was investigated. The 18S rRNA gene was amplified using RLB primers coupled to 96 unique sequence identifiers (MIDs). Theileria positive samples from African buffalo (672) and cattle (480) from southern Africa were combined in batches of 96 and sequenced using the GS Junior 454 sequencer to produce 825711 informative sequences. Sequences were extracted based on MIDs and analysed to identify Theileria genotypes. Genotypes observed in buffalo and cattle were confirmed in the current study, while no new genotypes were discovered. Genotypes showed specific geographic distributions, most probably linked with vector distributions. Host specificity of buffalo and cattle specific genotypes were confirmed and prevalence data as well as relative parasitemia trends indicate preference for different hosts. Mixed infections are common with African buffalo carrying more genotypes compared to cattle. Associative or exclusion co-infection profiles were observed between genotypes that may have implications for speciation and systematics: specifically that more Theileria species may exist in cattle and buffalo than currently recognized. Analysis of primers used for Theileria parva diagnostics indicate that no new genotypes will be amplified by the current primer sets confirming their specificity. T. parva SNP variants that occur in the 18S rRNA hypervariable region were confirmed. A next generation sequencing approach is useful in obtaining comprehensive knowledge regarding 18S rRNA diversity and prevalence for the Theileria, allowing for the assessment of systematics and diagnostic assays based on the 18S gene. Copyright © 2016 Elsevier GmbH. All rights reserved.
High-Resolution Melting (HRM) of Hypervariable Mitochondrial DNA Regions for Forensic Science.

PubMed

Dos Santos Rocha, Alípio; de Amorim, Isis Salviano Soares; Simão, Tatiana de Almeida; da Fonseca, Adenilson de Souza; Garrido, Rodrigo Grazinoli; Mencalha, Andre Luiz

2018-03-01

Forensic strategies commonly are proceeding by analysis of short tandem repeats (STRs); however, new additional strategies have been proposed for forensic science. Thus, this article standardized the high-resolution melting (HRM) of DNA for forensic analyzes. For HRM, mitochondrial DNA (mtDNA) from eight individuals were extracted from mucosa swabs by DNAzol reagent, samples were amplified by PCR and submitted to HRM analysis to identify differences in hypervariable (HV) regions I and II. To confirm HRM, all PCR products were DNA sequencing. The data suggest that is possible discriminate DNA from different samples by HRM curves. Also, uncommon dual-dissociation was identified in a single PCR product, increasing HRM analyzes by evaluation of melting peaks. Thus, HRM is accurate and useful to screening small differences in HVI and HVII regions from mtDNA and increase the efficiency of laboratory routines based on forensic genetics. © 2017 American Academy of Forensic Sciences.
An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

PubMed

Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

2011-01-01

cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.

Serial analysis of gene expression (SAGE) in normal human trabecular meshwork.

PubMed

Liu, Yutao; Munro, Drew; Layfield, David; Dellinger, Andrew; Walter, Jeffrey; Peterson, Katherine; Rickman, Catherine Bowes; Allingham, R Rand; Hauser, Michael A

2011-04-08

To identify the genes expressed in normal human trabecular meshwork tissue, a tissue critical to the pathogenesis of glaucoma. Total RNA was extracted from human trabecular meshwork (HTM) harvested from 3 different donors. Extracted RNA was used to synthesize individual SAGE (serial analysis of gene expression) libraries using the I-SAGE Long kit from Invitrogen. Libraries were analyzed using SAGE 2000 software to extract the 17 base pair sequence tags. The extracted sequence tags were mapped to the genome using SAGE Genie map. A total of 298,834 SAGE tags were identified from all HTM libraries (96,842, 88,126, and 113,866 tags, respectively). Collectively, there were 107,325 unique tags. There were 10,329 unique tags with a minimum of 2 counts from a single library. These tags were mapped to known unique Unigene clusters. Approximately 29% of the tags (orphan tags) did not map to a known Unigene cluster. Thirteen percent of the tags mapped to at least 2 Unigene clusters. Sequence tags from many glaucoma-related genes, including myocilin, optineurin, and WD repeat domain 36, were identified. This is the first time SAGE analysis has been used to characterize the gene expression profile in normal HTM. SAGE analysis provides an unbiased sampling of gene expression of the target tissue. These data will provide new and valuable information to improve understanding of the biology of human aqueous outflow.
The skin microbiome in psoriatic arthritis: methodology development and pilot data.

PubMed

Castelino, Madhura; Eyre, Stephen; Moat, John; Fox, Graeme; Martin, Paul; Ijaz, Umer; Quince, Christopher; Ho, Pauline; Upton, Mathew; Barton, Anne

2015-02-26

Skin microbiota are likely to be important in the development of conditions such as psoriatic arthritis. Profiling the bacterial community in the psosriatic plaques will contribute to our understanding of the role of the skin microbiome in these conditions. The aim of this work was to determine the optimum study design for work on the skin microbiome with use of the MiSeq platform. The objectives were to compare data generated from two platforms for two primer pairs in a low density mock bacterial community. DNA was obtained from two low density mock communities of 11 diverse bacterial strains (with and without human DNA supplementation) and from swabs taken from the skin of four healthy volunteers. The DNA was amplified with primer pairs covering hypervariable regions of the 16S rRNA gene: primers 63F and 519R (V1-V3), and 347F and 803R (V3-V4). The resultant libraries were indexed for the MiSeq and Roche454 platforms and sequenced. Both datasets were de-noised, cleaned of chimeras, and analysed by use of QIIME software (version 1.8.0). No significant difference in the diversity indices at the phylum and the genus level between the platforms was seen. Comparison of the diversity indices for the mock community data for the two primer pairs demonstrated that the V3-V4 hypervariable region had significantly better capture of bacterial diversity than did the V1-V3 region. Amplification with the same primer pairs showed strong concordance within each platform (98·9-99·8%), with negligible effect of spiked human DNA contamination. Comparison at the family level classification between samples processed on the MiSeq and Roche454 platforms using the V3-V4 hypervariable region also showed a high level of concordance (87%), although less so for the V1-V3 primers (10%). The pilot data from healthy volunteers were similar. Results obtained from the V3-V4 16S rRNA hypervariable region, sequencing on the MiSeq and Roche454 platforms, were concordant between replicates, and between each other. These findings suggest that the MiSeq platform, and these primers, is a comparable method for determining skin microbiota to the widely used Roche454 methodology. NIHR Manchester Musculoskeletal Biomedical Research Unit. Copyright © 2015 Elsevier Ltd. All rights reserved.
Identification of differentially expressed genes in cucumber (Cucumis sativus L.) root under waterlogging stress by digital gene expression profile.

PubMed

Qi, Xiao-Hua; Xu, Xue-Wen; Lin, Xiao-Jian; Zhang, Wen-Jie; Chen, Xue-Hao

2012-03-01

High-throughput tag-sequencing (Tag-seq) analysis based on the Solexa Genome Analyzer platform was applied to analyze the gene expression profiling of cucumber plant at 5 time points over a 24h period of waterlogging treatment. Approximately 5.8 million total clean sequence tags per library were obtained with 143013 distinct clean tag sequences. Approximately 23.69%-29.61% of the distinct clean tags were mapped unambiguously to the unigene database, and 53.78%-60.66% of the distinct clean tags were mapped to the cucumber genome database. Analysis of the differentially expressed genes revealed that most of the genes were down-regulated in the waterlogging stages, and the differentially expressed genes mainly linked to carbon metabolism, photosynthesis, reactive oxygen species generation/scavenging, and hormone synthesis/signaling. Finally, quantitative real-time polymerase chain reaction using nine genes independently verified the tag-mapped results. This present study reveals the comprehensive mechanisms of waterlogging-responsive transcription in cucumber. Copyright Â© 2011 Elsevier Inc. All rights reserved.
First report of bacterial community from a Bat Guano using Illumina next-generation sequencing.

PubMed

De Mandal, Surajit; Zothansanga; Panda, Amritha Kumari; Bisht, Satpal Singh; Senthil Kumar, Nachimuthu

2015-06-01

V4 hypervariable region of 16S rDNA was analyzed for identifying the bacterial communities present in Bat Guano from the unexplored cave - Pnahkyndeng, Meghalaya, Northeast India. Metagenome comprised of 585,434 raw Illumina sequences with a 59.59% G+C content. A total of 416,490 preprocessed reads were clustered into 1282 OTUs (operational taxonomical units) comprising of 18 bacterial phyla. The taxonomic profile showed that the guano bacterial community is dominated by Chloroflexi, Actinobacteria and Crenarchaeota which account for 70.73% of all sequence reads and 43.83% of all OTUs. Metagenome sequence data are available at NCBI under the accession no. SRP051094. This study is the first to characterize Bat Guano bacterial community using next-generation sequencing approach.
First report of bacterial community from a Bat Guano using Illumina next-generation sequencing

PubMed Central

De Mandal, Surajit; Zothansanga; Panda, Amritha Kumari; Bisht, Satpal Singh; Senthil Kumar, Nachimuthu

2015-01-01

V4 hypervariable region of 16S rDNA was analyzed for identifying the bacterial communities present in Bat Guano from the unexplored cave — Pnahkyndeng, Meghalaya, Northeast India. Metagenome comprised of 585,434 raw Illumina sequences with a 59.59% G+C content. A total of 416,490 preprocessed reads were clustered into 1282 OTUs (operational taxonomical units) comprising of 18 bacterial phyla. The taxonomic profile showed that the guano bacterial community is dominated by Chloroflexi, Actinobacteria and Crenarchaeota which account for 70.73% of all sequence reads and 43.83% of all OTUs. Metagenome sequence data are available at NCBI under the accession no. SRP051094. This study is the first to characterize Bat Guano bacterial community using next-generation sequencing approach. PMID:26484190
OSIRIS-REx Touch-And-Go (TAG) Navigation Performance

NASA Technical Reports Server (NTRS)

Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

2015-01-01

The Origins Spectral Interpretation Resource identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIES-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper will summarize the Monte-Carlo simulation of the TAG sequence and present analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and +-2 cms of the targeted contact velocity. The paper will describe some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.
OSIRI-REx Touch and Go (TAG) Navigation Performance

NASA Technical Reports Server (NTRS)

Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

2015-01-01

The Origins Spectral Interpretation Resource Identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIS-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive maneuvers required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper also summarizes the Monte-Carlo simulation of the TAG sequence and presents analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and 2 cm/s of the targeted contact velocity. The paper describes some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.
Genome-wide characterization and selection of expressed sequence tag simple sequence repeat primers for optimized marker distribution and reliability in peach

USDA-ARS?s Scientific Manuscript database

Expressed sequence tag (EST) simple sequence repeats (SSRs) in Prunus were mined, and flanking primers designed and used for genome-wide characterization and selection of primers to optimize marker distribution and reliability. A total of 12,618 contigs were assembled from 84,727 ESTs, along with 34...
Characterization of the HLA-DRβ1 third hypervariable region amino acid sequence according to charge and parental inheritance in systemic sclerosis.

PubMed

Gentil, Coline A; Gammill, Hilary S; Luu, Christine T; Mayes, Maureen D; Furst, Dan E; Nelson, J Lee

2017-03-07

Specific HLA class II alleles are associated with systemic sclerosis (SSc) risk, clinical characteristics, and autoantibodies. HLA nomenclature initially developed with antibodies as typing reagents defining DRB1 allele groups. However, alleles from different DRB1 allele groups encode the same third hypervariable region (3rd HVR) sequence, the primary T-cell recognition site, and 3rd HVR charge differences can affect interactions with T cells. We considered 3rd HVR sequences (amino acids 67-74) irrespective of the allele group and analyzed parental inheritance considered according to the 3rd HVR charge, comparing SSc patients with controls. In total, 306 families (121 SSc and 185 controls) were HLA genotyped and parental HLA-haplotype origin was determined. Analysis was conducted according to DRβ1 3rd HVR sequence, charge, and parental inheritance. The distribution of 3rd HVR sequences differed in SSc patients versus controls (p = 0.007), primarily due to an increase of specific DRB1*11 alleles, in accord with previous observations. The 3rd HVR sequences were next analyzed according to charge and parental inheritance. Paternal transmission of DRB1 alleles encoding a +2 charge 3rd HVR was significantly reduced in SSc patients compared with maternal transmission (p = 0.0003, corrected for analysis of four charge categories p = 0.001). To a lesser extent, paternal transmission was increased when charge was 0 (p = 0.021, corrected for multiple comparisons p = 0.084). In contrast, paternal versus maternal inheritance was similar in controls. SSc patients differed from controls when DRB1 alleles were categorized according to 3rd HVR sequences. Skewed parental inheritance was observed in SSc patients but not in controls when the DRβ1 3rd HVR was considered according to charge. These observations suggest that epigenetic modulation of HLA merits investigation in SSc.
Sex-related differences in the thanatomicrobiome in postmortem heart samples using bacterial gene regions V1-2 and V4.

PubMed

Bell, Courtnee R; Wilkinson, Jeremy E; Robertson, Boakai K; Javan, Gulnaz T

2018-05-10

Recent studies have revealed distinct thanatomicrobiome (microbiome of death) signatures in human body sites after death. Thanatomicrobiome studies suggest that microbial succession after death may have the potential to reveal important postmortem biomarkers for the identification of time of death. We surveyed the postmortem microbiomes of cardiac tissues from ten corpses with varying times of death (6-58 h) using amplicon-based sequencing of the 16S rRNA gene' V1-2 and V4 hypervariable regions. The results demonstrated that amplicons had statistically significant (p <0.05) sex-dependent changes. Clostridium sp., Pseudomonas sp., Pantoea sp., and Streptococcus sp. had the highest enrichment for both V1-2 and V4 regions. Interestingly, the results also show that V4 amplicons had higher abundance of Clostridium sp. and Pseudomonas sp. in female hearts compared to males. Additionally, Streptococcus sp. was solely found in male heart samples. The distinction between sexes was further supported by Principle Coordinate Analysis, which revealed microbes in female hearts formed a distinctive cluster separate from male cadavers for both hypervariable regions. This study provides data that demonstrates that two hypervariable regions show discriminatory power for sex differences in postmortem heart samples. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Methyl-CpG island-associated genome signature tags

DOEpatents

Dunn, John J

2014-05-20

Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.
An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets.

PubMed

Hosseini, Parsa; Tremblay, Arianne; Matthews, Benjamin F; Alkharouf, Nadim W

2010-07-02

The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data in a CASAVA-build into functional annotations while producing corresponding gene expression measurements. Achieving such analysis is executed in an ultrafast and highly efficient manner, whether the analysis be a single-read or paired-end sequencing experiment. TASE is a user-friendly and freely available application, allowing rapid analysis and annotation of any given Illumina Solexa sequencing dataset with ease.
Structure-Related Roles for the Conservation of the HIV-1 Fusion Peptide Sequence Revealed by Nuclear Magnetic Resonance.

PubMed

Serrano, Soraya; Huarte, Nerea; Rujas, Edurne; Andreu, David; Nieva, José L; Jiménez, María Angeles

2017-10-17

Despite extensive characterization of the human immunodeficiency virus type 1 (HIV-1) hydrophobic fusion peptide (FP), the structure-function relationships underlying its extraordinary degree of conservation remain poorly understood. Specifically, the fact that the tandem repeat of the FLGFLG tripeptide is absolutely conserved suggests that high hydrophobicity may not suffice to unleash FP function. Here, we have compared the nuclear magnetic resonance (NMR) structures adopted in nonpolar media by two FP surrogates, wtFP-tag and scrFP-tag, which had equal hydrophobicity but contained wild-type and scrambled core sequences LFLGFLG and FGLLGFL, respectively. In addition, these peptides were tagged at their C-termini with an epitope sequence that folded independently, thereby allowing Western blot detection without interfering with FP structure. We observed similar α-helical FP conformations for both specimens dissolved in the low-polarity medium 25% (v/v) 1,1,1,3,3,3-hexafluoro-2-propanol (HFIP), but important differences in contact with micelles of the membrane mimetic dodecylphosphocholine (DPC). Thus, whereas wtFP-tag preserved a helix displaying a Gly-rich ridge, the scrambled sequence lost in great part the helical structure upon being solubilized in DPC. Western blot analyses further revealed the capacity of wtFP-tag to assemble trimers in membranes, whereas membrane oligomers were not observed in the case of the scrFP-tag sequence. We conclude that, beyond hydrophobicity, preserving sequence order is an important feature for defining the secondary structures and oligomeric states adopted by the HIV FP in membranes.
Genetically encoded fluorescent tags

PubMed Central

Thorn, Kurt

2017-01-01

Genetically encoded fluorescent tags are protein sequences that can be fused to a protein of interest to render it fluorescent. These tags have revolutionized cell biology by allowing nearly any protein to be imaged by light microscopy at submicrometer spatial resolution and subsecond time resolution in a live cell or organism. They can also be used to measure protein abundance in thousands to millions of cells using flow cytometry. Here I provide an introduction to the different genetic tags available, including both intrinsically fluorescent proteins and proteins that derive their fluorescence from binding of either endogenous or exogenous fluorophores. I discuss their optical and biological properties and guidelines for choosing appropriate tags for an experiment. Tools for tagging nucleic acid sequences and reporter molecules that detect the presence of different biomolecules are also briefly discussed. PMID:28360214
Chloroplast Genome Differences between Asian and American Equisetum arvense (Equisetaceae) and the Origin of the Hypervariable trnY-trnE Intergenic Spacer

PubMed Central

Kim, Hyoung Tae; Kim, Ki-Joong

2014-01-01

Comparative analyses of complete chloroplast (cp) DNA sequences within a species may provide clues to understand the population dynamics and colonization histories of plant species. Equisetum arvense (Equisetaceae) is a widely distributed fern species in northeastern Asia, Europe, and North America. The complete cp DNA sequences from Asian and American E. arvense individuals were compared in this study. The Asian E. arvense cp genome was 583 bp shorter than that of the American E. arvense. In total, 159 indels were observed between two individuals, most of which were concentrated on the hypervariable trnY-trnE intergenic spacer (IGS) in the large single-copy (LSC) region of the cp genome. This IGS region held a series of 19 bp repeating units. The numbers of the 19 bp repeat unit were responsible for 78% of the total length difference between the two cp genomes. Furthermore, only other closely related species of Equisetum also show the hypervariable nature of the trnY-trnE IGS. By contrast, only a single indel was observed in the gene coding regions: the ycf1 gene showed 24 bp differences between the two continental individuals due to a single tandem-repeat indel. A total of 165 single-nucleotide polymorphisms (SNPs) were recorded between the two cp genomes. Of these, 52 SNPs (31.5%) were distributed in coding regions, 13 SNPs (7.9%) were in introns, and 100 SNPs (60.6%) were in intergenic spacers (IGS). The overall difference between the Asian and American E. arvense cp genomes was 0.12%. Despite the relatively high genetic diversity between Asian and American E. arvense, the two populations are recognized as a single species based on their high morphological similarity. This indicated that the two regional populations have been in morphological stasis. PMID:25157804
Generation of non-genomic oligonucleotide tag sequences for RNA template-specific PCR

PubMed Central

Pinto, Fernando Lopes; Svensson, Håkan; Lindblad, Peter

2006-01-01

Background In order to overcome genomic DNA contamination in transcriptional studies, reverse template-specific polymerase chain reaction, a modification of reverse transcriptase polymerase chain reaction, is used. The possibility of using tags whose sequences are not found in the genome further improves reverse specific polymerase chain reaction experiments. Given the absence of software available to produce genome suitable tags, a simple tool to fulfill such need was developed. Results The program was developed in Perl, with separate use of the basic local alignment search tool, making the tool platform independent (known to run on Windows XP and Linux). In order to test the performance of the generated tags, several molecular experiments were performed. The results show that Tagenerator is capable of generating tags with good priming properties, which will deliberately not result in PCR amplification of genomic DNA. Conclusion The program Tagenerator is capable of generating tag sequences that combine genome absence with good priming properties for RT-PCR based experiments, circumventing the effects of genomic DNA contamination in an RNA sample. PMID:16820068
Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).

PubMed

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.
Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)

PubMed Central

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863
Cyanine-based probe\\tag-peptide pair fluorescence protein imaging and fluorescence protein imaging methods

DOEpatents

Mayer-Cumblidge, M. Uljana; Cao, Haishi

2013-01-15

A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.
Cyanine-based probe\\tag-peptide pair for fluorescence protein imaging and fluorescence protein imaging methods

DOEpatents

Mayer-Cumblidge, M Uljana [Richland, WA; Cao, Haishi [Richland, WA

2010-08-17

A molecular probe comprises two arsenic atoms and at least one cyanine based moiety. A method of producing a molecular probe includes providing a molecule having a first formula, treating the molecule with HgOAc, and subsequently transmetallizing with AsCl.sub.3. The As is liganded to ethanedithiol to produce a probe having a second formula. A method of labeling a peptide includes providing a peptide comprising a tag sequence and contacting the peptide with a biarsenical molecular probe. A complex is formed comprising the tag sequence and the molecular probe. A method of studying a peptide includes providing a mixture containing a peptide comprising a peptide tag sequence, adding a biarsenical probe to the mixture, and monitoring the fluorescence of the mixture.

Sequence analysis of the canine mitochondrial DNA control region from shed hair samples in criminal investigations.

PubMed

Berger, C; Berger, B; Parson, W

2012-01-01

In recent years, evidence from domestic dogs has increasingly been analyzed by forensic DNA testing. Especially, canine hairs have proved most suitable and practical due to the high rate of hair transfer occurring between dogs and humans. Starting with the description of a contamination-free sample handling procedure, we give a detailed workflow for sequencing hypervariable segments (HVS) of the mtDNA control region from canine evidence. After the hair material is lysed and the DNA extracted by Phenol/Chloroform, the amplification and sequencing strategy comprises the HVS I and II of the canine control region and is optimized for DNA of medium-to-low quality and quantity. The sequencing procedure is based on the Sanger Big-dye deoxy-terminator method and the separation of the sequencing reaction products is performed on a conventional multicolor fluorescence detection capillary electrophoresis platform. Finally, software-aided base calling and sequence interpretation are addressed exemplarily.
Modular probes for enriching and detecting complex nucleic acid sequences

NASA Astrophysics Data System (ADS)

Wang, Juexiao Sherry; Yan, Yan Helen; Zhang, David Yu

2017-12-01

Complex DNA sequences are difficult to detect and profile, but are important contributors to human health and disease. Existing hybridization probes lack the capability to selectively bind and enrich hypervariable, long or repetitive sequences. Here, we present a generalized strategy for constructing modular hybridization probes (M-Probes) that overcomes these challenges. We demonstrate that M-Probes can tolerate sequence variations of up to 7 nt at prescribed positions while maintaining single nucleotide sensitivity at other positions. M-Probes are also shown to be capable of sequence-selectively binding a continuous DNA sequence of more than 500 nt. Furthermore, we show that M-Probes can detect genes with triplet repeats exceeding a programmed threshold. As a demonstration of this technology, we have developed a hybrid capture method to determine the exact triplet repeat expansion number in the Huntington's gene of genomic DNA using quantitative PCR.
Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region

PubMed Central

Dolz, Roser; Valle, Rosa; Perera, Carmen L.; Bertran, Kateri; Frías, Maria T.; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J.

2013-01-01

Background Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Methodology/Principal Findings Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. Conclusions/Significance To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide. PMID:23805195
Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region.

PubMed

Alfonso-Morales, Abdulahi; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J

2013-01-01

Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide.
Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

PubMed

Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

2014-10-01

Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.
Application of Cydia pomonella expressed sequence tags: identification and expression of three general odorant binding proteins in codling moth

USDA-ARS?s Scientific Manuscript database

The codling moth, Cydia pomonella, is one of the most important pests of pome fruits in the world, yet the molecular genetics and physiology of this insect remains poorly understood. A combined assembly of 8340 expressed sequence tags (ESTs) was generated from Roche 454 GS-FLX sequencing of 8 tissu...
Preparation of next-generation sequencing libraries using Nextera™ technology: simultaneous DNA fragmentation and adaptor tagging by in vitro transposition.

PubMed

Caruccio, Nicholas

2011-01-01

DNA library preparation is a common entry point and bottleneck for next-generation sequencing. Current methods generally consist of distinct steps that often involve significant sample loss and hands-on time: DNA fragmentation, end-polishing, and adaptor-ligation. In vitro transposition with Nextera™ Transposomes simultaneously fragments and covalently tags the target DNA, thereby combining these three distinct steps into a single reaction. Platform-specific sequencing adaptors can be added, and the sample can be enriched and bar-coded using limited-cycle PCR to prepare di-tagged DNA fragment libraries. Nextera technology offers a streamlined, efficient, and high-throughput method for generating bar-coded libraries compatible with multiple next-generation sequencing platforms.
Single nucleotide polymorphisms from Theobroma cacao expressed sequence tags associated with witches' broom disease in cacao.

PubMed

Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F

2009-07-14

In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.
SPlinted Ligation Adapter Tagging (SPLAT), a novel library preparation method for whole genome bisulphite sequencing

PubMed Central

Manlig, Erika; Wahlberg, Per

2017-01-01

Abstract Sodium bisulphite treatment of DNA combined with next generation sequencing (NGS) is a powerful combination for the interrogation of genome-wide DNA methylation profiles. Library preparation for whole genome bisulphite sequencing (WGBS) is challenging due to side effects of the bisulphite treatment, which leads to extensive DNA damage. Recently, a new generation of methods for bisulphite sequencing library preparation have been devised. They are based on initial bisulphite treatment of the DNA, followed by adaptor tagging of single stranded DNA fragments, and enable WGBS using low quantities of input DNA. In this study, we present a novel approach for quick and cost effective WGBS library preparation that is based on splinted adaptor tagging (SPLAT) of bisulphite-converted single-stranded DNA. Moreover, we validate SPLAT against three commercially available WGBS library preparation techniques, two of which are based on bisulphite treatment prior to adaptor tagging and one is a conventional WGBS method. PMID:27899585
An expressed sequence tag (EST) data mining strategy succeeding in the discovery of new G-protein coupled receptors.

PubMed

Wittenberger, T; Schaller, H C; Hellebrand, S

2001-03-30

We have developed a comprehensive expressed sequence tag database search method and used it for the identification of new members of the G-protein coupled receptor superfamily. Our approach proved to be especially useful for the detection of expressed sequence tag sequences that do not encode conserved parts of a protein, making it an ideal tool for the identification of members of divergent protein families or of protein parts without conserved domain structures in the expressed sequence tag database. At least 14 of the expressed sequence tags found with this strategy are promising candidates for new putative G-protein coupled receptors. Here, we describe the sequence and expression analysis of five new members of this receptor superfamily, namely GPR84, GPR86, GPR87, GPR90 and GPR91. We also studied the genomic structure and chromosomal localization of the respective genes applying in silico methods. A cluster of six closely related G-protein coupled receptors was found on the human chromosome 3q24-3q25. It consists of four orphan receptors (GPR86, GPR87, GPR91, and H963), the purinergic receptor P2Y1, and the uridine 5'-diphosphoglucose receptor KIAA0001. It seems likely that these receptors evolved from a common ancestor and therefore might have related ligands. In conclusion, we describe a data mining procedure that proved to be useful for the identification and first characterization of new genes and is well applicable for other gene families. Copyright 2001 Academic Press.
An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets

PubMed Central

2010-01-01

Background The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. Findings We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. Conclusions TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data in a CASAVA-build into functional annotations while producing corresponding gene expression measurements. Achieving such analysis is executed in an ultrafast and highly efficient manner, whether the analysis be a single-read or paired-end sequencing experiment. TASE is a user-friendly and freely available application, allowing rapid analysis and annotation of any given Illumina Solexa sequencing dataset with ease. PMID:20598141
Sequence tagging reveals unexpected modifications in toxicoproteomics

PubMed Central

Dasari, Surendra; Chambers, Matthew C.; Codreanu, Simona G.; Liebler, Daniel C.; Collins, Ben C.; Pennington, Stephen R.; Gallagher, William M.; Tabb, David L.

2010-01-01

Toxicoproteomic samples are rich in posttranslational modifications (PTMs) of proteins. Identifying these modifications via standard database searching can incur significant performance penalties. Here we describe the latest developments in TagRecon, an algorithm that leverages inferred sequence tags to identify modified peptides in toxicoproteomic data sets. TagRecon identifies known modifications more effectively than the MyriMatch database search engine. TagRecon outperformed state of the art software in recognizing unanticipated modifications from LTQ, Orbitrap, and QTOF data sets. We developed user-friendly software for detecting persistent mass shifts from samples. We follow a three-step strategy for detecting unanticipated PTMs in samples. First, we identify the proteins present in the sample with a standard database search. Next, identified proteins are interrogated for unexpected PTMs with a sequence tag-based search. Finally, additional evidence is gathered for the detected mass shifts with a refinement search. Application of this technology on toxicoproteomic data sets revealed unintended cross-reactions between proteins and sample processing reagents. Twenty five proteins in rat liver showed signs of oxidative stress when exposed to potentially toxic drugs. These results demonstrate the value of mining toxicoproteomic data sets for modifications. PMID:21214251
Linear reduction method for predictive and informative tag SNP selection.

PubMed

He, Jingwu; Westbrooks, Kelly; Zelikovsky, Alexander

2005-01-01

Constructing a complete human haplotype map is helpful when associating complex diseases with their related SNPs. Unfortunately, the number of SNPs is very large and it is costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNPs that should be sequenced to a small number of informative representatives called tag SNPs. In this paper, we propose a new linear algebra-based method for selecting and using tag SNPs. We measure the quality of our tag SNP selection algorithm by comparing actual SNPs with SNPs predicted from selected linearly independent tag SNPs. Our experiments show that for sufficiently long haplotypes, knowing only 0.4% of all SNPs the proposed linear reduction method predicts an unknown haplotype with the error rate below 2% based on 10% of the population.
Comparison of base composition analysis and Sanger sequencing of mitochondrial DNA for four U.S. population groups.

PubMed

Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M

2014-01-01

A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
Analysis and Functional Annotation of an Expressed Sequence Tag Collection for Tropical Crop Sugarcane

PubMed Central

Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo

2003-01-01

To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979
Exploring the Presence of microDNAs in Prostate Cancer Cell Lines, Tissue, and Sera of Prostate Cancer Patients and its Possible Application as Biomarker

DTIC Science & Technology

2015-08-01

Sequence tags were mapped on the human reference genome using the Novoalign software. Only those tags... the linear islands to create a novel junctional sequence that does not exist in the genome . Thus the PE- sequence of a fragment that breaks at or...identified in cancer cell lines. (b) Median percent GC content of microDNAs and the genomic sequences up- or downstream of the source loci are
Evolution of ribonuclease in relation to polypeptide folding mechanisms.

NASA Technical Reports Server (NTRS)

Barnard, E. A.; Cohen, M. S.; Gold, M. H.; Kim, J.-K.

1972-01-01

Comparisons of the N-terminal region of pancreatic RNAase in seven species are presented, taking into account cow, bison, deer, rat, pig, kangaroo, and turtle. The available limited evidence on hypervariable regions indicates that there is still an evolutionary constraint on them. It is proposed that there is a selection pressure acting on all regions of a protein sequence in evolution. Mutations that tend to obstruct the folding process can lead to various intensities of selection pressure.
Mitochondrial control-region sequence variation in aboriginal Australians.

PubMed Central

van Holst Pellekaan, S; Frommer, M; Sved, J; Boettcher, B

1998-01-01

The mitochondrial D-loop hypervariable segment 1 (mt HVS1) between nucleotides 15997 and 16377 has been examined in aboriginal Australian people from the Darling River region of New South Wales (riverine) and from Yuendumu in central Australia (desert). Forty-seven unique HVS1 types were identified, varying at 49 nucleotide positions. Pairwise analysis by calculation of BEPPI (between population proportion index) reveals statistically significant structure in the populations, although some identical HVS1 types are seen in the two contrasting regions. mt HVS1 types may reflect more-ancient distributions than do linguistic diversity and other culturally distinguishing attributes. Comparison with sequences from five published global studies reveals that these Australians demonstrate greatest divergence from some Africans, least from Papua New Guinea highlanders, and only slightly more from some Pacific groups (Indonesian, Asian, Samoan, and coastal Papua New Guinea), although the HVS1 types vary at different nucleotide sites. Construction of a median network, displaying three main groups, suggests that several hypervariable nucleotide sites within the HVS1 are likely to have undergone mutation independently, making phylogenetic comparison with global samples by conventional methods difficult. Specific nucleotide-site variants are major separators in median networks constructed from Australian HVS1 types alone and for one global selection. The distribution of these, requiring extended study, suggests that they may be signatures of different groups of prehistoric colonizers into Australia, for which the time of colonization remains elusive. PMID:9463317
Design and characterization of a nanopore-coupled polymerase for single-molecule DNA sequencing by synthesis on an electrode array

PubMed Central

Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.

2016-01-01

Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524
Mutation of CD2AP and SH3KBP1 Binding Motif in Alphavirus nsP3 Hypervariable Domain Results in Attenuated Virus.

PubMed

Mutso, Margit; Morro, Ainhoa Moliner; Smedberg, Cecilia; Kasvandik, Sergo; Aquilimeba, Muriel; Teppor, Mona; Tarve, Liisi; Lulla, Aleksei; Lulla, Valeria; Saul, Sirle; Thaa, Bastian; McInerney, Gerald M; Merits, Andres; Varjak, Margus

2018-04-27

Infection by Chikungunya virus (CHIKV) of the Old World alphaviruses (family Togaviridae) in humans can cause arthritis and arthralgia. The virus encodes four non-structural proteins (nsP) (nsP1, nsp2, nsP3 and nsP4) that act as subunits of the virus replicase. These proteins also interact with numerous host proteins and some crucial interactions are mediated by the unstructured C-terminal hypervariable domain (HVD) of nsP3. In this study, a human cell line expressing EGFP tagged with CHIKV nsP3 HVD was established. Using quantitative proteomics, it was found that CHIKV nsP3 HVD can bind cytoskeletal proteins, including CD2AP, SH3KBP1, CAPZA1, CAPZA2 and CAPZB. The interaction with CD2AP was found to be most evident; its binding site was mapped to the second SH3 ligand-like element in nsP3 HVD. Further assessment indicated that CD2AP can bind to nsP3 HVDs of many different New and Old World alphaviruses. Mutation of the short binding element hampered the ability of the virus to establish infection. The mutation also abolished ability of CD2AP to co-localise with nsP3 and replication complexes of CHIKV; the same was observed for Semliki Forest virus (SFV) harbouring a similar mutation. Similar to CD2AP, its homolog SH3KBP1 also bound the identified motif in CHIKV and SFV nsP3.

Decision Tree Algorithm-Generated Single-Nucleotide Polymorphism Barcodes of rbcL Genes for 38 Brassicaceae Species Tagging.

PubMed

Yang, Cheng-Hong; Wu, Kuo-Chuan; Chuang, Li-Yeh; Chang, Hsueh-Wei

2018-01-01

DNA barcode sequences are accumulating in large data sets. A barcode is generally a sequence larger than 1000 base pairs and generates a computational burden. Although the DNA barcode was originally envisioned as straightforward species tags, the identification usage of barcode sequences is rarely emphasized currently. Single-nucleotide polymorphism (SNP) association studies provide us an idea that the SNPs may be the ideal target of feature selection to discriminate between different species. We hypothesize that SNP-based barcodes may be more effective than the full length of DNA barcode sequences for species discrimination. To address this issue, we tested a r ibulose diphosphate carboxylase ( rbcL ) S NP b arcoding (RSB) strategy using a decision tree algorithm. After alignment and trimming, 31 SNPs were discovered in the rbcL sequences from 38 Brassicaceae plant species. In the decision tree construction, these SNPs were computed to set up the decision rule to assign the sequences into 2 groups level by level. After algorithm processing, 37 nodes and 31 loci were required for discriminating 38 species. Finally, the sequence tags consisting of 31 rbcL SNP barcodes were identified for discriminating 38 Brassicaceae species based on the decision tree-selected SNP pattern using RSB method. Taken together, this study provides the rational that the SNP aspect of DNA barcode for rbcL gene is a useful and effective sequence for tagging 38 Brassicaceae species.
TagDigger: user-friendly extraction of read counts from GBS and RAD-seq data.

PubMed

Clark, Lindsay V; Sacks, Erik J

2016-01-01

In genotyping-by-sequencing (GBS) and restriction site-associated DNA sequencing (RAD-seq), read depth is important for assessing the quality of genotype calls and estimating allele dosage in polyploids. However, existing pipelines for GBS and RAD-seq do not provide read counts in formats that are both accurate and easy to access. Additionally, although existing pipelines allow previously-mined SNPs to be genotyped on new samples, they do not allow the user to manually specify a subset of loci to examine. Pipelines that do not use a reference genome assign arbitrary names to SNPs, making meta-analysis across projects difficult. We created the software TagDigger, which includes three programs for analyzing GBS and RAD-seq data. The first script, tagdigger_interactive.py, rapidly extracts read counts and genotypes from FASTQ files using user-supplied sets of barcodes and tags. Input and output is in CSV format so that it can be opened by spreadsheet software. Tag sequences can also be imported from the Stacks, TASSEL-GBSv2, TASSEL-UNEAK, or pyRAD pipelines, and a separate file can be imported listing the names of markers to retain. A second script, tag_manager.py, consolidates marker names and sequences across multiple projects. A third script, barcode_splitter.py, assists with preparing FASTQ data for deposit in a public archive by splitting FASTQ files by barcode and generating MD5 checksums for the resulting files. TagDigger is open-source and freely available software written in Python 3. It uses a scalable, rapid search algorithm that can process over 100 million FASTQ reads per hour. TagDigger will run on a laptop with any operating system, does not consume hard drive space with intermediate files, and does not require programming skill to use.
Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing

PubMed Central

Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken

2012-01-01

Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646
Sequence analysis of diacylglycerol acyltransferases

USDA-ARS?s Scientific Manuscript database

Diacylglycerol acyltransferases (DGATs) catalyze the final step of triacylglycerol (TAG) biosynthesis in eukaryotes. DGATs esterify sn-1,2-diacylglycerol with a long-chain fatty acyl-CoA. Plants and animals deficient in DGATs accumulate less TAG and over-expression of DGATs increases TAG. DGAT knock...
Characterization of a prototype strain of hepatitis E virus.

PubMed

Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

1992-01-15

A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia.
Improved microarray methods for profiling the yeast knockout strain collection

PubMed Central

Yuan, Daniel S.; Pan, Xuewen; Ooi, Siew Loon; Peyser, Brian D.; Spencer, Forrest A.; Irizarry, Rafael A.; Boeke, Jef D.

2005-01-01

A remarkable feature of the Yeast Knockout strain collection is the presence of two unique 20mer TAG sequences in almost every strain. In principle, the relative abundances of strains in a complex mixture can be profiled swiftly and quantitatively by amplifying these sequences and hybridizing them to microarrays, but TAG microarrays have not been widely used. Here, we introduce a TAG microarray design with sophisticated controls and describe a robust method for hybridizing high concentrations of dye-labeled TAGs in single-stranded form. We also highlight the importance of avoiding PCR contamination and provide procedures for detection and eradication. Validation experiments using these methods yielded false positive (FP) and false negative (FN) rates for individual TAG detection of 3–6% and 15–18%, respectively. Analysis demonstrated that cross-hybridization was the chief source of FPs, while TAG amplification defects were the main cause of FNs. The materials, protocols, data and associated software described here comprise a suite of experimental resources that should facilitate the use of TAG microarrays for a wide variety of genetic screens. PMID:15994458
Hypervariable and highly divergent intron-exon organizations in the chordate Oikopleura dioica.

PubMed

Edvardsen, Rolf B; Lerat, Emmanuelle; Maeland, Anne Dorthea; Flåt, Mette; Tewari, Rita; Jensen, Marit F; Lehrach, Hans; Reinhardt, Richard; Seo, Hee-Chan; Chourrout, Daniel

2004-10-01

Oikopleura dioica is a pelagic tunicate with a very small genome and a very short life cycle. In order to investigate the intron-exon organizations in Oikopleura, we have isolated and characterized ribosomal protein EF-1alpha, Hox, and alpha-tubulin genes. Their intron positions have been compared with those of the same genes from various invertebrates and vertebrates, including four species with entirely sequenced genomes. Oikopleura genes, like Caenorhabditis genes, have introns at a large number of nonconserved positions, which must originate from late insertions or intron sliding of ancient insertions. Both species exhibit hypervariable intron-exon organization within their alpha-tubulin gene family. This is due to localization of most nonconserved intron positions in single members of this gene family. The hypervariability and divergence of intron positions in Oikopleura and Caenorhabditis may be related to the predominance of short introns, the processing of which is not very dependent upon the exonic environment compared to large introns. Also, both species have an undermethylated genome, and the control of methylation-induced point mutations imposes a control on exon size, at least in vertebrate genes. That introns placed at such variable positions in Oikopleura or C. elegans may serve a specific purpose is not easy to infer from our current knowledge and hypotheses on intron functions. We propose that new introns are retained in species with very short life cycles, because illegitimate exchanges including gene conversion are repressed. We also speculate that introns placed at gene-specific positions may contribute to suppressing these exchanges and thereby favor their own persistence.
Mitochondrial DNA typing from human axillary, pubic and head hair shafts - success rates and sequence comparisons.

PubMed

Pfeiffer, H; Hühne, J; Ortmann, C; Waterkamp, K; Brinkmann, B

1999-01-01

The analysis of mitochondrial DNA (mtDNA) from shed hairs has gained high importance in forensic casework since telogen hairs are one of the most common types of evidence left at the crime scene. In this systematic study of hair shafts from 20 individuals, the correlation of mtDNA recovery with hair morphology (length, diameter, volume, colour), with sex, and with body localisation (head, armpit, pubis) was investigated. The highest average success rate of hypervariable region 1 (HV 1) sequencing was found in head hair shafts (75%) followed by pubic (66%) and axillary hair shafts (52%). No statistically significant correlation between morphological parameters or sex and the success rate of sequencing was found. MtDNA sequences of buccal cells, head, pubic and axillary hair shafts did not show intraindividual differences. Heteroplasmic base positions were observed neither in the hair shafts nor in control samples of buccal cells.
Haplotag: Software for Haplotype-Based Genotyping-by-Sequencing Analysis

PubMed Central

Tinker, Nicholas A.; Bekele, Wubishet A.; Hattori, Jiro

2016-01-01

Genotyping-by-sequencing (GBS), and related methods, are based on high-throughput short-read sequencing of genomic complexity reductions followed by discovery of single nucleotide polymorphisms (SNPs) within sequence tags. This provides a powerful and economical approach to whole-genome genotyping, facilitating applications in genomics, diversity analysis, and molecular breeding. However, due to the complexity of analyzing large data sets, applications of GBS may require substantial time, expertise, and computational resources. Haplotag, the novel GBS software described here, is freely available, and operates with minimal user-investment on widely available computer platforms. Haplotag is unique in fulfilling the following set of criteria: (1) operates without a reference genome; (2) can be used in a polyploid species; (3) provides a discovery mode, and a production mode; (4) discovers polymorphisms based on a model of tag-level haplotypes within sequenced tags; (5) reports SNPs as well as haplotype-based genotypes; and (6) provides an intuitive visual “passport” for each inferred locus. Haplotag is optimized for use in a self-pollinating plant species. PMID:26818073
Molecular characterization of infectious bursal disease virus isolates from Nepal based on hypervariable region of VP2 gene.

PubMed

Sharma, K; Hair-Bejo, M; Omar, A R; Aini, I

2005-01-01

Two Infectious bursal disease virus (IBDV) isolates, NP1SSH and NP2K were obtained from a severe infectious bursal disease (IBD) outbreak in Nepal in 2002. The hypervariable (HV) region of VP2 gene (1326 bp) of the isolates was generated by RT-PCR and sequenced. The obtained nucleotide sequences were compared with those of twenty other IBDV isolates/strains. Phylogenetic analysis based on this comparison revealed that NP1SSH and NP2K clustered with very virulent (vv) IBDV strains of serotype 1. In contrast, classical, Australian classical and attenuated strains of serotype 1 and avirulent IBDV strains of serotype 2 formed a different cluster. The deduced amino acid sequences of the two isolates showed a 98.3% identity with each other and 97.1% and 98.3% identities, respectively with very virulent IBDV (vvIBDV) isolates/strains. Three amino acids substitutions at positions 300 (E-->A), 308 (I-->F) and 334 (A-->P) within the HV region were common for both the isolates. The amino acids substitutions at positions 27 (S-->T), 28 (I-->T), 31 (D-->A), 36 (H-->Y), 135 (E-->G), 223 (G-->S), 225 (V-->I), 351 (L-->I), 352 (V-->E) and 399 (I-->S) for NP1SSH and at position 438 (I-->S) for NP2K were unique and differed from other IBDV isolates/strains. NP1SSH and NP2K showed highest similarity (97.8%) with the BD399 strain from Bangladesh as compared with other vvIBDV isolates/strains. We conclude that the NP1SSH and NP2K isolates of IBDV from Nepal represent vvIBDV of serotype 1.
A combination of LongSAGE with Solexa sequencing is well suited to explore the depth and the complexity of transcriptome

PubMed Central

Hanriot, Lucie; Keime, Céline; Gay, Nadine; Faure, Claudine; Dossat, Carole; Wincker, Patrick; Scoté-Blachon, Céline; Peyron, Christelle; Gandrillon, Olivier

2008-01-01

Background "Open" transcriptome analysis methods allow to study gene expression without a priori knowledge of the transcript sequences. As of now, SAGE (Serial Analysis of Gene Expression), LongSAGE and MPSS (Massively Parallel Signature Sequencing) are the mostly used methods for "open" transcriptome analysis. Both LongSAGE and MPSS rely on the isolation of 21 pb tag sequences from each transcript. In contrast to LongSAGE, the high throughput sequencing method used in MPSS enables the rapid sequencing of very large libraries containing several millions of tags, allowing deep transcriptome analysis. However, a bias in the complexity of the transcriptome representation obtained by MPSS was recently uncovered. Results In order to make a deep analysis of mouse hypothalamus transcriptome avoiding the limitation introduced by MPSS, we combined LongSAGE with the Solexa sequencing technology and obtained a library of more than 11 millions of tags. We then compared it to a LongSAGE library of mouse hypothalamus sequenced with the Sanger method. Conclusion We found that Solexa sequencing technology combined with LongSAGE is perfectly suited for deep transcriptome analysis. In contrast to MPSS, it gives a complex representation of transcriptome as reliable as a LongSAGE library sequenced by the Sanger method. PMID:18796152
Genome-Wide Mutagenesis in Borrelia burgdorferi.

PubMed

Lin, Tao; Gao, Lihui

2018-01-01

Signature-tagged mutagenesis (STM) is a functional genomics approach to identify bacterial virulence determinants and virulence factors by simultaneously screening multiple mutants in a single host animal, and has been utilized extensively for the study of bacterial pathogenesis, host-pathogen interactions, and spirochete and tick biology. The signature-tagged transposon mutagenesis has been developed to investigate virulence determinants and pathogenesis of Borrelia burgdorferi. Mutants in genes important in virulence are identified by negative selection in which the mutants fail to colonize or disseminate in the animal host and tick vector. STM procedure combined with Luminex Flex ® Map™ technology and next-generation sequencing (e.g., Tn-seq) are the powerful high-throughput tools for the determination of Borrelia burgdorferi virulence determinants. The assessment of multiple tissue sites and two DNA resources at two different time points using Luminex Flex ® Map™ technology provides a robust data set. B. burgdorferi transposon mutant screening indicates that a high proportion of genes are the novel virulence determinants that are required for mouse and tick infection. In this protocol, an effective signature-tagged Himar1-based transposon suicide vector was developed and used to generate a sequence-defined library of nearly 4800 mutants in the infectious B. burgdorferi B31 clone. In STM, signature-tagged suicide vectors are constructed by inserting unique DNA sequences (tags) into the transposable elements. The signature-tagged transposon mutants are generated when transposon suicide vectors are transformed into an infectious B. burgdorferi clone, and the transposable element is transposed into the 5'-TA-3' sequence in the B. burgdorferi genome with the signature tag. The transposon library is created and consists of many sub-libraries, each sub-library has several hundreds of mutants with same tags. A group of mice or ticks are infected with a mixed population of mutants with different tags, after recovered from different tissues of infected mice and ticks, mutants from output pool and input pool are detected using high-throughput, semi-quantitative Luminex ® FLEXMAP™ or next-generation sequencing (Tn-seq) technologies. Thus far, we have created a high-density, sequence-defined transposon library of over 6600 STM mutants for the efficient genome-wide investigation of genes and gene products required for wild-type pathogenesis, host-pathogen interactions, in vitro growth, in vivo survival, physiology, morphology, chemotaxis, motility, structure, metabolism, gene regulation, plasmid maintenance and replication, etc. The insertion sites of 4480 transposon mutants have been determined. About 800 predicted protein-encoding genes in the genome were disrupted in the STM transposon library. The infectivity and some functions of 800 mutants in 500 genes have been determined. Analysis of these transposon mutants has yielded valuable information regarding the genes and gene products important in the pathogenesis and biology of B. burgdorferi and its tick vectors.
Increasing ecological inference from high throughput sequencing of fungi in the environment through a tagging approach

Treesearch

D. Lee Taylor; Michael G. Booth; Jack W. McFarland; Ian C. Herriott; Niall J. Lennon; Chad Nusbaum; Thomas G. Marr

2008-01-01

High throughput sequencing methods are widely used in analyses of microbial diversity but are generally applied to small numbers of samples, which precludes charaterization of patterns of microbial diversity across space and time. We have designed a primer-tagging approach that allows pooling and subsequent sorting of numerous samples, which is directed to...
SapTrap, a Toolkit for High-Throughput CRISPR/Cas9 Gene Modification in Caenorhabditis elegans.

PubMed

Schwartz, Matthew L; Jorgensen, Erik M

2016-04-01

In principle, clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 allows genetic tags to be inserted at any locus. However, throughput is limited by the laborious construction of repair templates and guide RNA constructs and by the identification of modified strains. We have developed a reagent toolkit and plasmid assembly pipeline, called "SapTrap," that streamlines the production of targeting vectors for tag insertion, as well as the selection of modified Caenorhabditis elegans strains. SapTrap is a high-efficiency modular plasmid assembly pipeline that produces single plasmid targeting vectors, each of which encodes both a guide RNA transcript and a repair template for a particular tagging event. The plasmid is generated in a single tube by cutting modular components with the restriction enzyme SapI, which are then "trapped" in a fixed order by ligation to generate the targeting vector. A library of donor plasmids supplies a variety of protein tags, a selectable marker, and regulatory sequences that allow cell-specific tagging at either the N or the C termini. All site-specific sequences, such as guide RNA targeting sequences and homology arms, are supplied as annealed synthetic oligonucleotides, eliminating the need for PCR or molecular cloning during plasmid assembly. Each tag includes an embedded Cbr-unc-119 selectable marker that is positioned to allow concurrent expression of both the tag and the marker. We demonstrate that SapTrap targeting vectors direct insertion of 3- to 4-kb tags at six different loci in 10-37% of injected animals. Thus SapTrap vectors introduce the possibility for high-throughput generation of CRISPR/Cas9 genome modifications. Copyright © 2016 by the Genetics Society of America.
Long-Term Evolution of the Hypervariable Region of Hepatitis C Virus in a Common-Source-Infected Cohort

PubMed Central

McAllister, Jane; Casino, Carmela; Davidson, Fiona; Power, Joan; Lawlor, Emer; Yap, Peng Lee; Simmonds, Peter; Smith, Donald B.

1998-01-01

The long-term evolution of the hepatitis C virus hypervariable region (HVR) and flanking regions of the E1 and E2 envelope proteins have been studied in a cohort of women infected from a common source of anti-D immunoglobulin. Whereas virus sequences in the infectious source were relatively homogeneous, distinct HVR variants were observed in each anti-D recipient, indicating that this region can evolve in multiple directions from the same point. Where HVR variants with dissimilar sequences were present in a single individual, the frequency of synonymous substitution in the flanking regions suggested that the lineages diverged more than a decade previously. Even where a single major HVR variant was present in an infected individual, this lineage was usually several years old. Multiple lineages can therefore coexist during long periods of chronic infection without replacement. The characteristics of amino acid substitution in the HVR were not consistent with the random accumulation of mutations and imply that amino acid replacement in the HVR was strongly constrained. Another variable region of E2 centered on codon 60 shows similar constraints, while HVR2 was relatively unconstrained. Several of these features are difficult to explain if a neutralizing immune response against the HVR is the only selective force operating on E2. The impact of PCR artifacts such as nucleotide misincorporation and the shuffling of dissimilar templates is discussed. PMID:9573256
A molecular epidemiology study based on VP2 gene sequences reveals that a new genotype of infectious bursal disease virus is dominantly prevalent in Italy.

PubMed

Lupini, Caterina; Giovanardi, Davide; Pesente, Patrizia; Bonci, Michela; Felice, Viviana; Rossi, Giulia; Morandini, Emilio; Cecchinato, Mattia; Catelli, Elena

2016-08-01

A distinctive infectious bursal disease (IBD) virus genotype (ITA) was detected in IBD-live vaccinated broilers in Italy without clinical signs of IBD. It was isolated in specific-pathogen-free eggs and molecularly characterized in the hypervariable region of the virus protein (VP) 2. Phylogenetic analysis showed that ITA strains clustered separately from other homologous reference sequences of IBDVs, either classical or very virulent, retrieved from GenBank or previously reported in Italy, and from vaccine strains. The new genotype shows peculiar molecular characteristics in key positions of the VP2 hypervariable region, which affect charged or potentially glycosylated amino acids virtually associated with important changes in virus properties. Characterization of 41 IBDV strains detected in Italy between 2013 and 2014 showed that ITA is emergent in densely populated poultry areas of Italy, being 68% of the IBDV detections made during routine diagnostic activity over a two-year period, in spite of the immunity induced by large-scale vaccination. Four very virulent strains (DV86) and one classical strain (HPR2), together with eight vaccine strains, were also detected. The currently available epidemiological and clinical data do not allow the degree of pathogenicity of the ITA genotype to be defined. Only in vivo experimental pathogenicity studies conducted in secure isolation conditions, through the evaluation of clinical signs and macro/microscopic lesions, will clarify conclusively the virulence of the new Italian genotype.
Characterization of a prototype strain of hepatitis E virus.

PubMed Central

Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

1992-01-01

A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia. Images PMID:1731327
The structure of the SBP-Tag–streptavidin complex reveals a novel helical scaffold bridging binding pockets on separate subunits

DOE Office of Scientific and Technical Information (OSTI.GOV)

Barrette-Ng, Isabelle H.; Wu, Sau-Ching; Tjia, Wai-Mui

2013-05-01

The structure of the SBP-Tag–streptavidin complex reveals a novel mode of peptide recognition in which a single peptide binds simultaneously to biotin-binding pockets from adjacent subunits of streptavidin. The molecular details of peptide recognition suggest how the SBP-Tag can be further modified to become an even more useful tag for a wider range of biotechnological applications. The 38-residue SBP-Tag binds to streptavidin more tightly (K{sub d} ≃ 2.5–4.9 nM) than most if not all other known peptide sequences. Crystallographic analysis at 1.75 Å resolution shows that the SBP-Tag binds to streptavidin in an unprecedented manner by simultaneously interacting with biotin-bindingmore » pockets from two separate subunits. An N-terminal HVV peptide sequence (residues 12–14) and a C-terminal HPQ sequence (residues 31–33) form the bulk of the direct interactions between the SBP-Tag and the two biotin-binding pockets. Surprisingly, most of the peptide spanning these two sites (residues 17–28) adopts a regular α-helical structure that projects three leucine side chains into a groove formed at the interface between two streptavidin protomers. The crystal structure shows that residues 1–10 and 35–38 of the original SBP-Tag identified through in vitro selection and deletion analysis do not appear to contact streptavidin and thus may not be important for binding. A 25-residue peptide comprising residues 11–34 (SBP-Tag2) was synthesized and shown using surface plasmon resonance to bind streptavidin with very similar affinity and kinetics when compared with the SBP-Tag. The SBP-Tag2 was also added to the C-terminus of β-lactamase and was shown to be just as effective as the full-length SBP-Tag in affinity purification. These results validate the molecular structure of the SBP-Tag–streptavidin complex and establish a minimal bivalent streptavidin-binding tag from which further rational design and optimization can proceed.« less
Application of Stochastic Labeling with Random-Sequence Barcodes for Simultaneous Quantification and Sequencing of Environmental 16S rRNA Genes.

PubMed

Hoshino, Tatsuhiko; Inagaki, Fumio

2017-01-01

Next-generation sequencing (NGS) is a powerful tool for analyzing environmental DNA and provides the comprehensive molecular view of microbial communities. For obtaining the copy number of particular sequences in the NGS library, however, additional quantitative analysis as quantitative PCR (qPCR) or digital PCR (dPCR) is required. Furthermore, number of sequences in a sequence library does not always reflect the original copy number of a target gene because of biases caused by PCR amplification, making it difficult to convert the proportion of particular sequences in the NGS library to the copy number using the mass of input DNA. To address this issue, we applied stochastic labeling approach with random-tag sequences and developed a NGS-based quantification protocol, which enables simultaneous sequencing and quantification of the targeted DNA. This quantitative sequencing (qSeq) is initiated from single-primer extension (SPE) using a primer with random tag adjacent to the 5' end of target-specific sequence. During SPE, each DNA molecule is stochastically labeled with the random tag. Subsequently, first-round PCR is conducted, specifically targeting the SPE product, followed by second-round PCR to index for NGS. The number of random tags is only determined during the SPE step and is therefore not affected by the two rounds of PCR that may introduce amplification biases. In the case of 16S rRNA genes, after NGS sequencing and taxonomic classification, the absolute number of target phylotypes 16S rRNA gene can be estimated by Poisson statistics by counting random tags incorporated at the end of sequence. To test the feasibility of this approach, the 16S rRNA gene of Sulfolobus tokodaii was subjected to qSeq, which resulted in accurate quantification of 5.0 × 103 to 5.0 × 104 copies of the 16S rRNA gene. Furthermore, qSeq was applied to mock microbial communities and environmental samples, and the results were comparable to those obtained using digital PCR and relative abundance based on a standard sequence library. We demonstrated that the qSeq protocol proposed here is advantageous for providing less-biased absolute copy numbers of each target DNA with NGS sequencing at one time. By this new experiment scheme in microbial ecology, microbial community compositions can be explored in more quantitative manner, thus expanding our knowledge of microbial ecosystems in natural environments.
Highly conserved D-loop-like nuclear mitochondrial sequences (Numts) in tiger (Panthera tigris).

PubMed

Zhang, Wenping; Zhang, Zhihe; Shen, Fujun; Hou, Rong; Lv, Xiaoping; Yue, Bisong

2006-08-01

Using oligonucleotide primers designed to match hypervariable segments I (HVS-1) of Panthera tigris mitochondrial DNA (mtDNA), we amplified two different PCR products (500 bp and 287 bp) in the tiger (Panthera tigris), but got only one PCR product (287 bp) in the leopard (Panthera pardus). Sequence analyses indicated that the sequence of 287 bp was a D-loop-like nuclear mitochondrial sequence (Numts), indicating a nuclear transfer that occurred approximately 4.8-17 million years ago in the tiger and 4.6-16 million years ago in the leopard. Although the mtDNA D-loop sequence has a rapid rate of evolution, the 287-bp Numts are highly conserved; they are nearly identical in tiger subspecies and only 1.742% different between tiger and leopard. Thus, such sequences represent molecular 'fossils' that can shed light on evolution of the mitochondrial genome and may be the most appropriate outgroup for phylogenetic analysis. This is also proved by comparing the phylogenetic trees reconstructed using the D-loop sequence of snow leopard and the 287-bp Numts as outgroup.

Distinctive archaebacterial species associated with anaerobic rumen protozoan Entodinium caudatum.

PubMed

Tóthová, T; Piknová, M; Kisidayová, S; Javorský, P; Pristas, P

2008-01-01

The diversity of archaebacteria associated with anaerobic rumen protozoan Entodinium caudatum in long term in vitro culture was investigated by denaturing gradient gel electrophoresis (DGGE) analysis of hypervariable V3 region of archaebacterial 16S rRNA gene. PCR was accomplished directly from DNA extracted from a single protozoal cell and from total community genomic DNA and the obtained fingerprints were compared. The analysis indicated the presence of a solitary intensive band present in Entodinium caudatum single cell DNA, which had no counterparts in the profile from total DNA. The identity of archaebacterium represented by this band was determined by sequence analysis which showed that the sequence fell to the cluster of ciliate symbiotic methanogens identified recently by 16S gene library approach.
Linear reduction methods for tag SNP selection.

PubMed

He, Jingwu; Zelikovsky, Alex

2004-01-01

It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.
Selection of mRNA 5'-untranslated region sequence with high translation efficiency through ribosome display

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mie, Masayasu; Shimizu, Shun; Takahashi, Fumio

2008-08-15

The 5'-untranslated region (5'-UTR) of mRNAs functions as a translation enhancer, promoting translation efficiency. Many in vitro translation systems exhibit a reduced efficiency in protein translation due to decreased translation initiation. The use of a 5'-UTR sequence with high translation efficiency greatly enhances protein production in these systems. In this study, we have developed an in vitro selection system that favors 5'-UTRs with high translation efficiency using a ribosome display technique. A 5'-UTR random library, comprised of 5'-UTRs tagged with a His-tag and Renilla luciferase (R-luc) fusion, were in vitro translated in rabbit reticulocytes. By limiting the translation period, onlymore » mRNAs with high translation efficiency were translated. During translation, mRNA, ribosome and translated R-luc with His-tag formed ternary complexes. They were collected with translated His-tag using Ni-particles. Extracted mRNA from ternary complex was amplified using RT-PCR and sequenced. Finally, 5'-UTR with high translation efficiency was obtained from random 5'-UTR library.« less
Re-evaluating microglia expression profiles using RiboTag and cell isolation strategies.

PubMed

Haimon, Zhana; Volaski, Alon; Orthgiess, Johannes; Boura-Halfon, Sigalit; Varol, Diana; Shemer, Anat; Yona, Simon; Zuckerman, Binyamin; David, Eyal; Chappell-Maor, Louise; Bechmann, Ingo; Gericke, Martin; Ulitsky, Igor; Jung, Steffen

2018-06-01

Transcriptome profiling is widely used to infer functional states of specific cell types, as well as their responses to stimuli, to define contributions to physiology and pathophysiology. Focusing on microglia, the brain's macrophages, we report here a side-by-side comparison of classical cell-sorting-based transcriptome sequencing and the 'RiboTag' method, which avoids cell retrieval from tissue context and yields translatome sequencing information. Conventional whole-cell microglial transcriptomes were found to be significantly tainted by artifacts introduced by tissue dissociation, cargo contamination and transcripts sequestered from ribosomes. Conversely, our data highlight the added value of RiboTag profiling for assessing the lineage accuracy of Cre recombinase expression in transgenic mice. Collectively, this study indicates method-based biases, reveals observer effects and establishes RiboTag-based translatome profiling as a valuable complement to standard sorting-based profiling strategies.
[Experimental study of candidate vaccines against variable or quasi-species pathogenes: multiepitopic synthetic peptide antigenes and new receptor-guiding adjuvants].

PubMed

Ignat'eva, G A; Maksiutov, A Z; L'vov, V L; Kolobov, A A; Ignat'ev, T I

2011-01-01

The short multiepitopic synthetic peptides from the sequences of hypervariable area of V3-loope of gp120 of HIV don't induce anti-peptides antibodies production in mice themselves. We prepared the potent immunogen by noncovalent conjugations of the multitude peptides with pure peptidoglycans from cell wall of Salmonella typhi. The sera from immunized mice have the anti-peptides antibody titers (3-5) x 10(5) in ELISA, as high as Freund's adjuvant is of use.
Genetic diversity of Streptococcus equi subsp. zooepidemicus and doxycycline resistance in kennelled dogs.

PubMed

Chalker, Victoria J; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe; Erles, Kerstin

2012-06-01

The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010.
Genetic Diversity of Streptococcus equi subsp. zooepidemicus and Doxycycline Resistance in Kennelled Dogs

PubMed Central

Chalker, Victoria J.; Waller, Andrew; Webb, Katy; Spearing, Emma; Crosse, Patricia; Brownlie, Joe

2012-01-01

The genetic diversity and antibiotic resistance profiles of 38 Streptococcus equi subsp. zooepidemicus isolates were determined from a kennelled canine population during two outbreaks of hemorrhagic pneumonia (1999 to 2002 and 2007 to 2010). Analysis of the szp gene hypervariable region and the 16S-23S rRNA intergenic spacer region and multilocus sequence typing (MLST) indicated a predominant tetO-positive, doxycycline-resistant ST-10 strain during 1999 to 2002 and a predominant tetM-positive doxycycline-resistant ST-62 strain during 2007 to 2010. PMID:22495558
Interactive Block Games for Assessing Children's Cognitive Skills: Design and Preliminary Evaluation.

PubMed

Lee, Kiju; Jeong, Donghwa; Schindler, Rachael C; Hlavaty, Laura E; Gross, Susan I; Short, Elizabeth J

2018-01-01

Background: This paper presents design and results from preliminary evaluation of Tangible Geometric Games (TAG-Games) for cognitive assessment in young children. The TAG-Games technology employs a set of sensor-integrated cube blocks, called SIG-Blocks, and graphical user interfaces for test administration and real-time performance monitoring. TAG-Games were administered to children from 4 to 8 years of age for evaluating preliminary efficacy of this new technology-based approach. Methods: Five different sets of SIG-Blocks comprised of geometric shapes, segmented human faces, segmented animal faces, emoticons, and colors, were used for three types of TAG-Games, including Assembly, Shape Matching, and Sequence Memory. Computational task difficulty measures were defined for each game and used to generate items with varying difficulty. For preliminary evaluation, TAG-Games were tested on 40 children. To explore the clinical utility of the information assessed by TAG-Games, three subtests of the age-appropriate Wechsler tests (i.e., Block Design, Matrix Reasoning, and Picture Concept) were also administered. Results: Internal consistency of TAG-Games was evaluated by the split-half reliability test. Weak to moderate correlations between Assembly and Block Design, Shape Matching and Matrix Reasoning, and Sequence Memory and Picture Concept were found. The computational measure of task complexity for each TAG-Game showed a significant correlation with participants' performance. In addition, age-correlations on TAG-Game scores were found, implying its potential use for assessing children's cognitive skills autonomously.
Interactive Block Games for Assessing Children's Cognitive Skills: Design and Preliminary Evaluation

PubMed Central

Lee, Kiju; Jeong, Donghwa; Schindler, Rachael C.; Hlavaty, Laura E.; Gross, Susan I.; Short, Elizabeth J.

2018-01-01

Background: This paper presents design and results from preliminary evaluation of Tangible Geometric Games (TAG-Games) for cognitive assessment in young children. The TAG-Games technology employs a set of sensor-integrated cube blocks, called SIG-Blocks, and graphical user interfaces for test administration and real-time performance monitoring. TAG-Games were administered to children from 4 to 8 years of age for evaluating preliminary efficacy of this new technology-based approach. Methods: Five different sets of SIG-Blocks comprised of geometric shapes, segmented human faces, segmented animal faces, emoticons, and colors, were used for three types of TAG-Games, including Assembly, Shape Matching, and Sequence Memory. Computational task difficulty measures were defined for each game and used to generate items with varying difficulty. For preliminary evaluation, TAG-Games were tested on 40 children. To explore the clinical utility of the information assessed by TAG-Games, three subtests of the age-appropriate Wechsler tests (i.e., Block Design, Matrix Reasoning, and Picture Concept) were also administered. Results: Internal consistency of TAG-Games was evaluated by the split-half reliability test. Weak to moderate correlations between Assembly and Block Design, Shape Matching and Matrix Reasoning, and Sequence Memory and Picture Concept were found. The computational measure of task complexity for each TAG-Game showed a significant correlation with participants' performance. In addition, age-correlations on TAG-Game scores were found, implying its potential use for assessing children's cognitive skills autonomously. PMID:29868520
Optimal use of tandem biotin and V5 tags in ChIP assays

PubMed Central

Kolodziej, Katarzyna E; Pourfarzad, Farzin; de Boer, Ernie; Krpic, Sanja; Grosveld, Frank; Strouboulis, John

2009-01-01

Background Chromatin immunoprecipitation (ChIP) assays coupled to genome arrays (Chip-on-chip) or massive parallel sequencing (ChIP-seq) lead to the genome wide identification of binding sites of chromatin associated proteins. However, the highly variable quality of antibodies and the availability of epitopes in crosslinked chromatin can compromise genomic ChIP outcomes. Epitope tags have often been used as more reliable alternatives. In addition, we have employed protein in vivo biotinylation tagging as a very high affinity alternative to antibodies. In this paper we describe the optimization of biotinylation tagging for ChIP and its coupling to a known epitope tag in providing a reliable and efficient alternative to antibodies. Results Using the biotin tagged erythroid transcription factor GATA-1 as example, we describe several optimization steps for the application of the high affinity biotin streptavidin system in ChIP. We find that the omission of SDS during sonication, the use of fish skin gelatin as blocking agent and choice of streptavidin beads can lead to significantly improved ChIP enrichments and lower background compared to antibodies. We also show that the V5 epitope tag performs equally well under the conditions worked out for streptavidin ChIP and that it may suffer less from the effects of formaldehyde crosslinking. Conclusion The combined use of the very high affinity biotin tag with the less sensitive to crosslinking V5 tag provides for a flexible ChIP platform with potential implications in ChIP sequencing outcomes. PMID:19196479
Analysis of expressed sequence tags from a single wheat cultivar facilitates interpretation of tandem mass spectrometry data and discrimination of gamma gliadin proteins that may play different functional roles in flour

USDA-ARS?s Scientific Manuscript database

The complement of gamma gliadin genes expressed in the wheat cultivar Butte 86 was evaluated by analyzing publicly available expressed sequence tag (EST) data. Eleven contigs were assembled from 153 Butte 86 ESTs. Nine of the contigs encoded full-length proteins and four of the proteins contained an...
Cloning, expression and phylogenetic analysis of Hemolin, from the Chinese oak silkmoth, Antheraea pernyi.

PubMed

Li, Wenli; Terenius, Olle; Hirai, Makoto; Nilsson, Anders S; Faye, Ingrid

2005-01-01

The Chinese oak silk moth Antheraea pernyi is an important silk producer. To understand microbial resistance of this moth, we cloned Hemolin, encoding a multifunctional immune protein belonging to the immunoglobulin superfamily, and examined the expression in gonads and fat body. The ApHemolin amino acid sequence was compared to other Hemolin sequences in order to predict functional sites. Several sites were conserved; among them a phosphate binding site, which according to 3D structure modelling does not appear in neuroglian, the phylogenetically closest related protein. In addition, two conserved KDG sequences in the C-C' loop of immunoglobulin domains 1 and 3, give rise to gamma-turns, which is a common motif in the C'-C'' loop of the hypervariable region L2 in vertebrate immunoglobulins. The comparisons also show variable regions of specific interest for future studies of hemolin and its interaction with microbial entities.
Historically low mitochondrial DNA diversity in koalas (Phascolarctos cinereus)

PubMed Central

2012-01-01

Background The koala (Phascolarctos cinereus) is an arboreal marsupial that was historically widespread across eastern Australia until the end of the 19th century when it suffered a steep population decline. Hunting for the fur trade, habitat conversion, and disease contributed to a precipitous reduction in koala population size during the late 1800s and early 1900s. To examine the effects of these reductions in population size on koala genetic diversity, we sequenced part of the hypervariable region of mitochondrial DNA (mtDNA) in koala museum specimens collected in the 19th and 20th centuries, hypothesizing that the historical samples would exhibit greater genetic diversity. Results The mtDNA haplotypes present in historical museum samples were identical to haplotypes found in modern koala populations, and no novel haplotypes were detected. Rarefaction analyses suggested that the mtDNA genetic diversity present in the museum samples was similar to that of modern koalas. Conclusions Low mtDNA diversity may have been present in koala populations prior to recent population declines. When considering management strategies, low genetic diversity of the mtDNA hypervariable region may not indicate recent inbreeding or founder events but may reflect an older historical pattern for koalas. PMID:23095716
Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants.

PubMed

Rodovalho, Cynara M; Ferro, Milene; Fonseca, Fernando Pp; Antonio, Erik A; Guilherme, Ivan R; Henrique-Silva, Flávio; Bacci, Maurício

2011-06-17

Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters.
Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants

PubMed Central

2011-01-01

Background Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. Results The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. Conclusion The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters. PMID:21682882
Separation efficiency of free-solution conjugated electrophoresis with drag-tags incorporating a synthetic amino acid.

PubMed

Seo, Kyung-Ho; Chu, Hun-Su; Yoo, Tae Hyeon; Lee, Sun-Gu; Won, Jong-In

2016-03-01

DNA sequencing or separation by conventional capillary electrophoresis with a polymer matrix has some inherent drawbacks, such as the expense of polymer matrix and limitations in sequencing read length. As DNA fragments have a linear charge-to-friction ratio in free solution, DNA fragments cannot be separated by size. However, size-based separation of DNA is possible in free-solution conjugate electrophoresis (FSCE) if a "drag-tag" is attached to DNA fragments because the tag breaks the linear charge-to-friction scaling. Although several previous studies have demonstrated the feasibility of DNA separation by free-solution conjugated electrophoresis, generation of a monodisperse drag-tag and identification of a strong, site-specific conjugation method between a DNA fragment and a drag-tag are challenges that still remain. In this study, we demonstrate an efficient FSCE method by conjugating a biologically synthesized elastin-like polypeptide (ELP) and green fluorescent protein (GFP) to DNA fragments. In addition, to produce strong and site-specific conjugation, a methionine residue in drag-tags is replaced with homopropargylglycine (Hpg), which can be conjugated specifically to a DNA fragment with an azide site. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Microbial Diversity of Acidic Hot Spring (Kawah Hujan B) in Geothermal Field of Kamojang Area, West Java-Indonesia

PubMed Central

Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

2009-01-01

Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria. PMID:19440252
Ultrasensitive electrochemical biosensor for detection of DNA from Bacillus subtilis by coupling target-induced strand displacement and nicking endonuclease signal amplification.

PubMed

Hu, Yuhua; Xu, Xueqin; Liu, Qionghua; Wang, Ling; Lin, Zhenyu; Chen, Guonan

2014-09-02

A simple, ultrasensitive, and specific electrochemical biosensor was designed to determine the given DNA sequence of Bacillus subtilis by coupling target-induced strand displacement and nicking endonuclease signal amplification. The target DNA (TD, the DNA sequence from the hypervarient region of 16S rDNA of Bacillus subtilis) could be detected by the differential pulse voltammetry (DPV) in a range from 0.1 fM to 20 fM with the detection limit down to 0.08 fM at the 3s(blank) level. This electrochemical biosensor exhibits high distinction ability to single-base mismatch, double-bases mismatch, and noncomplementary DNA sequence, which may be expected to detect single-base mismatch and single nucleotide polymorphisms (SNPs). Moreover, the applicability of the designed biosensor for detecting the given DNA sequence from Bacillus subtilis was investigated. The result obtained by electrochemical method is approximately consistent with that by a real-time quantitative polymerase chain reaction detecting system (QPCR) with SYBR Green.
Hepatitis C Virus Antigenic Convergence

PubMed Central

Campo, David S.; Dimitrova, Zoya; Yokosawa, Jonny; Hoang, Duc; Perez, Nestor O.; Ramachandran, Sumathi; Khudyakov, Yury

2012-01-01

Vaccine development against hepatitis C virus (HCV) is hindered by poor understanding of factors defining cross-immunoreactivity among heterogeneous epitopes. Using synthetic peptides and mouse immunization as a model, we conducted a quantitative analysis of cross-immunoreactivity among variants of the HCV hypervariable region 1 (HVR1). Analysis of 26,883 immunological reactions among pairs of peptides showed that the distribution of cross-immunoreactivity among HVR1 variants was skewed, with antibodies against a few variants reacting with all tested peptides. The HVR1 cross-immunoreactivity was accurately modeled based on amino acid sequence alone. The tested peptides were mapped in the HVR1 sequence space, which was visualized as a network of 11,319 sequences. The HVR1 variants with a greater network centrality showed a broader cross-immunoreactivity. The entire sequence space is explored by each HCV genotype and subtype. These findings indicate that HVR1 antigenic diversity is extensively convergent and effectively limited, suggesting significant implications for vaccine development. PMID:22355779
Microbial diversity of acidic hot spring (kawah hujan B) in geothermal field of kamojang area, west java-indonesia.

PubMed

Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

2009-01-01

Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria.

Abseq: Ultrahigh-throughput single cell protein profiling with droplet microfluidic barcoding.

PubMed

Shahi, Payam; Kim, Samuel C; Haliburton, John R; Gartner, Zev J; Abate, Adam R

2017-03-14

Proteins are the primary effectors of cellular function, including cellular metabolism, structural dynamics, and information processing. However, quantitative characterization of proteins at the single-cell level is challenging due to the tiny amount of protein available. Here, we present Abseq, a method to detect and quantitate proteins in single cells at ultrahigh throughput. Like flow and mass cytometry, Abseq uses specific antibodies to detect epitopes of interest; however, unlike these methods, antibodies are labeled with sequence tags that can be read out with microfluidic barcoding and DNA sequencing. We demonstrate this novel approach by characterizing surface proteins of different cell types at the single-cell level and distinguishing between the cells by their protein expression profiles. DNA-tagged antibodies provide multiple advantages for profiling proteins in single cells, including the ability to amplify low-abundance tags to make them detectable with sequencing, to use molecular indices for quantitative results, and essentially limitless multiplexing.
Abseq: Ultrahigh-throughput single cell protein profiling with droplet microfluidic barcoding

NASA Astrophysics Data System (ADS)

Shahi, Payam; Kim, Samuel C.; Haliburton, John R.; Gartner, Zev J.; Abate, Adam R.

2017-03-01

Proteins are the primary effectors of cellular function, including cellular metabolism, structural dynamics, and information processing. However, quantitative characterization of proteins at the single-cell level is challenging due to the tiny amount of protein available. Here, we present Abseq, a method to detect and quantitate proteins in single cells at ultrahigh throughput. Like flow and mass cytometry, Abseq uses specific antibodies to detect epitopes of interest; however, unlike these methods, antibodies are labeled with sequence tags that can be read out with microfluidic barcoding and DNA sequencing. We demonstrate this novel approach by characterizing surface proteins of different cell types at the single-cell level and distinguishing between the cells by their protein expression profiles. DNA-tagged antibodies provide multiple advantages for profiling proteins in single cells, including the ability to amplify low-abundance tags to make them detectable with sequencing, to use molecular indices for quantitative results, and essentially limitless multiplexing.
Abseq: Ultrahigh-throughput single cell protein profiling with droplet microfluidic barcoding

PubMed Central

Shahi, Payam; Kim, Samuel C.; Haliburton, John R.; Gartner, Zev J.; Abate, Adam R.

2017-01-01

Proteins are the primary effectors of cellular function, including cellular metabolism, structural dynamics, and information processing. However, quantitative characterization of proteins at the single-cell level is challenging due to the tiny amount of protein available. Here, we present Abseq, a method to detect and quantitate proteins in single cells at ultrahigh throughput. Like flow and mass cytometry, Abseq uses specific antibodies to detect epitopes of interest; however, unlike these methods, antibodies are labeled with sequence tags that can be read out with microfluidic barcoding and DNA sequencing. We demonstrate this novel approach by characterizing surface proteins of different cell types at the single-cell level and distinguishing between the cells by their protein expression profiles. DNA-tagged antibodies provide multiple advantages for profiling proteins in single cells, including the ability to amplify low-abundance tags to make them detectable with sequencing, to use molecular indices for quantitative results, and essentially limitless multiplexing. PMID:28290550
A novel expression system for intracellular production and purification of recombinant affinity-tagged proteins in Aspergillus niger.

PubMed

Roth, Andreas H F J; Dersch, Petra

2010-03-01

A set of different integrative expression vectors for the intracellular production of recombinant proteins with or without affinity tag in Aspergillus niger was developed. Target genes can be expressed under the control of the highly efficient, constitutive pkiA promoter or the novel sucrose-inducible promoter of the beta-fructofuranosidase (sucA) gene of A. niger in the presence or absence of alternative carbon sources. All expression plasmids contain an identical multiple cloning sequence that allows parallel construction of N- or C-terminally His6- and StrepII-tagged versions of the target proteins. Production of two heterologous model proteins, the green fluorescence protein and the Thermobifida fusca hydrolase, proved the functionality of the vector system. Efficient production and easy detection of the target proteins as well as their fast purification by a one-step affinity chromatography, using the His6- or StrepII-tag sequence, was demonstrated.
A statistical method for assessing peptide identification confidence in accurate mass and time tag proteomics

PubMed Central

Stanley, Jeffrey R.; Adkins, Joshua N.; Slysz, Gordon W.; Monroe, Matthew E.; Purvine, Samuel O.; Karpievitch, Yuliya V.; Anderson, Gordon A.; Smith, Richard D.; Dabney, Alan R.

2011-01-01

Current algorithms for quantifying peptide identification confidence in the accurate mass and time (AMT) tag approach assume that the AMT tags themselves have been correctly identified. However, there is uncertainty in the identification of AMT tags, as this is based on matching LC-MS/MS fragmentation spectra to peptide sequences. In this paper, we incorporate confidence measures for the AMT tag identifications into the calculation of probabilities for correct matches to an AMT tag database, resulting in a more accurate overall measure of identification confidence for the AMT tag approach. The method is referred to as Statistical Tools for AMT tag Confidence (STAC). STAC additionally provides a Uniqueness Probability (UP) to help distinguish between multiple matches to an AMT tag and a method to calculate an overall false discovery rate (FDR). STAC is freely available for download as both a command line and a Windows graphical application. PMID:21692516
Phylogeography of Barbary macaques (Macaca sylvanus) and the origin of the Gibraltar colony.

PubMed

Modolo, Lara; Salzburger, Walter; Martin, Robert D

2005-05-17

The Barbary macaque (Macaca sylvanus) is the earliest offshoot of the genus Macaca and the only extant African representative, all other species being Asiatic. Once distributed throughout North Africa, M. sylvanus is now restricted to isolated forest fragments in Algeria and Morocco. The species is threatened; the maximum total wild population size is estimated at 10,000 individuals. Relationships among surviving wild subpopulations in Algeria (96 samples) and Morocco (116 samples) were examined by using 468-bp sequences from hypervariable region I of the mitochondrial DNA control region. Twenty-four different haplotypes were identified, differing by 1-26 mutational steps (0.2-5.6%) and 1 insertion. With one exception (attributable to secondary introduction in coastal Morocco), Algerian and Moroccan haplotypes are clearly distinct. However, whereas Moroccan subpopulations show little divergence in hypervariable region I sequences and little correspondence with geographical distribution, there is a deep division between two main subpopulations in Algeria and one marked secondary division, with haplotypes generally matching geographical distribution. Accepting an origin of the genus Macaca of 5.5 million years ago, the Moroccan population and the two main Algerian subpopulations diverged approximately 1.6 million years ago. Distinction between Moroccan and Algerian haplotypes permitted analysis of the origin of the Gibraltar colony of Barbary macaques (68 samples; 30% of the population). It is generally held that the present Gibraltar population descended from a dozen individuals imported during World War II. However, the Gibraltar sample was found to include Algerian and Moroccan haplotypes separated by at least 16 mutational steps, revealing a dual origin of the founding females.
Phylogeography of Barbary macaques (Macaca sylvanus) and the origin of the Gibraltar colony

PubMed Central

Modolo, Lara; Salzburger, Walter; Martin, Robert D.

2005-01-01

The Barbary macaque (Macaca sylvanus) is the earliest offshoot of the genus Macaca and the only extant African representative, all other species being Asiatic. Once distributed throughout North Africa, M. sylvanus is now restricted to isolated forest fragments in Algeria and Morocco. The species is threatened; the maximum total wild population size is estimated at 10,000 individuals. Relationships among surviving wild subpopulations in Algeria (96 samples) and Morocco (116 samples) were examined by using 468-bp sequences from hypervariable region I of the mitochondrial DNA control region. Twenty-four different haplotypes were identified, differing by 1-26 mutational steps (0.2-5.6%) and 1 insertion. With one exception (attributable to secondary introduction in coastal Morocco), Algerian and Moroccan haplotypes are clearly distinct. However, whereas Moroccan subpopulations show little divergence in hypervariable region I sequences and little correspondence with geographical distribution, there is a deep division between two main subpopulations in Algeria and one marked secondary division, with haplotypes generally matching geographical distribution. Accepting an origin of the genus Macaca of 5.5 million years ago, the Moroccan population and the two main Algerian subpopulations diverged ≈1.6 million years ago. Distinction between Moroccan and Algerian haplotypes permitted analysis of the origin of the Gibraltar colony of Barbary macaques (68 samples; 30% of the population). It is generally held that the present Gibraltar population descended from a dozen individuals imported during World War II. However, the Gibraltar sample was found to include Algerian and Moroccan haplotypes separated by at least 16 mutational steps, revealing a dual origin of the founding females. PMID:15870193
Receptor-like genes in the major resistance locus of lettuce are subject to divergent selection.

PubMed Central

Meyers, B C; Shen, K A; Rohani, P; Gaut, B S; Michelmore, R W

1998-01-01

Disease resistance genes in plants are often found in complex multigene families. The largest known cluster of disease resistance specificities in lettuce contains the RGC2 family of genes. We compared the sequences of nine full-length genomic copies of RGC2 representing the diversity in the cluster to determine the structure of genes within this family and to examine the evolution of its members. The transcribed regions range from at least 7.0 to 13.1 kb, and the cDNAs contain deduced open reading frames of approximately 5. 5 kb. The predicted RGC2 proteins contain a nucleotide binding site and irregular leucine-rich repeats (LRRs) that are characteristic of resistance genes cloned from other species. Unique features of the RGC2 gene products include a bipartite LRR region with >40 repeats. At least eight members of this family are transcribed. The level of sequence diversity between family members varied in different regions of the gene. The ratio of nonsynonymous (Ka) to synonymous (Ks) nucleotide substitutions was lowest in the region encoding the nucleotide binding site, which is the presumed effector domain of the protein. The LRR-encoding region showed an alternating pattern of conservation and hypervariability. This alternating pattern of variation was also found in all comparisons within families of resistance genes cloned from other species. The Ka /Ks ratios indicate that diversifying selection has resulted in increased variation at these codons. The patterns of variation support the predicted structure of LRR regions with solvent-exposed hypervariable residues that are potentially involved in binding pathogen-derived ligands. PMID:9811792
Magnetic Resonance Arterial Spin Tagging for Non-Invasive Pharmacokinetic Analysis of Breast Cancer

DTIC Science & Technology

2000-10-01

sequence software that we had developed for this project. In addition, we revised the pulse sequences to utilize the high performance gradients (40 mT/ m ...peak, 150 mT/ m /ms rise) of the system. We believe these revised sequences will provide better arterial spin tagged data for perfusion measurement. All...U.... ...... ... -- v p I _1 i-:F~ ----- ! - .Ag Jig. H aI .. M e fI6lo 3 ~ ~ 2 0’,~- A.11. I 1 1 9 - HP ~ ~ IM I 15 L 1 1 8 = NIAt I C J1 5
The transcription factor GCN4 regulates PHM8 and alters triacylglycerol metabolism in Saccharomyces cerevisiae.

PubMed

Yadav, Kamlesh Kumar; Rajasekharan, Ram

2016-11-01

PHM8 is a very important enzyme in nonpolar lipid metabolism because of its role in triacylglycerol (TAG) biosynthesis under phosphate stress conditions. It is positively regulated by the PHO4 transcription factor under low phosphate conditions; however, its regulation has not been explored under normal physiological conditions. General control nonderepressible (GCN4), a basic leucine-zipper transcription factor activates the transcription of amino acids, purine biosynthesis genes and many stress response genes under various stress conditions. In this study, we demonstrate that the level of TAG is regulated by the transcription factor GCN4. GCN4 directly binds to its consensus recognition sequence (TGACTC) in the PHM8 promoter and controls its expression. The analysis of cells expressing the P PHM8 -lacZ reporter gene showed that mutations (TGACTC-GGGCCC) in the GCN4-binding sequence caused a significant increase in β-galactosidase activity. Mutation in the GCN4 binding sequence causes an increase in PHM8 expression, lysophosphatidic acid phosphatase activity and TAG level. PHM8, in conjunction with DGA1, a mono- and diacylglycerol transferase, controls the level of TAG. These results revealed that GCN4 negatively regulates PHM8 and that deletion of GCN4 causes de-repression of PHM8, which is responsible for the increased TAG content in gcn4∆ cells.
Probing the potential of CnaB-type domains for the design of tag/catcher systems

PubMed Central

Pröschel, Marlene; Kraner, Max E.; Horn, Anselm H. C.; Schäfer, Lena; Sonnewald, Uwe

2017-01-01

Building proteins into larger, post-translational assemblies in a defined and stable way is still a challenging task. A promising approach relies on so-called tag/catcher systems that are fused to the proteins of interest and allow a durable linkage via covalent intermolecular bonds. Tags and catchers are generated by splitting protein domains that contain intramolecular isopeptide or ester bonds that form autocatalytically under physiological conditions. There are already numerous biotechnological and medical applications that demonstrate the usefulness of covalent linkages mediated by these systems. Additional covalent tag/catcher systems would allow creating more complex and ultra-stable protein architectures and networks. Two of the presently available tag/catcher systems were derived from closely related CnaB-domains of Streptococcus pyogenes and Streptococcus dysgalactiae proteins. However, it is unclear whether domain splitting is generally tolerated within the CnaB-family or only by a small subset of these domains. To address this point, we have selected a set of four CnaB domains of low sequence similarity and characterized the resulting tag/catcher systems by computational and experimental methods. Experimental testing for intermolecular isopeptide bond formation demonstrated two of the four systems to be functional. For these two systems length and sequence variations of the peptide tags were investigated revealing only a relatively small effect on the efficiency of the reaction. Our study suggests that splitting into tag and catcher moieties is tolerated by a significant portion of the naturally occurring CnaB-domains, thus providing a large reservoir for the design of novel tag/catcher systems. PMID:28654665
Characterization of the patterns of polymorphism in a [open quotes]cryptic repeat[close quotes] reveals a novel type of hypervariable sequence

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jacobson, D.P.; Schmeling, P.; Sommer, S.S.

Alternating purine and pyrimidine repeats (RY(i)) are an abundant source of polymorphism. The subset with long tandem repeats of GT or AC (GT(i)) have been studied extensively, but cryptic RY(i) (i.e., no single tandem repeat predominates) have received little attention. The factor IX gene has a polymorphic cryptic RY(i) of 142-216 bp. Previously, there were four known polymorphic alleles, of the form AB, A[sub 2]B, A[sub 2]B[sub 2], and A[sub 3]B[sub 2], where A = (GT)(AC)[sub 3](AT)[sub 3](GT)(AT)[sub 4] and B = A with an additional 3' AT dinucleotide. To further characterize this locus, the authors examined more than 1,700more » additional human chromosomes and determined the sequences of the homologous sites in orangutans and chimpanzees. The novel alleles found in humans expand the repertoire of A/B alleles to A[sub 0-4]B[sub 1] and A[sub 1-3]B[sub 2]. The A[sub n]B[sub 2] series are abundant in Caucasians but are absent in blacks and Asians. Conversely, the A[sub 0]B[sub 1] allele is common in blacks but is not found in more than 1,700 Caucasian chromosomes. The data are compatible with a model in which recombination is more frequent than polymerase slippage at this locus. In orangutans, the RY(i) is present, but the sequence is markedly different. An A/B-type of pattern was discerned in which B differs from A by an additional six (AT) dinucleotides at the 3' end. In chimpanzees, the size of the RY(i) locus was greatly expanded, and the sequence showed a novel pattern of hypervariability in which there are many tandem repeats of the form (GT)[sub n](AC)[sub 0](AT)[sub p](GT)[sub q](AT)[sub s], where n, o, p, q, and s are different integers. The sequences of the factor IX intron 1 cryptic RY(i) in three primates provide perspective on the range of possible patterns of polymorphism. Analysis of the patterns suggests how the RY(i) can be conserved during evolution, while the precise sequence varies. 25 refs., 5 figs., 3 tabs.« less
Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

PubMed

Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

2016-05-23

Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Design and Evaluation of Illumina MiSeq-Compatible, 18S rRNA Gene-Specific Primers for Improved Characterization of Mixed Phototrophic Communities.

PubMed

Bradley, Ian M; Pinto, Ameet J; Guest, Jeremy S

2016-10-01

The use of high-throughput sequencing technologies with the 16S rRNA gene for characterization of bacterial and archaeal communities has become routine. However, the adoption of sequencing methods for eukaryotes has been slow, despite their significance to natural and engineered systems. There are large variations among the target genes used for amplicon sequencing, and for the 18S rRNA gene, there is no consensus on which hypervariable region provides the most suitable representation of diversity. Additionally, it is unclear how much PCR/sequencing bias affects the depiction of community structure using current primers. The present study amplified the V4 and V8-V9 regions from seven microalgal mock communities as well as eukaryotic communities from freshwater, coastal, and wastewater samples to examine the effect of PCR/sequencing bias on community structure and membership. We found that degeneracies on the 3' end of the current V4-specific primers impact read length and mean relative abundance. Furthermore, the PCR/sequencing error is markedly higher for GC-rich members than for communities with balanced GC content. Importantly, the V4 region failed to reliably capture 2 of the 12 mock community members, and the V8-V9 hypervariable region more accurately represents mean relative abundance and alpha and beta diversity. Overall, the V4 and V8-V9 regions show similar community representations over freshwater, coastal, and wastewater environments, but specific samples show markedly different communities. These results indicate that multiple primer sets may be advantageous for gaining a more complete understanding of community structure and highlight the importance of including mock communities composed of species of interest. The quantification of error associated with community representation by amplicon sequencing is a critical challenge that is often ignored. When target genes are amplified using currently available primers, differential amplification efficiencies result in inaccurate estimates of community structure. The extent to which amplification bias affects community representation and the accuracy with which different gene targets represent community structure are not known. As a result, there is no consensus on which region provides the most suitable representation of diversity for eukaryotes. This study determined the accuracy with which commonly used 18S rRNA gene primer sets represent community structure and identified particular biases related to PCR amplification and Illumina MiSeq sequencing in order to more accurately study eukaryotic microbial communities. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Phylogenetic Characterization of Fecal Microbial Communities of Dogs Fed Diets with or without Supplemental Dietary Fiber Using 454 Pyrosequencing

PubMed Central

Middelbos, Ingmar S.; Vester Boler, Brittany M.; Qu, Ani; White, Bryan A.; Swanson, Kelly S.; Fahey, George C.

2010-01-01

Background Dogs suffer from many of the same maladies as humans that may be affected by the gut microbiome, but knowledge of the canine microbiome is incomplete. This work aimed to use 16S rDNA tag pyrosequencing to phylogenetically characterize hindgut microbiome in dogs and determine how consumption of dietary fiber affects community structure. Principal Findings Six healthy adult dogs were used in a crossover design. A control diet without supplemental fiber and a beet pulp-supplemented (7.5%) diet were fed. Fecal DNA was extracted and the V3 hypervariable region of the microbial 16S rDNA gene amplified using primers suitable for 454-pyrosequencing. Microbial diversity was assessed on random 2000-sequence subsamples of individual and pooled DNA samples by diet. Our dataset comprised 77,771 reads with an average length of 141 nt. Individual samples contained approximately 129 OTU, with Fusobacteria (23 – 40% of reads), Firmicutes (14 – 28% of reads) and Bacteroidetes (31 – 34% of reads) being co-dominant phyla. Feeding dietary fiber generally decreased Fusobacteria and increased Firmicutes, but these changes were not equally apparent in all dogs. UniFrac analysis revealed that structure of the gut microbiome was affected by diet and Firmicutes appeared to play a strong role in by-diet clustering. Conclusions Our data suggest three co-dominant bacterial phyla in the canine hindgut. Furthermore, a relatively small amount of dietary fiber changed the structure of the gut microbiome detectably. Our data are among the first to characterize the healthy canine gut microbiome using pyrosequencing and provide a basis for studies focused on devising dietary interventions for microbiome-associated diseases. PMID:20339542
Genetic characterization of brown bears of the Kodiak Archipelago

USGS Publications Warehouse

Talbot, Sandra L.; Gust, Judy R.; Sage, George K.; Fischbach, Anthony S.; Amstrup, Kristin S.; Leacock, William; Van Daele, Larry

2006-01-01

Here we examine genetic characteristics of brown bears of Kodiak and Afognak islands, using 14 variable nuclear microsatellite loci and nucleotide sequence information including the hypervariable domain I of the mtDNA control region (Wakely 1993). Because these markers, or a subset of them, have been used to characterize brown bears of the Kenai Peninsula (Jackson et al. 2005), Katmai National Park, Seward Peninsula, and nine other populations in Alaska (Talbot, unpublished data), we compared levels of genetic diversity and relationships among populations when possible. In addition, we obtained preliminary comparative information from class II DQA and DQB genes of the brown bear MHC, to examine levels of variation at this important immunology-mediating supergene. These data were used to answer the following questions: 1) are earlier findings of extremely low levels of variability at nuclear (biparentallyinherited) microsatellite loci from a small geographic area (Paetkau et al. 1998b) representative of Kodiak Archipelago populations as a whole? 2) Is the level and type of variation at the maternally-inherited mtDNA lower, or similar to, levels found in other populations in Alaska? 3) Is there concordance between low levels of genetic variation observed at neutral markers with levels of variation observed at functional genes? 4) Is there population substructuring within Kodiak and Afognak islands? 5) What is the connectivity between populations on Afognak Island and Kodiak Island? 6) What are the phylogeographic relationships between bears of the Kodiak Archipelago with brown bears on mainland Alaskan and other western Beringian populations? We also test whether these markers will provide an appropriate baseline for designing genetic tagging studies for use in future research and management activities, such as mark-recapture efforts, on the Refuge.
Diet and exercise orthogonally alter the gut microbiome and reveal independent associations with anxiety and cognition

PubMed Central

2014-01-01

Background The ingestion of a high-fat diet (HFD) and the resulting obese state can exert a multitude of stressors on the individual including anxiety and cognitive dysfunction. Though many studies have shown that exercise can alleviate the negative consequences of a HFD using metabolic readouts such as insulin and glucose, a paucity of well-controlled rodent studies have been published on HFD and exercise interactions with regard to behavioral outcomes. This is a critical issue since some individuals assume that HFD-induced behavioral problems such as anxiety and cognitive dysfunction can simply be exercised away. To investigate this, we analyzed mice fed a normal diet (ND), ND with exercise, HFD diet, or HFD with exercise. Results We found that mice on a HFD had robust anxiety phenotypes but this was not rescued by exercise. Conversely, exercise increased cognitive abilities but this was not impacted by the HFD. Given the importance of the gut microbiome in shaping the host state, we used 16S rRNA hypervariable tag sequencing to profile our cohorts and found that HFD massively reshaped the gut microbial community in agreement with numerous published studies. However, exercise alone also caused massive shifts in the gut microbiome at nearly the same magnitude as diet but these changes were surprisingly orthogonal. Additionally, specific bacterial abundances were directly proportional to measures of anxiety or cognition. Conclusions Thus, behavioral domains and the gut microbiome are both impacted by diet and exercise but in unrelated ways. These data have important implications for obesity research aimed at modifications of the gut microbiome and suggest that specific gut microbes could be used as a biomarker for anxiety or cognition or perhaps even targeted for therapy. PMID:25217888
Diet and exercise orthogonally alter the gut microbiome and reveal independent associations with anxiety and cognition.

PubMed

Kang, Silvia S; Jeraldo, Patricio R; Kurti, Aishe; Miller, Margret E Berg; Cook, Marc D; Whitlock, Keith; Goldenfeld, Nigel; Woods, Jeffrey A; White, Bryan A; Chia, Nicholas; Fryer, John D

2014-09-13

The ingestion of a high-fat diet (HFD) and the resulting obese state can exert a multitude of stressors on the individual including anxiety and cognitive dysfunction. Though many studies have shown that exercise can alleviate the negative consequences of a HFD using metabolic readouts such as insulin and glucose, a paucity of well-controlled rodent studies have been published on HFD and exercise interactions with regard to behavioral outcomes. This is a critical issue since some individuals assume that HFD-induced behavioral problems such as anxiety and cognitive dysfunction can simply be exercised away. To investigate this, we analyzed mice fed a normal diet (ND), ND with exercise, HFD diet, or HFD with exercise. We found that mice on a HFD had robust anxiety phenotypes but this was not rescued by exercise. Conversely, exercise increased cognitive abilities but this was not impacted by the HFD. Given the importance of the gut microbiome in shaping the host state, we used 16S rRNA hypervariable tag sequencing to profile our cohorts and found that HFD massively reshaped the gut microbial community in agreement with numerous published studies. However, exercise alone also caused massive shifts in the gut microbiome at nearly the same magnitude as diet but these changes were surprisingly orthogonal. Additionally, specific bacterial abundances were directly proportional to measures of anxiety or cognition. Thus, behavioral domains and the gut microbiome are both impacted by diet and exercise but in unrelated ways. These data have important implications for obesity research aimed at modifications of the gut microbiome and suggest that specific gut microbes could be used as a biomarker for anxiety or cognition or perhaps even targeted for therapy.
Gapped Spectral Dictionaries and Their Applications for Database Searches of Tandem Mass Spectra*

PubMed Central

Jeong, Kyowon; Kim, Sangtae; Bandeira, Nuno; Pevzner, Pavel A.

2011-01-01

Generating all plausible de novo interpretations of a peptide tandem mass (MS/MS) spectrum (Spectral Dictionary) and quickly matching them against the database represent a recently emerged alternative approach to peptide identification. However, the sizes of the Spectral Dictionaries quickly grow with the peptide length making their generation impractical for long peptides. We introduce Gapped Spectral Dictionaries (all plausible de novo interpretations with gaps) that can be easily generated for any peptide length thus addressing the limitation of the Spectral Dictionary approach. We show that Gapped Spectral Dictionaries are small thus opening a possibility of using them to speed-up MS/MS searches. Our MS-GappedDictionary algorithm (based on Gapped Spectral Dictionaries) enables proteogenomics applications (such as searches in the six-frame translation of the human genome) that are prohibitively time consuming with existing approaches. MS-GappedDictionary generates gapped peptides that occupy a niche between accurate but short peptide sequence tags and long but inaccurate full length peptide reconstructions. We show that, contrary to conventional wisdom, some high-quality spectra do not have good peptide sequence tags and introduce gapped tags that have advantages over the conventional peptide sequence tags in MS/MS database searches. PMID:21444829
Detection and molecular characterization of infectious bronchitis virus isolated from recent outbreaks in broiler flocks in Thailand.

PubMed

Pohuang, Tawatchai; Chansiripornchai, Niwat; Tawatsin, Achara; Sasipreeyajan, Jiroj

2009-09-01

Thirteen field isolates of infectious bronchitis virus (IBV) were isolated from broiler flocks in Thailand between January and June 2008. The 878-bp of the S1 gene covering a hypervariable region was amplified and sequenced. Phylogenetic analysis based on that region revealed that these viruses were separated into two groups (I and II). IBV isolates in group I were not related to other IBV strains published in the GenBank database. Group 1 nucleotide sequence identities were less than 85% and amino acid sequence identities less than 84% in common with IBVs published in the GenBank database. This group likely represents the strains indigenous to Thailand. The isolates in group II showed a close relationship with Chinese IBVs. They had nucleotide sequence identities of 97-98% and amino acid sequence identities 96-98% in common with Chinese IBVs (strain A2, SH and QXIBV). This finding indicated that the recent Thai IBVs evolved separately and at least two groups of viruses are circulating in Thailand.

The complementarity-determining region sequences in IgY antivenom hypervariable regions.

PubMed

da Rocha, David Gitirana; Fernandez, Jorge Hernandez; de Almeida, Claudia Maria Costa; da Silva, Claudia Letícia; Magnoli, Fabio Carlos; da Silva, Osmair Élder; da Silva, Wilmar Dias

2017-08-01

The data presented in this article are related to the research article entitled "Development of IgY antibodies against anti-snake toxins endowed with highly lethal neutralizing activity" (da Rocha et al., 2017) [1]. Complementarity-determining region (CDR) sequences are variable antibody (Ab) sequences that respond with specificity, duration and strength to identify and bind to antigen (Ag) epitopes. B lymphocytes isolated from hens immunized with Bitis arietans (Ba) and anti- Crotalus durissus terrificus (Cdt) venoms and expressing high specificity, affinity and toxicity neutralizing antibody titers were used as DNA sources. The VLF1, CDR1, CDR2, VLR1 and CDR3 sequences were validated by BLASTp, and values corresponding to IgY V L and V H anti-Ba or anti-Cdt venoms were identified, registered [ Gallus gallus IgY Fv Light chain (GU815099)/ Gallus gallus IgY Fv Heavy chain (GU815098)] and used for molecular modeling of IgY scFv anti-Ba. The resulting CDR1, CDR2 and CDR3 sequences were combined to construct the three - dimensional structure of the Ab paratope.
High-resolution mapping of the 11q13 amplicon and identification of a gene, TAOS1, that is amplified and overexpressed in oral cancer cells

PubMed Central

Huang, Xin; Gollin, Susanne M.; Raja, Siva; Godfrey, Tony E.

2002-01-01

Amplification of chromosomal band 11q13 is a common event in human cancer. It has been reported in about 45% of head and neck carcinomas and in other cancers including esophageal, breast, liver, lung, and bladder cancer. To understand the mechanism of 11q13 amplification and to identify the potential oncogene(s) driving it, we have fine-mapped the structure of the amplicon in oral squamous cell carcinoma cell lines and localized the proximal and distal breakpoints. A 5-Mb physical map of the region has been prepared from which sequence is available. We quantified copy number of sequence-tagged site markers at 42–550 kb intervals along the length of the amplicon and defined the amplicon core and breakpoints by using TaqMan-based quantitative microsatellite analysis. The core of the amplicon maps to a 1.5-Mb region. The proximal breakpoint localizes to two intervals between sequence-tagged site markers, 550 kb and 160 kb in size, and the distal breakpoint maps to a 250 kb interval. The cyclin D1 gene maps to the amplicon core, as do two new expressed sequence tag clusters. We have analyzed one of these expressed sequence tag clusters and now report that it contains a previously uncharacterized gene, TAOS1 (tumor amplified and overexpressed sequence 1), which is both amplified and overexpressed in oral cancer cells. The data suggest that TAOS1 may be an amplification-dependent candidate oncogene with a role in the development and/or progression of human tumors, including oral squamous cell carcinomas. The approach described here should be useful for characterizing amplified genomic regions in a wide variety of tumors. PMID:12172009
Evaluating information content of SNPs for sample-tagging in re-sequencing projects.

PubMed

Hu, Hao; Liu, Xiang; Jin, Wenfei; Hilger Ropers, H; Wienker, Thomas F

2015-05-15

Sample-tagging is designed for identification of accidental sample mix-up, which is a major issue in re-sequencing studies. In this work, we develop a model to measure the information content of SNPs, so that we can optimize a panel of SNPs that approach the maximal information for discrimination. The analysis shows that as low as 60 optimized SNPs can differentiate the individuals in a population as large as the present world, and only 30 optimized SNPs are in practice sufficient in labeling up to 100 thousand individuals. In the simulated populations of 100 thousand individuals, the average Hamming distances, generated by the optimized set of 30 SNPs are larger than 18, and the duality frequency, is lower than 1 in 10 thousand. This strategy of sample discrimination is proved robust in large sample size and different datasets. The optimized sets of SNPs are designed for Whole Exome Sequencing, and a program is provided for SNP selection, allowing for customized SNP numbers and interested genes. The sample-tagging plan based on this framework will improve re-sequencing projects in terms of reliability and cost-effectiveness.
Isolation of centromeric-tandem repetitive DNA sequences by chromatin affinity purification using a HaloTag7-fused centromere-specific histone H3 in tobacco.

PubMed

Nagaki, Kiyotaka; Shibata, Fukashi; Kanatani, Asaka; Kashihara, Kazunari; Murata, Minoru

2012-04-01

The centromere is a multi-functional complex comprising centromeric DNA and a number of proteins. To isolate unidentified centromeric DNA sequences, centromere-specific histone H3 variants (CENH3) and chromatin immunoprecipitation (ChIP) have been utilized in some plant species. However, anti-CENH3 antibody for ChIP must be raised in each species because of its species specificity. Production of the antibodies is time-consuming and costly, and it is not easy to produce ChIP-grade antibodies. In this study, we applied a HaloTag7-based chromatin affinity purification system to isolate centromeric DNA sequences in tobacco. This system required no specific antibody, and made it possible to apply a highly stringent wash to remove contaminated DNA. As a result, we succeeded in isolating five tandem repetitive DNA sequences in addition to the centromeric retrotransposons that were previously identified by ChIP. Three of the tandem repeats were centromere-specific sequences located on different chromosomes. These results confirm the validity of the HaloTag7-based chromatin affinity purification system as an alternative method to ChIP for isolating unknown centromeric DNA sequences. The discovery of more than two chromosome-specific centromeric DNA sequences indicates the mosaic structure of tobacco centromeres. © Springer-Verlag 2011
Structure of a human monoclonal antibody Fab fragment against gp41 of human immunodeficiency virus type

NASA Technical Reports Server (NTRS)

He, X. M.; Ruker, F.; Casale, E.; Carter, D. C.

1992-01-01

The three-dimensional structure of a human monoclonal antibody (Fab), which binds specifically to a major epitope of the transmembrane protein gp41 of the human immunodeficiency virus type 1, has been determined by crystallographic methods to a resolution of 2.7 A. It has been previously determined that this antibody recognizes the epitope SGKLICTTAVPWNAS, belongs to the subclass IgG1 (kappa), and exhibits antibody-dependent cellular cytotoxicity. The quaternary structure of the Fab is in an extended conformation with an elbow bend angle between the constant and variable domains of 175 degrees. Structurally, four of the hypervariable loops can be classified according to previously recognized canonical structures. The third hypervariable loops of the heavy (H3) and light chain (L3) are structurally distinct. Hypervariable loop H3, residues 102H-109H, is unusually extended from the surface. The complementarity-determining region forms a hydrophobic binding pocket that is created primarily from hypervariable loops L3, H3, and H2.
Structure of a human monoclonal antibody Fab fragment against gp41 of human immunodeficiency virus type 1

NASA Technical Reports Server (NTRS)

He, Xiao M.; Rueker, Florian; Casale, Elena; Carter, Daniel C.

1992-01-01

The three-dimensional structure of a human monoclonal antibody (Fab), which binds specifically to a major epitope of the transmembrane protein gp41 of the human immunodeficiency virus type 1, has been determined by crystallographic methods to a resolution of 2.7 A. It has been previously determined that this antibody recognizes the epitope SGKLICTTAVPWNAS, belongs to the subclass IgG1 (kappa), and exhibits antibody-dependent cellular cytotoxicity. The quaternary structure of the Fab is in an extended conformation with an elbow bend angle between the constant and variable domains of 175 deg. Structurally, four of the hypervariable loops can be classified according to previously recognized canonical structures. The third hypervariable loops of the heavy (H3) and light chain (L3) are structurally distinct. Hypervariable loop H3, residues 102H-109H, is unusually extended from the surface. The complementarity-determining region forms a hydrophobic binding pocket that is created primarily from hypervariable loops L3, H3, and H2.
Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

PubMed Central

2015-01-01

Abstract Trees contribute to enormous plant oil reserves because many trees contain 50%–80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the “proline knot” motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs. PMID:26258573
Genome-Wide Analysis of Oleosin Gene Family in 22 Tree Species: An Accelerator for Metabolic Engineering of BioFuel Crops and Agrigenomics Industrial Applications?

PubMed

Cao, Heping

2015-09-01

Trees contribute to enormous plant oil reserves because many trees contain 50%-80% of oil (triacylglycerols, TAGs) in the fruits and kernels. TAGs accumulate in subcellular structures called oil bodies/droplets, in which TAGs are covered by low-molecular-mass hydrophobic proteins called oleosins (OLEs). The OLEs/TAGs ratio determines the size and shape of intracellular oil bodies. There is a lack of comprehensive sequence analysis and structural information of OLEs among diverse trees. The objectives of this study were to identify OLEs from 22 tree species (e.g., tung tree, tea-oil tree, castor bean), perform genome-wide analysis of OLEs, classify OLEs, identify conserved sequence motifs and amino acid residues, and predict secondary and three-dimensional structures in tree OLEs and OLE subfamilies. Data mining identified 65 OLEs with perfect conservation of the "proline knot" motif (PX5SPX3P) from 19 trees. These OLEs contained >40% hydrophobic amino acid residues. They displayed similar properties and amino acid composition. Genome-wide phylogenetic analysis and multiple sequence alignment demonstrated that these proteins could be classified into five OLE subfamilies. There were distinct patterns of sequence conservation among the OLE subfamilies and within individual tree species. Computational modeling indicated that OLEs were composed of at least three α-helixes connected with short coils without any β-strand and that they exhibited distinct 3D structures and ligand binding sites. These analyses provide fundamental information in the similarity and specificity of diverse OLE isoforms within the same subfamily and among the different species, which should facilitate studying the structure-function relationship and identify critical amino acid residues in OLEs for metabolic engineering of tree TAGs.
Application of an E. coli signal sequence as a versatile inclusion body tag.

PubMed

Jong, Wouter S P; Vikström, David; Houben, Diane; van den Berg van Saparoea, H Bart; de Gier, Jan-Willem; Luirink, Joen

2017-03-21

Heterologous protein production in Escherichia coli often suffers from bottlenecks such as proteolytic degradation, complex purification procedures and toxicity towards the expression host. Production of proteins in an insoluble form in inclusion bodies (IBs) can alleviate these problems. Unfortunately, the propensity of heterologous proteins to form IBs is variable and difficult to predict. Hence, fusing the target protein to an aggregation prone polypeptide or IB-tag is a useful strategy to produce difficult-to-express proteins in an insoluble form. When screening for signal sequences that mediate optimal targeting of heterologous proteins to the periplasmic space of E. coli, we observed that fusion to the 39 amino acid signal sequence of E. coli TorA (ssTorA) did not promote targeting but rather directed high-level expression of the human proteins hEGF, Pla2 and IL-3 in IBs. Further analysis revealed that ssTorA even mediated IB formation of the highly soluble endogenous E. coli proteins TrxA and MBP. The ssTorA also induced aggregation when fused to the C-terminus of target proteins and appeared functional as IB-tag in E. coli K-12 as well as B strains. An additive effect on IB-formation was observed upon fusion of multiple ssTorA sequences in tandem, provoking almost complete aggregation of TrxA and MBP. The ssTorA-moiety was successfully used to produce the intrinsically unstable hEGF and the toxic fusion partner SymE, demonstrating its applicability as an IB-tag for difficult-to-express and toxic proteins. We present proof-of-concept for the use of ssTorA as a small, versatile tag for robust E. coli-based expression of heterologous proteins in IBs.
Diversity and Plasticity of the Intracellular Plant Pathogen and Insect Symbiont “Candidatus Liberibacter asiaticus” as Revealed by Hypervariable Prophage Genes with Intragenic Tandem Repeats ▿ †

PubMed Central

Zhou, Lijuan; Powell, Charles A.; Hoffman, Michele T.; Li, Wenbin; Fan, Guocheng; Liu, Bo; Lin, Hong; Duan, Yongping

2011-01-01

“Candidatus Liberibacter asiaticus” is a psyllid-transmitted, phloem-limited alphaproteobacterium and the most prevalent species of “Ca. Liberibacter” associated with a devastating worldwide citrus disease known as huanglongbing (HLB). Two related and hypervariable genes (hyvI and hyvII) were identified in the prophage regions of the Psy62 “Ca. Liberibacter asiaticus” genome. Sequence analyses of the hyvI and hyvII genes in 35 “Ca. Liberibacter asiaticus” DNA isolates collected globally revealed that the hyvI gene contains up to 12 nearly identical tandem repeats (NITRs, 132 bp) and 4 partial repeats, while hyvII contains up to 2 NITRs and 4 partial repeats and shares homology with hyvI. Frequent deletions or insertions of these repeats within the hyvI and hyvII genes were observed, none of which disrupted the open reading frames. Sequence conservation within the individual repeats but an extensive variation in repeat numbers, rearrangement, and the sequences flanking the repeat region indicate the diversity and plasticity of “Ca. Liberibacter asiaticus” bacterial populations in the world. These differences were found not only in samples of distinct geographical origins but also in samples from a single origin and even from a single “Ca. Liberibacter asiaticus”-infected sample. This is the first evidence of different “Ca. Liberibacter asiaticus” populations coexisting in a single HLB-affected sample. The Florida “Ca. Liberibacter asiaticus” isolates contain both hyvI and hyvII, while all other global “Ca. Liberibacter asiaticus” isolates contain either one or the other. Interclade assignments of the putative HyvI and HyvII proteins from Florida isolates with other global isolates in phylogenetic trees imply multiple “Ca. Liberibacter asiaticus” populations in the world and a multisource introduction of the “Ca. Liberibacter asiaticus” bacterium into Florida. PMID:21784907
A family of cellular proteins related to snake venom disintegrins.

PubMed

Weskamp, G; Blobel, C P

1994-03-29

Disintegrins are short soluble integrin ligands that were initially identified in snake venom. A previously recognized cellular protein with a disintegrin domain was the guinea pig sperm protein PH-30, a protein implicated in sperm-egg membrane binding and fusion. Here we present peptide sequences that are characteristic for several cellular disintegrin-domain proteins. These peptide sequences were deduced from cDNA sequence tags that were generated by polymerase chain reaction from various mouse tissue and a mouse muscle cell line. Northern blot analysis with four sequence tags revealed distinct mRNA expression patterns. Evidently, cellular proteins containing a disintegrin domain define a superfamily of potential integrin ligands that are likely to function in important cell-cell and cell-matrix interactions.
Characterization of the two intracellular lipases of Y. lipolytica encoded by TGL3 and TGL4 genes: new insights into the role of intracellular lipases and lipid body organisation.

PubMed

Dulermo, Thierry; Tréton, Brigitte; Beopoulos, Athanasios; Kabran Gnankon, Affoué Philomène; Haddouche, Ramdane; Nicaud, Jean-Marc

2013-09-01

Eukaryotes store lipids in a specialised organelle, the lipid body (LB), mainly as triglycerides (TAGs). Both the rates of synthesis and degradation contribute to the control of the accumulation of TAGs. The synthesis of TAGs in yeasts has been well documented, especially in the model yeast Saccharomyces cerevisiae and in the oleaginous yeast Yarrowia lipolytica. However, descriptions of the processes involved in TAG degradation are more scarce and mostly for S. cerevisiae. Here, we report the characterisation of two Y. lipolytica genes, YlTGL3 and YlTGL4, encoding intracellular lipases involved in TAG degradation. The two proteins are localised in lipid bodies, and YlTgl4 was mainly found at the interface between LBs. Surprisingly, the spatial organisation of YlTgl3 and YlTgl4 depends on the culture medium and on the physiological phase of the cell. Inactivation of one or both genes doubles the lipid accumulation capacity of Y. lipolytica, increasing the cell's capacity to accumulate TAGs. The amino acid sequence of YlTgl4 contains the consensus sequence motif (G/A)XSXG, typical of serine hydrolases, whereas YlTgl3 does not. Single and double mutants are unable to degrade TAGs, and higher expression of YlTgl4 correlates with TAG degradation. Therefore, we propose that YlTgl4 is the main lipase responsible for TAG degradation and that YlTgl3 may act as a positive regulator of YlTgl4 rather than a functional lipase. Thus, contrary to S. cerevisiae, Y. lipolytica possesses two intracellular lipases with distinct roles and with distinct localisations in the LB. © 2013. Published by Elsevier B.V. All rights reserved.
A normalization strategy for comparing tag count data

PubMed Central

2012-01-01

Background High-throughput sequencing, such as ribonucleic acid sequencing (RNA-seq) and chromatin immunoprecipitation sequencing (ChIP-seq) analyses, enables various features of organisms to be compared through tag counts. Recent studies have demonstrated that the normalization step for RNA-seq data is critical for a more accurate subsequent analysis of differential gene expression. Development of a more robust normalization method is desirable for identifying the true difference in tag count data. Results We describe a strategy for normalizing tag count data, focusing on RNA-seq. The key concept is to remove data assigned as potential differentially expressed genes (DEGs) before calculating the normalization factor. Several R packages for identifying DEGs are currently available, and each package uses its own normalization method and gene ranking algorithm. We compared a total of eight package combinations: four R packages (edgeR, DESeq, baySeq, and NBPSeq) with their default normalization settings and with our normalization strategy. Many synthetic datasets under various scenarios were evaluated on the basis of the area under the curve (AUC) as a measure for both sensitivity and specificity. We found that packages using our strategy in the data normalization step overall performed well. This result was also observed for a real experimental dataset. Conclusion Our results showed that the elimination of potential DEGs is essential for more accurate normalization of RNA-seq data. The concept of this normalization strategy can widely be applied to other types of tag count data and to microarray data. PMID:22475125
Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

PubMed

Wang, Penghao; Wilson, Susan R

2013-01-01

Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.
A tag-based approach for high-throughput analysis of CCWGG methylation.

PubMed

Denisova, Oksana V; Chernov, Andrei V; Koledachkina, Tatyana Y; Matvienko, Nicholas I

2007-10-15

Non-CpG methylation occurring in the context of CNG sequences is found in plants at a large number of genomic loci. However, there is still little information available about non-CpG methylation in mammals. Efficient methods that would allow detection of scarcely localized methylated sites in small quantities of DNA are required to elucidate the biological role of non-CpG methylation in both plants and animals. In this study, we tested a new whole genome approach to identify sites of CCWGG methylation (W is A or T), a particular case of CNG methylation, in genomic DNA. This technique is based on digestion of DNAs with methylation-sensitive restriction endonucleases EcoRII-C and AjnI. Short DNAs flanking methylated CCWGG sites (tags) are selectively purified and assembled in tandem arrays of up to nine tags. This allows high-throughput sequencing of tags, identification of flanking regions, and their exact positions in the genome. In this study, we tested specificity and efficiency of the approach.
Short range spread-spectrum radiolocation system and method

DOEpatents

Smith, Stephen F.

2003-04-29

A short range radiolocation system and associated methods that allow the location of an item, such as equipment, containers, pallets, vehicles, or personnel, within a defined area. A small, battery powered, self-contained tag is provided to an item to be located. The tag includes a spread-spectrum transmitter that transmits a spread-spectrum code and identification information. A plurality of receivers positioned about the area receive signals from a transmitting tag. The position of the tag, and hence the item, is located by triangulation. The system employs three different ranging techniques for providing coarse, intermediate, and fine spatial position resolution. Coarse positioning information is provided by use of direct-sequence code phase transmitted as a spread-spectrum signal. Intermediate positioning information is provided by the use of a difference signal transmitted with the direct-sequence spread-spectrum code. Fine positioning information is provided by use of carrier phase measurements. An algorithm is employed to combine the three data sets to provide accurate location measurements.
Expanding the versatility of phage display I: efficient display of peptide-tags on protein VII of the filamentous phage.

PubMed

Løset, Geir Åge; Bogen, Bjarne; Sandlie, Inger

2011-02-24

Phage display is a platform for selection of specific binding molecules and this is a clear-cut motivation for increasing its performance. Polypeptides are normally displayed as fusions to the major coat protein VIII (pVIII), or the minor coat protein III (pIII). Display on other coat proteins such as pVII allows for display of heterologous peptide sequences on the virions in addition to those displayed on pIII and pVIII. In addition, pVII display is an alternative to pIII or pVIII display. Here we demonstrate how standard pIII or pVIII display phagemids are complemented with a helper phage which supports production of virions that are tagged with octa FLAG, HIS(6) or AviTag on pVII. The periplasmic signal sequence required for pIII and pVIII display, and which has been added to pVII in earlier studies, is omitted altogether. Tagging on pVII is an important and very useful add-on feature to standard pIII and pVII display. Any phagemid bearing a protein of interest on either pIII or pVIII can be tagged with any of the tags depending simply on choice of helper phage. We show in this paper how such tags may be utilized for immobilization and separation as well as purification and detection of monoclonal and polyclonal phage populations.
The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

Treesearch

Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

2014-01-01

Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...
A 28,000 Years Old Cro-Magnon mtDNA Sequence Differs from All Potentially Contaminating Modern Sequences

PubMed Central

Caramelli, David; Milani, Lucio; Vai, Stefania; Modi, Alessandra; Pecchioli, Elena; Girardi, Matteo; Pilli, Elena; Lari, Martina; Lippi, Barbara; Ronchitelli, Annamaria; Mallegni, Francesco; Casoli, Antonella; Bertorelle, Giorgio; Barbujani, Guido

2008-01-01

Background DNA sequences from ancient speciments may in fact result from undetected contamination of the ancient specimens by modern DNA, and the problem is particularly challenging in studies of human fossils. Doubts on the authenticity of the available sequences have so far hampered genetic comparisons between anatomically archaic (Neandertal) and early modern (Cro-Magnoid) Europeans. Methodology/Principal Findings We typed the mitochondrial DNA (mtDNA) hypervariable region I in a 28,000 years old Cro-Magnoid individual from the Paglicci cave, in Italy (Paglicci 23) and in all the people who had contact with the sample since its discovery in 2003. The Paglicci 23 sequence, determined through the analysis of 152 clones, is the Cambridge reference sequence, and cannot possibly reflect contamination because it differs from all potentially contaminating modern sequences. Conclusions/Significance: The Paglicci 23 individual carried a mtDNA sequence that is still common in Europe, and which radically differs from those of the almost contemporary Neandertals, demonstrating a genealogical continuity across 28,000 years, from Cro-Magnoid to modern Europeans. Because all potential sources of modern DNA contamination are known, the Paglicci 23 sample will offer a unique opportunity to get insight for the first time into the nuclear genes of early modern Europeans. PMID:18628960
Characterisation of caecum and crop microbiota of Indian indigenous chicken targeting multiple hypervariable regions within 16S rRNA gene.

PubMed

Saxena, S; Saxena, V K; Tomar, S; Sapcota, D; Gonmei, G

2016-06-01

A comparative analysis of caecum and crop microbiota of chick, grower and adult stages of Indian indigenous chickens was conducted to investigate the role of the microbiota of the gastrointestinal tract, which play an important role in host performance, health and immunity. High-throughput Illumina sequencing was performed for V3, V4 and V4-V6 hypervariable regions of the 16S rRNA gene. M5RNA and M5NR databases under MG-RAST were used for metagenomic datasets annotation. In the crop, Firmicutes (~78%) and Proteobacteria (~16%) were the predominant phyla whereas in the caecum, Firmicutes (~50%), Bacteroidetes (~29%) and Actinobacteria (~10%) were predominant. The Shannon-Wiener diversity index suggested that sample richness and diversity increased as the chicken aged. For the first time, the presence of Lactobacillus species such as L. frumenti, L. antri, L. mucosae in the chicken crop along with Kineococcus radiotolerans, Desulfohalobium retbaense and L. jensenii in the caecum are reported. Many of these bacterial species have been found to be involved in immune response modulation and disease prevention in pigs and humans. The gut microbiome of the indigenous chicken was enriched with microbes having probiotic potential which might be essential for their adaptability.

Bacterial RecA Protein Promotes Adenoviral Recombination during In Vitro Infection

PubMed Central

Lee, Jeong Yoon; Lee, Ji Sun; Materne, Emma C.; Rajala, Rahul; Ismail, Ashrafali M.; Seto, Donald; Dyer, David W.

2018-01-01

ABSTRACT Adenovirus infections in humans are common and sometimes lethal. Adenovirus-derived vectors are also commonly chosen for gene therapy in human clinical trials. We have shown in previous work that homologous recombination between adenoviral genomes of human adenovirus species D (HAdV-D), the largest and fastest growing HAdV species, is responsible for the rapid evolution of this species. Because adenovirus infection initiates in mucosal epithelia, particularly at the gastrointestinal, respiratory, genitourinary, and ocular surfaces, we sought to determine a possible role for mucosal microbiota in adenovirus genome diversity. By analysis of known recombination hot spots across 38 human adenovirus genomes in species D (HAdV-D), we identified nucleotide sequence motifs similar to bacterial Chi sequences, which facilitate homologous recombination in the presence of bacterial Rec enzymes. These motifs, referred to here as ChiAD, were identified immediately 5′ to the sequence encoding penton base hypervariable loop 2, which expresses the arginine-glycine-aspartate moiety critical to adenoviral cellular entry. Coinfection with two HAdV-Ds in the presence of an Escherichia coli lysate increased recombination; this was blocked in a RecA mutant strain, E. coli DH5α, or upon RecA depletion. Recombination increased in the presence of E. coli lysate despite a general reduction in viral replication. RecA colocalized with viral DNA in HAdV-D-infected cell nuclei and was shown to bind specifically to ChiAD sequences. These results indicate that adenoviruses may repurpose bacterial recombination machinery, a sharing of evolutionary mechanisms across a diverse microbiota, and unique example of viral commensalism. IMPORTANCE Adenoviruses are common human mucosal pathogens of the gastrointestinal, respiratory, and genitourinary tracts and ocular surface. Here, we report finding Chi-like sequences in adenovirus recombination hot spots. Adenovirus coinfection in the presence of bacterial RecA protein facilitated homologous recombination between viruses. Genetic recombination led to evolution of an important external feature on the adenoviral capsid, namely, the penton base protein hypervariable loop 2, which contains the arginine-glycine-aspartic acid motif critical to viral internalization. We speculate that free Rec proteins present in gastrointestinal secretions upon bacterial cell death facilitate the evolution of human adenoviruses through homologous recombination, an example of viral commensalism and the complexity of virus-host interactions, including regional microbiota. PMID:29925671
Structure-function analysis of diacylglycerol acyltransferase sequences for metabolic engineering and drug discovery

USDA-ARS?s Scientific Manuscript database

Diacylglycerol acyltransferase families (DGATs) catalyze the final and rate-limiting step of triacylglycerol (TAG) biosynthesis in eukaryotic organisms. DGAT knockout mice are resistant to diet-induced obesity and lack milk secretion. Over-expression of DGATs increases TAG in plants. Therefore, unde...
Transcriptome profile analysis of young floral buds of fertile and sterile plants from the self-pollinated offspring of the hybrid between novel restorer line NR1 and Nsa CMS line in Brassica napus

PubMed Central

2013-01-01

Background The fertile and sterile plants were derived from the self-pollinated offspring of the F1 hybrid between the novel restorer line NR1 and the Nsa CMS line in Brassica napus. To elucidate gene expression and regulation caused by the A and C subgenomes of B. napus, as well as the alien chromosome and cytoplasm from Sinapis arvensis during the development of young floral buds, we performed a genome-wide high-throughput transcriptomic sequencing for young floral buds of sterile and fertile plants. Results In this study, equal amounts of total RNAs taken from young floral buds of sterile and fertile plants were sequenced using the Illumina/Solexa platform. After filtered out low quality data, a total of 2,760,574 and 2,714,441 clean tags were remained in the two libraries, from which 242,163 (Ste) and 253,507 (Fer) distinct tags were obtained. All distinct sequencing tags were annotated using all possible CATG+17-nt sequences of the genome and transcriptome of Brassica rapa and those of Brassica oleracea as the reference sequences, respectively. In total, 3231 genes of B. rapa and 3371 genes of B. oleracea were detected with significant differential expression levels. GO and pathway-based analyses were performed to determine and further to understand the biological functions of those differentially expressed genes (DEGs). In addition, there were 1089 specially expressed unknown tags in Fer, which were neither mapped to B. oleracea nor to B. rapa, and these unique tags were presumed to arise basically from the added alien chromosome of S. arvensis. Fifteen genes were randomly selected and their expression levels were confirmed by quantitative RT-PCR, and fourteen of them showed consistent expression patterns with the digital gene expression (DGE) data. Conclusions A number of genes were differentially expressed between the young floral buds of sterile and fertile plants. Some of these genes may be candidates for future research on CMS in Nsa line, fertility restoration and improved agronomic traits in NR1 line. Further study of the unknown tags which were specifically expressed in Fer will help to explore desirable agronomic traits from wild species. PMID:23324545
Transcriptome profile analysis of young floral buds of fertile and sterile plants from the self-pollinated offspring of the hybrid between novel restorer line NR1 and Nsa CMS line in Brassica napus.

PubMed

Yan, Xiaohong; Dong, Caihua; Yu, Jingyin; Liu, Wanghui; Jiang, Chenghong; Liu, Jia; Hu, Qiong; Fang, Xiaoping; Wei, Wenhui

2013-01-16

The fertile and sterile plants were derived from the self-pollinated offspring of the F1 hybrid between the novel restorer line NR1 and the Nsa CMS line in Brassica napus. To elucidate gene expression and regulation caused by the A and C subgenomes of B. napus, as well as the alien chromosome and cytoplasm from Sinapis arvensis during the development of young floral buds, we performed a genome-wide high-throughput transcriptomic sequencing for young floral buds of sterile and fertile plants. In this study, equal amounts of total RNAs taken from young floral buds of sterile and fertile plants were sequenced using the Illumina/Solexa platform. After filtered out low quality data, a total of 2,760,574 and 2,714,441 clean tags were remained in the two libraries, from which 242,163 (Ste) and 253,507 (Fer) distinct tags were obtained. All distinct sequencing tags were annotated using all possible CATG+17-nt sequences of the genome and transcriptome of Brassica rapa and those of Brassica oleracea as the reference sequences, respectively. In total, 3231 genes of B. rapa and 3371 genes of B. oleracea were detected with significant differential expression levels. GO and pathway-based analyses were performed to determine and further to understand the biological functions of those differentially expressed genes (DEGs). In addition, there were 1089 specially expressed unknown tags in Fer, which were neither mapped to B. oleracea nor to B. rapa, and these unique tags were presumed to arise basically from the added alien chromosome of S. arvensis. Fifteen genes were randomly selected and their expression levels were confirmed by quantitative RT-PCR, and fourteen of them showed consistent expression patterns with the digital gene expression (DGE) data. A number of genes were differentially expressed between the young floral buds of sterile and fertile plants. Some of these genes may be candidates for future research on CMS in Nsa line, fertility restoration and improved agronomic traits in NR1 line. Further study of the unknown tags which were specifically expressed in Fer will help to explore desirable agronomic traits from wild species.
Functional hypervariability and gene diversity of cardioactive neuropeptides.

PubMed

Möller, Carolina; Melaun, Christian; Castillo, Cecilia; Díaz, Mary E; Renzelman, Chad M; Estrada, Omar; Kuch, Ulrich; Lokey, Scott; Marí, Frank

2010-12-24

Crustacean cardioactive peptide (CCAP) and related peptides are multifunctional regulatory neurohormones found in invertebrates. We isolated a CCAP-related peptide (conoCAP-a, for cone snail CardioActive Peptide) and cloned the cDNA of its precursor from venom of Conus villepinii. The precursor of conoCAP-a encodes for two additional CCAP-like peptides: conoCAP-b and conoCAP-c. This multi-peptide precursor organization is analogous to recently predicted molluscan CCAP-like preprohormones, and suggests a mechanism for the generation of biological diversification without gene amplification. While arthropod CCAP is a cardio-accelerator, we found that conoCAP-a decreases the heart frequency in Drosophila larvae, demonstrating that conoCAP-a and CCAP have opposite effects. Intravenous injection of conoCAP-a in rats caused decreased heart frequency and blood pressure in contrast to the injection of CCAP, which did not elicit any cardiac effect. Perfusion of rat ventricular cardiac myocytes with conoCAP-a decreased systolic calcium, indicating that conoCAP-a cardiac negative inotropic effects might be mediated via impairment of intracellular calcium trafficking. The contrasting cardiac effects of conoCAP-a and CCAP indicate that molluscan CCAP-like peptides have functions that differ from those of their arthropod counterparts. Molluscan CCAP-like peptides sequences, while homologous, differ between taxa and have unique sequences within a species. This relates to the functional hypervariability of these peptides as structure activity relationship studies demonstrate that single amino acids variations strongly affect cardiac activity. The discovery of conoCAPs in cone snail venom emphasizes the significance of their gene plasticity to have mutations as an adaptive evolution in terms of structure, cellular site of expression, and physiological functions.
Functional Hypervariability and Gene Diversity of Cardioactive Neuropeptides*

PubMed Central

Möller, Carolina; Melaun, Christian; Castillo, Cecilia; Díaz, Mary E.; Renzelman, Chad M.; Estrada, Omar; Kuch, Ulrich; Lokey, Scott; Marí, Frank

2010-01-01

Crustacean cardioactive peptide (CCAP) and related peptides are multifunctional regulatory neurohormones found in invertebrates. We isolated a CCAP-related peptide (conoCAP-a, for cone snail CardioActive Peptide) and cloned the cDNA of its precursor from venom of Conus villepinii. The precursor of conoCAP-a encodes for two additional CCAP-like peptides: conoCAP-b and conoCAP-c. This multi-peptide precursor organization is analogous to recently predicted molluscan CCAP-like preprohormones, and suggests a mechanism for the generation of biological diversification without gene amplification. While arthropod CCAP is a cardio-accelerator, we found that conoCAP-a decreases the heart frequency in Drosophila larvae, demonstrating that conoCAP-a and CCAP have opposite effects. Intravenous injection of conoCAP-a in rats caused decreased heart frequency and blood pressure in contrast to the injection of CCAP, which did not elicit any cardiac effect. Perfusion of rat ventricular cardiac myocytes with conoCAP-a decreased systolic calcium, indicating that conoCAP-a cardiac negative inotropic effects might be mediated via impairment of intracellular calcium trafficking. The contrasting cardiac effects of conoCAP-a and CCAP indicate that molluscan CCAP-like peptides have functions that differ from those of their arthropod counterparts. Molluscan CCAP-like peptides sequences, while homologous, differ between taxa and have unique sequences within a species. This relates to the functional hypervariability of these peptides as structure activity relationship studies demonstrate that single amino acids variations strongly affect cardiac activity. The discovery of conoCAPs in cone snail venom emphasizes the significance of their gene plasticity to have mutations as an adaptive evolution in terms of structure, cellular site of expression, and physiological functions. PMID:20923766
Mitochondrial DNA history of Sri Lankan ethnic people: their relations within the island and with the Indian subcontinental populations.

PubMed

Ranaweera, Lanka; Kaewsutthi, Supannee; Win Tun, Aung; Boonyarit, Hathaichanoke; Poolsuwan, Samerchai; Lertrit, Patcharee

2014-01-01

Located only a short distance off the southernmost shore of the Greater Indian subcontinent, the island of Sri Lanka has long been inhabited by various ethnic populations. Mainly comprising the Vedda, Sinhalese (Up- and Low-country) and Tamil (Sri Lankan and Indian); their history of settlements on the island and the biological relationships among them have remained obscure. It has been hypothesized that the Vedda was probably the earliest inhabitants of the area, followed by Sinhalese and Tamil from the Indian mainland. This study, in which 271 individuals, representing the Sri Lankan ethnic populations mentioned, were typed for their mitochondrial DNA (mtDNA) hypervariable segment 1 (HVS-1) and part of hypervariable segment 2 (HVS-2), provides implications for their settlement history on the island. From the phylogenetic, principal coordinate and analysis of molecular variance results, the Vedda occupied a position separated from all other ethnic people of the island, who formed relatively close affiliations among themselves, suggesting a separate origin of the former. The haplotypes and analysis of molecular variance revealed that Vedda people's mitochondrial sequences are more related to the Sinhalese and Sri Lankan Tamils' than the Indian Tamils' sequences. MtDNA haplogroup analysis revealed that several West Eurasian haplogroups as well as Indian-specific mtDNA clades were found amongst the Sri Lankan populations. Through a comparison with the mtDNA HVS-1 and part of HVS-2 of Indian database, both Tamils and Sinhalese clusters were affiliated with Indian subcontinent populations than Vedda people who are believed to be the native population of the island of Sri Lanka.
Use of a Molecular Decoy to Segregate Transport from Antigenicity in the FrpB Iron Transporter from Neisseria meningitidis

PubMed Central

Saleem, Muhammad; Prince, Stephen M.; Rigby, Stephen E. J.; Imran, Muhammad; Patel, Hema; Chan, Hannah; Sanders, Holly; Maiden, Martin C. J.; Feavers, Ian M.; Derrick, Jeremy P.

2013-01-01

FrpB is an outer membrane transporter from Neisseria meningitidis, the causative agent of meningococcal meningitis. It is a member of the TonB-dependent transporter (TBDT) family and is responsible for iron uptake into the periplasm. FrpB is subject to a high degree of antigenic variation, principally through a region of hypervariable sequence exposed at the cell surface. From the crystal structures of two FrpB antigenic variants, we identify a bound ferric ion within the structure which induces structural changes on binding which are consistent with it being the transported substrate. Binding experiments, followed by elemental analysis, verified that FrpB binds Fe3+ with high affinity. EPR spectra of the bound Fe3+ ion confirmed that its chemical environment was consistent with that observed in the crystal structure. Fe3+ binding was reduced or abolished on mutation of the Fe3+-chelating residues. FrpB orthologs were identified in other Gram-negative bacteria which showed absolute conservation of the coordinating residues, suggesting the existence of a specific TBDT sub-family dedicated to the transport of Fe3+. The region of antigenic hypervariability lies in a separate, external sub-domain, whose structure is conserved in both the F3-3 and F5-1 variants, despite their sequence divergence. We conclude that the antigenic sub-domain has arisen separately as a result of immune selection pressure to distract the immune response from the primary transport function. This would enable FrpB to function as a transporter independently of antibody binding, by using the antigenic sub-domain as a ‘molecular decoy’ to distract immune surveillance. PMID:23457610
Use of a molecular decoy to segregate transport from antigenicity in the FrpB iron transporter from Neisseria meningitidis.

PubMed

Saleem, Muhammad; Prince, Stephen M; Rigby, Stephen E J; Imran, Muhammad; Patel, Hema; Chan, Hannah; Sanders, Holly; Maiden, Martin C J; Feavers, Ian M; Derrick, Jeremy P

2013-01-01

FrpB is an outer membrane transporter from Neisseria meningitidis, the causative agent of meningococcal meningitis. It is a member of the TonB-dependent transporter (TBDT) family and is responsible for iron uptake into the periplasm. FrpB is subject to a high degree of antigenic variation, principally through a region of hypervariable sequence exposed at the cell surface. From the crystal structures of two FrpB antigenic variants, we identify a bound ferric ion within the structure which induces structural changes on binding which are consistent with it being the transported substrate. Binding experiments, followed by elemental analysis, verified that FrpB binds Fe(3+) with high affinity. EPR spectra of the bound Fe(3+) ion confirmed that its chemical environment was consistent with that observed in the crystal structure. Fe(3+) binding was reduced or abolished on mutation of the Fe(3+)-chelating residues. FrpB orthologs were identified in other Gram-negative bacteria which showed absolute conservation of the coordinating residues, suggesting the existence of a specific TBDT sub-family dedicated to the transport of Fe(3+). The region of antigenic hypervariability lies in a separate, external sub-domain, whose structure is conserved in both the F3-3 and F5-1 variants, despite their sequence divergence. We conclude that the antigenic sub-domain has arisen separately as a result of immune selection pressure to distract the immune response from the primary transport function. This would enable FrpB to function as a transporter independently of antibody binding, by using the antigenic sub-domain as a 'molecular decoy' to distract immune surveillance.
RAD tag sequencing as a source of SNP markers in Cynara cardunculus L

PubMed Central

2012-01-01

Background The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. Results RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. Conclusion The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria. PMID:22214349
Tilting the balance between canonical and noncanonical conformations for the H1 hypervariable loop of a llama VHH through point mutations.

PubMed

Mahajan, Sai Pooja; Velez-Vega, Camilo; Escobedo, Fernando A

2013-01-10

Nanobodies are single-domain antibodies found in camelids. These are the smallest naturally occurring binding domains and derive functionality via three hypervariable loops (H1-H3) that form the binding surface. They are excellent candidates for antibody engineering because of their favorable characteristics like small size, high solubility, and stability. To rationally engineer antibodies with affinity for a specific target, the hypervariable loops can be tailored to obtain the desired binding surface. As a first step toward such a goal, we consider the design of loops with a desired conformation. In this study, we focus on the H1 loop of the anti-hCG llama nanobody that exhibits a noncanonical conformation. We aim to "tilt" the stability of the H1 loop structure from a noncanonical conformation to a (humanized) type 1 canonical conformation by studying the effect of selected mutations to the amino acid sequence of the H1, H2, and proximal residues. We use all-atomistic, explicit-solvent, biased molecular dynamic simulations to simulate the wild-type and mutant loops in a prefolded framework. We thus find mutants with increasing propensity to form a stable type 1 canonical conformation of the H1 loop. Free energy landscapes reveal the existence of conformational isomers of the canonical conformation that may play a role in binding different antigenic surfaces. We also elucidate the approximate mechanism and kinetics of transitions between such conformational isomers by using a Markovian model. We find that a particular three-point mutant has the strongest thermodynamic propensity to form the H1 type 1 canonical structure but also to exhibit transitions between conformational isomers, while a different, more rigid three-point mutant has the strongest propensity to be kinetically trapped in such a canonical structure.
A SSR-based genetic linkage map of cultivated peanut (Arachis hypogaea L.)

USDA-ARS?s Scientific Manuscript database

The objective of this study was to construct a molecular linkage map of cultivated tetraploid peanut using simple sequence repeat (SSR) markers derived primarily from peanut genomic sequences, expressed sequence tags (ESTs), and by "data mining" sequences released in GenBank. Three recombinant inbre...
Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism

USDA-ARS?s Scientific Manuscript database

Search for simple sequence repeat (SSR) motifs and design of flanking primers in expressed sequence tag (EST) sequences can be easily done at a large scale using bioinformatics programs. However, failed amplification and/or detection, along with lack of polymorphism, is often seen among randomly sel...
PRRSV strain VR-2332 Nsp2 deletion mutants attenuate clinical symptoms in swine

USDA-ARS?s Scientific Manuscript database

PRRSV nonstructural protein 2 (nsp2) contains a N-terminal cysteine proteinase (PL2) domain, a middle hypervariable region and C-terminal putative transmembrane domain. Prior studies had shown that as much as 403 amino acids could be removed from the hypervariable region without losing virus viabil...
Regional spread of HIV-1 M subtype B in middle-aged patients by random env-C2V4 region sequencing

PubMed Central

Stürmer, Martin; Zimmermann, Katrin; Fritzsche, Carlos; Reisinger, Emil; Doelken, Gottfried; Berger, Annemarie; Doerr, Hans W.; Eberle, Josef

2010-01-01

A transmission cluster of HIV-1 M:B was identified in 11 patients with a median age of 52 (range 26–65) in North-East Germany by C2V4 region sequencing of the env gene of HIV-1, who—except of one—were not aware of any risky behaviour. The 10 male and 1 female patients deteriorated immunologically, according to their information made available, within 4 years after a putative HIV acquisition. Nucleic acid sequence analysis showed a R5 virus in all patients and in 7 of 11 a crown motif of the V3 loop, GPGSALFTT, which is found rarely. Analysis of formation of this cluster showed that there is still a huge discrepancy between awareness and behaviour regarding HIV transmission in middle-aged patients, and that a local outbreak can be detected by nucleic acid analysis of the hypervariable env region. PMID:20217125
'Mitominis': multiplex PCR analysis of reduced size amplicons for compound sequence analysis of the entire mtDNA control region in highly degraded samples.

PubMed

Eichmann, Cordula; Parson, Walther

2008-09-01

The traditional protocol for forensic mitochondrial DNA (mtDNA) analyses involves the amplification and sequencing of the two hypervariable segments HVS-I and HVS-II of the mtDNA control region. The primers usually span fragment sizes of 300-400 bp each region, which may result in weak or failed amplification in highly degraded samples. Here we introduce an improved and more stable approach using shortened amplicons in the fragment range between 144 and 237 bp. Ten such amplicons were required to produce overlapping fragments that cover the entire human mtDNA control region. These were co-amplified in two multiplex polymerase chain reactions and sequenced with the individual amplification primers. The primers were carefully selected to minimize binding on homoplasic and haplogroup-specific sites that would otherwise result in loss of amplification due to mis-priming. The multiplexes have successfully been applied to ancient and forensic samples such as bones and teeth that showed a high degree of degradation.
Parallel gene analysis with allele-specific padlock probes and tag microarrays

PubMed Central

Banér, Johan; Isaksson, Anders; Waldenström, Erik; Jarvius, Jonas; Landegren, Ulf; Nilsson, Mats

2003-01-01

Parallel, highly specific analysis methods are required to take advantage of the extensive information about DNA sequence variation and of expressed sequences. We present a scalable laboratory technique suitable to analyze numerous target sequences in multiplexed assays. Sets of padlock probes were applied to analyze single nucleotide variation directly in total genomic DNA or cDNA for parallel genotyping or gene expression analysis. All reacted probes were then co-amplified and identified by hybridization to a standard tag oligonucleotide array. The technique was illustrated by analyzing normal and pathogenic variation within the Wilson disease-related ATP7B gene, both at the level of DNA and RNA, using allele-specific padlock probes. PMID:12930977
A new fusion protein platform for quantitatively measuring activity of multiple proteases

PubMed Central

2014-01-01

Background Recombinant proteins fused with specific cleavage sequences are widely used as substrate for quantitatively analyzing the activity of proteases. Here we propose a new fusion platform for multiple proteases, by using diaminopropionate ammonia-lyase (DAL) as the fusion protein. It was based on the finding that a fused His6-tag could significantly decreases the activities of DAL from E. coli (eDAL) and Salmonella typhimurium (sDAL). Previously, we have shown that His6GST-tagged eDAL could be used to determine the activity of tobacco etch virus protease (TEVp) under different temperatures or in the denaturant at different concentrations. In this report, we will assay different tags and cleavage sequences on DAL for expressing yield in E. coli, stability of the fused proteins and performance of substrate of other common proteases. Results We tested seven different protease cleavage sequences (rhinovirus 3C, TEV protease, factor Xa, Ssp DnaB intein, Sce VMA1 intein, thrombin and enterokinase), three different tags (His6, GST, CBD and MBP) and two different DALs (eDAL and sDAL), for their performance as substrate to the seven corresponding proteases. Among them, we found four active DAL-fusion substrates suitable for TEVp, factor Xa, thrombin and DnaB intein. Enterokinase cleaved eDAL at undesired positions and did not process sDAL. Substitution of GST with MBP increase the expression level of the fused eDAL and this fusion protein was suitable as a substrate for analyzing activity of rhinovirus 3C. We demonstrated that SUMO protease Ulp1 with a N-terminal His6-tag or MBP tag displayed different activity using the designed His6SUMO-eDAL as substrate. Finally, owing to the high level of the DAL-fusion protein in E. coli, these protein substrates can also be detected directly from the crude extract. Conclusion The results show that our designed DAL-fusion proteins can be used to quantify the activities of both sequence- and conformational-specific proteases, with sufficient substrate specificity. PMID:24649897
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Treesearch

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
A microsphere-based assay for mutation analysis of the biotinidase gene using dried blood spots

PubMed Central

Lindau-Shepard, Barbara; Janik, David K.; Pass, Kenneth A.

2012-01-01

Biotinidase deficiency is an autosomal recessive syndrome caused by defects in the biotinidase gene, the product of which affects biotin metabolism. Newborn screening (NBS) for biotinidase deficiency can identify affected infants prior to onset of symptoms; biotin supplementation can resolve or prevent the clinical features. In NBS, dry blood spots (DBS) are usually tested for biotinidase enzyme activity by colorimetric analysis. By taking advantage of the multiplexing capabilities of the Luminex platform, we have developed a microsphere-based array genotyping method for the simultaneous detection of six disease causing mutations in the biotinidase gene, thereby permitting a second tier of molecular analysis. Genomic DNA was extracted from 3.2 mm DBS. Biotinidase gene sequences, containing the mutations of interest, were amplified by multiplexed polymerase chain reaction, followed by multiplexed allele-specific primer extension using universally tagged genotyping primers. The products were then hybridized to anti-tag carrying xTAG microspheres and detected on the Luminex platform. Genotypes were verified by sequencing. Genotyping results of 22 known biotinidase deficient samples by our xTAG biotinidase assay was in concordance with the results obtained from DNA sequencing, for all 6 mutations used in our panel. These results indicate that genotyping by an xTAG microsphere-based array is accurate, flexible, and can be adapted for high-throughput. Since NBS for biotinidase deficiency is by enzymatic assay, less than optimal quality of the DBS itself can compromise enzyme activity, while the DNA from these samples mostly remains unaffected. This assay warrants evaluation as a viable complement to the biotinidase semi-quantitative colorimetric assay. PMID:27625817

Illumina MiSeq Sequencing for Preliminary Analysis of Microbiome Causing Primary Endodontic Infections in Egypt

PubMed Central

Azab, Marwa Mohamed; Fayyad, Dalia Mukhtar

2018-01-01

The use of high throughput next generation technologies has allowed more comprehensive analysis than traditional Sanger sequencing. The specific aim of this study was to investigate the microbial diversity of primary endodontic infections using Illumina MiSeq sequencing platform in Egyptian patients. Samples were collected from 19 patients in Suez Canal University Hospital (Endodontic Department) using sterile # 15K file and paper points. DNA was extracted using Mo Bio power soil DNA isolation extraction kit followed by PCR amplification and agarose gel electrophoresis. The microbiome was characterized on the basis of the V3 and V4 hypervariable region of the 16S rRNA gene by using paired-end sequencing on Illumina MiSeq device. MOTHUR software was used in sequence filtration and analysis of sequenced data. A total of 1858 operational taxonomic units at 97% similarity were assigned to 26 phyla, 245 families, and 705 genera. Four main phyla Firmicutes, Bacteroidetes, Proteobacteria, and Synergistetes were predominant in all samples. At genus level, Prevotella, Bacillus, Porphyromonas, Streptococcus, and Bacteroides were the most abundant. Illumina MiSeq platform sequencing can be used to investigate oral microbiome composition of endodontic infections. Elucidating the ecology of endodontic infections is a necessary step in developing effective intracanal antimicrobials. PMID:29849646
A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family.

PubMed

Lucotte, Gérard

2010-10-04

This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon.
A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family

PubMed Central

2010-01-01

This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon. PMID:21092341
SEAN: SNP prediction and display program utilizing EST sequence clusters.

PubMed

Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek

2006-02-15

SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.
Inventory of high-abundance mRNAs in skeletal muscle of normal men.

PubMed

Welle, S; Bhatt, K; Thornton, C A

1999-05-01

G42875rial analysis of gene expression (SAGE) method was used to generate a catalog of 53,875 short (14 base) expressed sequence tags from polyadenylated RNA obtained from vastus lateralis muscle of healthy young men. Over 12,000 unique tags were detected. The frequency of occurrence of each tag reflects the relative abundance of the corresponding mRNA. The mRNA species that were detected 10 or more times, each comprising >/=0.02% of the mRNA population, accounted for 64% of the mRNA mass but <10% of the total number of mRNA species detected. Almost all of the abundant tags matched mRNA or EST sequences cataloged in GenBank. Mitochondrial transcripts accounted for approximately 20% of the polyadenylated RNA. Transcripts encoding proteins of the myofibrils were the most abundant nuclear-encoded mRNAs. Transcripts encoding ribosomal proteins, and those encoding proteins involved in energy metabolism, also were very abundant. The database can be used as a reference for investigations of alterations in gene expression associated with conditions that influence muscle function, such as muscular dystrophies, aging, and exercise.
Rapid Covalent Fluorescence Labeling of Membrane Proteins on Live Cells via Coiled-Coil Templated Acyl Transfer.

PubMed

Reinhardt, Ulrike; Lotze, Jonathan; Mörl, Karin; Beck-Sickinger, Annette G; Seitz, Oliver

2015-10-21

Fluorescently labeled proteins enable the microscopic imaging of protein localization and function in live cells. In labeling reactions targeted against specific tag sequences, the size of the fluorophore-tag is of major concern. The tag should be small to prevent interference with protein function. Furthermore, rapid and covalent labeling methods are desired to enable the analysis of fast biological processes. Herein, we describe the development of a method in which the formation of a parallel coiled coil triggers the transfer of a fluorescence dye from a thioester-linked coil peptide conjugate onto a cysteine-modified coil peptide. This labeling method requires only small tag sequences (max 23 aa) and occurs with high tag specificity. We show that size matching of the coil peptides and a suitable thioester reactivity allow the acyl transfer reaction to proceed within minutes (rather than hours). We demonstrate the versatility of this method by applying it to the labeling of different G-protein coupled membrane receptors including the human neuropeptide Y receptors 1, 2, 4, 5, the neuropeptide FF receptors 1 and 2, and the dopamine receptor 1. The labeled receptors are fully functional and able to bind the respective ligand with high affinity. Activity is not impaired as demonstrated by activation, internalization, and recycling experiments.
Deep sequencing reveals exceptional diversity and modes of transmission for bacterial sponge symbionts.

PubMed

Webster, Nicole S; Taylor, Michael W; Behnam, Faris; Lücker, Sebastian; Rattei, Thomas; Whalan, Stephen; Horn, Matthias; Wagner, Michael

2010-08-01

Marine sponges contain complex bacterial communities of considerable ecological and biotechnological importance, with many of these organisms postulated to be specific to sponge hosts. Testing this hypothesis in light of the recent discovery of the rare microbial biosphere, we investigated three Australian sponges by massively parallel 16S rRNA gene tag pyrosequencing. Here we show bacterial diversity that is unparalleled in an invertebrate host, with more than 250,000 sponge-derived sequence tags being assigned to 23 bacterial phyla and revealing up to 2996 operational taxonomic units (95% sequence similarity) per sponge species. Of the 33 previously described 'sponge-specific' clusters that were detected in this study, 48% were found exclusively in adults and larvae - implying vertical transmission of these groups. The remaining taxa, including 'Poribacteria', were also found at very low abundance among the 135,000 tags retrieved from surrounding seawater. Thus, members of the rare seawater biosphere may serve as seed organisms for widely occurring symbiont populations in sponges and their host association might have evolved much more recently than previously thought. © 2009 Society for Applied Microbiology and Blackwell Publishing Ltd.
Deep sequencing reveals exceptional diversity and modes of transmission for bacterial sponge symbionts

PubMed Central

Webster, Nicole S; Taylor, Michael W; Behnam, Faris; Lücker, Sebastian; Rattei, Thomas; Whalan, Stephen; Horn, Matthias; Wagner, Michael

2010-01-01

Marine sponges contain complex bacterial communities of considerable ecological and biotechnological importance, with many of these organisms postulated to be specific to sponge hosts. Testing this hypothesis in light of the recent discovery of the rare microbial biosphere, we investigated three Australian sponges by massively parallel 16S rRNA gene tag pyrosequencing. Here we show bacterial diversity that is unparalleled in an invertebrate host, with more than 250 000 sponge-derived sequence tags being assigned to 23 bacterial phyla and revealing up to 2996 operational taxonomic units (95% sequence similarity) per sponge species. Of the 33 previously described ‘sponge-specific’ clusters that were detected in this study, 48% were found exclusively in adults and larvae – implying vertical transmission of these groups. The remaining taxa, including ‘Poribacteria’, were also found at very low abundance among the 135 000 tags retrieved from surrounding seawater. Thus, members of the rare seawater biosphere may serve as seed organisms for widely occurring symbiont populations in sponges and their host association might have evolved much more recently than previously thought. PMID:21966903
Developmental staging of male murine embryonic gonad by SAGE analysis

PubMed Central

Lee, Tin-Lap; Li, Yunmin; Alba, Diana; Vong, Queenie P.; Wu, Shao-Ming; Baxendale, Vanessa; Rennert, Owen M.; Lau, Yun-Fai Chris; Chan, Wai-Yee

2012-01-01

Despite the identification of key genes such as Sry integral to embryonic gonadal development, the genomic classification and identification of chromosomal activation of this process is still poorly understood. To better understand the genetic regulation of gonadal development, we performed Serial Analysis of Gene Expression (SAGE) to profile the genes and novel transcripts, and an average of 152,000 tags from male embryonic gonads at E10.5 (embryonic day 10.5), E11.5, E12.5, E13.5, E15.5 and E17.5 were analyzed. A total of 275,583 non-singleton tags that do not map to any annotated sequence were identified in the six gonad libraries, and 47,255 tags were mapped to 24,975 annotated sequences, among which 987 sequences were uncharacterized. Utilizing an unsupervised pattern identification technique, we established molecular staging of male gonadal development. Rather than providing a static descriptive analysis, we developed algorithms to cluster the SAGE data and assign SAGE tags to a corresponding chromosomal position; these data are displayed in chromosome graphic format. A prominent increase in global genomic activity from E10.5 to E17.5 was observed. Important chromosomal regions related to the developmental processes were identified and validated based on established mouse models with developmental disorders. These regions may represent markers for early diagnosis for disorders of male gonad development as well as potential treatment targets. PMID:19376482
Analysis of SSR information in EST resources of sugarcane

USDA-ARS?s Scientific Manuscript database

Expressed sequence tags ( ESTs) offer the opportunity to exploit single, low -copy, conserved sequence motifs for the development of simple sequence repeats ( SSRs). The total of 262 113 ESTs of sugarcane (Saccharum officinarum) in the database of NCBI were downloaded and analyzed, which resulted in...
Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Trembath-Reichert, Elizabeth; Case, David H.; Orphan, Victoria J.

Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM). This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME) and sulfate-reducing bacteria (SRB). These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range ofDeltaproteobacteriadiversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seepmore » sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag). Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP) iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases) and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. In addition, many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to many co-occurrences containing putatively heterotrophic, candidate phyla such as OD1, Atribacteria, MBG-B, and Hyd24-12 and the potential for complex sulfur cycling involving Epsilon-, Delta-, and Gammaproteobacteria in methane seep ecosystems.« less
Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments

DOE PAGES

Trembath-Reichert, Elizabeth; Case, David H.; Orphan, Victoria J.

2016-04-18

Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM). This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME) and sulfate-reducing bacteria (SRB). These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range ofDeltaproteobacteriadiversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seepmore » sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag). Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP) iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases) and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. In addition, many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to many co-occurrences containing putatively heterotrophic, candidate phyla such as OD1, Atribacteria, MBG-B, and Hyd24-12 and the potential for complex sulfur cycling involving Epsilon-, Delta-, and Gammaproteobacteria in methane seep ecosystems.« less
Characterization of microbial associations with methanotrophic archaea and sulfate-reducing bacteria through statistical comparison of nested Magneto-FISH enrichments.

PubMed

Trembath-Reichert, Elizabeth; Case, David H; Orphan, Victoria J

2016-01-01

Methane seep systems along continental margins host diverse and dynamic microbial assemblages, sustained in large part through the microbially mediated process of sulfate-coupled Anaerobic Oxidation of Methane (AOM). This methanotrophic metabolism has been linked to consortia of anaerobic methane-oxidizing archaea (ANME) and sulfate-reducing bacteria (SRB). These two groups are the focus of numerous studies; however, less is known about the wide diversity of other seep associated microorganisms. We selected a hierarchical set of FISH probes targeting a range of Deltaproteobacteria diversity. Using the Magneto-FISH enrichment technique, we then magnetically captured CARD-FISH hybridized cells and their physically associated microorganisms from a methane seep sediment incubation. DNA from nested Magneto-FISH experiments was analyzed using Illumina tag 16S rRNA gene sequencing (iTag). Enrichment success and potential bias with iTag was evaluated in the context of full-length 16S rRNA gene clone libraries, CARD-FISH, functional gene clone libraries, and iTag mock communities. We determined commonly used Earth Microbiome Project (EMP) iTAG primers introduced bias in some common methane seep microbial taxa that reduced the ability to directly compare OTU relative abundances within a sample, but comparison of relative abundances between samples (in nearly all cases) and whole community-based analyses were robust. The iTag dataset was subjected to statistical co-occurrence measures of the most abundant OTUs to determine which taxa in this dataset were most correlated across all samples. Many non-canonical microbial partnerships were statistically significant in our co-occurrence network analysis, most of which were not recovered with conventional clone library sequencing, demonstrating the utility of combining Magneto-FISH and iTag sequencing methods for hypothesis generation of associations within complex microbial communities. Network analysis pointed to many co-occurrences containing putatively heterotrophic, candidate phyla such as OD1, Atribacteria, MBG-B, and Hyd24-12 and the potential for complex sulfur cycling involving Epsilon-, Delta-, and Gammaproteobacteria in methane seep ecosystems.
Land, language, and loci: mtDNA in Native Americans and the genetic history of Peru.

PubMed

Lewis, Cecil M; Tito, Raúl Y; Lizárraga, Beatriz; Stone, Anne C

2005-07-01

Despite a long history of complex societies and despite extensive present-day linguistic and ethnic diversity, relatively few populations in Peru have been sampled for population genetic investigations. In order to address questions about the relationships between South American populations and about the extent of correlation between genetic distance, language, and geography in the region, mitochondrial DNA (mtDNA) hypervariable region I sequences and mtDNA haplogroup markers were examined in 33 individuals from the state of Ancash, Peru. These sequences were compared to those from 19 American Indian populations using diversity estimates, AMOVA tests, mismatch distributions, a multidimensional scaling plot, and regressions. The results show correlations between genetics, linguistics, and geographical affinities, with stronger correlations between genetics and language. Additionally, the results suggest a pattern of differential gene flow and drift in western vs. eastern South America, supporting previous mtDNA and Y chromosome investigations. (c) 2004 Wiley-Liss, Inc
DXS10011: studies on structure, allele distribution in three populations and genetic linkage to further q-telomeric chromosome X markers.

PubMed

Hering, Sandra; Brundirs, Nicola; Kuhlisch, Eberhard; Edelmann, Jeanett; Plate, Ines; Benecke, Mark; Van, Pham Hung; Michael, Matthias; Szibor, Reinhard

2004-12-01

The hypervariable tetranucleotide STR polymorphism DXS10011 is a powerful marker for forensic purposes. Investigation of this STR led to an allele nomenclature which is in consensus with the ISFG recommendations. DXS10011 is located at Xq28 and genetically closely linked to DXS7423 and DXS8377 but is unlinked to HPRTB and more distant X-chromosomal STRs. DXS10011 is a very complex marker exhibiting some structural variants within alleles of identical length. Two types of repeat structure (regular and inter-alleles) are known and described as types A and B. Two SNPs which are in strong linkage disequilibrium to the different sequence types were found in the repeat flanking region. The type A sequence consists of a long stretch of uninterrupted homogenous repeats which is highly susceptible to slippage mutation during male meiosis.
A reanalysis of the indirect evidence for recombination in human mitochondrial DNA.

PubMed

Piganeau, G; Eyre-Walker, A

2004-04-01

In an attempt to resolve the controversy about whether recombination occurs in human mtDNA, we have analysed three recently published data sets of complete mtDNA sequences along with 10 RFLP data sets. We have analysed the relationship between linkage disequilibrium (LD) and distance between sites under a variety of conditions using two measures of LD, r2 and /D'/. We find that there is a negative correlation between r2 and distance in the majority of data sets, but no overall trend for /D'/. Five out of six mtDNA sequence data sets show an excess of homoplasy, but this could be due to either recombination or hypervariable sites. Two additional recombination detection methods used, Geneconv and Maximum Chi-Square, showed nonsignificant results. The overall significance of these findings is hard to quantify because of nonindependence, but our results suggest a lack of evidence for recombination in human mtDNA.
Insights into mechanisms of bacterial antigenic variation derived from the complete genome sequence of Anaplasma marginale.

PubMed

Palmer, Guy H; Futse, James E; Knowles, Donald P; Brayton, Kelly A

2006-10-01

Persistence of Anaplasma spp. in the animal reservoir host is required for efficient tick-borne transmission of these pathogens to animals and humans. Using A. marginale infection of its natural reservoir host as a model, persistent infection has been shown to reflect sequential cycles in which antigenic variants emerge, replicate, and are controlled by the immune system. Variation in the immunodominant outer-membrane protein MSP2 is generated by a process of gene conversion, in which unique hypervariable region sequences (HVRs) located in pseudogenes are recombined into a single operon-linked msp2 expression site. Although organisms expressing whole HVRs derived from pseudogenes emerge early in infection, long-term persistent infection is dependent on the generation of complex mosaics in which segments from different HVRs recombine into the expression site. The resulting combinatorial diversity generates the number of variants both predicted and shown to emerge during persistence.
Microchannel DNA Sequencing by End-Labelled Free Solution Electrophoresis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Barron, A.

2005-09-29

The further development of End-Labeled Free-Solution Electrophoresis will greatly simplify DNA separation and sequencing on microfluidic devices. The development and optimization of drag-tags is critical to the success of this research.
Differential expression of a novel gene during seed triacylglycerol accumulation in lupin species ( Lupinus angustifolius L. and L. mutabilis L.).

PubMed

Francki, Michael G; Whitaker, Peta; Smith, Penelope M; Atkins, Craig A

2002-11-01

Seed triacylglycerols (TAGs) are stored as energy reserves and extracted for various end-product uses. In lupins, seed oil content varies from 16% in Lupinus mutabilisto 8% in L. angustifolius. We have shown that TAGs rapidly accumulate during mid-stages of seed development in L. mutabilis compared to the lower seed oil species, L. angustifolius. In this study, we have targeted the key enzymes of the lipid biosynthetic pathway, acetyl-CoA carboxylase (ACCase) and diacylglycerol acyltransferase (DAGAT), to determine factors regulating TAG accumulation between two lupin species. A twofold increase in ACCase activity was observed in L. mutabilis relative to L. angustifolius and correlated with rapid TAG accumulation. No difference in DAGAT activity was detected. We have identified, cloned and partially characterised a novel gene differentially expressed during TAG accumulation between L. angustifolius and L. mutabilis. The gene has some identity to the glucose dehydrogenase family previously described in barley and bacteria and the significance of its expression levels during seed development in relation to TAG accumulation is discussed. DNA sequence analysis of the promoter in both L. angustifolius and L. mutabilis identified putative matrix attachment regions and recognition sequences for transcription binding sites similar to those found in the Adh1 gene from Arabidopsis. The identical promoter regions between species indicate that differential gene expression is controlled by alternative transcription factors, accessibility to binding sites or a combination of both.
Tab2, a novel recombinant polypeptide tag offering sensitive and specific protein detection and reliable affinity purification.

PubMed

Crusius, Kerstin; Finster, Silke; McClary, John; Xia, Wei; Larsen, Brent; Schneider, Douglas; Lu, Hong-Tao; Biancalana, Sara; Xuan, Jian-Ai; Newton, Alicia; Allen, Debbie; Bringmann, Peter; Cobb, Ronald R

2006-10-01

The detection and purification of proteins are often time-consuming and frequently involve complicated protocols. The addition of a peptide tag to recombinant proteins can make this process more efficient. Many of the commonly used tags, such as Flagtrade mark, Myc, HA and V5 are recognized by specific monoclonal antibodies and therefore, allow immunoaffinity-based purification. Enhancing the current scope of flexibility in using diverse peptide tags, we report here the development of a novel, short polypeptide tag (Tab2) for detection and purification of recombinant proteins. The Tab2 epitope corresponds to the NH2-terminal seven amino acid residues of human TGFalpha. A monoclonal anti-Tab2 antibody was raised and characterized. To investigate the potential of this peptide sequence as a novel tag for recombinant proteins, we expressed several different recombinant proteins containing this tag in E. coli, baculovirus, and mammalian cells. The data presented demonstrates the Tab2 tag-anti-Tab2 antibody combination is a reliable tool enabling specific Western blot detection, FACS analysis, and immunoprecipitation as well as non-denaturing protein affinity purification.

Cytogenetic and molecular markers for detecting Aegilops uniaristata chromosomes in a wheat background.

PubMed

Gong, Wenping; Li, Guangrong; Zhou, Jianping; Li, Genying; Liu, Cheng; Huang, Chengyan; Zhao, Zhendong; Yang, Zujun

2014-09-01

Aegilops uniaristata has many agronomically useful traits that can be used for wheat breeding. So far, a Triticum turgidum - Ae. uniaristata amphiploid and one set of Chinese Spring (CS) - Ae. uniaristata addition lines have been produced. To guide Ae. uniaristata chromatin transformation from these lines into cultivated wheat through chromosome engineering, reliable cytogenetic and molecular markers specific for Ae. uniaristata chromosomes need to be developed. Standard C-banding shows that C-bands mainly exist in the centromeric regions of Ae. uniaristata but rarely at the distal ends. Fluorescence in situ hybridization (FISH) using (GAA)8 as a probe showed that the hybridization signal of chromosomes 1N-7N are different, thus (GAA)8 can be used to identify all Ae. uniaristata chromosomes in wheat background simultaneously. Moreover, a total of 42 molecular markers specific for Ae. uniaristata chromosomes were developed by screening expressed sequence tag - sequence tagged site (EST-STS), expressed sequence tag - simple sequence repeat (EST-SSR), and PCR-based landmark unique gene (PLUG) primers. The markers were subsequently localized using the CS - Ae. uniaristata addition lines and different wheat cultivars as controls. The cytogenetic and molecular markers developed herein will be helpful for screening and identifying wheat - Ae. uniaristata progeny.
DNA sequencing using fluorescence background electroblotting membrane

DOEpatents

Caldwell, Karin D.; Chu, Tun-Jen; Pitt, William G.

1992-01-01

A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through said smino groups contained on the surface thereof. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to said target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membrances may be reprobed numerous times.
DNA sequencing using fluorescence background electroblotting membrane

DOEpatents

Caldwell, K.D.; Chu, T.J.; Pitt, W.G.

1992-05-12

A method for the multiplex sequencing on DNA is disclosed which comprises the electroblotting or specific base terminated DNA fragments, which have been resolved by gel electrophoresis, onto the surface of a neutral non-aromatic polymeric microporous membrane exhibiting low background fluorescence which has been surface modified to contain amino groups. Polypropylene membranes are preferably and the introduction of amino groups is accomplished by subjecting the membrane to radio or microwave frequency plasma discharge in the presence of an aminating agent, preferably ammonia. The membrane, containing physically adsorbed DNA fragments on its surface after the electroblotting, is then treated with crosslinking means such as UV radiation or a glutaraldehyde spray to chemically bind the DNA fragments to the membrane through amino groups contained on the surface. The DNA fragments chemically bound to the membrane are subjected to hybridization probing with a tagged probe specific to the sequence of the DNA fragments. The tagging may be by either fluorophores or radioisotopes. The tagged probes hybridized to the target DNA fragments are detected and read by laser induced fluorescence detection or autoradiograms. The use of aminated low fluorescent background membranes allows the use of fluorescent detection and reading even when the available amount of DNA to be sequenced is small. The DNA bound to the membranes may be reprobed numerous times. No Drawings
Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

PubMed Central

Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

2012-01-01

Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382
Generation and analysis of a barcode-tagged insertion mutant library in the fission yeast Schizosaccharomyces pombe.

PubMed

Chen, Bo-Ruei; Hale, Devin C; Ciolek, Peter J; Runge, Kurt W

2012-05-03

Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches.
Insight into the coordination and the binding sites of Cu(2+) by the histidyl-6-tag using experimental and computational tools.

PubMed

Watly, Joanna; Simonovsky, Eyal; Wieczorek, Robert; Barbosa, Nuno; Miller, Yifat; Kozlowski, Henryk

2014-07-07

His-tags are specific sequences containing six to nine subsequent histydyl residues, and they are used for purification of recombinant proteins by use of IMAC chromatography. Such polyhistydyl tags, often used in molecular biology, can be also found in nature. Proteins containing histidine-rich domains play a critical role in many life functions in both prokaryote and eukaryote organisms. Binding mode and the thermodynamic properties of the system depend on the specific metal ion and the histidine sequence. Despite the wide application of the His-tag for purification of proteins, little is known about the properties of metal-binding to such tag domains. This inspired us to undertake detailed studies on the coordination of Cu(2+) ion to hexa-His-tag. Experiments were performed using the potentiometric, UV-visible, CD, and EPR techniques. In addition, molecular dynamics (MD) simulations and density functional theory (DFT) calculations were applied. The experimental studies have shown that the Cu(2+) ion binds most likely to two imidazoles and one, two, or three amide nitrogens, depending on the pH. The structures and stabilities of the complexes for the Cu(2+)-Ac-(His)6-NH2 system using experimental and computational tools were established. Polymorphic binding states are suggested, with a possibility of the formation of α-helix structure induced by metal ion coordination. Metal ion is bound to various pairs of imidazole moieties derived from the tag with different efficiencies. The coordination sphere around the metal ion is completed by molecules of water. Finally, the Cu(2+) binding by Ac-(His)6-NH2 is much more efficient compared to other multihistidine protein domains.
Short hypervariable microhaplotypes: A novel set of very short high discriminating power loci without stutter artefacts.

PubMed

van der Gaag, Kristiaan J; de Leeuw, Rick H; Laros, Jeroen F J; den Dunnen, Johan T; de Knijff, Peter

2018-07-01

Since two decades, short tandem repeats (STRs) are the preferred markers for human identification, routinely analysed by fragment length analysis. Here we present a novel set of short hypervariable autosomal microhaplotypes (MH) that have four or more SNPs in a span of less than 70 nucleotides (nt). These MHs display a discriminating power approaching that of STRs and provide a powerful alternative for the analysis;1;is of forensic samples that are problematic when the STR fragment size range exceeds the integrity range of severely degraded DNA or when multiple donors contribute to an evidentiary stain and STR stutter artefacts complicate profile interpretation. MH typing was developed using the power of massively parallel sequencing (MPS) enabling new powerful, fast and efficient SNP-based approaches. MH candidates were obtained from queries in data of the 1000 Genomes, and Genome of the Netherlands (GoNL) projects. Wet-lab analysis of 276 globally dispersed samples and 97 samples of nine large CEPH families assisted locus selection and corroboration of informative value. We infer that MHs represent an alternative marker type with good discriminating power per locus (allowing the use of a limited number of loci), small amplicon sizes and absence of stutter artefacts that can be especially helpful when unbalanced mixed samples are submitted for human identification. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Generation of a total of 6483 expressed sequence tags from 60 day-old bovine whole fetus and fetal placenta.

PubMed

Oishi, M; Gohma, H; Lejukole, H Y; Taniguchi, Y; Yamada, T; Suzuki, K; Shinkai, H; Uenishi, H; Yasue, H; Sasaki, Y

2004-05-01

Expressed sequence tags (ESTs) generated based on characterization of clones isolated randomly from cDNA libraries are used to study gene expression profiles in specific tissues and to provide useful information for characterizing tissue physiology. In this study, two directionally cloned cDNA libraries were constructed from 60 day-old bovine whole fetus and fetal placenta. We have characterized 5357 and 1126 clones, and then identified 3464 and 795 unique sequences for the fetus and placenta cDNA libraries: 1851 and 504 showed homology to already identified genes, and 1613 and 291 showed no significant matches to any of the sequences in DNA databases, respectively. Further, we found 94 unique sequences overlapping in both the fetus and the placenta, leading to a catalog of 4165 genes expressed in 60 day-old fetus and placenta. The catalog is used to examine expression profile of genes in 60 day-old bovine fetus and placenta.
Analysis and functional annotation of expressed sequence tags from the fall armyworm Spodoptera frugiperda

PubMed Central

Deng, Youping; Dong, Yinghua; Thodima, Venkata; Clem, Rollie J; Passarelli, A Lorena

2006-01-01

Background Little is known about the genome sequences of lepidopteran insects, although this group of insects has been studied extensively in the fields of endocrinology, development, immunity, and pathogen-host interactions. In addition, cell lines derived from Spodoptera frugiperda and other lepidopteran insects are routinely used for baculovirus foreign gene expression. This study reports the results of an expressed sequence tag (EST) sequencing project in cells from the lepidopteran insect S. frugiperda, the fall armyworm. Results We have constructed an EST database using two cDNA libraries from the S. frugiperda-derived cell line, SF-21. The database consists of 2,367 ESTs which were assembled into 244 contigs and 951 singlets for a total of 1,195 unique sequences. Conclusion S. frugiperda is an agriculturally important pest insect and genomic information will be instrumental for establishing initial transcriptional profiling and gene function studies, and for obtaining information about genes manipulated during infections by insect pathogens such as baculoviruses. PMID:17052344
Illuminator, a desktop program for mutation detection using short-read clonal sequencing.

PubMed

Carr, Ian M; Morgan, Joanne E; Diggle, Christine P; Sheridan, Eamonn; Markham, Alexander F; Logan, Clare V; Inglehearn, Chris F; Taylor, Graham R; Bonthron, David T

2011-10-01

Current methods for sequencing clonal populations of DNA molecules yield several gigabases of data per day, typically comprising reads of < 100 nt. Such datasets permit widespread genome resequencing and transcriptome analysis or other quantitative tasks. However, this huge capacity can also be harnessed for the resequencing of smaller (gene-sized) target regions, through the simultaneous parallel analysis of multiple subjects, using sample "tagging" or "indexing". These methods promise to have a huge impact on diagnostic mutation analysis and candidate gene testing. Here we describe a software package developed for such studies, offering the ability to resolve pooled samples carrying barcode tags and to align reads to a reference sequence using a mutation-tolerant process. The program, Illuminator, can identify rare sequence variants, including insertions and deletions, and permits interactive data analysis on standard desktop computers. It facilitates the effective analysis of targeted clonal sequencer data without dedicated computational infrastructure or specialized training. Copyright © 2011 Elsevier Inc. All rights reserved.
A universal TagModule collection for parallel genetic analysis of microorganisms

PubMed Central

Oh, Julia; Fung, Eula; Price, Morgan N.; Dehal, Paramvir S.; Davis, Ronald W.; Giaever, Guri; Nislow, Corey; Arkin, Adam P.; Deutschbauer, Adam

2010-01-01

Systems-level analyses of non-model microorganisms are limited by the existence of numerous uncharacterized genes and a corresponding over-reliance on automated computational annotations. One solution to this challenge is to disrupt gene function using DNA tag technology, which has been highly successful in parallelizing reverse genetics in Saccharomyces cerevisiae and has led to discoveries in gene function, genetic interactions and drug mechanism of action. To extend the yeast DNA tag methodology to a wide variety of microorganisms and applications, we have created a universal, sequence-verified TagModule collection. A hallmark of the 4280 TagModules is that they are cloned into a Gateway entry vector, thus facilitating rapid transfer to any compatible genetic system. Here, we describe the application of the TagModules to rapidly generate tagged mutants by transposon mutagenesis in the metal-reducing bacterium Shewanella oneidensis MR-1 and the pathogenic yeast Candida albicans. Our results demonstrate the optimal hybridization properties of the TagModule collection, the flexibility in applying the strategy to diverse microorganisms and the biological insights that can be gained from fitness profiling tagged mutant collections. The publicly available TagModule collection is a platform-independent resource for the functional genomics of a wide range of microbial systems in the post-genome era. PMID:20494978
Exploiting EST databases for the development and characterisation of 3425 gene-tagged CISP markers in biofuel crop sugarcane and their transferability in cereals and orphan tropical grasses.

PubMed

Chandra, Amaresh; Jain, Radha; Solomon, Sushil; Shrivastava, Shiksha; Roy, Ajoy K

2013-02-04

Sugarcane is an important cash crop, providing 70% of the global raw sugar as well as raw material for biofuel production. Genetic analysis is hindered in sugarcane because of its large and complex polyploid genome and lack of sufficiently informative gene-tagged markers. Modern genomics has produced large amount of ESTs, which can be exploited to develop molecular markers based on comparative analysis with EST datasets of related crops and whole rice genome sequence, and accentuate their cross-technical functionality in orphan crops like tropical grasses. Utilising 246,180 Saccharum officinarum EST sequences vis-à-vis its comparative analysis with ESTs of sorghum and barley and the whole rice genome sequence, we have developed 3425 novel gene-tagged markers - namely, conserved-intron scanning primers (CISP) - using the web program GeMprospector. Rice orthologue annotation results indicated homology of 1096 sequences with expressed proteins, 491 with hypothetical proteins. The remaining 1838 were miscellaneous in nature. A total of 367 primer-pairs were tested in diverse panel of samples. The data indicate amplification of 41% polymorphic bands leading to 0.52 PIC and 3.50 MI with a set of sugarcane varieties and Saccharum species. In addition, a moderate technical functionality of a set of such markers with orphan tropical grasses (22%) and fodder cum cereal oat (33%) is observed. Developed gene-tagged CISP markers exhibited considerable technical functionality with varieties of sugarcane and unexplored species of tropical grasses. These markers would thus be particularly useful in identifying the economical traits in sugarcane and developing conservation strategies for orphan tropical grasses.
Multiple tag labeling method for DNA sequencing

DOEpatents

Mathies, Richard A.; Huang, Xiaohua C.; Quesada, Mark A.

1995-01-01

A DNA sequencing method described which uses single lane or channel electrophoresis. Sequencing fragments are separated in said lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radio-isotope labels.
Primer and platform effects on 16S rRNA tag sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tremblay, Julien; Singh, Kanwar; Fern, Alison

Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as wellmore » as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.« less
Primer and platform effects on 16S rRNA tag sequencing

DOE PAGES

Tremblay, Julien; Singh, Kanwar; Fern, Alison; ...

2015-08-04

Sequencing of 16S rRNA gene tags is a popular method for profiling and comparing microbial communities. The protocols and methods used, however, vary considerably with regard to amplification primers, sequencing primers, sequencing technologies; as well as quality filtering and clustering. How results are affected by these choices, and whether data produced with different protocols can be meaningfully compared, is often unknown. Here we compare results obtained using three different amplification primer sets (targeting V4, V6–V8, and V7–V8) and two sequencing technologies (454 pyrosequencing and Illumina MiSeq) using DNA from a mock community containing a known number of species as wellmore » as complex environmental samples whose PCR-independent profiles were estimated using shotgun sequencing. We find that paired-end MiSeq reads produce higher quality data and enabled the use of more aggressive quality control parameters over 454, resulting in a higher retention rate of high quality reads for downstream data analysis. While primer choice considerably influences quantitative abundance estimations, sequencing platform has relatively minor effects when matched primers are used. In conclusion, beta diversity metrics are surprisingly robust to both primer and sequencing platform biases.« less
Rapid in silico cloning of genes using expressed sequence tags (ESTs).

PubMed

Gill, R W; Sanseau, P

2000-01-01

Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.
Genome-Wide Analysis of Differentially Expressed Genes Relevant to Rhizome Formation in Lotus Root (Nelumbo nucifera Gaertn)

PubMed Central

Yin, Jingjing; Li, Liangjun; Chen, Xuehao

2013-01-01

Lotus root is a popular wetland vegetable which produces edible rhizome. At the molecular level, the regulation of rhizome formation is very complex, which has not been sufficiently addressed in research. In this study, to identify differentially expressed genes (DEGs) in lotus root, four libraries (L1 library: stolon stage, L2 library: initial swelling stage, L3 library: middle swelling stage, L4: later swelling stage) were constructed from the rhizome development stages. High-throughput tag-sequencing technique was used which is based on Solexa Genome Analyzer Platform. Approximately 5.0 million tags were sequenced, and 4542104, 4474755, 4777919, and 4750348 clean tags including 151282, 137476, 215872, and 166005 distinct tags were obtained after removal of low quality tags from each library respectively. More than 43% distinct tags were unambiguous tags mapping to the reference genes, and 40% were unambiguous tag-mapped genes. From L1, L2, L3, and L4, total 20471, 18785, 23448, and 21778 genes were annotated, after mapping their functions in existing databases. Profiling of gene expression in L1/L2, L2/L3, and L3/L4 libraries were different among most of the selected 20 DEGs. Most of the DEGs in L1/L2 libraries were relevant to fiber development and stress response, while in L2/L3 and L3/L4 libraries, major of the DEGs were involved in metabolism of energy and storage. All up-regulated transcriptional factors in four libraries and 14 important rhizome formation-related genes in four libraries were also identified. In addition, the expression of 9 genes from identified DEGs was performed by qRT-PCR method. In a summary, this study provides a comprehensive understanding of gene expression during the rhizome formation in lotus root. PMID:23840598
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

USDA-ARS?s Scientific Manuscript database

Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
Development of an Expressed Sequence Tag (EST) Resource for Wheat (Triticum aestivum L.)

PubMed Central

Lazo, G. R.; Chao, S.; Hummel, D. D.; Edwards, H.; Crossman, C. C.; Lui, N.; Matthews, D. E.; Carollo, V. L.; Hane, D. L.; You, F. M.; Butler, G. E.; Miller, R. E.; Close, T. J.; Peng, J. H.; Lapitan, N. L. V.; Gustafson, J. P.; Qi, L. L.; Echalier, B.; Gill, B. S.; Dilbirligi, M.; Randhawa, H. S.; Gill, K. S.; Greene, R. A.; Sorrells, M. E.; Akhunov, E. D.; Dvořák, J.; Linkiewicz, A. M.; Dubcovsky, J.; Hossain, K. G.; Kalavacharla, V.; Kianian, S. F.; Mahmoud, A. A.; Miftahudin; Ma, X.-F.; Conley, E. J.; Anderson, J. A.; Pathan, M. S.; Nguyen, H. T.; McGuire, P. E.; Qualset, C. O.; Anderson, O. D.

2004-01-01

This report describes the rationale, approaches, organization, and resource development leading to a large-scale deletion bin map of the hexaploid (2n = 6x = 42) wheat genome (Triticum aestivum L.). Accompanying reports in this issue detail results from chromosome bin-mapping of expressed sequence tags (ESTs) representing genes onto the seven homoeologous chromosome groups and a global analysis of the entire mapped wheat EST data set. Among the resources developed were the first extensive public wheat EST collection (113,220 ESTs). Described are protocols for sequencing, sequence processing, EST nomenclature, and the assembly of ESTs into contigs. These contigs plus singletons (unassembled ESTs) were used for selection of distinct sequence motif unigenes. Selected ESTs were rearrayed, validated by 5′ and 3′ sequencing, and amplified for probing a series of wheat aneuploid and deletion stocks. Images and data for all Southern hybridizations were deposited in databases and were used by the coordinators for each of the seven homoeologous chromosome groups to validate the mapping results. Results from this project have established the foundation for future developments in wheat genomics. PMID:15514037
Fluorescent signatures for variable DNA sequences

PubMed Central

Rice, John E.; Reis, Arthur H.; Rice, Lisa M.; Carver-Brown, Rachel K.; Wangh, Lawrence J.

2012-01-01

Life abounds with genetic variations writ in sequences that are often only a few hundred nucleotides long. Rapid detection of these variations for identification of genetic diseases, pathogens and organisms has become the mainstay of molecular science and medicine. This report describes a new, highly informative closed-tube polymerase chain reaction (PCR) strategy for analysis of both known and unknown sequence variations. It combines efficient quantitative amplification of single-stranded DNA targets through LATE-PCR with sets of Lights-On/Lights-Off probes that hybridize to their target sequences over a broad temperature range. Contiguous pairs of Lights-On/Lights-Off probes of the same fluorescent color are used to scan hundreds of nucleotides for the presence of mutations. Sets of probes in different colors can be combined in the same tube to analyze even longer single-stranded targets. Each set of hybridized Lights-On/Lights-Off probes generates a composite fluorescent contour, which is mathematically converted to a sequence-specific fluorescent signature. The versatility and broad utility of this new technology is illustrated in this report by characterization of variant sequences in three different DNA targets: the rpoB gene of Mycobacterium tuberculosis, a sequence in the mitochondrial cytochrome C oxidase subunit 1 gene of nematodes and the V3 hypervariable region of the bacterial 16 s ribosomal RNA gene. We anticipate widespread use of these technologies for diagnostics, species identification and basic research. PMID:22879378

Sequence variation and phylogenetic analysis of envelope glycoprotein of hepatitis G virus.

PubMed

Lim, M Y; Fry, K; Yun, A; Chong, S; Linnen, J; Fung, K; Kim, J P

1997-11-01

A transfusion-transmissible agent provisionally designated hepatitis G virus (HGV) was recently identified. In this study, we examined the variability of the HGV genome by analysing sequences in the putative envelope region from 72 isolates obtained from diverse geographical sources. The 1561 nucleotide sequence of the E1/E2/NS2a region of HGV was determined from 12 isolates, and compared with three published sequences. The most variability was observed in 400 nucleotides at the N terminus of E2. We next analysed this 400 nucleotide envelope variable region (EV) from an additional 60 HGV isolates. This sequence varied considerably among the 75 isolates, with overall identity ranging from 79.3% to 99.5% at the nucleotide level, and from 83.5% to 100% at the amino acid level. However, hypervariable regions were not identified. Phylogenetic analyses indicated that the 75 HGV isolates belong to a single genotype. A single-tier distribution of evolutionary distances was observed among the 15 E1/E2/NS2a sequences and the 75 EV sequences. In contrast, 11 isolates of HCV were analysed and showed a three-tiered distribution, representing genotypes, subtypes, and isolates. The 75 isolates of HGV fell into four clusters on the phylogenetic tree. Tight geographical clustering was observed among the HGV isolates from Japan and Korea.
DSAP: deep-sequencing small RNA analysis pipeline.

PubMed

Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

2010-07-01

DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.
Identification and molecular analysis of infectious bursal disease in broiler farms in the Kurdistan Regional Government of Iraq.

PubMed

Amin, Oumed Gerjis M; Jackwood, Daral J

2014-10-01

The present study was undertaken to characterize field isolates of infectious bursal disease virus (IBDV). The identification was done using reverse transcription-polymerase chain reaction (RT-PCR) and partial sequencing of the VP2 gene. Pooled bursal samples were collected from commercial broiler farms located in the Kurdistan Regional Government (KRG) of Iraq. The genetic material of the IBDV was detected in 10 out of 29 field samples. Sequences of the hypervariable VP2 region were determined for 10 of these viruses. Molecular analysis of the VP2 gene of five IBDVs showed amino acid sequences consistent with the very virulent (vv) IBDV. Two samples were identified as classic vaccine viruses, and three samples were classic vaccine viruses that appear to have mutated during replication in the field. Phylogenetic analysis showed that all five field IBDV strains of the present study were closely related to each other. On the basis of nucleotide sequencing and phylogenetic analysis, it is very likely that IBD-causing viruses in this part of Iraq are of the very virulent type. These IBDVs appear to be evolving relative to their type strains.
Sequence-length variation of mtDNA HVS-I C-stretch in Chinese ethnic groups.

PubMed

Chen, Feng; Dang, Yong-hui; Yan, Chun-xia; Liu, Yan-ling; Deng, Ya-jun; Fulton, David J R; Chen, Teng

2009-10-01

The purpose of this study was to investigate mitochondrial DNA (mtDNA) hypervariable segment-I (HVS-I) C-stretch variations and explore the significance of these variations in forensic and population genetics studies. The C-stretch sequence variation was studied in 919 unrelated individuals from 8 Chinese ethnic groups using both direct and clone sequencing approaches. Thirty eight C-stretch haplotypes were identified, and some novel and population specific haplotypes were also detected. The C-stretch genetic diversity (GD) values were relatively high, and probability (P) values were low. Additionally, C-stretch length heteroplasmy was observed in approximately 9% of individuals studied. There was a significant correlation (r=-0.961, P<0.01) between the expansion of the cytosine sequence length in the C-stretch of HVS-I and a reduction in the number of upstream adenines. These results indicate that the C-stretch could be a useful genetic maker in forensic identification of Chinese populations. The results from the Fst and dA genetic distance matrix, neighbor-joining tree, and principal component map also suggest that C-stretch could be used as a reliable genetic marker in population genetics.
Transferable green fluorescence-tagged pEI2 in Edwardsiella ictaluri

USDA-ARS?s Scientific Manuscript database

The pEI2 plasmid of Edwardsiella ictaluri isolate, I49, was tagged using a Tn10-GFP-kan cassette to create the green fluorescence-expressing derivative I49-gfp. The Tn10-GFP-kan insertion site was mapped by plasmid sequencing to 663 bp upstream of orf2 and appeared to be at a neutral site in the pla...
Digital gene expression analysis with sample multiplexing and PCR duplicate detection: A straightforward protocol.

PubMed

Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph

2016-01-01

Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans.
Stable isotope, site-specific mass tagging for protein identification

DOEpatents

Chen, Xian

2006-10-24

Proteolytic peptide mass mapping as measured by mass spectrometry provides an important method for the identification of proteins, which are usually identified by matching the measured and calculated m/z values of the proteolytic peptides. A unique identification is, however, heavily dependent upon the mass accuracy and sequence coverage of the fragment ions generated by peptide ionization. The present invention describes a method for increasing the specificity, accuracy and efficiency of the assignments of particular proteolytic peptides and consequent protein identification, by the incorporation of selected amino acid residue(s) enriched with stable isotope(s) into the protein sequence without the need for ultrahigh instrumental accuracy. Selected amino acid(s) are labeled with .sup.13C/.sup.15N/.sup.2H and incorporated into proteins in a sequence-specific manner during cell culturing. Each of these labeled amino acids carries a defined mass change encoded in its monoisotopic distribution pattern. Through their characteristic patterns, the peptides with mass tag(s) can then be readily distinguished from other peptides in mass spectra. The present method of identifying unique proteins can also be extended to protein complexes and will significantly increase data search specificity, efficiency and accuracy for protein identifications.
Molecular characterization of infectious bursal disease viruses from Pakistan.

PubMed

Shabbir, Muhammad Zubair; Ali, Muhammad; Abbas, Muhammad; Chaudhry, Umer Naveed; Zia-Ur-Rehman; Munir, Muhammad

2016-07-01

Since the first report of infectious bursal disease in Pakistan in 1987, outbreaks have been common even in vaccinated flocks. Despite appropriate administration of vaccines, concerns arise if the circulating strains are different from the ones used in the vaccine. Here, we sequenced the hypervariable region (HVR) of the VP2 gene of circulating strains of infectious bursal disease virus (IBDV) originating from outbreaks (n = 4) in broiler flocks in Pakistan. Nucleotide sequencing followed by phylogeny and deduced amino acid sequence analysis showed the circulating strains to be very virulent (vv) and identified characteristic residues at position 222 (A), 242 (I), 256 (I), 294 (I) and 299 (S). In addition, a substitution at positions 221 (Q→H) was found to be exclusive to Pakistani strains in our analysis, although a larger dataset is required to confirm this finding. Compared to vaccine strains that are commonly used in Pakistan, substitution mutations were found at key amino acid positions in VP2 that may be responsible for potential changes in neutralization epitopes and vaccine failure.
Improved Selection of Internal Transcribed Spacer-Specific Primers Enables Quantitative, Ultra-High-Throughput Profiling of Fungal Communities

PubMed Central

Bokulich, Nicholas A.

2013-01-01

Ultra-high-throughput sequencing (HTS) of fungal communities has been restricted by short read lengths and primer amplification bias, slowing the adoption of newer sequencing technologies to fungal community profiling. To address these issues, we evaluated the performance of several common internal transcribed spacer (ITS) primers and designed a novel primer set and work flow for simultaneous quantification and species-level interrogation of fungal consortia. Primer comparison and validation were predicted in silico and by sequencing a “mock community” of mixed yeast species to explore the challenges of amplicon length and amplification bias for reconstructing defined yeast community structures. The amplicon size and distribution of this primer set are smaller than for all preexisting ITS primer sets, maximizing sequencing coverage of hypervariable ITS domains by very-short-amplicon, high-throughput sequencing platforms. This feature also enables the optional integration of quantitative PCR (qPCR) directly into the HTS preparatory work flow by substituting qPCR with these primers for standard PCR, yielding quantification of individual community members. The complete work flow described here, utilizing any of the qualified primer sets evaluated, can rapidly profile mixed fungal communities and capably reconstructed well-characterized beer and wine fermentation fungal communities. PMID:23377949
Nested Machine Learning Facilitates Increased Sequence Content for Large-Scale Automated High Resolution Melt Genotyping

PubMed Central

Fraley, Stephanie I.; Athamanolap, Pornpat; Masek, Billie J.; Hardick, Justin; Carroll, Karen C.; Hsieh, Yu-Hsiang; Rothman, Richard E.; Gaydos, Charlotte A.; Wang, Tza-Huei; Yang, Samuel

2016-01-01

High Resolution Melt (HRM) is a versatile and rapid post-PCR DNA analysis technique primarily used to differentiate sequence variants among only a few short amplicons. We recently developed a one-vs-one support vector machine algorithm (OVO SVM) that enables the use of HRM for identifying numerous short amplicon sequences automatically and reliably. Herein, we set out to maximize the discriminating power of HRM + SVM for a single genetic locus by testing longer amplicons harboring significantly more sequence information. Using universal primers that amplify the hypervariable bacterial 16 S rRNA gene as a model system, we found that long amplicons yield more complex HRM curve shapes. We developed a novel nested OVO SVM approach to take advantage of this feature and achieved 100% accuracy in the identification of 37 clinically relevant bacteria in Leave-One-Out-Cross-Validation. A subset of organisms were independently tested. Those from pure culture were identified with high accuracy, while those tested directly from clinical blood bottles displayed more technical variability and reduced accuracy. Our findings demonstrate that long sequences can be accurately and automatically profiled by HRM with a novel nested SVM approach and suggest that clinical sample testing is feasible with further optimization. PMID:26778280
Multiple tag labeling method for DNA sequencing

DOEpatents

Mathies, R.A.; Huang, X.C.; Quesada, M.A.

1995-07-25

A DNA sequencing method is described which uses single lane or channel electrophoresis. Sequencing fragments are separated in the lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radioisotope labels. 5 figs.
Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

PubMed Central

de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

2000-01-01

Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084
Studies of a biochemical factory: tomato trichome deep expressed sequence tag sequencing and proteomics.

PubMed

Schilmiller, Anthony L; Miner, Dennis P; Larson, Matthew; McDowell, Eric; Gang, David R; Wilkerson, Curtis; Last, Robert L

2010-07-01

Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces beta-caryophyllene and alpha-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells.
Studies of a Biochemical Factory: Tomato Trichome Deep Expressed Sequence Tag Sequencing and Proteomics1[W][OA

PubMed Central

Schilmiller, Anthony L.; Miner, Dennis P.; Larson, Matthew; McDowell, Eric; Gang, David R.; Wilkerson, Curtis; Last, Robert L.

2010-01-01

Shotgun proteomics analysis allows hundreds of proteins to be identified and quantified from a single sample at relatively low cost. Extensive DNA sequence information is a prerequisite for shotgun proteomics, and it is ideal to have sequence for the organism being studied rather than from related species or accessions. While this requirement has limited the set of organisms that are candidates for this approach, next generation sequencing technologies make it feasible to obtain deep DNA sequence coverage from any organism. As part of our studies of specialized (secondary) metabolism in tomato (Solanum lycopersicum) trichomes, 454 sequencing of cDNA was combined with shotgun proteomics analyses to obtain in-depth profiles of genes and proteins expressed in leaf and stem glandular trichomes of 3-week-old plants. The expressed sequence tag and proteomics data sets combined with metabolite analysis led to the discovery and characterization of a sesquiterpene synthase that produces β-caryophyllene and α-humulene from E,E-farnesyl diphosphate in trichomes of leaf but not of stem. This analysis demonstrates the utility of combining high-throughput cDNA sequencing with proteomics experiments in a target tissue. These data can be used for dissection of other biochemical processes in these specialized epidermal cells. PMID:20431087
High Throughput Biological Analysis Using Multi-bit Magnetic Digital Planar Tags

NASA Astrophysics Data System (ADS)

Hong, B.; Jeong, J.-R.; Llandro, J.; Hayward, T. J.; Ionescu, A.; Trypiniotis, T.; Mitrelias, T.; Kopper, K. P.; Steinmuller, S. J.; Bland, J. A. C.

2008-06-01

We report a new magnetic labelling technology for high-throughput biomolecular identification and DNA sequencing. Planar multi-bit magnetic tags have been designed and fabricated, which comprise a magnetic barcode formed by an ensemble of micron-sized thin film Ni80Fe20 bars encapsulated in SU8. We show that by using a globally applied magnetic field and magneto-optical Kerr microscopy the magnetic elements in the multi-bit magnetic tags can be addressed individually and encoded/decoded remotely. The critical steps needed to show the feasibility of this technology are demonstrated, including fabrication, flow transport, remote writing and reading, and successful functionalization of the tags as verified by fluorescence detection. This approach is ideal for encoding information on tags in microfluidic flow or suspension, for such applications as labelling of chemical precursors during drug synthesis and combinatorial library-based high-throughput multiplexed bioassays.
Expressed sequence tag (EST) analysis of two subspecies of Metarhizium anisopliae reveals a plethora of secreted proteins with potential activity in insect hosts.

PubMed

Freimoser, Florian M; Screen, Steven; Bagga, Savita; Hu, Gang; St Leger, Raymond J

2003-01-01

Expressed sequence tag (EST) libraries for Metarhizium anisopliae, the causative agent of green muscardine disease, were developed from the broad host-range pathogen Metarhizium anisopliae sf. anisopliae and the specific grasshopper pathogen, M. anisopliae sf. acridum. Approximately 1,700 5' end sequences from each subspecies were generated from cDNA libraries representing fungi grown under conditions that maximize secretion of cuticle-degrading enzymes. Both subspecies had ESTs for virtually all pathogenicity-related genes cloned to date from M. anisopliae, but many novel genes encoding potential virulence factors were also tagged. Enzymes with potential targets in the insect host included proteases, chitinases, phospholipases, lipases, esterases, phosphatases and enzymes producing toxic secondary metabolites. A diverse array of proteases composed 36 % of all M. anisopliae sf. anisopliae ESTs. Eighty percent of the ESTs that could be clustered into functional groups had significant matches (E<10(-5)) in other ascomycete fungi. These included genes reported to have specific roles in pathogens with plant or vertebrate hosts. Many of the remaining ESTs had their best BLAST match among animal, plant and bacterial sequences. These include genes with plant and microbial counterparts that produce potent antimicrobials. The abundance of transcripts discovered for different functional groups varied between the two subspecies of M. anisopliae in a manner consistent with ecological adaptations of the two pathogens. By hastening gene discovery this project has enhanced development of improved mycoinsecticides. In addition, the M. anisopliae ESTs represent a significant contribution to the extensive database of sequences from ascomycetes that are saprophytes or plant and vertebrate pathogens. Comparative analyses of these sequences is providing important information about the biology and evolutionary history of this clade.
Phylogenetic Network for European mtDNA

PubMed Central

Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari

2001-01-01

The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229
Analysis of MHC class I genes across horse MHC haplotypes

PubMed Central

Tallmadge, Rebecca L.; Campbell, Julie A.; Miller, Donald C.; Antczak, Douglas F.

2010-01-01

The genomic sequences of 15 horse Major Histocompatibility Complex (MHC) class I genes and a collection of MHC class I homozygous horses of five different haplotypes were used to investigate the genomic structure and polymorphism of the equine MHC. A combination of conserved and locus-specific primers was used to amplify horse MHC class I genes with classical and non-classical characteristics. Multiple clones from each haplotype identified three to five classical sequences per homozygous animal, and two to three non-classical sequences. Phylogenetic analysis was applied to these sequences and groups were identified which appear to be allelic series, but some sequences were left ungrouped. Sequences determined from MHC class I heterozygous horses and previously described MHC class I sequences were then added, representing a total of ten horse MHC haplotypes. These results were consistent with those obtained from the MHC homozygous horses alone, and 30 classical sequences were assigned to four previously confirmed loci and three new provisional loci. The non-classical genes had few alleles and the classical genes had higher levels of allelic polymorphism. Alleles for two classical loci with the expected pattern of polymorphism were found in the majority of haplotypes tested, but alleles at two other commonly detected loci had more variation outside of the hypervariable region than within. Our data indicate that the equine Major Histocompatibility Complex is characterized by variation in the complement of class I genes expressed in different haplotypes in addition to the expected allelic polymorphism within loci. PMID:20099063
Comparison of Sanger and next generation sequencing performance for genotyping Cryptosporidium isolates at the 18S rRNA and actin loci.

PubMed

Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M

2015-01-01

Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.
Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

PubMed

Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

2017-06-01

Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

Role of Host-Driven Mutagenesis in Determining Genome Evolution of Sigma Virus (DMelSV; Rhabdoviridae) in Drosophila melanogaster.

PubMed

Piontkivska, Helen; Matos, Luis F; Paul, Sinu; Scharfenberg, Brian; Farmerie, William G; Miyamoto, Michael M; Wayne, Marta L

2016-10-05

Sigma virus (DMelSV) is ubiquitous in natural populations of Drosophila melanogaster. Host-mediated, selective RNA editing of adenosines to inosines (ADAR) may contribute to control of viral infection by preventing transcripts from being transported into the cytoplasm or being translated accurately; or by increasing the viral genomic mutation rate. Previous PCR-based studies showed that ADAR mutations occur in DMelSV at low frequency. Here we use SOLiD TM deep sequencing of flies from a single host population from Athens, GA, USA to comprehensively evaluate patterns of sequence variation in DMelSV with respect to ADAR. GA dinucleotides, which are weak targets of ADAR, are strongly overrepresented in the positive strand of the virus, consistent with selection to generate ADAR resistance on this complement of the transient, double-stranded RNA intermediate in replication and transcription. Potential ADAR sites in a worldwide sample of viruses are more likely to be "resistant" if the sites do not vary among samples. Either variable sites are less constrained and hence are subject to weaker selection than conserved sites, or the variation is driven by ADAR. We also find evidence of mutations segregating within hosts, hereafter referred to as hypervariable sites. Some of these sites were variable only in one or two flies (i.e., rare); others were shared by four or even all five of the flies (i.e., common). Rare and common hypervariable sites were indistinguishable with respect to susceptibility to ADAR; however, polymorphism in rare sites were more likely to be consistent with the action of ADAR than in common ones, again suggesting that ADAR is deleterious to the virus. Thus, in DMelSV, host mutagenesis is constraining viral evolution both within and between hosts. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Distribution of Barley yellow dwarf virus-PAV in the Sub-Antarctic Kerguelen Islands and Characterization of Two New Luteovirus Species

PubMed Central

Svanella-Dumas, Laurence; Candresse, Thierry; Hullé, Maurice; Marais, Armelle

2013-01-01

A systematic search for viral infection was performed in the isolated Kerguelen Islands, using a range of polyvalent genus-specific PCR assays. Barley yellow dwarf virus (BYDV) was detected in both introduced and native grasses such as Poa cookii. The geographical distribution of BYDV and its prevalence in P. cookii were analyzed using samples collected from various sites of the archipelago. We estimate the average prevalence of BYDV to be 24.9% in P. cookii, with significant variability between sites. BYDV genetic diversity was assessed using sequence information from two genomic regions: the P3 open reading frame (ORF) (encoding the coat protein) and the hypervariable P6 ORF region. The phylogenetic analysis in the P3 region showed that BYDV sequences segregate into three major lineages, the most frequent of which (Ker-I cluster) showed close homology with BYDV-PAV-I isolates and had very low intra-lineage diversity (0.6%). A similarly low diversity was also recorded in the hypervariable P6 region, suggesting that Ker-I isolates derive from the recent introduction of BYDV-PAV-I. Divergence time estimation suggests that BYDV-PAV-I was likely introduced in the Kerguelen environment at the same time frame as its aphid vector, Rhopalosiphum padi, whose distribution shows good overlap with that of BYDV-Ker-I. The two other lineages show more than 22% amino acid divergence in the P3 region with other known species in the BYDV species complex, indicating that they represent distinct BYDV species. Using species-specific amplification primers, the distribution of these novel species was analyzed. The high prevalence of BYDV on native Poaceae and the presence of the vector R. padi, raises the question of its impact on the vulnerable plant communities of this remote ecosystem. PMID:23825645
Generation and analysis of expressed sequence tags from a cDNA library of the fruiting body of Ganoderma lucidum

PubMed Central

2010-01-01

Background Little genomic or trancriptomic information on Ganoderma lucidum (Lingzhi) is known. This study aims to discover the transcripts involved in secondary metabolite biosynthesis and developmental regulation of G. lucidum using an expressed sequence tag (EST) library. Methods A cDNA library was constructed from the G. lucidum fruiting body. Its high-quality ESTs were assembled into unique sequences with contigs and singletons. The unique sequences were annotated according to sequence similarities to genes or proteins available in public databases. The detection of simple sequence repeats (SSRs) was preformed by online analysis. Results A total of 1,023 clones were randomly selected from the G. lucidum library and sequenced, yielding 879 high-quality ESTs. These ESTs showed similarities to a diverse range of genes. The sequences encoding squalene epoxidase (SE) and farnesyl-diphosphate synthase (FPS) were identified in this EST collection. Several candidate genes, such as hydrophobin, MOB2, profilin and PHO84 were detected for the first time in G. lucidum. Thirteen (13) potential SSR-motif microsatellite loci were also identified. Conclusion The present study demonstrates a successful application of EST analysis in the discovery of transcripts involved in the secondary metabolite biosynthesis and the developmental regulation of G. lucidum. PMID:20230644
Genotyping variability of computationally categorized peach microsatellite markers

USDA-ARS?s Scientific Manuscript database

Numerous expressed sequence tag (EST) simple sequence repeat (SSR) primers can be easily mined out. The obstacle to develop them into usable markers is how to optimally select downsized subsets of the primers for genotyping, which accordingly reduces amplification failure and monomorphism often occu...
Application of the High Resolution Melting analysis for genetic mapping of Sequence Tagged Site markers in narrow-leafed lupin (Lupinus angustifolius L.).

PubMed

Kamel, Katarzyna A; Kroc, Magdalena; Święcicki, Wojciech

2015-01-01

Sequence tagged site (STS) markers are valuable tools for genetic and physical mapping that can be successfully used in comparative analyses among related species. Current challenges for molecular markers genotyping in plants include the lack of fast, sensitive and inexpensive methods suitable for sequence variant detection. In contrast, high resolution melting (HRM) is a simple and high-throughput assay, which has been widely applied in sequence polymorphism identification as well as in the studies of genetic variability and genotyping. The present study is the first attempt to use the HRM analysis to genotype STS markers in narrow-leafed lupin (Lupinus angustifolius L.). The sensitivity and utility of this method was confirmed by the sequence polymorphism detection based on melting curve profiles in the parental genotypes and progeny of the narrow-leafed lupin mapping population. Application of different approaches, including amplicon size and a simulated heterozygote analysis, has allowed for successful genetic mapping of 16 new STS markers in the narrow-leafed lupin genome.
The rescue and evaluation of FLAG and HIS epitope-tagged Asia 1 type foot-and-mouth disease viruses.

PubMed

Yang, Bo; Yang, Fan; Zhang, Yan; Liu, Huanan; Jin, Ye; Cao, Weijun; Zhu, Zixiang; Zheng, Haixue; Yin, Hong

2016-02-02

The VP1 G-H loop of the foot-and-mouth disease virus (FMDV) contains the primary antigenic site, as well as an Arg-Gly-Asp (RGD) binding motif for the αv-integrin family of cell surface receptors. We anticipated that introducing a foreign epitope tag sequence downstream of the RGD motif would be tolerated by the viral capsid and would not destroy the antigenic site of FMDV. In this study, we have designed, generated, and characterized two recombinant FMDVs with a FLAG tag or histidine (HIS) inserted in the VP1 G-H loop downstream of the RGD motif +9 position. The tagged viruses were genetically stable and exhibited similar growth properties with their parental virus. What is more, the recombinant viruses rFMDV-FLAG and rFMDV-HIS showed neutralization sensitivity to FMDV type Asia1-specific mAbs, as well as to polyclonal antibodies. Additionally, the r1 values of the recombinant viruses were similar to that of the parental virus, indicating that the insertion of FLAG or HIS tag sequences downstream of the RGD motif +9 position do not eradicate the antigenic site of FMDV and do not affect its antigenicity. These results indicated that the G-H loop of Asia1 FMDV is able to effectively display the foreign epitopes, making this a potential approach for novel FMDV vaccines development. Copyright © 2015 Elsevier B.V. All rights reserved.
Association between hMLH1 hypermethylation and JC virus (JCV) infection in human colorectal cancer (CRC).

PubMed

Vilkin, Alex; Niv, Yaron

2011-04-01

Incorporation of viral DNA may interfere with the normal sequence of human DNA bases on the genetic level or cause secondary epigenetic changes such as gene promoter methylation or histone acetylation. Colorectal cancer (CRC) is the second leading cause of cancer mortality in the USA. Chromosomal instability (CIN) was established as the key mechanism in cancer development. Later, it was found that CRC results not only from the progressive accumulation of genetic alterations but also from epigenetic changes. JC virus (JCV) is a candidate etiologic factor in sporadic CRC. It may act by stabilizing β-catenin, facilitating its entrance to the cell nucleus, initialing proliferation and cancer development. Diploid CRC cell lines transfected with JCV-containing plasmids developed CIN. This result provides direct experimental evidence for the ability of JCV T-Ag to induce CIN in the genome of colonic epithelial cells. The association of CRC hMLH1 methylation and tumor positivity for JCV was recently documented. JC virus T-Ag DNA sequences were found in 77% of CRCs and are associated with promoter methylation of multiple genes. hMLH1 was methylated in 25 out of 80 CRC patients positive for T-Ag (31%) in comparison with only one out of 11 T-Ag negative cases (9%). Thus, JCV can mediate both CIN and aberrant methylation in CRC. Like other viruses, chronic infection with JCV may induce CRC by different mechanisms which should be further investigated. Thus, gene promoter methylation induced by JCV may be an important process in CRC and the polyp-carcinoma sequence.
Generation and analysis of a barcode-tagged insertion mutant library in the fission yeast Schizosaccharomyces pombe

PubMed Central

2012-01-01

Background Barcodes are unique DNA sequence tags that can be used to specifically label individual mutants. The barcode-tagged open reading frame (ORF) haploid deletion mutant collections in the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe allow for high-throughput mutant phenotyping because the relative growth of mutants in a population can be determined by monitoring the proportions of their associated barcodes. While these mutant collections have greatly facilitated genome-wide studies, mutations in essential genes are not present, and the roles of these genes are not as easily studied. To further support genome-scale research in S. pombe, we generated a barcode-tagged fission yeast insertion mutant library that has the potential of generating viable mutations in both essential and non-essential genes and can be easily analyzed using standard molecular biological techniques. Results An insertion vector containing a selectable ura4+ marker and a random barcode was used to generate a collection of 10,000 fission yeast insertion mutants stored individually in 384-well plates and as six pools of mixed mutants. Individual barcodes are flanked by Sfi I recognition sites and can be oligomerized in a unique orientation to facilitate barcode sequencing. Independent genetic screens on a subset of mutants suggest that this library contains a diverse collection of single insertion mutations. We present several approaches to determine insertion sites. Conclusions This collection of S. pombe barcode-tagged insertion mutants is well-suited for genome-wide studies. Because insertion mutations may eliminate, reduce or alter the function of essential and non-essential genes, this library will contain strains with a wide range of phenotypes that can be assayed by their associated barcodes. The design of the barcodes in this library allows for barcode sequencing using next generation or standard benchtop cloning approaches. PMID:22554201
Transcript Profile of the Response of Two Soybean Genotypes to Potassium Deficiency

PubMed Central

Hao, QingNan; Sha, AiHua; Shan, ZhiHui; Chen, LiMiao; Zhou, Rong; Zhi, HaiJian; Zhou, XinAn

2012-01-01

The macronutrient potassium (K) is essential to plant growth and development. Crop yield potential is often affected by lack of soluble K. The molecular regulation mechanism of physiological and biochemical responses to K starvation in soybean roots and shoots is not fully understood. In the present study, two soybean varieties were subjected to low-K stress conditions: a low-K-tolerant variety (You06-71) and a low-K-sensitive variety (HengChun04-11). Eight libraries were generated for analysis: 2 genotypes ×2 tissues (roots and shoots) ×2 time periods [short term (0.5 to 12 h) and long term (3 to 12 d)]. RNA derived from the roots and shoots of these two varieties across two periods (short term and long term) were sequenced and the transcriptomes were compared using high-throughput tag-sequencing. To this end, a large number of clean tags (tags used for analysis after removal of dirty tags) corresponding to distinct tags (all types of clean tags) were identified in eight libraries (L1, You06-71-root short term; L2, HengChun04-11-root short term; L3, You06-71-shoot short term; L4, HengChun04-11-shoot short term; L5, You06-71-root long term; L6, HengChun04-11-root long term; L7, You06-71-shoot long term; L8, HengChun04-11-shoot long term). All clean tags were mapped to the available soybean (Glycine max) transcript database (http://www.soybase.org). Many genes showed substantial differences in expression across the libraries. In total, 5,440 transcripts involved in 118 KEGG pathways were either up- or down-regulated. Fifteen genes were randomly selected and their expression levels were confirmed using quantitative RT-PCR. Our results provide preliminary information on the molecular mechanism of potassium absorption and transport under low-K stress conditions in different soybean tissues. PMID:22792192
Transcriptome Analysis of Gene Expression during Chinese Water Chestnut Storage Organ Formation

PubMed Central

Chen, Sainan; Wang, Yan; Yu, Meizhen; Chen, Xuehao; Li, Liangjun; Yin, Jingjing

2016-01-01

The product organ (storage organ; corm) of the Chinese water chestnut has become a very popular food in Asian countries because of its unique nutritional value. Corm formation is a complex biological process, and extensive whole genome analysis of transcripts during corm development has not been carried out. In this study, four corm libraries at different developmental stages were constructed, and gene expression was identified using a high-throughput tag sequencing technique. Approximately 4.9 million tags were sequenced, and 4,371,386, 4,372,602, 4,782,494, and 5,276,540 clean tags, including 119,676, 110,701, 100,089, and 101,239 distinct tags, respectively, were obtained after removal of low-quality tags from each library. More than 39% of the distinct tags were unambiguous and could be mapped to reference genes, while 40% were unambiguous tag-mapped genes. After mapping their functions in existing databases, a total of 11,592, 10,949, 10,585, and 7,111 genes were annotated from the B1, B2, B3, and B4 libraries, respectively. Analysis of the differentially expressed genes (DEGs) in B1/B2, B2/B3, and B3/B4 libraries showed that most of the DEGs at the B1/B2 stages were involved in carbohydrate and hormone metabolism, while the majority of DEGs were involved in energy metabolism and carbohydrate metabolism at the B2/B3 and B3/B4 stages. All of the upregulated transcription factors and 9 important genes related to product organ formation in the above four stages were also identified. The expression changes of nine of the identified DEGs were validated using a quantitative PCR approach. This study provides a comprehensive understanding of gene expression during corm formation in the Chinese water chestnut. PMID:27716802
openSputnik--a database to ESTablish comparative plant genomics using unsaturated sequence collections.

PubMed

Rudd, Stephen

2005-01-01

The public expressed sequence tag collections are continually being enriched with high-quality sequences that represent an ever-expanding range of taxonomically diverse plant species. While these sequence collections provide biased insight into the populations of expressed genes available within individual species and their associated tissues, the information is conceivably of wider relevance in a comparative context. When we consider the available expressed sequence tag (EST) collections of summer 2004, most of the major plant taxonomic clades are at least superficially represented. Investigation of the five million available plant ESTs provides a wealth of information that has applications in modelling the routes of plant genome evolution and the identification of lineage-specific genes and gene families. Over four million ESTs from over 50 distinct plant species have been collated within an EST analysis pipeline called openSputnik. The ESTs were resolved down into approximately one million unigene sequences. These have been annotated using orthology-based annotation transfer from reference plant genomes and using a variety of contemporary bioinformatics methods to assign peptide, structural and functional attributes. The openSputnik database is available at http://sputnik.btk.fi.
Automated antibody structure prediction using Accelrys tools: Results and best practices

PubMed Central

Fasnacht, Marc; Butenhof, Ken; Goupil-Lamy, Anne; Hernandez-Guzman, Francisco; Huang, Hongwei; Yan, Lisa

2014-01-01

We describe the methodology and results from our participation in the second Antibody Modeling Assessment experiment. During the experiment we predicted the structure of eleven unpublished antibody Fv fragments. Our prediction methods centered on template-based modeling; potential templates were selected from an antibody database based on their sequence similarity to the target in the framework regions. Depending on the quality of the templates, we constructed models of the antibody framework regions either using a single, chimeric or multiple template approach. The hypervariable loop regions in the initial models were rebuilt by grafting the corresponding regions from suitable templates onto the model. For the H3 loop region, we further refined models using ab initio methods. The final models were subjected to constrained energy minimization to resolve severe local structural problems. The analysis of the models submitted show that Accelrys tools allow for the construction of quite accurate models for the framework and the canonical CDR regions, with RMSDs to the X-ray structure on average below 1 Å for most of these regions. The results show that accurate prediction of the H3 hypervariable loops remains a challenge. Furthermore, model quality assessment of the submitted models show that the models are of quite high quality, with local geometry assessment scores similar to that of the target X-ray structures. Proteins 2014; 82:1583–1598. © 2014 The Authors. Proteins published by Wiley Periodicals, Inc. PMID:24833271
In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

PubMed Central

Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

2011-01-01

To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533
Bacterial Communities in the Rhizosphere of Amilaceous Maize (Zea mays L.) as Assessed by Pyrosequencing.

PubMed

Correa-Galeote, David; Bedmar, Eulogio J; Fernández-González, Antonio J; Fernández-López, Manuel; Arone, Gregorio J

2016-01-01

Maize (Zea mays L.) is the staple diet of the native peasants in the Quechua region of the Peruvian Andes who continue growing it in small plots called chacras following ancestral traditions. The abundance and structure of bacterial communities associated with the roots of amilaceous maize has not been studied in Andean chacras. Accordingly, the main objective of this study was to describe the rhizospheric bacterial diversity of amilaceous maize grown either in the presence or the absence of bur clover cultivated in soils from the Quechua maize belt. Three 16S rRNA gene libraries, one corresponding to sequences of bacteria from bulk soil of a chacra maintained under fallow conditions, the second from the rhizosphere of maize-cultivated soils, and the third prepared from rhizospheric soil of maize cultivated in intercropping with bur clover were examined using pyrosequencing tags spanning the V4 and V5 hypervariable regions of the gene. A total of 26031 sequences were found that grouped into 5955 distinct operational taxonomic units which distributed in 309 genera. The numbers of OTUs in the libraries from the maize-cultivated soils were significantly higher than those found in the libraries from bulk soil. One hundred ninety seven genera were found in the bulk soil library and 234 and 203 were in those from the maize and maize/bur clover-cultivated soils. Sixteen out of the 309 genera had a relative abundance higher than 0.5% and the were (in decreasing order of abundance) Gp4, Gp6, Flavobacterium, Subdivision3 genera incertae sedis of the Verrucomicrobia phylum, Gemmatimonas, Dechloromonas, Ohtaekwangia, Rhodoferax, Gaiella, Opitutus, Gp7, Spartobacteria genera incertae sedis, Terrimonas, Gp5, Steroidobacter and Parcubacteria genera incertae sedis. Genera Gp4 and Gp6 of the Acidobacteria, Gemmatimonas and Rhodoferax were the most abundant in bulk soil, whereas Flavobacterium, Dechloromonas and Ohtaekwangia were the main genera in the rhizosphere of maize intercropped with bur clover, and Gp4, Subdivision3 genera incertae sedis of phylum Verrucomicrobia, Gp6 and Rhodoferax were the main genera in the rhizosphere of maize plants. Taken together, our results suggest that bur clover produces specific changes in rhizospheric bacterial diversity of amilaceous maize plants.
Bacterial Communities in the Rhizosphere of Amilaceous Maize (Zea mays L.) as Assessed by Pyrosequencing

PubMed Central

Correa-Galeote, David; Bedmar, Eulogio J.; Fernández-González, Antonio J.; Fernández-López, Manuel; Arone, Gregorio J.

2016-01-01

Maize (Zea mays L.) is the staple diet of the native peasants in the Quechua region of the Peruvian Andes who continue growing it in small plots called chacras following ancestral traditions. The abundance and structure of bacterial communities associated with the roots of amilaceous maize has not been studied in Andean chacras. Accordingly, the main objective of this study was to describe the rhizospheric bacterial diversity of amilaceous maize grown either in the presence or the absence of bur clover cultivated in soils from the Quechua maize belt. Three 16S rRNA gene libraries, one corresponding to sequences of bacteria from bulk soil of a chacra maintained under fallow conditions, the second from the rhizosphere of maize-cultivated soils, and the third prepared from rhizospheric soil of maize cultivated in intercropping with bur clover were examined using pyrosequencing tags spanning the V4 and V5 hypervariable regions of the gene. A total of 26031 sequences were found that grouped into 5955 distinct operational taxonomic units which distributed in 309 genera. The numbers of OTUs in the libraries from the maize-cultivated soils were significantly higher than those found in the libraries from bulk soil. One hundred ninety seven genera were found in the bulk soil library and 234 and 203 were in those from the maize and maize/bur clover-cultivated soils. Sixteen out of the 309 genera had a relative abundance higher than 0.5% and the were (in decreasing order of abundance) Gp4, Gp6, Flavobacterium, Subdivision3 genera incertae sedis of the Verrucomicrobia phylum, Gemmatimonas, Dechloromonas, Ohtaekwangia, Rhodoferax, Gaiella, Opitutus, Gp7, Spartobacteria genera incertae sedis, Terrimonas, Gp5, Steroidobacter and Parcubacteria genera incertae sedis. Genera Gp4 and Gp6 of the Acidobacteria, Gemmatimonas and Rhodoferax were the most abundant in bulk soil, whereas Flavobacterium, Dechloromonas and Ohtaekwangia were the main genera in the rhizosphere of maize intercropped with bur clover, and Gp4, Subdivision3 genera incertae sedis of phylum Verrucomicrobia, Gp6 and Rhodoferax were the main genera in the rhizosphere of maize plants. Taken together, our results suggest that bur clover produces specific changes in rhizospheric bacterial diversity of amilaceous maize plants. PMID:27524985
Single-step affinity and cost-effective purification of recombinant proteins using the Sepharose-binding lectin-tag from the mushroom Laetiporus sulphureus as fusion partner.

PubMed

Li, Xiao-Jing; Liu, Jin-Ling; Gao, Dong-Sheng; Wan, Wen-Yan; Yang, Xia; Li, Yong-Tao; Chang, Hong-Tao; Chen, Lu; Wang, Chuan-Qing; Zhao, Jun

2016-03-01

Previous research showed that a lectin from the mushroom Laetiporus sulphureus, designed LSL, bound to Sepharose and could be eluted by lactose. In this study, by taking advantage of the strong affinity of LSL-tag for Sepharose, we developed a single-step purification method for LSL-tagged fusion proteins. We utilized unmodified Sepharose-4B as a specific adsorbent and 0.2 M lactose solution as an elution buffer. Fusion proteins of LSL-tag and porcine circovirus capsid protein, designated LSL-Cap was recovered with purity of 90 ± 4%, and yield of 87 ± 3% from crude extract of recombinant Escherichia coli. To enable the remove of LSL-tag, tobacco etch virus (TEV) protease recognition sequence was placed downstream of LSL-tag in the expression vector, and LSL-tagged TEV protease, designated LSL-TEV, was also expressed in E. coli., and was recovered with purity of 82 ± 5%, and yield of 85 ± 2% from crude extract of recombinant E. coli. After digestion of LSL-tagged recombinant proteins with LSL-TEV, the LSL tag and LSL-TEV can be easily removed by passing the digested products through the Sepharose column. It is of worthy noting that the Sepharose can be reused after washing with PBS. The LSL affinity purification method enables rapid and inexpensive purification of LSL-tagged fusion proteins and scale-up production of native proteins. Copyright © 2015 Elsevier Inc. All rights reserved.
The unusually large Plasmodium telomerase reverse-transcriptase localizes in a discrete compartment associated with the nucleolus

PubMed Central

Figueiredo, Luisa M.; Rocha, Eduardo P. C.; Mancio-Silva, Liliana; Prevost, Christine; Hernandez-Verdun, Danièle; Scherf, Artur

2005-01-01

Telomerase replicates chromosome ends, a function necessary for maintaining genome integrity. We have identified the gene that encodes the catalytic reverse transcriptase (RT) component of this enzyme in the malaria parasite Plasmodium falciparum (PfTERT) as well as the orthologous genes from two rodent and one simian malaria species. PfTERT is predicted to encode a basic protein that contains the major sequence motifs previously identified in known telomerase RTs (TERTs). At ∼2500 amino acids, PfTERT is three times larger than other characterized TERTs. We observed remarkable sequence diversity between TERT proteins of different Plasmodial species, with conserved domains alternating with hypervariable regions. Immunofluorescence analysis revealed that PfTERT is expressed in asexual blood stage parasites that have begun DNA synthesis. Surprisingly, rather than at telomere clusters, PfTERT typically localizes into a discrete nuclear compartment. We further demonstrate that this compartment is associated with the nucleolus, hereby defined for the first time in P.falciparum. PMID:15722485
Babesia lengau sp. nov., a novel Babesia species in cheetah (Acinonyx jubatus, Schreber, 1775) populations in South Africa.

PubMed

Bosman, Anna-Mari; Oosthuizen, Marinda C; Peirce, Michael A; Venter, Estelle H; Penzhorn, Barend L

2010-08-01

In a previous paper, we reported on a large number of cheetah blood specimens that gave positive signals only for Babesia and/or Theileria genus-specific probes on the reverse line blot (RLB) assay, indicating the presence of a novel species or variant of an existing species. Some of these specimens were investigated further by microscopic, serological, sequencing, and phylogenetic analyses. The near-full-length 18S rRNA genes of 13 samples, as well as the second internal transcribed spacer (ITS2) region, were amplified, cloned, and sequenced. A species-specific RLB probe, designed to target the hypervariable V4 region of the 18S rRNA gene for detection of the novel Babesia sp., was used to screen an additional 137 cheetah blood specimens for the presence of the species. The prevalence of infection was 28.5%. Here we describe the morphology and phylogenetic relationships of the novel species, which we have named Babesia lengau sp. nov.
Babesia lengau sp. nov., a Novel Babesia Species in Cheetah (Acinonyx jubatus, Schreber, 1775) Populations in South Africa ▿

PubMed Central

Bosman, Anna-Mari; Oosthuizen, Marinda C.; Peirce, Michael A.; Venter, Estelle H.; Penzhorn, Barend L.

2010-01-01

In a previous paper, we reported on a large number of cheetah blood specimens that gave positive signals only for Babesia and/or Theileria genus-specific probes on the reverse line blot (RLB) assay, indicating the presence of a novel species or variant of an existing species. Some of these specimens were investigated further by microscopic, serological, sequencing, and phylogenetic analyses. The near-full-length 18S rRNA genes of 13 samples, as well as the second internal transcribed spacer (ITS2) region, were amplified, cloned, and sequenced. A species-specific RLB probe, designed to target the hypervariable V4 region of the 18S rRNA gene for detection of the novel Babesia sp., was used to screen an additional 137 cheetah blood specimens for the presence of the species. The prevalence of infection was 28.5%. Here we describe the morphology and phylogenetic relationships of the novel species, which we have named Babesia lengau sp. nov. PMID:20519464
Identification of novel Theileria genotypes from Grant's gazelle

PubMed Central

Hooge, Janis; Howe, Laryssa; Ezenwa, Vanessa O.

2015-01-01

Blood samples collected from Grant's gazelles (Nanger granti) in Kenya were screened for hemoparasites using a combination of microscopic and molecular techniques. All 69 blood smears examined by microscopy were positive for hemoparasites. In addition, Theileria/Babesia DNA was detected in all 65 samples screened by PCR for a ~450-base pair fragment of the V4 hypervariable region of the 18S rRNA gene. Sequencing and BLAST analysis of a subset of PCR amplicons revealed widespread co-infection (25/39) and the existence of two distinct Grant's gazelle Theileria subgroups. One group of 11 isolates clustered as a subgroup with previously identified Theileria ovis isolates from small ruminants from Europe, Asia and Africa; another group of 3 isolates clustered with previously identified Theileria spp. isolates from other African antelope. Based on extensive levels of sequence divergence (1.2–2%) from previously reported Theileria species within Kenya and worldwide, the Theileria isolates detected in Grant's gazelles appear to represent at least two novel Theileria genotypes. PMID:25973394

Identification of novel Theileria genotypes from Grant's gazelle.

PubMed

Hooge, Janis; Howe, Laryssa; Ezenwa, Vanessa O

2015-08-01

Blood samples collected from Grant's gazelles (Nanger granti) in Kenya were screened for hemoparasites using a combination of microscopic and molecular techniques. All 69 blood smears examined by microscopy were positive for hemoparasites. In addition, Theileria/Babesia DNA was detected in all 65 samples screened by PCR for a ~450-base pair fragment of the V4 hypervariable region of the 18S rRNA gene. Sequencing and BLAST analysis of a subset of PCR amplicons revealed widespread co-infection (25/39) and the existence of two distinct Grant's gazelle Theileria subgroups. One group of 11 isolates clustered as a subgroup with previously identified Theileria ovis isolates from small ruminants from Europe, Asia and Africa; another group of 3 isolates clustered with previously identified Theileria spp. isolates from other African antelope. Based on extensive levels of sequence divergence (1.2-2%) from previously reported Theileria species within Kenya and worldwide, the Theileria isolates detected in Grant's gazelles appear to represent at least two novel Theileria genotypes.
Extremophiles in Household Water Heaters

NASA Astrophysics Data System (ADS)

Wilpiszeski, R.; House, C. H.

2016-12-01

A significant fraction of Earth's microbial diversity comes from species living in extreme environments, but natural extreme environments can be difficult to access. Manmade systems like household water heaters serve as an effective proxy for thermophilic environments that are otherwise difficult to sample directly. As such, we are investigating the biogeography, taxonomic distribution, and evolution of thermophiles growing in domestic water heaters. Citizen scientists collected hot tap water culture- and filter- samples from 101 homes across the United States. We recovered a single species of thermophilic heterotroph from culture samples inoculated from water heaters across the United States, Thermus scotoductus. Whole-genome sequencing was conducted to better understand the distribution and evolution of this single species. We have also sequenced hyper-variable regions of the 16S rRNA gene from whole-community filter samples to identify the broad diversity and distribution of microbial cells captured from each water heater. These results shed light on the processes that shape thermophilic populations and genomes at a spatial resolution that is difficult to access in naturally occurring extreme ecosystems.
Analysis of ER Resident Proteins in S. cerevisiae: Implementation of H/KDEL Retrieval Sequences

PubMed Central

Young, Carissa L.; Raden, David L.; Robinson, Anne S.

2013-01-01

An elaborate quality control system regulates ER homeostasis by ensuring the fidelity of protein synthesis and maturation. In budding yeast, genomic analyses and high-throughput proteomic studies have identified ER resident proteins that restore homeostasis following local perturbations. Yet, how these folding factors modulate stress has been largely unexplored. In this study, we designed a series of PCR-based modules including codon-optimized epitopes and FP variants complete with C-terminal H/KDEL retrieval motifs. These conserved sequences are inherent to most soluble ER resident proteins. To monitor multiple proteins simultaneously, H/KDEL cassettes are available with six different selection markers, providing optimal flexibility for live-cell imaging and multicolor labeling in vivo. A single pair of PCR primers can be used for the amplification of these 26 modules, enabling numerous combinations of tags and selection markers. The versatility of pCY H/KDEL cassettes was demonstrated by labeling BiP/Kar2p, Pdi1p, and Scj1p with all novel tags, thus providing a direct comparison among FP variants. Furthermore, to advance in vitro studies of yeast ER proteins, Strep-tag II was engineered with a C-terminal retrieval sequence. Here, an efficient purification strategy was established for BiP under physiological conditions. PMID:23324027
Construction and Cloning of Reporter-Tagged Replicon cDNA for an In Vitro Replication Study of Murine Norovirus-1 (MNV-1).

PubMed

Ahmad, Muhammad Khairi; Tabana, Yasser M; Ahmed, Mowaffaq Adam; Sandai, Doblin Anak; Mohamed, Rafeezul; Ismail, Ida Shazrina; Zulkiflie, Nurulisa; Yunus, Muhammad Amir

2017-12-01

A norovirus maintains its viability, infectivity and virulence by its ability to replicate. However, the biological mechanisms of the process remain to be explored. In this work, the NanoLuc™ Luciferase gene was used to develop a reporter-tagged replicon system to study norovirus replication. The NanoLuc™ Luciferase reporter protein was engineered to be expressed as a fusion protein for MNV-1 minor capsid protein, VP2. The foot-and-mouth disease virus 2A (FMDV2A) sequence was inserted between the 3'end of the reporter gene and the VP2 start sequence to allow co-translational 'cleavage' of fusion proteins during intracellular transcript expression. Amplification of the fusion gene was performed using a series of standard and overlapping polymerase chain reactions. The resulting amplicon was then cloned into three readily available backbones of MNV-1 cDNA clones. Restriction enzyme analysis indicated that the NanoLucTM Luciferase gene was successfully inserted into the parental MNV-1 cDNA clone. The insertion was further confirmed by using DNA sequencing. NanoLuc™ Luciferase-tagged MNV-1 cDNA clones were successfully engineered. Such clones can be exploited to develop robust experimental assays for in vitro assessments of viral RNA replication.
Monitoring Arthrobacter protophormiae RKJ100 in a 'tag and chase' method during p-nitrophenol bio-remediation in soil microcosms.

PubMed

Pandey, Gunjan; Pandey, Janmejay; Jain, Rakesh K

2006-05-01

Monitoring of micro-organisms released deliberately into the environment is essential to assess their movement during the bio-remediation process. During the last few years, DNA-based genetic methods have emerged as the preferred method for such monitoring; however, their use is restricted in cases where organisms used for bio-remediation are not well characterized or where the public domain databases do not provide sufficient information regarding their sequence. For monitoring of such micro-organisms, alternate approaches have to be undertaken. In this study, we have specifically monitored a p-nitrophenol (PNP)-degrading organism, Arthrobacter protophormiae RKJ100, using molecular methods during PNP degradation in soil microcosm. Cells were tagged with a transposon-based foreign DNA sequence prior to their introduction into PNP-contaminated microcosms. Later, this artificially introduced DNA sequence was PCR-amplified to distinguish the bio-augmented organism from the indigenous microflora during PNP bio-remediation.
Distorted Immunodominance by Linker Sequences or other Epitopes from a Second Protein Antigen During Antigen-Processing

PubMed Central

Kim, AeRyon; Boronina, Tatiana N.; Cole, Robert N.; Darrah, Erika; Sadegh-Nasseri, Scheherazade

2017-01-01

The immune system focuses on and responds to very few representative immunodominant epitopes from pathogenic insults. However, due to the complexity of the antigen processing, understanding the parameters that lead to immunodominance has proved difficult. In an attempt to uncover the determinants of immunodominance among several dominant epitopes, we utilized a cell free antigen processing system and allowed the system to identify the hierarchies among potential determinants. We then tested the results in vivo; in mice and in human. We report here, that immunodominance of known sequences in a given protein can change if two or more proteins are being processed and presented simultaneously. Surprisingly, we find that new spacer/tag sequences commonly added to proteins for purification purposes can distort the capture of the physiological immunodominant epitopes. We warn against adding tags and spacers to candidate vaccines, or recommend cleaving it off before using for vaccination. PMID:28422163
ScanRanker: Quality Assessment of Tandem Mass Spectra via Sequence Tagging

PubMed Central

Ma, Ze-Qiang; Chambers, Matthew C.; Ham, Amy-Joan L.; Cheek, Kristin L.; Whitwell, Corbin W.; Aerni, Hans-Rudolf; Schilling, Birgit; Miller, Aaron W.; Caprioli, Richard M.; Tabb, David L.

2011-01-01

In shotgun proteomics, protein identification by tandem mass spectrometry relies on bioinformatics tools. Despite recent improvements in identification algorithms, a significant number of high quality spectra remain unidentified for various reasons. Here we present ScanRanker, an open-source tool that evaluates the quality of tandem mass spectra via sequence tagging with reliable performance in data from different instruments. The superior performance of ScanRanker enables it not only to find unassigned high quality spectra that evade identification through database search, but also to select spectra for de novo sequencing and cross-linking analysis. In addition, we demonstrate that the distribution of ScanRanker scores predicts the richness of identifiable spectra among multiple LC-MS/MS runs in an experiment, and ScanRanker scores assist the process of peptide assignment validation to increase confident spectrum identifications. The source code and executable versions of ScanRanker are available from http://fenchurch.mc.vanderbilt.edu. PMID:21520941
Humanized Antibodies for Antiviral Therapy

NASA Astrophysics Data System (ADS)

Co, Man Sung; Deschamps, Marguerite; Whitley, Richard J.; Queen, Cary

1991-04-01

Antibody therapy holds great promise for the treatment of cancer, autoimmune disorders, and viral infections. Murine monoclonal antibodies are relatively easy to produce but are severely restricted for therapeutic use by their immunogenicity in humans. Production of human monoclonal antibodies has been problematic. Humanized antibodies can be generated by introducing the six hypervariable regions from the heavy and light chains of a murine antibody into a human framework sequence and combining it with human constant regions. We humanized, with the aid of computer modeling, two murine monoclonal antibodies against herpes simplex virus gB and gD glycoproteins. The binding, virus neutralization, and cell protection results all indicate that both humanized antibodies have retained the binding activities and the biological properties of the murine monoclonal antibodies.
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

PubMed Central

2010-01-01

Background Expressed Sequence Tag (EST) has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST) sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047), among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs) in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65%) and low in the peach (46%), and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species. PMID:20626882
Analyses of Expressed Sequence Tags from Apple1

PubMed Central

Newcomb, Richard D.; Crowhurst, Ross N.; Gleave, Andrew P.; Rikkerink, Erik H.A.; Allan, Andrew C.; Beuning, Lesley L.; Bowen, Judith H.; Gera, Emma; Jamieson, Kim R.; Janssen, Bart J.; Laing, William A.; McArtney, Steve; Nain, Bhawana; Ross, Gavin S.; Snowden, Kimberley C.; Souleyre, Edwige J.F.; Walton, Eric F.; Yauk, Yar-Khing

2006-01-01

The domestic apple (Malus domestica; also known as Malus pumila Mill.) has become a model fruit crop in which to study commercial traits such as disease and pest resistance, grafting, and flavor and health compound biosynthesis. To speed the discovery of genes involved in these traits, develop markers to map genes, and breed new cultivars, we have produced a substantial expressed sequence tag collection from various tissues of apple, focusing on fruit tissues of the cultivar Royal Gala. Over 150,000 expressed sequence tags have been collected from 43 different cDNA libraries representing 34 different tissues and treatments. Clustering of these sequences results in a set of 42,938 nonredundant sequences comprising 17,460 tentative contigs and 25,478 singletons, together representing what we predict are approximately one-half the expressed genes from apple. Many potential molecular markers are abundant in the apple transcripts. Dinucleotide repeats are found in 4,018 nonredundant sequences, mainly in the 5′-untranslated region of the gene, with a bias toward one repeat type (containing AG, 88%) and against another (repeats containing CG, 0.1%). Trinucleotide repeats are most common in the predicted coding regions and do not show a similar degree of sequence bias in their representation. Bi-allelic single-nucleotide polymorphisms are highly abundant with one found, on average, every 706 bp of transcribed DNA. Predictions of the numbers of representatives from protein families indicate the presence of many genes involved in disease resistance and the biosynthesis of flavor and health-associated compounds. Comparisons of some of these gene families with Arabidopsis (Arabidopsis thaliana) suggest instances where there have been duplications in the lineages leading to apple of biosynthetic and regulatory genes that are expressed in fruit. This resource paves the way for a concerted functional genomics effort in this important temperate fruit crop. PMID:16531485
Sequencing degraded DNA from non-destructively sampled museum specimens for RAD-tagging and low-coverage shotgun phylogenetics.

PubMed

Tin, Mandy Man-Ying; Economo, Evan Philip; Mikheyev, Alexander Sergeyevich

2014-01-01

Ancient and archival DNA samples are valuable resources for the study of diverse historical processes. In particular, museum specimens provide access to biotas distant in time and space, and can provide insights into ecological and evolutionary changes over time. However, archival specimens are difficult to handle; they are often fragile and irreplaceable, and typically contain only short segments of denatured DNA. Here we present a set of tools for processing such samples for state-of-the-art genetic analysis. First, we report a protocol for minimally destructive DNA extraction of insect museum specimens, which produced sequenceable DNA from all of the samples assayed. The 11 specimens analyzed had fragmented DNA, rarely exceeding 100 bp in length, and could not be amplified by conventional PCR targeting the mitochondrial cytochrome oxidase I gene. Our approach made these samples amenable to analysis with commonly used next-generation sequencing-based molecular analytic tools, including RAD-tagging and shotgun genome re-sequencing. First, we used museum ant specimens from three species, each with its own reference genome, for RAD-tag mapping. Were able to use the degraded DNA sequences, which were sequenced in full, to identify duplicate reads and filter them prior to base calling. Second, we re-sequenced six Hawaiian Drosophila species, with millions of years of divergence, but with only a single available reference genome. Despite a shallow coverage of 0.37 ± 0.42 per base, we could recover a sufficient number of overlapping SNPs to fully resolve the species tree, which was consistent with earlier karyotypic studies, and previous molecular studies, at least in the regions of the tree that these studies could resolve. Although developed for use with degraded DNA, all of these techniques are readily applicable to more recent tissue, and are suitable for liquid handling automation.
The bovine lactation genome: Insights into the evolution of mammalian milk

USDA-ARS?s Scientific Manuscript database

The newly assembled Bos Taurus genome sequence enables the linkage of bovine milk and lactation data with other mammalian genomes. Using publicly available milk proteome data and mammary expressed sequence tags, 197 milk protein genes and over 6,000 mammary genes were identified in the bovine genome...
Differential transferability of EST-SSR primers developed from diploid species Pseudoroegneria spicata, Thinopyrum bessarabicum, and Th. elongatum

USDA-ARS?s Scientific Manuscript database

Simple sequence repeat technology based on expressed sequence tag (EST-SSR) is a useful genomic tool for genome mapping, characterizing plant species relationships, elucidating genome evolution, and tracing genes on alien chromosome segments. EST-SSR primers developed from three perennial diploid T...
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

PubMed

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
Programmable polyproteams built using twin peptide superglues

PubMed Central

Veggiani, Gianluca; Nakamura, Tomohiko; Brenner, Michael D.; Yan, Jun; Robinson, Carol V.; Howarth, Mark

2016-01-01

Programmed connection of amino acids or nucleotides into chains introduced a revolution in control of biological function. Reacting proteins together is more complex because of the number of reactive groups and delicate stability. Here we achieved sequence-programmed irreversible connection of protein units, forming polyprotein teams by sequential amidation and transamidation. SpyTag peptide is engineered to spontaneously form an isopeptide bond with SpyCatcher protein. By engineering the adhesin RrgA from Streptococcus pneumoniae, we developed the peptide SnoopTag, which formed a spontaneous isopeptide bond to its protein partner SnoopCatcher with >99% yield and no cross-reaction to SpyTag/SpyCatcher. Solid-phase attachment followed by sequential SpyTag or SnoopTag reaction between building-blocks enabled iterative extension. Linear, branched, and combinatorial polyproteins were synthesized, identifying optimal combinations of ligands against death receptors and growth factor receptors for cancer cell death signal activation. This simple and modular route to programmable “polyproteams” should enable exploration of a new area of biological space. PMID:26787909
Programmable polyproteams built using twin peptide superglues.

PubMed

Veggiani, Gianluca; Nakamura, Tomohiko; Brenner, Michael D; Gayet, Raphaël V; Yan, Jun; Robinson, Carol V; Howarth, Mark

2016-02-02

Programmed connection of amino acids or nucleotides into chains introduced a revolution in control of biological function. Reacting proteins together is more complex because of the number of reactive groups and delicate stability. Here we achieved sequence-programmed irreversible connection of protein units, forming polyprotein teams by sequential amidation and transamidation. SpyTag peptide is engineered to spontaneously form an isopeptide bond with SpyCatcher protein. By engineering the adhesin RrgA from Streptococcus pneumoniae, we developed the peptide SnoopTag, which formed a spontaneous isopeptide bond to its protein partner SnoopCatcher with >99% yield and no cross-reaction to SpyTag/SpyCatcher. Solid-phase attachment followed by sequential SpyTag or SnoopTag reaction between building-blocks enabled iterative extension. Linear, branched, and combinatorial polyproteins were synthesized, identifying optimal combinations of ligands against death receptors and growth factor receptors for cancer cell death signal activation. This simple and modular route to programmable "polyproteams" should enable exploration of a new area of biological space.
Mitochondrial sequence analysis for forensic identification using pyrosequencing technology.

PubMed

Andréasson, H; Asp, A; Alderborn, A; Gyllensten, U; Allen, M

2002-01-01

Over recent years, requests for mtDNA analysis in the field of forensic medicine have notably increased, and the results of such analyses have proved to be very useful in forensic cases where nuclear DNA analysis cannot be performed. Traditionally, mtDNA has been analyzed by DNA sequencing of the two hypervariable regions, HVI and HVII, in the D-loop. DNA sequence analysis using the conventional Sanger sequencing is very robust but time consuming and labor intensive. By contrast, mtDNA analysis based on the pyrosequencing technology provides fast and accurate results from the human mtDNA present in many types of evidence materials in forensic casework. The assay has been developed to determine polymorphic sites in the mitochondrial D-loop as well as the coding region to further increase the discrimination power of mtDNA analysis. The pyrosequencing technology for analysis of mtDNA polymorphisms has been tested with regard to sensitivity, reproducibility, and success rate when applied to control samples and actual casework materials. The results show that the method is very accurate and sensitive; the results are easily interpreted and provide a high success rate on casework samples. The panel of pyrosequencing reactions for the mtDNA polymorphisms were chosen to result in an optimal discrimination power in relation to the number of bases determined.
Classification of European Mtdnas from an Analysis of Three European Populations

PubMed Central

Torroni, A.; Huoponen, K.; Francalacci, P.; Petrozzi, M.; Morelli, L.; Scozzari, R.; Obinu, D.; Savontaus, M. L.; Wallace, D. C.

1996-01-01

Mitochondrial DNA (mtDNA) sequence variation was examined in Finns, Swedes and Tuscans by PCR amplification and restriction analysis. About 99% of the mtDNAs were subsumed within 10 mtDNA haplogroups (H, I, J, K, M, T, U, V, W, and X) suggesting that the identified haplogroups could encompass virtually all European mtDNAs. Because both hypervariable segments of the mtDNA control region were previously sequenced in the Tuscan samples, the mtDNA haplogroups and control region sequences could be compared. Using a combination of haplogroup-specific restriction site changes and control region nucleotide substitutions, the distribution of the haplogroups was surveyed through the published restriction site polymorphism and control region sequence data of Caucasoids. This supported the conclusion that most haplogroups observed in Europe are Caucasoid-specific, and that at least some of them occur at varying frequencies in different Caucasoid populations. The classification of almost all European mtDNA variation in a number of well defined haplogroups could provide additional insights about the origin and relationships of Caucasoid populations and the process of human colonization of Europe, and is valuable for the definition of the role played by mtDNA backgrounds in the expression of pathological mtDNA mutations PMID:8978068
Antimicrobial Peptides from Plants

PubMed Central

Tam, James P.; Wang, Shujing; Wong, Ka H.; Tan, Wei Liang

2015-01-01

Plant antimicrobial peptides (AMPs) have evolved differently from AMPs from other life forms. They are generally rich in cysteine residues which form multiple disulfides. In turn, the disulfides cross-braced plant AMPs as cystine-rich peptides to confer them with extraordinary high chemical, thermal and proteolytic stability. The cystine-rich or commonly known as cysteine-rich peptides (CRPs) of plant AMPs are classified into families based on their sequence similarity, cysteine motifs that determine their distinctive disulfide bond patterns and tertiary structure fold. Cystine-rich plant AMP families include thionins, defensins, hevein-like peptides, knottin-type peptides (linear and cyclic), lipid transfer proteins, α-hairpinin and snakins family. In addition, there are AMPs which are rich in other amino acids. The ability of plant AMPs to organize into specific families with conserved structural folds that enable sequence variation of non-Cys residues encased in the same scaffold within a particular family to play multiple functions. Furthermore, the ability of plant AMPs to tolerate hypervariable sequences using a conserved scaffold provides diversity to recognize different targets by varying the sequence of the non-cysteine residues. These properties bode well for developing plant AMPs as potential therapeutics and for protection of crops through transgenic methods. This review provides an overview of the major families of plant AMPs, including their structures, functions, and putative mechanisms. PMID:26580629
Molecular characterization of allelic variants of (GATA)n microsatellite loci in parthenogenetic lizards Darevskia unisexualis (Lacertidae).

PubMed

Korchagin, V I; Badaeva, T N; Tokarskaya, O N; Martirosyan, I A; Darevsky, I S; Ryskov, A P

2007-05-01

Populations of parthenogenetic lizards of the genus Darevskia consist of genetically identical animals, and represent a unique model for studying the molecular mechanisms underlying the variability and evolution of hypervariable DNA repeats. As unisexual lineages, parthenogenetic lizards are characterized by some level of genetic diversity at microsatellite loci. We cloned and sequenced a number of (GATA)n microsatellite loci of Darevskia unisexualis. PCR products from these loci were also sequenced and the degree of intraspecific polymorphism was assessed. Among the five (GATA)n loci analysed, two (Du215 and Du281) were polymorphic. Cross-species analysis of Du215 and Du281 indicate that the priming sites at the D. unisexualis loci are conserved in the bisexual parental species, D. raddei and D. valentini. Sequencing the PCR products amplified from Du215 and Du281 and from monomorphic Du323 showed that allelic differences at the polymorphic loci are caused by microsatellite mutations and by point mutations in the flanking regions. The haplotypes identified among the allelic variants of Du281 and among its orthologues in the parental species provide new evidence of the cross-species origin of D. unisexualis. To our knowledge, these data are the first to characterize the nucleotide sequences of allelic variants at microsatellite loci within parthenogenetic vertebrate animals.

High Rhodotorula sequences in skin transcriptome of patients with diffuse systemic sclerosis

PubMed Central

Arron, Sarah T.; Dimon, Michelle T.; Li, Zhenghui; Johnson, Michael E.; Wood, Tammara; Feeney, Luzviminda; Angeles, Jorge Gil; Lafyatis, Robert; Whitfield, Michael L.

2014-01-01

Previous studies have suggested a role for pathogens as a trigger of systemic sclerosis (SSc), though neither a pathogen nor a mechanism of pathogenesis is known. Here we show enrichment of Rhodotorula sequences in the skin of patients with early, diffuse SSc compared to normal controls. RNA-seq was performed on four SSc and four controls, to a depth of 200 million reads per patient. Data were analyzed to quantify the non-human sequence reads in each sample. We found little difference between bacterial microbiome and viral read counts, but found a significant difference between the read counts for a mycobiome component, R. glutinis. Normal samples contained almost no detected R. glutinis or other Rhodotorula sequence reads (mean score 0.021 for R. glutinis, 0.024 for all Rhodotorula). In contrast, SSc samples had a mean score of 5.039 for R. glutinis (5.232 for Rhodotorula). We were able to assemble the D1–D2 hypervariable region of the 28S rRNA of R. glutinis from each of the SSc samples. Taken together, these results suggest R. glutinis may be present in the skin of early SSc patients at higher levels than normal skin, raising the possibility that it may be triggering the inflammatory response found in SSc. PMID:24608988
High Rhodotorula sequences in skin transcriptome of patients with diffuse systemic sclerosis.

PubMed

Arron, Sarah T; Dimon, Michelle T; Li, Zhenghui; Johnson, Michael E; Wood, Tammara A; Feeney, Luzviminda; Angeles, Jorge G; Lafyatis, Robert; Whitfield, Michael L

2014-08-01

Previous studies have suggested a role for pathogens as a trigger of systemic sclerosis (SSc), although neither a pathogen nor a mechanism of pathogenesis is known. Here we show enrichment of Rhodotorula sequences in the skin of patients with early, diffuse SSc compared with that in normal controls. RNA-seq was performed on four SSc patients and four controls, to a depth of 200 million reads per patient. Data were analyzed to quantify the nonhuman sequence reads in each sample. We found little difference between bacterial microbiome and viral read counts, but found a significant difference between the read counts for a mycobiome component, R. glutinis. Normal samples contained almost no detected R. glutinis or other Rhodotorula sequence reads (mean score 0.021 for R. glutinis, 0.024 for all Rhodotorula). In contrast, SSc samples had a mean score of 5.039 for R. glutinis (5.232 for Rhodotorula). We were able to assemble the D1-D2 hypervariable region of the 28S ribosomal RNA (rRNA) of R. glutinis from each of the SSc samples. Taken together, these results suggest that R. glutinis may be present in the skin of early SSc patients at higher levels than in normal skin, raising the possibility that it may be triggering the inflammatory response found in SSc.
Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification.

PubMed

Ziesemer, Kirsten A; Mann, Allison E; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T; Brandt, Bernd W; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A; MacDonald, Sandy J; Thomas, Gavin H; Collins, Matthew J; Lewis, Cecil M; Hofman, Corinne; Warinner, Christina

2015-11-13

To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341-534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions.
Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification

PubMed Central

Ziesemer, Kirsten A.; Mann, Allison E.; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T.; Brandt, Bernd W.; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C.; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A.; MacDonald, Sandy J.; Thomas, Gavin H.; Collins, Matthew J.; Lewis, Cecil M.; Hofman, Corinne; Warinner, Christina

2015-01-01

To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341–534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions. PMID:26563586
The hypervariable domain of the mitochondrial control region in Atlantic spiny lobsters and its potential as a marker for investigating phylogeographic structuring.

PubMed

Diniz, Fabio M; Maclean, Norman; Ogawa, Masayoshi; Cintra, Israel H A; Bentzen, Paul

2005-01-01

Atlantic spiny lobsters support major fisheries in northeastern Brazilian waters and in the Caribbean Sea. To avoid reduction in diversity and elimination of distinct stocks, understanding their population dynamics, including structuring of populations and genetic diversity, is critical. We here explore the potential of using the hypervariable domain in the control region of the mitochondrial DNA as a genetic marker to characterize population subdivision in spiny lobsters, using Panulirus argus as the species model. The primers designed on the neighboring conserved genes have amplified the entire control region (approx. 780 bases) of P. argus and other closely related species. Average nucleotide and haplotype diversity within P. argus were found to be high, and population structuring was hypothesized. The data suggest a division of P. argus into genetically different phylogeographic groups. The hypervariable domain seems to be useful for determining genetic differentiation of geographically distinct stocks of P. argus and other Atlantic spiny lobsters.
The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: I. Statistically annotated datasets for peptide sequences and proteins identified via the application of ICAT and tandem mass spectrometry to proteins copurifying with T cell lipid rafts.

PubMed

von Haller, Priska D; Yi, Eugene; Donohoe, Samuel; Vaughn, Kelly; Keller, Andrew; Nesvizhskii, Alexey I; Eng, Jimmy; Li, Xiao-jun; Goodlett, David R; Aebersold, Ruedi; Watts, Julian D

2003-07-01

Lipid rafts were prepared according to standard protocols from Jurkat T cells stimulated via T cell receptor/CD28 cross-linking and from control (unstimulated) cells. Co-isolating proteins from the control and stimulated cell preparations were labeled with isotopically normal (d0) and heavy (d8) versions of the same isotope-coded affinity tag (ICAT) reagent, respectively. Samples were combined, proteolyzed, and resultant peptides fractionated via cation exchange chromatography. Cysteine-containing (ICAT-labeled) peptides were recovered via the biotin tag component of the ICAT reagents by avidin-affinity chromatography. On-line micro-capillary liquid chromatography tandem mass spectrometry was performed on both avidin-affinity (ICAT-labeled) and flow-through (unlabeled) fractions. Initial peptide sequence identification was by searching recorded tandem mass spectrometry spectra against a human sequence data base using SEQUEST software. New statistical data modeling algorithms were then applied to the SEQUEST search results. These allowed for discrimination between likely "correct" and "incorrect" peptide assignments, and from these the inferred proteins that they collectively represented, by calculating estimated probabilities that each peptide assignment and subsequent protein identification was a member of the "correct" population. For convenience, the resultant lists of peptide sequences assigned and the proteins to which they corresponded were filtered at an arbitrarily set cut-off of 0.5 (i.e. 50% likely to be "correct") and above and compiled into two separate datasets. In total, these data sets contained 7667 individual peptide identifications, which represented 2669 unique peptide sequences, corresponding to 685 proteins and related protein groups.
An accurate and efficient experimental approach for characterization of the complex oral microbiota.

PubMed

Zheng, Wei; Tsompana, Maria; Ruscitto, Angela; Sharma, Ashu; Genco, Robert; Sun, Yijun; Buck, Michael J

2015-10-05

Currently, taxonomic interrogation of microbiota is based on amplification of 16S rRNA gene sequences in clinical and scientific settings. Accurate evaluation of the microbiota depends heavily on the primers used, and genus/species resolution bias can arise with amplification of non-representative genomic regions. The latest Illumina MiSeq sequencing chemistry has extended the read length to 300 bp, enabling deep profiling of large number of samples in a single paired-end reaction at a fraction of the cost. An increasingly large number of researchers have adopted this technology for various microbiome studies targeting the 16S rRNA V3-V4 hypervariable region. To expand the applicability of this powerful platform for further descriptive and functional microbiome studies, we standardized and tested an efficient, reliable, and straightforward workflow for the amplification, library construction, and sequencing of the 16S V1-V3 hypervariable region using the new 2 × 300 MiSeq platform. Our analysis involved 11 subgingival plaque samples from diabetic and non-diabetic human subjects suffering from periodontitis. The efficiency and reliability of our experimental protocol was compared to 16S V3-V4 sequencing data from the same samples. Comparisons were based on measures of observed taxonomic richness and species evenness, along with Procrustes analyses using beta(β)-diversity distance metrics. As an experimental control, we also analyzed a total of eight technical replicates for the V1-V3 and V3-V4 regions from a synthetic community with known bacterial species operon counts. We show that our experimental protocol accurately measures true bacterial community composition. Procrustes analyses based on unweighted UniFrac β-diversity metrics depicted significant correlation between oral bacterial composition for the V1-V3 and V3-V4 regions. However, measures of phylotype richness were higher for the V1-V3 region, suggesting that V1-V3 offers a deeper assessment of population diversity and community ecology for the complex oral microbiota. This study provides researchers with valuable experimental evidence for the selection of appropriate 16S amplicons for future human oral microbiome studies. We expect that the tested 16S V1-V3 framework will be widely applicable to other types of microbiota, allowing robust, time-efficient, and inexpensive examination of thousands of samples for population, phylogenetic, and functional crossectional and longitutidal studies.
A new fungal large subunit ribosomal RNA primer for high throughput sequencing surveys

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mueller, Rebecca C.; Gallegos-Graves, La Verne; Kuske, Cheryl R.

The inclusion of phylogenetic metrics in community ecology has provided insights into important ecological processes, particularly when combined with high-throughput sequencing methods; however, these approaches have not been widely used in studies of fungal communities relative to other microbial groups. Two obstacles have been considered: (1) the internal transcribed spacer (ITS) region has limited utility for constructing phylogenies and (2) most PCR primers that target the large subunit (LSU) ribosomal unit generate amplicons that exceed current limits of high-throughput sequencing platforms. We designed and tested a PCR primer (LR22R) to target approximately 300–400 bp region of the D2 hypervariable regionmore » of the fungal LSU for use with the Illumina MiSeq platform. Both in silico and empirical analyses showed that the LR22R–LR3 pair captured a broad range of fungal taxonomic groups with a small fraction of non-fungal groups. Phylogenetic placement of publically available LSU D2 sequences showed broad agreement with taxonomic classification. Comparisons of the LSU D2 and the ITS2 ribosomal regions from environmental samples and known communities showed similar discriminatory abilities of the two primer sets. Altogether, these findings show that the LR22R–LR3 primer pair has utility for phylogenetic analyses of fungal communities using high-throughput sequencing methods.« less
Limited Genetic Diversity Preceded Extinction of the Tasmanian Tiger

PubMed Central

Menzies, Brandon R.; Renfree, Marilyn B.; Heider, Thomas; Mayer, Frieder; Hildebrandt, Thomas B.; Pask, Andrew J.

2012-01-01

The Tasmanian tiger or thylacine was the largest carnivorous marsupial when Europeans first reached Australia. Sadly, the last known thylacine died in captivity in 1936. A recent analysis of the genome of the closely related and extant Tasmanian devil demonstrated limited genetic diversity between individuals. While a similar lack of diversity has been reported for the thylacine, this analysis was based on just two individuals. Here we report the sequencing of an additional 12 museum-archived specimens collected between 102 and 159 years ago. We examined a portion of the mitochondrial DNA hyper-variable control region and determined that all sequences were on average 99.5% identical at the nucleotide level. As a measure of accuracy we also sequenced mitochondrial DNA from a mother and two offspring. As expected, these samples were found to be 100% identical, validating our methods. We also used 454 sequencing to reconstruct 2.1 kilobases of the mitochondrial genome, which shared 99.91% identity with the two complete thylacine mitochondrial genomes published previously. Our thylacine genomic data also contained three highly divergent putative nuclear mitochondrial sequences, which grouped phylogenetically with the published thylacine mitochondrial homologs but contained 100-fold more polymorphisms than the conserved fragments. Together, our data suggest that the thylacine population in Tasmania had limited genetic diversity prior to its extinction, possibly as a result of their geographic isolation from mainland Australia approximately 10,000 years ago. PMID:22530022
A new fungal large subunit ribosomal RNA primer for high throughput sequencing surveys

DOE PAGES

Mueller, Rebecca C.; Gallegos-Graves, La Verne; Kuske, Cheryl R.

2015-12-09

The inclusion of phylogenetic metrics in community ecology has provided insights into important ecological processes, particularly when combined with high-throughput sequencing methods; however, these approaches have not been widely used in studies of fungal communities relative to other microbial groups. Two obstacles have been considered: (1) the internal transcribed spacer (ITS) region has limited utility for constructing phylogenies and (2) most PCR primers that target the large subunit (LSU) ribosomal unit generate amplicons that exceed current limits of high-throughput sequencing platforms. We designed and tested a PCR primer (LR22R) to target approximately 300–400 bp region of the D2 hypervariable regionmore » of the fungal LSU for use with the Illumina MiSeq platform. Both in silico and empirical analyses showed that the LR22R–LR3 pair captured a broad range of fungal taxonomic groups with a small fraction of non-fungal groups. Phylogenetic placement of publically available LSU D2 sequences showed broad agreement with taxonomic classification. Comparisons of the LSU D2 and the ITS2 ribosomal regions from environmental samples and known communities showed similar discriminatory abilities of the two primer sets. Altogether, these findings show that the LR22R–LR3 primer pair has utility for phylogenetic analyses of fungal communities using high-throughput sequencing methods.« less
Analysis of expressed sequence tags for Frankliniella occidentalis, the western flower thrips.

PubMed

Rotenberg, D; Whitfield, A E

2010-08-01

Thrips are members of the insect order Thysanoptera and Frankliniella occidentalis (the western flower thrips) is the most economically important pest within this order. F. occidentalis is both a direct pest of crops and an efficient vector of plant viruses, including Tomato spotted wilt virus (TSWV). Despite the world-wide importance of thrips in agriculture, there is little knowledge of the F. occidentalis genome or gene functions at this time. A normalized cDNA library was constructed from first instar thrips and 13 839 expressed sequence tags (ESTs) were obtained. Our EST data assembled into 894 contigs and 11 806 singletons (12 700 nonredundant sequences). We found that 31% of these sequences had significant similarity (E< or = 10(-10)) to protein sequences in the National Center for Biotechnology Information nonredundant (nr) protein database, and 25% were functionally annotated using Blast 2GO. We identified 74 sequences with putative homology to proteins associated with insect innate immunity. Sixteen sequences had significant similarity to proteins associated with small RNA-mediated gene silencing pathways (RNA interference; RNAi), including the antiviral pathway (short interfering RNA-mediated pathway). Our EST collection provides new sequence resources for characterizing gene functions in F. occidentalis and other thrips species with regards to vital biological processes, studying the mechanism of interactions with the viruses harboured and transmitted by the vector, and identifying new insect gene-centred targets for plant disease and insect control.
Survey of duckweed diversity in Lake Chao and total fatty acid, triacylglycerol, profiles of representative strains.

PubMed

Tang, J; Li, Y; Ma, J; Cheng, J J

2015-09-01

Lemnaceae (duckweeds) are widely distributed aquatic flowering plants. Their high growth rate, starch content and suitability for bioremediation make them potential feedstock for biofuels. However, few natural duckweed resources have been investigated in China, and there is no information about total fatty acid (TFA) and triacylglycerol (TAG) composition of duckweeds from China. Here, the genetic diversity of a natural duckweed population collected from Lake Chao, China, was investigated using multilocus sequence typing (MLST). The 54 strains were categorised into four species in four genera, representing 12 distinct sequence types. Strains representing Lemna aequinoctialis and Spirodela polyrhiza were predominant. Interestingly, a surprisingly high degree of genetic diversification within L. aequinoctialis was observed. The four duckweed species revealed a uniform fatty acid composition, with three fatty acids, palmitic acid, linoleic acid and linolenic acid, accounting for more than 80% of the TFA. The TFA in biomass varied among species, ranging from 1.05% (of dry weight, DW) for L. punctata and S. polyrhiza to 1.62% for Wolffia globosa. The four duckweed species contained similar TAG contents, 0.02% mg · DW(-1). The fatty acid profiles of TAG were different from those of TFA, and also varied among the four species. The survey investigated the genetic diversity of duckweeds from Lake Chao, and provides an initial insight into TFA and TAG of four duckweed species, indicating that intraspecific and interspecific variations exist in the content and composition of both TFA and TAG in comparison with other studies. © 2015 German Botanical Society and The Royal Botanical Society of the Netherlands.
In-plane "superresolution" MRI with phaseless sub-pixel encoding.

PubMed

Hennel, Franciszek; Tian, Rui; Engel, Maria; Pruessmann, Klaas P

2018-04-15

Acquisition of high-resolution imaging data using multiple excitations without the sensitivity to fluctuations of the transverse magnetization phase, which is a major problem of multi-shot MRI. The concept of superresolution MRI based on microscopic tagging is analyzed using an analogy with the optical method of structured illumination. Sinusoidal tagging is shown to provide subpixel resolution by mixing of neighboring spatial frequency (k-space) bands. It represents a phaseless modulation added on top of the standard Fourier encoding, which allows the phase fluctuations to be discarded at an intermediate reconstruction step. Improvements are proposed to correct for tag distortions due to magnetic field inhomogeneity and to avoid the propagation of Gibbs ringing from intermediate low-resolution images to the final image. The method was applied to diffusion-weighted EPI. Artifact-free superresolution images can be obtained despite a finite duration of the tagging sequence and related pattern distortions by a field map based phase correction of band-wise reconstructed images. The ringing effect present in the intermediate images can be suppressed by partial overlapping of the mixed k-space bands in combination with an adapted filter. High-resolution diffusion-weighted images of the human head were obtained with a three-shot EPI sequence despite motion-related phase fluctuations between the shots. Due to its phaseless character, tagging-based sub-pixel encoding is an alternative to k-space segmenting in the presence of unknown phase fluctuations, in particular those due to motion under strong diffusion gradients. Proposed improvements render the method practicable in realistic conditions. © 2018 International Society for Magnetic Resonance in Medicine.
hSAGEing: an improved SAGE-based software for identification of human tissue-specific or common tumor markers and suppressors.

PubMed

Yang, Cheng-Hong; Chuang, Li-Yeh; Shih, Tsung-Mu; Chang, Hsueh-Wei

2010-12-17

SAGE (serial analysis of gene expression) is a powerful method of analyzing gene expression for the entire transcriptome. There are currently many well-developed SAGE tools. However, the cross-comparison of different tissues is seldom addressed, thus limiting the identification of common- and tissue-specific tumor markers. To improve the SAGE mining methods, we propose a novel function for cross-tissue comparison of SAGE data by combining the mathematical set theory and logic with a unique "multi-pool method" that analyzes multiple pools of pair-wise case controls individually. When all the settings are in "inclusion", the common SAGE tag sequences are mined. When one tissue type is in "inclusion" and the other types of tissues are not in "inclusion", the selected tissue-specific SAGE tag sequences are generated. They are displayed in tags-per-million (TPM) and fold values, as well as visually displayed in four kinds of scales in a color gradient pattern. In the fold visualization display, the top scores of the SAGE tag sequences are provided, along with cluster plots. A user-defined matrix file is designed for cross-tissue comparison by selecting libraries from publically available databases or user-defined libraries. The hSAGEing tool provides a combination of friendly cross-tissue analysis and an interface for comparing SAGE libraries for the first time. Some up- or down-regulated genes with tissue-specific or common tumor markers and suppressors are identified computationally. The tool is useful and convenient for in silico cancer transcriptomic studies and is freely available at http://bio.kuas.edu.tw/hSAGEing.
Geographically widespread swordfish barcode stock identification: a case study of its application.

PubMed

Pappalardo, Anna Maria; Guarino, Francesca; Reina, Simona; Messina, Angela; De Pinto, Vito

2011-01-01

The swordfish (Xiphias gladius) is a cosmopolitan large pelagic fish inhabiting tempered and tropical waters and it is a target species for fisheries all around the world. The present study investigated the ability of COI barcoding to reliably identify swordfish and particularly specific stocks of this commercially important species. We applied the classical DNA barcoding technology, upon a 682 bp segment of COI, and compared swordfish sequences from different geographical sources (Atlantic, Indian Oceans and Mediterranean Sea). The sequences of the 5' hyper-variable fragment of the control region (5'dloop), were also used to validate the efficacy of COI as a stock-specific marker. This information was successfully applied to the discrimination of unknown samples from the market, detecting in some cases mislabeled seafood products. The NJ distance-based phenogram (K2P model) obtained with COI sequences allowed us to correlate the swordfish haplotypes to the different geographical stocks. Similar results were obtained with 5'dloop. Our preliminary data in swordfish Xiphias gladius confirm that Cytochrome Oxidase I can be proposed as an efficient species-specific marker that has also the potential to assign geographical provenance. This information might speed the samples analysis in commercial application of barcoding.
The minisatellite of the GPI/AMF/NLK/MF gene: interspecies conservation and transcriptional activity.

PubMed

Williams, R R; Hassan-Walker, A F; Lavender, F L; Morgan, M; Faik, P; Ragoussis, J

2001-05-16

Minisatellites are tandemly repeated DNA sequences found throughout the genomes of all eukaryotes. They are regions often prone to instability and hence hypervariability; thus repeat unit sequence is generally not conserved beyond closely related species. We have studied the minisatellite located in intron 9 of the human glucose phosphate isomerase (GPI) gene (also known as neuroleukin, autocrine motility factor, maturation and differentiation factor) and have found, by Zoo blotting coupled with PCR amplification and DNA sequencing, that similar repeat units are present in seven other species of mammal. There is also evidence for the presence of the minisatellite in chicken. The repeat unit does not appear to be present at any other locus in these genomes. Minisatellite DNA has been reported to be involved in recombination activity, control of gene expression of nearby gene(s) (both transcriptional and translational), whilst others form protein coding regions. The high level of conservation exhibited by the GPI minisatellite, coupled with the unique location, strongly suggests a functional role. Our results from transient and stable transfections using luciferase reporter constructs have shown that the GPI minisatellite region can act to increase transcription from the SV40 promoter, CMV promoter and the human GPI promoter.
Bacterial community comparisons by taxonomy-supervised analysis independent of sequence alignment and clustering

PubMed Central

Sul, Woo Jun; Cole, James R.; Jesus, Ederson da C.; Wang, Qiong; Farris, Ryan J.; Fish, Jordan A.; Tiedje, James M.

2011-01-01

High-throughput sequencing of 16S rRNA genes has increased our understanding of microbial community structure, but now even higher-throughput methods to the Illumina scale allow the creation of much larger datasets with more samples and orders-of-magnitude more sequences that swamp current analytic methods. We developed a method capable of handling these larger datasets on the basis of assignment of sequences into an existing taxonomy using a supervised learning approach (taxonomy-supervised analysis). We compared this method with a commonly used clustering approach based on sequence similarity (taxonomy-unsupervised analysis). We sampled 211 different bacterial communities from various habitats and obtained ∼1.3 million 16S rRNA sequences spanning the V4 hypervariable region by pyrosequencing. Both methodologies gave similar ecological conclusions in that β-diversity measures calculated by using these two types of matrices were significantly correlated to each other, as were the ordination configurations and hierarchical clustering dendrograms. In addition, our taxonomy-supervised analyses were also highly correlated with phylogenetic methods, such as UniFrac. The taxonomy-supervised analysis has the advantages that it is not limited by the exhaustive computation required for the alignment and clustering necessary for the taxonomy-unsupervised analysis, is more tolerant of sequencing errors, and allows comparisons when sequences are from different regions of the 16S rRNA gene. With the tremendous expansion in 16S rRNA data acquisition underway, the taxonomy-supervised approach offers the potential to provide more rapid and extensive community comparisons across habitats and samples. PMID:21873204
Generation of Recombinant Polioviruses Harboring RNA Affinity Tags in the 5′ and 3′ Noncoding Regions of Genomic RNAs

PubMed Central

Flather, Dylan; Cathcart, Andrea L.; Cruz, Casey; Baggs, Eric; Ngo, Tuan; Gershon, Paul D.; Semler, Bert L.

2016-01-01

Despite being intensely studied for more than 50 years, a complete understanding of the enterovirus replication cycle remains elusive. Specifically, only a handful of cellular proteins have been shown to be involved in the RNA replication cycle of these viruses. In an effort to isolate and identify additional cellular proteins that function in enteroviral RNA replication, we have generated multiple recombinant polioviruses containing RNA affinity tags within the 3′ or 5′ noncoding region of the genome. These recombinant viruses retained RNA affinity sequences within the genome while remaining viable and infectious over multiple passages in cell culture. Further characterization of these viruses demonstrated that viral protein production and growth kinetics were unchanged or only slightly altered relative to wild type poliovirus. However, attempts to isolate these genetically-tagged viral genomes from infected cells have been hindered by high levels of co-purification of nonspecific proteins and the limited matrix-binding efficiency of RNA affinity sequences. Regardless, these recombinant viruses represent a step toward more thorough characterization of enterovirus ribonucleoprotein complexes involved in RNA replication. PMID:26861382
New features of triacylglycerol biosynthetic pathways of peanut seeds in early developmental stages.

PubMed

Yu, Mingli; Liu, Fengzhen; Zhu, Weiwei; Sun, Meihong; Liu, Jiang; Li, Xinzheng

2015-11-01

The peanut (Arachis hypogaea L.) is one of the three most important oil crops in the world due to its high average oil content (50 %). To reveal the biosynthetic pathways of seed oil in the early developmental stages of peanut pods with the goal of improving the oil quality, we presented a method combining deep sequencing analysis of the peanut pod transcriptome and quantitative real-time PCR (RT-PCR) verification of seed oil-related genes. From the sequencing data, approximately 1500 lipid metabolism-associated Unigenes were identified. The RT-PCR results quantified the different expression patterns of these triacylglycerol (TAG) synthesis-related genes in the early developmental stages of peanut pods. Based on these results and analysis, we proposed a novel construct of the metabolic pathways involved in the biosynthesis of TAG, including the Kennedy pathway, acyl-CoA-independent pathway and proposed monoacylglycerol pathway. It showed that the biosynthetic pathways of TAG in the early developmental stages of peanut pods were much more complicated than a simple, unidirectional, linear pathway.
Micropreparative capillary gel electrophoresis of DNA: rapid expressed sequence tag library construction.

PubMed

Shi, Liang; Khandurina, Julia; Ronai, Zsolt; Li, Bi-Yu; Kwan, Wai King; Wang, Xun; Guttman, András

2003-01-01

A capillary gel electrophoresis based automated DNA fraction collection technique was developed to support a novel DNA fragment-pooling strategy for expressed sequence tag (EST) library construction. The cDNA population is first cleaved by BsaJ I and EcoR I restriction enzymes, and then subpooled by selective ligation with specific adapters followed by polymerase chain reaction (PCR) amplification and labeling. Combination of this cDNA fingerprinting method with high-resolution capillary gel electrophoresis separation and precise fractionation of individual cDNA transcript representatives avoids redundant fragment selection and concomitant repetitive sequencing of abundant transcripts. Using a computer-controlled capillary electrophoresis device the transcript representatives were separated by their size and fractions were automatically collected in every 30 s into 96-well plates. The high resolving power of the sieving matrix ensured sequencing grade separation of the DNA fragments (i.e., single-base resolution) and successful fraction collection. Performance and precision of the fraction collection procedure was validated by PCR amplification of the collected DNA fragments followed by capillary electrophoresis analysis for size and purity verification. The collected and PCR-amplified transcript representatives, ranging up to several hundred base pairs, were then sequenced to create an EST library.

Expressed sequence tags from the plant trypanosomatid Phytomonas serpens.

PubMed

Pappas, Georgios J; Benabdellah, Karim; Zingales, Bianca; González, Antonio

2005-08-01

We have generated 2190 expressed sequence tags (ESTs) from a cDNA library of the plant trypanosomatid Phytomonas serpens. Upon processing and clustering the set of 1893 accepted sequences was reduced to 697 clusters consisting of 452 singletons and 245 contigs. Functional categories were assigned based on BLAST searches against a database of the eukaryotic orthologous groups of proteins (KOG). Thirty six percent of the generated sequences showed no hits against the KOG database and 39.6% presented similarity to the KOG classes corresponding to translation, ribosomal structure and biogenesis. The most populated cluster contained 45 ESTs homologous to members of the glucose transporter family. This fact can be immediately correlated to the reported Phytomonas dependence on anaerobic glycolytic ATP production due to the lack of cytochrome-mediated respiratory chain. In this context, not only a number of enzymes of the glycolytic pathway were identified but also of the Krebs cycle as well as specific components of the respiratory chain. The data here reported, including a few hundred unique sequences and the description of tandemly repeated motifs and putative transcript stability motifs at untranslated mRNA ends, represent an initial approach to overcome the lack of information on the molecular biology of this organism.
Poly(A)-tag deep sequencing data processing to extract poly(A) sites.

PubMed

Wu, Xiaohui; Ji, Guoli; Li, Qingshun Quinn

2015-01-01

Polyadenylation [poly(A)] is an essential posttranscriptional processing step in the maturation of eukaryotic mRNA. The advent of next-generation sequencing (NGS) technology has offered feasible means to generate large-scale data and new opportunities for intensive study of polyadenylation, particularly deep sequencing of the transcriptome targeting the junction of 3'-UTR and the poly(A) tail of the transcript. To take advantage of this unprecedented amount of data, we present an automated workflow to identify polyadenylation sites by integrating NGS data cleaning, processing, mapping, normalizing, and clustering. In this pipeline, a series of Perl scripts are seamlessly integrated to iteratively map the single- or paired-end sequences to the reference genome. After mapping, the poly(A) tags (PATs) at the same genome coordinate are grouped into one cleavage site, and the internal priming artifacts removed. Then the ambiguous region is introduced to parse the genome annotation for cleavage site clustering. Finally, cleavage sites within a close range of 24 nucleotides and from different samples can be clustered into poly(A) clusters. This procedure could be used to identify thousands of reliable poly(A) clusters from millions of NGS sequences in different tissues or treatments.
An efficient tag derived from the common epitope of tospoviral NSs proteins for monitoring recombinant proteins expressed in both bacterial and plant systems.

PubMed

Cheng, Hao-Wen; Chen, Kuan-Chun; Raja, Joseph A J; Li, Jian-Xian; Yeh, Shyi-Dong

2013-04-15

NSscon (23 aa), a common epitope in the gene silencing suppressor NSs proteins of the members of the Watermelon silver mottle virus (WSMoV) serogroup, was previously identified. In this investigation, we expressed different green fluorescent protein (GFP)-fused deletions of NSscon in bacteria and reacted with NSscon monoclonal antibody (MAb). Our results indicated that the core 9 amino acids, "(109)KFTMHNQIF(117)", denoted as "nss", retain the reactivity of NSscon. In bacterial pET system, four different recombinant proteins labeled with nss, either at N- or C-extremes, were readily detectable without position effects, with sensitivity superior to that for the polyhistidine-tag. When the nss-tagged Zucchini yellow mosaic virus (ZYMV) helper component-protease (HC-Pro) and WSMoV nucleocapsid protein were transiently expressed by agroinfiltration in tobacco, they were readily detectable and the tag's possible efficacy for gene silencing suppression was not noticed. Co-immunoprecipitation of nss-tagged and non-tagged proteins expressed from bacteria confirmed the interaction of potyviral HC-Pro and coat protein. Thus, we conclude that this novel nss sequence is highly valuable for tagging recombinant proteins in both bacterial and plant expression systems. Copyright © 2013 Elsevier B.V. All rights reserved.
Identification and characterization of TCRgamma and TCRdelta chains in channel catfish, Ictalurus punctatus

USDA-ARS?s Scientific Manuscript database

Channel catfish, Ictalurus punctatus, T cell receptors (TCR) gamma and delta were identified by mining of expressed sequence tag databases and full length sequences were obtained by 5'-RACE and RT-PCR protocols. cDNAs for each of these TCR chains encode typical variable (V), (diversity; D), joining ...
Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

NASA Astrophysics Data System (ADS)

Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

2010-01-01

Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.
Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

PubMed Central

Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

2012-01-01

Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Cell-free translational screening of an expression sequence tag library of Clonorchis sinensis for novel antigen discovery.

PubMed

Kasi, Devi; Catherine, Christy; Lee, Seung-Won; Lee, Kyung-Ho; Kim, Yu Jung; Ro Lee, Myeong; Ju, Jung Won; Kim, Dong-Myung

2017-05-01

The rapidly evolving cloning and sequencing technologies have enabled understanding of genomic structure of parasite genomes, opening up new ways of combatting parasite-related diseases. To make the most of the exponentially accumulating genomic data, however, it is crucial to analyze the proteins encoded by these genomic sequences. In this study, we adopted an engineered cell-free protein synthesis system for large-scale expression screening of an expression sequence tag (EST) library of Clonorchis sinensis to identify potential antigens that can be used for diagnosis and treatment of clonorchiasis. To allow high-throughput expression and identification of individual genes comprising the library, a cell-free synthesis reaction was designed such that both the template DNA and the expressed proteins were co-immobilized on the same microbeads, leading to microbead-based linkage of the genotype and phenotype. This reaction configuration allowed streamlined expression, recovery, and analysis of proteins. This approach enabled us to identify 21 antigenic proteins. © 2017 American Institute of Chemical Engineers Biotechnol. Prog., 33:832-837, 2017. © 2017 American Institute of Chemical Engineers.
Sequence variation in the env gene of simian immunodeficiency virus recovered from immunized macaques is predominantly in the V1 region.

PubMed

Almond, N; Jenkins, A; Heath, A B; Kitchin, P

1993-05-01

Three cynomolgus macaques were immunized with recombinant envelope protein preparations derived from simian immunodeficiency virus (SIV). Although humoral and cellular responses were elicited by the immunization regime, all macaques became infected upon challenge with 10 MID50 of the 11/88 virus challenge stock of SIVmac251-32H. The polymerase chain reaction was used to amplify proviral SIV gp120 sequences present in the blood of both immunized and control macaques at 2 months post-infection. A comparison of the predominant sequences found in the region from V2 to V5 of gp120 failed to differentiate provirus recovered from either immunized or control animals. A detailed investigation of sequences obtained from the hypervariable V1 region identified a mixture of sequences in both immunized and control macaques. Some sequences were identical to those previously detected in the virus challenge stock, whereas others had not been detected previously. Phenogram analysis of the new V1 sequences found in immunized animals revealed that they were quite distinct from those from the virus challenge stock and that they included alterations to potential N-linked glycosylation sites. In contrast, new sequence variants recovered from the control animals were closely related to sequences from the virus challenge stock. The difference in diversity of new V1 sequences recovered from immunized and control macaques was highly significant (P < 0.001). Thus, the presence of pre-existing immune responses to SIV envelope protein is associated with greater genetic change in the V1 region of gp120. These data are discussed in relation to the epitopes of SIV gp120 that may confer protection from in vivo challenge.
PET-Tool: a software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data.

PubMed

Chiu, Kuo Ping; Wong, Chee-Hong; Chen, Qiongyu; Ariyaratne, Pramila; Ooi, Hong Sain; Wei, Chia-Lin; Sung, Wing-Kin Ken; Ruan, Yijun

2006-08-25

We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the Project Manager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.
Candidate Genes Expressed in Tolerant Common Wheat With Resistant to English Grain Aphid.

PubMed

Luo, Kun; Zhang, Gaisheng; Wang, Chunping; Ouellet, Thérèse; Wu, Jingjing; Zhu, Qidi; Zhao, Huiyan

2014-10-01

The English grain aphid, Sitobion avenae (F.) (Hemiptera: Aphididae), is a common worldwide pest of wheat (Triticum aestivum L.). The use of improved resistant cultivars by the farmers is the most effective and environmentally friendly method to control this aphid in the field. The winter wheat genotypes 98-10-35 and Amigo are resistant to S. avenae. To identify genes responsible for resistance to S. avenae in these genotypes, differential-display reverse transcription-polymerase chain reaction was used to identify the corresponding differentially expressed sequences in current study. Two backcross progenies were obtained by crossing the two resistant genotypes with the susceptible genotype 1376. Six potential expected-differential bands were sequenced. Lengths of the expressed sequence tags ranged from 128 to 532 bp. Although these expressed sequences were likely associated with S. avenae resistance, there was one expressed sequence tag located on 7DL chromosome, and its potential function may associate with the ability to maintain photosynthesis in wheat. That serves as an active way for tolerant common wheat with resistant to S. avenae. Cloning the full length of these sequences would help us thoroughly understand the mechanism of wheat resistance to S. avenae and be valuable for breeding cultivars with S. avenae resistance. © 2014 Entomological Society of America.
Profiling cellular protein complexes by proximity ligation with dual tag microarray readout.

PubMed

Hammond, Maria; Nong, Rachel Yuan; Ericsson, Olle; Pardali, Katerina; Landegren, Ulf

2012-01-01

Patterns of protein interactions provide important insights in basic biology, and their analysis plays an increasing role in drug development and diagnostics of disease. We have established a scalable technique to compare two biological samples for the levels of all pairwise interactions among a set of targeted protein molecules. The technique is a combination of the proximity ligation assay with readout via dual tag microarrays. In the proximity ligation assay protein identities are encoded as DNA sequences by attaching DNA oligonucleotides to antibodies directed against the proteins of interest. Upon binding by pairs of antibodies to proteins present in the same molecular complexes, ligation reactions give rise to reporter DNA molecules that contain the combined sequence information from the two DNA strands. The ligation reactions also serve to incorporate a sample barcode in the reporter molecules to allow for direct comparison between pairs of samples. The samples are evaluated using a dual tag microarray where information is decoded, revealing which pairs of tags that have become joined. As a proof-of-concept we demonstrate that this approach can be used to detect a set of five proteins and their pairwise interactions both in cellular lysates and in fixed tissue culture cells. This paper provides a general strategy to analyze the extent of any pairwise interactions in large sets of molecules by decoding reporter DNA strands that identify the interacting molecules.
Individual specific DNA fingerprints from a hypervariable region probe: alpha-globin 3'HVR.

PubMed

Fowler, S J; Gill, P; Werrett, D J; Higgs, D R

1988-06-01

A probe detecting a hypervariable region (HVR) 3' to the alpha globin locus on chromosome 16 has been used to produce DNA fingerprints. Segregation analysis has revealed multiple, randomly dispersed DNA fragments inherited in a Mendelian fashion with minimal allelism and linkage. The fingerprints are highly polymorphic (probability of chance association between random individuals much less than 10(-14]. The probe is, therefore, a powerful discriminating tool: it is envisaged that this probe will have forensic applications, including paternity cases, and will be informative in linkage analysis.
Electrostatically driven immobilization of peptides onto (Maleic anhydride-alt-methyl vinyl ether) copolymers in aqueous media.

PubMed

Ladavière, C; Lorenzo, C; Elaïssari, A; Mandrand, B; Delair, T

2000-01-01

The covalent immobilization of a model peptide onto the MAMVE copolymer, via the formation of amide bonds, occurred in moderate yields in aqueous conditions. The improvement of the grafting reaction was achieved by adding at the amino terminus of the model peptide a sequence (tag) of three positively charged amino acids, lysine or arginine, and by taking profit of electrostatic attractive interactions between the negatively charged copolymer and the tagged peptides. The arginine tag was more efficient than the lysine tag for enhancing the immobilization reaction, proving that the effect was due to an electrostic driving force. On the basis of these results, a tentative mechanism is discussed, and Scatchard plots pointed out two regimes of binding. With the first, at low polymer load (up to 50% of saturation for a lysine tag and 60-70% for an arginine tag), the binding occurred with a positive cooperative effect, the already bound peptide participating to the binding of others. A second one for higher coverages, for which the binding occurred with a negative cooperativity, and saturation was reached in the presence of a large excess of peptide.
Serial analysis of gene expression in the silkworm, Bombyx mori.

PubMed

Huang, Jianhua; Miao, Xuexia; Jin, Weirong; Couble, Pierre; Mita, Kasuei; Zhang, Yong; Liu, Wenbin; Zhuang, Leijun; Shen, Yan; Keime, Celine; Gandrillon, Olivier; Brouilly, Patrick; Briolay, Jerome; Zhao, Guoping; Huang, Yongping

2005-08-01

The silkworm Bombyx mori is one of the most economically important insects and serves as a model for Lepidoptera insects. We used serial analysis of gene expression (SAGE) to derive profiles of expressed genes during the developmental life cycle of the silkworm and to create a reference for understanding silkworm metamorphosis. We generated four SAGE libraries, one from each of the four developmental stages of the silkworm. In total we obtained 257,964 SAGE tags, of which 39,485 were unique tags. Sorted by copy number, 14.1% of the unique tags were detected at a median to high level (five or more copies), 24.2% at lower levels (two to four copies), and 61.7% as single copies. Using a basic local alignment search tool on the EST database, 35% of the tags matched known silkworm expressed sequence tags. SAGE demonstrated that a number of the genes were up- or down-regulated during the four developmental phases of the egg, larva, pupa, and adult. Furthermore, we found that the generation of longer cDNA fragments from SAGE tags constituted the most efficient method of gene identification, which facilitated the analysis of a large number of unknown genes.
De Novo Transcriptomic Analysis of an Oleaginous Microalga: Pathway Description and Gene Discovery for Production of Next-Generation Biofuels

PubMed Central

Wan, LingLin; Han, Juan; Sang, Min; Li, AiFen; Wu, Hong; Yin, ShunJi; Zhang, ChengWu

2012-01-01

Background Eustigmatos cf. polyphem is a yellow-green unicellular soil microalga belonging to the eustimatophyte with high biomass and considerable production of triacylglycerols (TAGs) for biofuels, which is thus referred to as an oleaginous microalga. The paucity of microalgae genome sequences, however, limits development of gene-based biofuel feedstock optimization studies. Here we describe the sequencing and de novo transcriptome assembly for a non-model microalgae species, E. cf. polyphem, and identify pathways and genes of importance related to biofuel production. Results We performed the de novo assembly of E. cf. polyphem transcriptome using Illumina paired-end sequencing technology. In a single run, we produced 29,199,432 sequencing reads corresponding to 2.33 Gb total nucleotides. These reads were assembled into 75,632 unigenes with a mean size of 503 bp and an N50 of 663 bp, ranging from 100 bp to >3,000 bp. Assembled unigenes were subjected to BLAST similarity searches and annotated with Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) orthology identifiers. These analyses identified the majority of carbohydrate, fatty acids, TAG and carotenoids biosynthesis and catabolism pathways in E. cf. polyphem. Conclusions Our data provides the construction of metabolic pathways involved in the biosynthesis and catabolism of carbohydrate, fatty acids, TAG and carotenoids in E. cf. polyphem and provides a foundation for the molecular genetics and functional genomics required to direct metabolic engineering efforts that seek to enhance the quantity and character of microalgae-based biofuel feedstock. PMID:22536352
Expressed sequence tag analysis of human RPE/choroid for the NEIBank Project: over 6000 non-redundant transcripts, novel genes and splice variants.

PubMed

Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Fariss, Robert N; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

2002-06-15

The retinal pigment epithelium (RPE) and choroid comprise a functional unit of the eye that is essential to normal retinal health and function. Here we describe expressed sequence tag (EST) analysis of human RPE/choroid as part of a project for ocular bioinformatics. A cDNA library (cs) was made from human RPE/choroid and sequenced. Data were analyzed and assembled using the program GRIST (GRouping and Identification of Sequence Tags). Complete sequencing, Northern and Western blots, RH mapping, peptide antibody synthesis and immunofluorescence (IF) have been used to examine expression patterns and genome location for selected transcripts and proteins. Ten thousand individual sequence reads yield over 6300 unique gene clusters of which almost half have no matches with named genes. One of the most abundant transcripts is from a gene (named "alpha") that maps to the BBS1 region of chromosome 11. A number of tissue preferred transcripts are common to both RPE/choroid and iris. These include oculoglycan/opticin, for which an alternative splice form is detected in RPE/choroid, and "oculospanin" (Ocsp), a novel tetraspanin that maps to chromosome 17q. Antiserum to Ocsp detects expression in RPE, iris, ciliary body, and retinal ganglion cells by IF. A newly identified gene for a zinc-finger protein (TIRC) maps to 19q13.4. Variant transcripts of several genes were also detected. Most notably, the predominant form of Bestrophin represented in cs contains a longer open reading frame as a result of splice junction skipping. The unamplified cs library gives a view of the transcriptional repertoire of the adult RPE/choroid. A large number of potentially novel genes and splice forms and candidates for genetic diseases are revealed. Clones from this collection are being included in a large, nonredundant set for cDNA microarray construction.
Fluorescence turn-on detection of target sequence DNA based on silicon nanodot-mediated quenching.

PubMed

Zhang, Yanan; Ning, Xinping; Mao, Guobin; Ji, Xinghu; He, Zhike

2018-05-01

We have developed a new enzyme-free method for target sequence DNA detection based on the dynamic quenching of fluorescent silicon nanodots (SiNDs) toward Cy5-tagged DNA probe. Fascinatingly, the water-soluble SiNDs can quench the fluorescence of cyanine (Cy5) in Cy5-tagged DNA probe in homogeneous solution, and the fluorescence of Cy5-tagged DNA probe can be restored in the presence of target sequence DNA (the synthetic target miRNA-27a). Based on this phenomenon, a SiND-featured fluorescent sensor has been constructed for "turn-on" detection of the synthetic target miRNA-27a for the first time. This newly developed approach possesses the merits of low cost, simple design, and convenient operation since no enzymatic reaction, toxic reagents, or separation procedures are involved. The established method achieves a detection limit of 0.16 nM, and the relative standard deviation of this method is 9% (1 nM, n = 5). The linear range is 0.5-20 nM, and the recoveries in spiked human fluids are in the range of 90-122%. This protocol provides a new tactic in the development of the nonenzymic miRNA biosensors and opens a promising avenue for early diagnosis of miRNA-associated disease. Graphical abstract The SiND-based fluorescent sensor for detection of S-miR-27a.
Transcriptome-wide analysis of WRKY transcription factors in wheat and their leaf rust responsive expression profiling.

PubMed

Satapathy, Lopamudra; Singh, Dharmendra; Ranjan, Prashant; Kumar, Dhananjay; Kumar, Manish; Prabhu, Kumble Vinod; Mukhopadhyay, Kunal

2014-12-01

WRKY, a plant-specific transcription factor family, has important roles in pathogen defense, abiotic cues and phytohormone signaling, yet little is known about their roles and molecular mechanism of function in response to rust diseases in wheat. We identified 100 TaWRKY sequences using wheat Expressed Sequence Tag database of which 22 WRKY sequences were novel. Identified proteins were characterized based on their zinc finger motifs and phylogenetic analysis clustered them into six clades consisting of class IIc and class III WRKY proteins. Functional annotation revealed major functions in metabolic and cellular processes in control plants; whereas response to stimuli, signaling and defense in pathogen inoculated plants, their major molecular function being binding to DNA. Tag-based expression analysis of the identified genes revealed differential expression between mock and Puccinia triticina inoculated wheat near isogenic lines. Gene expression was also performed with six rust-related microarray experiments at Gene Expression Omnibus database. TaWRKY10, 15, 17 and 56 were common in both tag-based and microarray-based differential expression analysis and could be representing rust specific WRKY genes. The obtained results will bestow insight into the functional characterization of WRKY transcription factors responsive to leaf rust pathogenesis that can be used as candidate genes in molecular breeding programs to improve biotic stress tolerance in wheat.
Parallel tagged next-generation sequencing on pooled samples - a new approach for population genetics in ecology and conservation.

PubMed

Zavodna, Monika; Grueber, Catherine E; Gemmell, Neil J

2013-01-01

Next-generation sequencing (NGS) on pooled samples has already been broadly applied in human medical diagnostics and plant and animal breeding. However, thus far it has been only sparingly employed in ecology and conservation, where it may serve as a useful diagnostic tool for rapid assessment of species genetic diversity and structure at the population level. Here we undertake a comprehensive evaluation of the accuracy, practicality and limitations of parallel tagged amplicon NGS on pooled population samples for estimating species population diversity and structure. We obtained 16S and Cyt b data from 20 populations of Leiopelma hochstetteri, a frog species of conservation concern in New Zealand, using two approaches - parallel tagged NGS on pooled population samples and individual Sanger sequenced samples. Data from each approach were then used to estimate two standard population genetic parameters, nucleotide diversity (π) and population differentiation (FST), that enable population genetic inference in a species conservation context. We found a positive correlation between our two approaches for population genetic estimates, showing that the pooled population NGS approach is a reliable, rapid and appropriate method for population genetic inference in an ecological and conservation context. Our experimental design also allowed us to identify both the strengths and weaknesses of the pooled population NGS approach and outline some guidelines and suggestions that might be considered when planning future projects.
High-resolution melt analysis to identify and map sequence-tagged site anchor points onto linkage maps: a white lupin (Lupinus albus) map as an exemplar.

PubMed

Croxford, Adam E; Rogers, Tom; Caligari, Peter D S; Wilkinson, Michael J

2008-01-01

* The provision of sequence-tagged site (STS) anchor points allows meaningful comparisons between mapping studies but can be a time-consuming process for nonmodel species or orphan crops. * Here, the first use of high-resolution melt analysis (HRM) to generate STS markers for use in linkage mapping is described. This strategy is rapid and low-cost, and circumvents the need for labelled primers or amplicon fractionation. * Using white lupin (Lupinus albus, x = 25) as a case study, HRM analysis was applied to identify 91 polymorphic markers from expressed sequence tag (EST)-derived and genomic libraries. Of these, 77 generated STS anchor points in the first fully resolved linkage map of the species. The map also included 230 amplified fragment length polymorphisms (AFLP) loci, spanned 1916 cM (84.2% coverage) and divided into the expected 25 linkage groups. * Quantitative trait loci (QTL) analyses performed on the population revealed genomic regions associated with several traits, including the agronomically important time to flowering (tf), alkaloid synthesis and stem height (Ph). Use of HRM-STS markers also allowed us to make direct comparisons between our map and that of the related crop, Lupinus angustifolius, based on the conversion of RFLP, microsatellite and single nucleotide polymorphism (SNP) markers into HRM markers.

Construction and Cloning of Reporter-Tagged Replicon cDNA for an In Vitro Replication Study of Murine Norovirus-1 (MNV-1)

PubMed Central

Ahmad, Muhammad Khairi; Tabana, Yasser M; Ahmed, Mowaffaq Adam; Sandai, Doblin Anak; Mohamed, Rafeezul; Ismail, Ida Shazrina; Zulkiflie, Nurulisa; Yunus, Muhammad Amir

2017-01-01

Background A norovirus maintains its viability, infectivity and virulence by its ability to replicate. However, the biological mechanisms of the process remain to be explored. In this work, the NanoLuc™ Luciferase gene was used to develop a reporter-tagged replicon system to study norovirus replication. Methods The NanoLuc™ Luciferase reporter protein was engineered to be expressed as a fusion protein for MNV-1 minor capsid protein, VP2. The foot-and-mouth disease virus 2A (FMDV2A) sequence was inserted between the 3′end of the reporter gene and the VP2 start sequence to allow co-translational ‘cleavage’ of fusion proteins during intracellular transcript expression. Amplification of the fusion gene was performed using a series of standard and overlapping polymerase chain reactions. The resulting amplicon was then cloned into three readily available backbones of MNV-1 cDNA clones. Results Restriction enzyme analysis indicated that the NanoLucTM Luciferase gene was successfully inserted into the parental MNV-1 cDNA clone. The insertion was further confirmed by using DNA sequencing. Conclusion NanoLuc™ Luciferase-tagged MNV-1 cDNA clones were successfully engineered. Such clones can be exploited to develop robust experimental assays for in vitro assessments of viral RNA replication. PMID:29379384
A high-resolution whole genome radiation hybrid map of human chromosome 17q22-q25.3 across the genes for GH and TK

DOE Office of Scientific and Technical Information (OSTI.GOV)

Foster, J.W.; Schafer, A.J.; Critcher, R.

1996-04-15

We have constructed a whole genome radiation hybrid (WG-RH) map across a region of human chromosome 17q, from growth hormone (GH) to thymidine kinase (TK). A panel of 128 WG-RH hybrid cell lines generated by X-irradiation and fusion has been tested for the retention of 39 sequence-tagged site (STS) markers by the polymerase chain reaction. This genome mapping technique has allowed the integration of existing VNTR and microsatellite markers with additional new markers and existing STS markers previously mapped to this region by other means. The WG-RH map includes eight expressed sequence tag (EST) and three anonymous markers developed formore » this study, together with 23 anonymous microsatellites and five existing ESTs. Analysis of these data resulted in a high-density comprehensive map across this region of the genome. A subset of these markers has been used to produce a framework map consisting of 20 loci ordered with odds greater than 1000:1. The markers are of sufficient density to build a YAC contig across this region based on marker content. We have developed sequence tags for both ends of a 2.1-Mb YAC and mapped these using the WG-RH panel, allowing a direct comparison of cRay{sub 6000} to physical distance. 31 refs., 3 figs., 2 tabs.« less
Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

PubMed

Yohda, Masafumi; Yagi, Osami; Takechi, Ayane; Kitajima, Mizuki; Matsuda, Hisashi; Miyamura, Naoaki; Aizawa, Tomoko; Nakajima, Mutsuyasu; Sunairi, Michio; Daiba, Akito; Miyajima, Takashi; Teruya, Morimi; Teruya, Kuniko; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Juan, Ayaka; Nakano, Kazuma; Aoyama, Misako; Terabayashi, Yasunobu; Satou, Kazuhito; Hirano, Takashi

2015-07-01

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

PubMed Central

Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

2001-01-01

Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

PubMed

Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

2001-10-09

Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.
Myocardial tagging by Cardiovascular Magnetic Resonance: evolution of techniques--pulse sequences, analysis algorithms, and applications

PubMed Central

2011-01-01

Cardiovascular magnetic resonance (CMR) tagging has been established as an essential technique for measuring regional myocardial function. It allows quantification of local intramyocardial motion measures, e.g. strain and strain rate. The invention of CMR tagging came in the late eighties, where the technique allowed for the first time for visualizing transmural myocardial movement without having to implant physical markers. This new idea opened the door for a series of developments and improvements that continue up to the present time. Different tagging techniques are currently available that are more extensive, improved, and sophisticated than they were twenty years ago. Each of these techniques has different versions for improved resolution, signal-to-noise ratio (SNR), scan time, anatomical coverage, three-dimensional capability, and image quality. The tagging techniques covered in this article can be broadly divided into two main categories: 1) Basic techniques, which include magnetization saturation, spatial modulation of magnetization (SPAMM), delay alternating with nutations for tailored excitation (DANTE), and complementary SPAMM (CSPAMM); and 2) Advanced techniques, which include harmonic phase (HARP), displacement encoding with stimulated echoes (DENSE), and strain encoding (SENC). Although most of these techniques were developed by separate groups and evolved from different backgrounds, they are in fact closely related to each other, and they can be interpreted from more than one perspective. Some of these techniques even followed parallel paths of developments, as illustrated in the article. As each technique has its own advantages, some efforts have been made to combine different techniques together for improved image quality or composite information acquisition. In this review, different developments in pulse sequences and related image processing techniques are described along with the necessities that led to their invention, which makes this article easy to read and the covered techniques easy to follow. Major studies that applied CMR tagging for studying myocardial mechanics are also summarized. Finally, the current article includes a plethora of ideas and techniques with over 300 references that motivate the reader to think about the future of CMR tagging. PMID:21798021
Quantitative Tracking of Salmonella Enteritidis Transmission Routes Using Barcode-Tagged Isogenic Strains in Chickens: Proof-of-Concept Study

PubMed Central

Yang, Yichao; Ricke, Steven C.; Tellez, Guillermo; Kwon, Young Min

2017-01-01

Salmonella is an important foodborne bacterial pathogen, however, a fundamental understanding on Salmonella transmission routes within a poultry flock remains unclear. In this study, a series of barcode-tagged strains were constructed by inserting six random nucleotides into a functionally neutral region on the chromosome of S. Enteritidis as a tool for quantitative tracking of Salmonella transmission in chickens. Six distinct barcode-tagged strains were used for infection or contamination at either low dose (103 CFUs; three strains) or high dose (105 CFUs; three strains) in three independent experiments (Experiment 1 oral gavage; Experiment 2 contaminated feed; Experiment 3 contaminated water). For all chick experiments, cecal and foot-wash samples were collected from a subset of the chickens at days 7 or/and 14, from which genomic DNA was extracted and used to amplify the barcode regions. After the resulting PCR amplicons were pooled and analyzed by MiSeq sequencing, a total of approximately 1.5 million reads containing the barcode sequences were analyzed to determine the relative frequency of every barcode-tagged strain in each sample. In Experiment 1, the high dose of oral infection was correlated with greater dominance of the strains in the ceca of the respective seeder chickens and also in the contact chickens yet at lesser degrees. When chicks were exposed to contaminated feed (Experiment 2) or water (Experiment 3), there were no clear patterns of the barcode-tagged strains in relation to the dosage, except that the strains introduced at low dose required a longer time to colonize the ceca with contaminated feed. Most foot-wash samples contained only one to three strains for the majority of the samples, suggesting potential existence of an unknown mechanism(s) for strain exclusion. These results demonstrated the proof of concept of using barcode tagged to investigate transmission dynamics of Salmonella in chickens in a quantitative manner. PMID:28261587
Highly Selective End-Tagged Antimicrobial Peptides Derived from PRELP

PubMed Central

Malmsten, Martin; Kasetty, Gopinath; Pasupuleti, Mukesh; Alenfall, Jan; Schmidtchen, Artur

2011-01-01

Background Antimicrobial peptides (AMPs) are receiving increasing attention due to resistance development against conventional antibiotics. Pseudomonas aeruginosa and Staphylococcus aureus are two major pathogens involved in an array of infections such as ocular infections, cystic fibrosis, wound and post-surgery infections, and sepsis. The goal of the study was to design novel AMPs against these pathogens. Methodology and Principal Findings Antibacterial activity was determined by radial diffusion, viable count, and minimal inhibitory concentration assays, while toxicity was evaluated by hemolysis and effects on human epithelial cells. Liposome and fluorescence studies provided mechanistic information. Protease sensitivity was evaluated after subjection to human leukocyte elastase, staphylococcal aureolysin and V8 proteinase, as well as P. aeruginosa elastase. Highly active peptides were evaluated in ex vivo skin infection models. C-terminal end-tagging by W and F amino acid residues increased antimicrobial potency of the peptide sequences GRRPRPRPRP and RRPRPRPRP, derived from proline arginine-rich and leucine-rich repeat protein (PRELP). The optimized peptides were antimicrobial against a range of Gram-positive S. aureus and Gram-negative P. aeruginosa clinical isolates, also in the presence of human plasma and blood. Simultaneously, they showed low toxicity against mammalian cells. Particularly W-tagged peptides displayed stability against P. aeruginosa elastase, and S. aureus V8 proteinase and aureolysin, and the peptide RRPRPRPRPWWWW-NH2 was effective against various “superbugs” including vancomycin-resistant enterococci, multi-drug resistant P. aeruginosa, and methicillin-resistant S. aureus, as well as demonstrated efficiency in an ex vivo skin wound model of S. aureus and P. aeruginosa infection. Conclusions/Significance Hydrophobic C-terminal end-tagging of the cationic sequence RRPRPRPRP generates highly selective AMPs with potent activity against multiresistant bacteria and efficiency in ex vivo wound infection models. A precise “tuning” of toxicity and proteolytic stability may be achieved by changing tag-length and adding W- or F-amino acid tags. PMID:21298015
Systematics of Cladophora spp. (Chlorophyta) from North Carolina, USA, based upon morphology and DNA sequence data with a description of Cladophora subtilissima sp. nov.

PubMed

Taylor, Robin L; Bailey, Jeffrey Craig; Freshwater, David Wilson

2017-06-01

Identification of Cladophora species is challenging due to conservation of gross morphology, few discrete autapomorphies, and environmental influences on morphology. Twelve species of marine Cladophora were reported from North Carolina waters. Cladophora specimens were collected from inshore and offshore marine waters for DNA sequence and morphological analyses. The nuclear-encoded rRNA internal transcribed spacer regions (ITS) were sequenced for 105 specimens and used in molecular assisted identification. The ITS1 and ITS2 region was highly variable, and sequences were sorted into ITS Sets of Alignable Sequences (SASs). Sequencing of short hyper-variable ITS1 sections from Cladophora type specimens was used to positively identify species represented by SASs when the types were made available. Secondary structures for the ITS1 locus were also predicted for each specimen and compared to predicted structures from Cladophora sequences available in GenBank. Nine ITS SASs were identified and representative specimens chosen for phylogenetic analyses of 18S and 28S rRNA gene sequences to reveal relationships with other Cladophora species. Phylogenetic analyses indicated that marine Cladophorales were polyphyletic and separated into two clades, the Cladophora clade and the "Siphonocladales" clade. Morphological analyses were performed to assess the consistency of character states within species, and complement the DNA sequence analyses. These analyses revealed intra- and interspecific character state variation, and that combined molecular and morphological analyses were required for the identification of species. One new report, Cladophora dotyana, and one new species Cladophora subtilissima sp. nov., were revealed, and increased the biodiversity of North Carolina marine Cladophora to 14 species. © 2017 Phycological Society of America.
Characterization by Suppression Subtractive Hybridization of Transcripts That Are Differentially Expressed in Leaves of Anthracnose-Resistant Ramie Cultivar.

PubMed

Xuxia, Wang; Jie, Chen; Bo, Wang; Lijun, Liu; Hui, Jiang; Diluo, Tang; Dingxiang, Peng

2012-01-01

For the purpose of screening putative anthracnose resistance-related genes of ramie ( Boehmeria nivea L. Gaud), a cDNA library was constructed by suppression subtractive hybridization using anthracnose-resistant cultivar Huazhu no. 4. The cDNAs from Huazhu no. 4, which were infected with Colletotrichum gloeosporioides , were used as the tester and cDNAs from uninfected Huazhu no. 4 as the driver. Sequencing analysis and homology searching showed that these clones represented 132 single genes, which were assigned to functional categories, including 14 putative cellular functions, according to categories established for Arabidopsis . These 132 genes included 35 disease resistance and stress tolerance-related genes including putative heat-shock protein 90, metallothionein, PR-1.2 protein, catalase gene, WRKY family genes, and proteinase inhibitor-like protein. Partial disease-related genes were further analyzed by reverse transcription PCR and RNA gel blot. These expressed sequence tags are the first anthracnose resistance-related expressed sequence tags reported in ramie.
Construction of a Lotus japonicus late nodulin expressed sequence tag library and identification of novel nodule-specific genes.

PubMed Central

Szczyglowski, K; Hamburger, D; Kapranov, P; de Bruijn, F J

1997-01-01

A range of novel expressed sequence tags (ESTs) associated with late developmental events during nodule organogenesis in the legume Lotus japonicus were identified using mRNA differential display; 110 differentially displayed polymerase chain reaction products were cloned and analyzed. Of 88 unique cDNAs obtained, 22 shared significant homology to DNA/protein sequences in the respective databases. This group comprises, among others, a nodule-specific homolog of protein phosphatase 2C, a peptide transporter protein, and a nodule-specific form of cytochrome P450. RNA gel-blot analysis of 16 differentially displayed ESTs confirmed their nodule-specific expression pattern. The kinetics of mRNA accumulation of the majority of the ESTs analyzed were found to resemble the expression pattern observed for the L. japonicus leghemoglobin gene. These results indicate that the newly isolated molecular markers correspond to genes induced during late developmental stages of L. japonicus nodule organogenesis and provide important, novel tools for the study of nodulation. PMID:9276951
Small RNA Library Preparation Method for Next-Generation Sequencing Using Chemical Modifications to Prevent Adapter Dimer Formation.

PubMed

Shore, Sabrina; Henderson, Jordana M; Lebedev, Alexandre; Salcedo, Michelle P; Zon, Gerald; McCaffrey, Anton P; Paul, Natasha; Hogrefe, Richard I

2016-01-01

For most sample types, the automation of RNA and DNA sample preparation workflows enables high throughput next-generation sequencing (NGS) library preparation. Greater adoption of small RNA (sRNA) sequencing has been hindered by high sample input requirements and inherent ligation side products formed during library preparation. These side products, known as adapter dimer, are very similar in size to the tagged library. Most sRNA library preparation strategies thus employ a gel purification step to isolate tagged library from adapter dimer contaminants. At very low sample inputs, adapter dimer side products dominate the reaction and limit the sensitivity of this technique. Here we address the need for improved specificity of sRNA library preparation workflows with a novel library preparation approach that uses modified adapters to suppress adapter dimer formation. This workflow allows for lower sample inputs and elimination of the gel purification step, which in turn allows for an automatable sRNA library preparation protocol.
Identification and characterization of 43 microsatellite markers derived from expressed sequence tags of the sea cucumber ( Apostichopus japonicus)

NASA Astrophysics Data System (ADS)

Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng

2011-06-01

The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.
JC Virus Mediates Invasion and Migration in Colorectal Metastasis

PubMed Central

Link, Alexander; Shin, Sung Kwan; Nagasaka, Takeshi; Balaguer, Francesc; Koi, Minoru; Jung, Barbara; Boland, C. Richard; Goel, Ajay

2009-01-01

Introduction JC Virus (JCV), a human polyomavirus, is frequently present in colorectal cancers (CRCs). JCV large T-Ag (T-Ag) expressed in approximately half of all CRC's, however, its functional role in CRC is poorly understood. We hypothesized that JCV T-Ag may mediate metastasis in CRC cells through increased migration and invasion. Material and Methods CRC cell lines (HCT116 and SW837) were stably transfected with JCV early transcript sequences cloned into pCR3 or empty vectors. Migration and invasion assays were performed using Boyden chambers. Global gene expression analysis was performed to identify genetic targets and pathways altered by T-Ag expression. Microarray results were validated by qRT-PCR, protein expression analyses and immunohistochemistry. Matching primary CRCs and liver metastases from 33 patients were analyzed for T-Ag expression by immunohistochemistry. Results T-Ag expressing cell lines showed 2 to 3-fold increase in migration and invasion compared to controls. JCV T-Ag expression resulted in differential expression of several genetic targets, including genes that mediate cell migration and invasion. Pathway analysis suggested a significant involvement of these genes with AKT and MAPK signaling. Treatment with selective PI3K/AKT and MAPK pathway inhibitors resulted in reduced migration and invasion. In support of our in-vitro results, immunohistochemical staining of the advanced stage tumors revealed frequent JCV T-Ag expression in metastatic primary tumors (92%) as well as in their matching liver metastasis (73%). Conclusion These data suggest that JCV T-Ag expression in CRC associates with a metastatic phenotype, which may partly be mediated through the AKT/MAPK signaling pathway. Frequent expression of JCV T-Ag in CRC liver metastasis provides further clues supporting a mechanistic role for JCV as a possible mediator of cellular motility and invasion in CRC. PMID:19997600
GSyellow, a Multifaceted Tag for Functional Protein Analysis in Monocot and Dicot Plants.

PubMed

Besbrugge, Nienke; Van Leene, Jelle; Eeckhout, Dominique; Cannoot, Bernard; Kulkarni, Shubhada R; De Winne, Nancy; Persiau, Geert; Van De Slijke, Eveline; Bontinck, Michiel; Aesaert, Stijn; Impens, Francis; Gevaert, Kris; Van Damme, Daniel; Van Lijsebettens, Mieke; Inzé, Dirk; Vandepoele, Klaas; Nelissen, Hilde; De Jaeger, Geert

2018-06-01

The ability to tag proteins has boosted the emergence of generic molecular methods for protein functional analysis. Fluorescent protein tags are used to visualize protein localization, and affinity tags enable the mapping of molecular interactions by, for example, tandem affinity purification or chromatin immunoprecipitation. To apply these widely used molecular techniques on a single transgenic plant line, we developed a multifunctional tandem affinity purification tag, named GS yellow , which combines the streptavidin-binding peptide tag with citrine yellow fluorescent protein. We demonstrated the versatility of the GS yellow tag in the dicot Arabidopsis ( Arabidopsis thaliana ) using a set of benchmark proteins. For proof of concept in monocots, we assessed the localization and dynamic interaction profile of the leaf growth regulator ANGUSTIFOLIA3 (AN3), fused to the GS yellow tag, along the growth zone of the maize ( Zea mays ) leaf. To further explore the function of ZmAN3, we mapped its DNA-binding landscape in the growth zone of the maize leaf through chromatin immunoprecipitation sequencing. Comparison with AN3 target genes mapped in the developing maize tassel or in Arabidopsis cell cultures revealed strong conservation of AN3 target genes between different maize tissues and across monocots and dicots, respectively. In conclusion, the GS yellow tag offers a powerful molecular tool for distinct types of protein functional analyses in dicots and monocots. As this approach involves transforming a single construct, it is likely to accelerate both basic and translational plant research. © 2018 American Society of Plant Biologists. All rights reserved.
Genetic variation patterns of American chestnut populations at EST-SSRs

Treesearch

Oliver Gailing; C. Dana Nelson

2017-01-01

The objective of this study is to analyze patterns of genetic variation at genic expressed sequence tag - simple sequence repeats (EST-SSRs) and at chloroplast DNA markers in populations of American chestnut (Castanea dentata Borkh.) to assist in conservation and breeding efforts. Allelic diversity at EST-SSRs decreased significantly from southwest to northeast along...
Identification of genotyping-by-sequencing sequence tags associated with milling performance and end-use quality traits in hard red spring wheat (Triticum aestivum L.)

USDA-ARS?s Scientific Manuscript database

Wheat quality is defined by culinary end-uses and processing characteristics. Wheat breeders are interested to identify quantitative trait loci for grain, milling, and end-use quality traits because it is imperative to understand the genetic complexity underlying quantitatively inherited traits to ...
Genome Comparisons Reveal a Dominant Mechanism of Chromosome Number Reduction in Grasses and Accelerated Genome Evolution in Triticeae

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphism was employed in the construction of a high-resolution, expressed sequence tag (EST) map of Aegilops tauschii, the diploid source of the wheat D genome. Comparison of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and...
Passive wireless tags for tongue controlled assistive technology interfaces.

PubMed

Rakibet, Osman O; Horne, Robert J; Kelly, Stephen W; Batchelor, John C

2016-03-01

Tongue control with low profile, passive mouth tags is demonstrated as a human-device interface by communicating values of tongue-tag separation over a wireless link. Confusion matrices are provided to demonstrate user accuracy in targeting by tongue position. Accuracy is found to increase dramatically after short training sequences with errors falling close to 1% in magnitude with zero missed targets. The rate at which users are able to learn accurate targeting with high accuracy indicates that this is an intuitive device to operate. The significance of the work is that innovative very unobtrusive, wireless tags can be used to provide intuitive human-computer interfaces based on low cost and disposable mouth mounted technology. With the development of an appropriate reading system, control of assistive devices such as computer mice or wheelchairs could be possible for tetraplegics and others who retain fine motor control capability of their tongues. The tags contain no battery and are intended to fit directly on the hard palate, detecting tongue position in the mouth with no need for tongue piercings.
An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

PubMed Central

Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

2004-01-01

Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051

EST-PAC a web package for EST annotation and protein sequence prediction

PubMed Central

Strahm, Yvan; Powell, David; Lefèvre, Christophe

2006-01-01

With the decreasing cost of DNA sequencing technology and the vast diversity of biological resources, researchers increasingly face the basic challenge of annotating a larger number of expressed sequences tags (EST) from a variety of species. This typically consists of a series of repetitive tasks, which should be automated and easy to use. The results of these annotation tasks need to be stored and organized in a consistent way. All these operations should be self-installing, platform independent, easy to customize and amenable to using distributed bioinformatics resources available on the Internet. In order to address these issues, we present EST-PAC a web oriented multi-platform software package for expressed sequences tag (EST) annotation. EST-PAC provides a solution for the administration of EST and protein sequence annotations accessible through a web interface. Three aspects of EST annotation are automated: 1) searching local or remote biological databases for sequence similarities using Blast services, 2) predicting protein coding sequence from EST data and, 3) annotating predicted protein sequences with functional domain predictions. In practice, EST-PAC integrates the BLASTALL suite, EST-Scan2 and HMMER in a relational database system accessible through a simple web interface. EST-PAC also takes advantage of the relational database to allow consistent storage, powerful queries of results and, management of the annotation process. The system allows users to customize annotation strategies and provides an open-source data-management environment for research and education in bioinformatics. PMID:17147782
Genome-wide analysis of differential transcriptional and epigenetic variability across human immune cell types.

PubMed

Ecker, Simone; Chen, Lu; Pancaldi, Vera; Bagger, Frederik O; Fernández, José María; Carrillo de Santa Pau, Enrique; Juan, David; Mann, Alice L; Watt, Stephen; Casale, Francesco Paolo; Sidiropoulos, Nikos; Rapin, Nicolas; Merkel, Angelika; Stunnenberg, Hendrik G; Stegle, Oliver; Frontini, Mattia; Downes, Kate; Pastinen, Tomi; Kuijpers, Taco W; Rico, Daniel; Valencia, Alfonso; Beck, Stephan; Soranzo, Nicole; Paul, Dirk S

2017-01-26

A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. We apply a novel analytical approach to measure and compare transcriptional and epigenetic variability genome-wide across CD14 + CD16 - monocytes, CD66b + CD16 + neutrophils, and CD4 + CD45RA + naïve T cells from the same 125 healthy individuals. We discover substantially increased variability in neutrophils compared to monocytes and T cells. In neutrophils, genes with hypervariable expression are found to be implicated in key immune pathways and are associated with cellular properties and environmental exposure. We also observe increased sex-specific gene expression differences in neutrophils. Neutrophil-specific DNA methylation hypervariable sites are enriched at dynamic chromatin regions and active enhancers. Our data highlight the importance of transcriptional and epigenetic variability for the key role of neutrophils as the first responders to inflammatory stimuli. We provide a resource to enable further functional studies into the plasticity of immune cells, which can be accessed from: http://blueprint-dev.bioinfo.cnio.es/WP10/hypervariability .
Human mtDNA hypervariable regions, HVR I and II, hint at deep common maternal founder and subsequent maternal gene flow in Indian population groups.

PubMed

Sharma, Swarkar; Saha, Anjana; Rai, Ekta; Bhat, Audesh; Bamezai, Ramesh

2005-01-01

We have analysed the hypervariable regions (HVR I and II) of human mitochondrial DNA (mtDNA) in individuals from Uttar Pradesh (UP), Bihar (BI) and Punjab (PUNJ), belonging to the Indo-European linguistic group, and from South India (SI), that have their linguistic roots in Dravidian language. Our analysis revealed the presence of known and novel mutations in both hypervariable regions in the studied population groups. Median joining network analyses based on mtDNA showed extensive overlap in mtDNA lineages despite the extensive cultural and linguistic diversity. MDS plot analysis based on Fst distances suggested increased maternal genetic proximity for the studied population groups compared with other world populations. Mismatch distribution curves, respective neighbour joining trees and other statistical analyses showed that there were significant expansions. The study revealed an ancient common ancestry for the studied population groups, most probably through common founder female lineage(s), and also indicated that human migrations occurred (maybe across and within the Indian subcontinent) even after the initial phase of female migration to India.
Distinguishing aspartic and isoaspartic acids in peptides by several mass spectrometric fragmentation methods

PubMed Central

DeGraan-Weber, Nick; Zhang, Jun; Reilly, James P.

2016-01-01

Six ion fragmentation techniques that can distinguish aspartic acid from its isomer, isoaspartic acid, were compared. MALDI post source decay (PSD), MALDI 157 nm photodissociation, TMPP charge tagging in PSD and photodissociation, ESI collision-induced dissociation (CID), electron transfer dissociation (ETD), and free-radical initiated peptide sequencing (FRIPS) with CID were applied to peptides containing either aspartic or isoaspartic acid. Diagnostic ions, such as the y-46 and b+H2O, are present in PSD, photodissociation, and charge tagging. c•+57 and z-57 ions are observed in ETD and FRIPS experiments. For some molecules, aspartic and isoaspartic acid yield ion fragments with significantly different intensities. ETD and charge tagging appear to be most effective at distinguishing these residues. PMID:27613306
OSIRIS-REx Touch-And-Go (TAG) Mission Design and Analysis

NASA Technical Reports Server (NTRS)

Berry, Kevin; Sutter, Brian; May, Alex; Williams, Ken; Barbee, Brent W.; Beckman, Mark; Williams, Bobby

2013-01-01

The Origins Spectral Interpretation Resource Identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) 1999 RQ36 in late 2018. After several months in formation with and orbit about the asteroid, OSIRIS-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid s surface to obtain a regolith sample. This paper describes the mission design of the TAG sequence and the propulsive maneuvers required to achieve the trajectory. This paper also shows preliminary results of orbit covariance analysis and Monte-Carlo analysis that demonstrate the ability to arrive at a targeted location on the surface of RQ36 within a 25 meter radius with 98.3% confidence.
Frequency tagging to track the neural processing of contrast in fast, continuous sound sequences.

PubMed

Nozaradan, Sylvie; Mouraux, André; Cousineau, Marion

2017-07-01

The human auditory system presents a remarkable ability to detect rapid changes in fast, continuous acoustic sequences, as best illustrated in speech and music. However, the neural processing of rapid auditory contrast remains largely unclear, probably due to the lack of methods to objectively dissociate the response components specifically related to the contrast from the other components in response to the sequence of fast continuous sounds. To overcome this issue, we tested a novel use of the frequency-tagging approach allowing contrast-specific neural responses to be tracked based on their expected frequencies. The EEG was recorded while participants listened to 40-s sequences of sounds presented at 8Hz. A tone or interaural time contrast was embedded every fifth sound (AAAAB), such that a response observed in the EEG at exactly 8 Hz/5 (1.6 Hz) or harmonics should be the signature of contrast processing by neural populations. Contrast-related responses were successfully identified, even in the case of very fine contrasts. Moreover, analysis of the time course of the responses revealed a stable amplitude over repetitions of the AAAAB patterns in the sequence, except for the response to perceptually salient contrasts that showed a buildup and decay across repetitions of the sounds. Overall, this new combination of frequency-tagging with an oddball design provides a valuable complement to the classic, transient, evoked potentials approach, especially in the context of rapid auditory information. Specifically, we provide objective evidence on the neural processing of contrast embedded in fast, continuous sound sequences. NEW & NOTEWORTHY Recent theories suggest that the basis of neurodevelopmental auditory disorders such as dyslexia might be an impaired processing of fast auditory changes, highlighting how the encoding of rapid acoustic information is critical for auditory communication. Here, we present a novel electrophysiological approach to capture in humans neural markers of contrasts in fast continuous tone sequences. Contrast-specific responses were successfully identified, even for very fine contrasts, providing direct insight on the encoding of rapid auditory information. Copyright © 2017 the American Physiological Society.
A resource of large-scale molecular markers for monitoring Agropyron cristatum chromatin introgression in wheat background based on transcriptome sequences.

PubMed

Zhang, Jinpeng; Liu, Weihua; Lu, Yuqing; Liu, Qunxing; Yang, Xinming; Li, Xiuquan; Li, Lihui

2017-09-20

Agropyron cristatum is a wild grass of the tribe Triticeae and serves as a gene donor for wheat improvement. However, very few markers can be used to monitor A. cristatum chromatin introgressions in wheat. Here, we reported a resource of large-scale molecular markers for tracking alien introgressions in wheat based on transcriptome sequences. By aligning A. cristatum unigenes with the Chinese Spring reference genome sequences, we designed 9602 A. cristatum expressed sequence tag-sequence-tagged site (EST-STS) markers for PCR amplification and experimental screening. As a result, 6063 polymorphic EST-STS markers were specific for the A. cristatum P genome in the single-receipt wheat background. A total of 4956 randomly selected polymorphic EST-STS markers were further tested in eight wheat variety backgrounds, and 3070 markers displaying stable and polymorphic amplification were validated. These markers covered more than 98% of the A. cristatum genome, and the marker distribution density was approximately 1.28 cM. An application case of all EST-STS markers was validated on the A. cristatum 6 P chromosome. These markers were successfully applied in the tracking of alien A. cristatum chromatin. Altogether, this study provided a universal method of large-scale molecular marker development to monitor wild relative chromatin in wheat.
Microbial Communities on Seafloor Basalts at Dorado Outcrop Reflect Level of Alteration and Highlight Global Lithic Clades

PubMed Central

Lee, Michael D.; Walworth, Nathan G.; Sylvan, Jason B.; Edwards, Katrina J.; Orcutt, Beth N.

2015-01-01

Areas of exposed basalt along mid-ocean ridges and at seafloor outcrops serve as conduits of fluid flux into and out of a subsurface ocean, and microbe–mineral interactions can influence alteration reactions at the rock–water interface. Located on the eastern flank of the East Pacific Rise, Dorado Outcrop is a site of low-temperature (<20°C) hydrothermal venting and represents a new end-member in the current survey of seafloor basalt biomes. Consistent with prior studies, a survey of 16S rRNA gene sequence diversity using universal primers targeting the V4 hypervariable region revealed much greater richness and diversity on the seafloor rocks than in surrounding seawater. Overall, Gamma-, Alpha-, and Deltaproteobacteria, and Thaumarchaeota dominated the sequenced communities, together making up over half of the observed diversity, though bacterial sequences were more abundant than archaeal in all samples. The most abundant bacterial reads were closely related to the obligate chemolithoautotrophic, sulfur-oxidizing Thioprofundum lithotrophicum, suggesting carbon and sulfur cycling as dominant metabolic pathways in this system. Representatives of Thaumarchaeota were detected in relatively high abundance on the basalts in comparison to bottom water, possibly indicating ammonia oxidation. In comparison to other sequence datasets from globally distributed seafloor basalts, this study reveals many overlapping and cosmopolitan phylogenetic groups and also suggests that substrate age correlates with community structure. PMID:26779122
Analysis of sequence variation among smeDEF multi drug efflux pump genes and flanking DNA from defined 16S rRNA subgroups of clinical Stenotrophomonas maltophilia isolates.

PubMed

Gould, Virginia C; Okazaki, Aki; Howe, Robin A; Avison, Matthew B

2004-08-01

To determine the level of variation in the smeDEF efflux pump and smeT transcriptional regulator genes among three defined 16S rRNA sequence subgroups of clinical Stenotrophomonas maltophilia isolates. smeDEF sequencing used a PCR genome walking approach. Determination of the sequence surrounding smeDEF used a flanking primer PCR method and specific primers anchored in smeD or smeF together with random primers. smeDEF is chromosomal and located in the same position in the chromosome in all three subgroups of isolates. Flanking smeD is a gene, smeT, encoding a putative transcriptional repressor for smeDEF. Variation at these loci among the isolates is considerably lower (up to 10%) than at intrinsic beta-lactamase loci (up to 30%) in the same isolates, implying greater functional constraint. The smeD-smeT intergenic region contains a highly conserved section, which maps with previously predicted promoter/operator regions, and a hypervariable untranslated region, which can be used to subgroup clinical isolates. These data provide further evidence that it is possible to group clinical isolates of the inherently variable species, S. maltophilia, based on genotypic properties. Isolate D457, in which most work concerning smeDEF expression has been performed, does not fall into S. maltophilia subgroup A, which is the most typical.
Molecular characterization and phylogenetic analysis of infectious bursal disease viruses isolated from chicken in South China in 2011.

PubMed

Liu, Di; Zhang, Xiang-Bin; Yan, Zhuan-Qiang; Chen, Feng; Ji, Jun; Qin, Jian-Ping; Li, Hai-Yan; Lu, Jun-Peng; Xue, Yu; Liu, Jia-Jia; Xie, Qing-Mei; Ma, Jing-Yun; Xue, Chun-Yi; Bee, Ying-Zuo

2013-06-01

Infectious bursal disease virus (IBDV) is a double-stranded RNA virus that causes immunosuppressive disease in young chickens. Thousands of cases of IBDV infection are reported each year in South China, and these infections can result in considerable economic losses to the poultry industry. To monitor variations of the virus during the outbreaks, 30 IBDVs were identified from vaccinated chicken flocks from nine provinces in South China in 2011. VP2 fragments from different virus strains were sequenced and analyzed by comparison with the published sequences of IBDV strains from China and around the world. Phylogenetic analysis of hypervariable regions of the VP2 (vVP2) gene showed that 29 of the isolates were very virulent (vv) IBDVs, and were closely related to vvIBDV strains from Europe and Asia. Alignment analysis of the deduced amino acid (aa) sequences of vVP2 showed the 29 vv isolates had high uniformity, indicated low variability and slow evolution of the virus. The non-vvIBDV isolate JX2-11 was associated with higher than expected mortality, and had high deduced aa sequence similarity (99.2 %) with the attenuated vaccine strain B87 (BJ). The present study has demonstrated the continued circulation of IBDV strains in South China, and emphasizes the importance of reinforcing IBDV surveillance.
Strong positive selection and recombination drive the antigenic variation of the PilE protein of the human pathogen Neisseria meningitidis.

PubMed

Andrews, T Daniel; Gojobori, Takashi

2004-01-01

The PilE protein is the major component of the Neisseria meningitidis pilus, which is encoded by the pilE/pilS locus that includes an expressed gene and eight homologous silent fragments. The silent gene fragments have been shown to recombine through gene conversion with the expressed gene and thereby provide a means by which novel antigenic variants of the PilE protein can be generated. We have analyzed the evolutionary rate of the pilE gene using the nucleotide sequence of two complete pilE/pilS loci. The very high rate of evolution displayed by the PilE protein appears driven by both recombination and positive selection. Within the semivariable region of the pilE and pilS genes, recombination appears to occur within multiple small sequence blocks that lie between conserved sequence elements. Within the hypervariable region, positive selection was identified from comparison of the silent and expressed genes. The unusual gene conversion mechanism that operates at the pilE/pilS locus is a strategy employed by N. meningitidis to enhance mutation of certain regions of the PilE protein. The silent copies of the gene effectively allow "parallelized" evolution of pilE, thus enabling the encoded protein to rapidly explore a large area of sequence space in an effort to find novel antigenic variants.
A morphogenetic survey on ciliate plankton from a mountain lake pinpoints the necessity of lineage-specific barcode markers in microbial ecology.

PubMed

Stoeck, Thorsten; Breiner, Hans-Werner; Filker, Sabine; Ostermaier, Veronika; Kammerlander, Barbara; Sonntag, Bettina

2014-02-01

Analyses of high-throughput environmental sequencing data have become the 'gold-standard' to address fundamental questions of microbial diversity, ecology and biogeography. Findings that emerged from sequencing are, e.g. the discovery of the extensive 'rare microbial biosphere' and its potential function as a seed-bank. Even though applied since several years, results from high-throughput environmental sequencing have hardly been validated. We assessed how well pyrosequenced amplicons [the hypervariable eukaryotic V4 region of the small subunit ribosomal RNA (SSU rRNA) gene] reflected morphotype ciliate plankton. Moreover, we assessed if amplicon sequencing had the potential to detect the annual ciliate plankton stock. In both cases, we identified significant quantitative and qualitative differences. Our study makes evident that taxon abundance distributions inferred from amplicon data are highly biased and do not mirror actual morphotype abundances at all. Potential reasons included cell losses after fixation, cryptic morphotypes, resting stages, insufficient sequence data availability of morphologically described species and the unsatisfying resolution of the V4 SSU rRNA fragment for accurate taxonomic assignments. The latter two underline the necessity of barcoding initiatives for eukaryotic microbes to better and fully exploit environmental amplicon data sets, which then will also allow studying the potential of seed-bank taxa as a buffer for environmental changes. © 2013 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.
Molecular and phylogenetic characterization of bovine coronavirus virus isolated from dairy cattle in Central Region, Thailand.

PubMed

Singasa, Kanokwan; Songserm, Taweesak; Lertwatcharasarakul, Preeda; Arunvipas, Pipat

2017-10-01

Bovine coronavirus (BCoV) is involved mainly in enteric infections in cattle. This study reports the first molecular detection of BCoV in a diarrhea outbreak in dairy cows in the Central Region, Thailand. BCoV was molecularly detected from bloody diarrheic cattle feces by using nested PCR. Agarose gel electrophoresis of three diarrheic fecal samples yielded from the 25 samples desired amplicons that were 488 base pairs and sequencing substantiated that have BCoV. The sequence alignment indicated that nucleotide and amino acid sequences, the three TWD isolated in Thailand, were more quite homologous to each other (amino acid at position 39 of TWD1, TWD3 was proline, but TWD2 was serine) and closely related to OK-0514-3strain (virulent respiratory strain; RBCoV).The amino acid sequencing identities among TWD1, TWD2,TWD3, and OK-0514-3 strain were 96.0 to 96.6%, those at which T3I, H65N, D87G, H127Y, andQ136R were changed. In addition, the phylogenetic tree of the hypervariable region S1subunit spike glycoprotein BCoV gene was composed of three major clades by using the 54 sequences generated and showed that the evolutionally distance, TWD1, TWD2, and TWD3 were the isolated group together and most similar to OK-0514-3 strain (98.2 to 98.5% similarity). Further study will develop ELISA assay for serologic detection of winter dysentery disease.
Analysis of beta-carotene hydroxylase gene cDNA isolated from the American oil-palm (Elaeis oleifera) mesocarp tissue cDNA library

PubMed Central

Bhore, Subhash J; Kassim, Amelia; Loh, Chye Ying; Shah, Farida H

2010-01-01

It is well known that the nutritional quality of the American oil-palm (Elaeis oleifera) mesocarp oil is superior to that of African oil-palm (Elaeis guineensis Jacq. Tenera) mesocarp oil. Therefore, it is of important to identify the genetic features for its superior value. This could be achieved through the genome sequencing of the oil-palm. However, the genome sequence is not available in the public domain due to commercial secrecy. Hence, we constructed a cDNA library and generated expressed sequence tags (3,205) from the mesocarp tissue of the American oil-palm. We continued to annotate each of these cDNAs after submitting to GenBank/DDBJ/EMBL. A rough analysis turned our attention to the beta-carotene hydroxylase (Chyb) enzyme encoding cDNA. Then, we completed the full sequencing of cDNA clone for its both strands using M13 forward and reverse primers. The full nucleotide and protein sequence was further analyzed and annotated using various Bioinformatics tools. The analysis results showed the presence of fatty acid hydroxylase superfamily domain in the protein sequence. The multiple sequence alignment of selected Chyb amino acid sequences from other plant species and algal members with E. oleifera Chyb using ClustalW and its phylogenetic analysis suggest that Chyb from monocotyledonous plant species, Lilium hubrid, Crocus sativus and Zea mays are the most evolutionary related with E. oleifera Chyb. This study reports the annotation of E. oleifera Chyb. Abbreviations ESTs - expressed sequence tags, EoChyb - Elaeis oleifera beta-carotene hydroxylase, MC - main cluster PMID:21364789
Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

PubMed Central

Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

2007-01-01

Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730
Gene family encoding the major toxins of lethal Amanita mushrooms

PubMed Central

Hallen, Heather E.; Luo, Hong; Scott-Craig, John S.; Walton, Jonathan D.

2007-01-01

Amatoxins, the lethal constituents of poisonous mushrooms in the genus Amanita, are bicyclic octapeptides. Two genes in A. bisporigera, AMA1 and PHA1, directly encode α-amanitin, an amatoxin, and the related bicyclic heptapeptide phallacidin, a phallotoxin, indicating that these compounds are synthesized on ribosomes and not by nonribosomal peptide synthetases. α-Amanitin and phallacidin are synthesized as proproteins of 35 and 34 amino acids, respectively, from which they are predicted to be cleaved by a prolyl oligopeptidase. AMA1 and PHA1 are present in other toxic species of Amanita section Phalloidae but are absent from nontoxic species in other sections. The genomes of A. bisporigera and A. phalloides contain multiple sequences related to AMA1 and PHA1. The predicted protein products of this family of genes are characterized by a hypervariable “toxin” region capable of encoding a wide variety of peptides of 7–10 amino acids flanked by conserved sequences. Our results suggest that these fungi have a broad capacity to synthesize cyclic peptides on ribosomes. PMID:18025465
Unexpected sequences and structures of mtDNA required for efficient transcription from the first heavy-strand promoter

PubMed Central

Uchida, Akira; Murugesapillai, Divakaran; Kastner, Markus; Wang, Yao; Lodeiro, Maria F; Prabhakar, Shaan; Oliver, Guinevere V; Arnold, Jamie J; Maher, L James; Williams, Mark C; Cameron, Craig E

2017-01-01

Human mtDNA contains three promoters, suggesting a need for differential expression of the mitochondrial genome. Studies of mitochondrial transcription have used a reductionist approach, perhaps masking differential regulation. Here we evaluate transcription from light-strand (LSP) and heavy-strand (HSP1) promoters using templates that mimic their natural context. These studies reveal sequences upstream, hypervariable in the human population (HVR3), and downstream of the HSP1 transcription start site required for maximal yield. The carboxy-terminal tail of TFAM is essential for activation of HSP1 but not LSP. Images of the template obtained by atomic force microscopy show that TFAM creates loops in a discrete region, the formation of which correlates with activation of HSP1; looping is lost in tail-deleted TFAM. Identification of HVR3 as a transcriptional regulatory element may contribute to between-individual variability in mitochondrial gene expression. The unique requirement of HSP1 for the TFAM tail may enable its regulation by post-translational modifications. DOI: http://dx.doi.org/10.7554/eLife.27283.001 PMID:28745586
Molecular epidemiological investigation of velogenic Newcastle disease viruses from village chickens in Cambodia.

PubMed

Choi, Kang-Seuk; Kye, Soo-Jeong; Kim, Ji-Ye; Damasco, Vanessa R; Sorn, San; Lee, Youn-Jeong; Choi, Jun-Gu; Kang, Hyun-Mi; Kim, Kwang-Il; Song, Byung-Min; Lee, Hee-Soo

2013-10-01

Three isolates of Newcastle disease virus (NDV) were isolated from tracheal samples of dead village chickens in two provinces (Phnom Penh and Kampong Cham) in Cambodia during 2011-2012. All of these Cambodian NDV isolates were categorized as velogenic pathotype, based on in vivo pathogenicity tests and F cleavage site motif sequence ((112)RRRKRF(117)). The phylogenetic analysis and the evolutionary distances based on the sequences of the F gene revealed that all the three field isolates of NDV from Cambodia form a distinct cluster (VIIh) together with three Indonesian strains and were assigned to the genotype VII within the class II. Further phylogenetic analysis based on the hyper-variable region of the F gene revealed that some of NDV strains from Malaysia since the mid-2000s were also classified into the VIIh virus. This indicates that the VIIh NDVs are spreading through Southeast Asia. The present investigation, therefore, emphasizes the importance of further surveillance of NDV in neighboring countries as well as throughout Southeast Asia to contain further spreading of these VIIh viruses.
Transcriptome sequencing and de novo analysis of the copepod Calanus sinicus using 454 GS FLX.

PubMed

Ning, Juan; Wang, Minxiao; Li, Chaolun; Sun, Song

2013-01-01

Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus. Copepodid larvae and adults were used as the basic material for transcriptome sequencing. Using 454 pyrosequencing, a total of 1,470,799 reads were obtained, which were assembled into 56,809 high quality expressed sequence tags. Based on their sequence similarity to known proteins, about 14,000 different genes were identified, including members of all major conserved signaling pathways. Transcripts that were putatively involved with growth, lipid metabolism, molting, and diapause were also identified among these genes. Differentially expressed genes related to several processes were found in C. sinicus copepodid larvae and adults. We detected 284,154 single nucleotide polymorphisms (SNPs) that provide a resource for gene function studies. Our data provide the most comprehensive transcriptome resource available for C. sinicus. This resource allowed us to identify genes associated with primary physiological processes and SNPs in coding regions, which facilitated the quantitative analysis of differential gene expression. These data should provide foundation for future genetic and genomic studies of this and related species.
Identification of the maize gravitropism gene lazy plant1 by a transposon-tagging genome resequencing strategy.

PubMed

Howard, Thomas P; Hayward, Andrew P; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A; Tohme, Joe; Kausch, Albert P; Mottinger, John P; Dellaporta, Stephen L

2014-01-01

Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform.

Defining the transcriptome assembly and its use for genome dynamics and transcriptome profiling studies in pigeonpea (Cajanus cajan L.).

PubMed

Dubey, Anuja; Farmer, Andrew; Schlueter, Jessica; Cannon, Steven B; Abernathy, Brian; Tuteja, Reetu; Woodward, Jimmy; Shah, Trushar; Mulasmanovic, Benjamin; Kudapa, Himabindu; Raju, Nikku L; Gothalwal, Ragini; Pande, Suresh; Xiao, Yongli; Town, Chris D; Singh, Nagendra K; May, Gregory D; Jackson, Scott; Varshney, Rajeev K

2011-06-01

This study reports generation of large-scale genomic resources for pigeonpea, a so-called 'orphan crop species' of the semi-arid tropic regions. FLX/454 sequencing carried out on a normalized cDNA pool prepared from 31 tissues produced 494 353 short transcript reads (STRs). Cluster analysis of these STRs, together with 10 817 Sanger ESTs, resulted in a pigeonpea trancriptome assembly (CcTA) comprising of 127 754 tentative unique sequences (TUSs). Functional analysis of these TUSs highlights several active pathways and processes in the sampled tissues. Comparison of the CcTA with the soybean genome showed similarity to 10 857 and 16 367 soybean gene models (depending on alignment methods). Additionally, Illumina 1G sequencing was performed on Fusarium wilt (FW)- and sterility mosaic disease (SMD)-challenged root tissues of 10 resistant and susceptible genotypes. More than 160 million sequence tags were used to identify FW- and SMD-responsive genes. Sequence analysis of CcTA and the Illumina tags identified a large new set of markers for use in genetics and breeding, including 8137 simple sequence repeats, 12 141 single-nucleotide polymorphisms and 5845 intron-spanning regions. Genomic resources developed in this study should be useful for basic and applied research, not only for pigeonpea improvement but also for other related, agronomically important legumes.
Identification of the Maize Gravitropism Gene lazy plant1 by a Transposon-Tagging Genome Resequencing Strategy

PubMed Central

Howard, Thomas P.; Hayward, Andrew P.; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A.; Tohme, Joe; Kausch, Albert P.; Mottinger, John P.; Dellaporta, Stephen L.

2014-01-01

Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform. PMID:24498020
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.).

PubMed

Bushakra, Jill M; Lewers, Kim S; Staton, Margaret E; Zhebentyayeva, Tetyana; Saski, Christopher A

2015-10-26

Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs. Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.
High-resolution phylogenetic microbial community profiling

DOE Office of Scientific and Technical Information (OSTI.GOV)

Singer, Esther; Coleman-Derr, Devin; Bowman, Brett

2014-03-17

The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less
Anchoring 9,371 Maize Expressed Sequence Tagged Unigenes to the Bacterial Artificial Chromosome Contig Map by Two-Dimensional Overgo Hybridization1

PubMed Central

Gardiner, Jack; Schroeder, Steven; Polacco, Mary L.; Sanchez-Villeda, Hector; Fang, Zhiwei; Morgante, Michele; Landewe, Tim; Fengler, Kevin; Useche, Francisco; Hanafey, Michael; Tingey, Scott; Chou, Hugh; Wing, Rod; Soderlund, Carol; Coe, Edward H.

2004-01-01

Our goal is to construct a robust physical map for maize (Zea mays) comprehensively integrated with the genetic map. We have used a two-dimensional 24 × 24 overgo pooling strategy to anchor maize expressed sequence tagged (EST) unigenes to 165,888 bacterial artificial chromosomes (BACs) on high-density filters. A set of 70,716 public maize ESTs seeded derivation of 10,723 EST unigene assemblies. From these assemblies, 10,642 overgo sequences of 40 bp were applied as hybridization probes. BAC addresses were obtained for 9,371 overgo probes, representing an 88% success rate. More than 96% of the successful overgo probes identified two or more BACs, while 5% identified more than 50 BACs. The majority of BACs identified (79%) were hybridized with one or two overgos. A small number of BACs hybridized with eight or more overgos, suggesting that these BACs must be gene rich. Approximately 5,670 overgos identified BACs assembled within one contig, indicating that these probes are highly locus specific. A total of 1,795 megabases (Mb; 87%) of the total 2,050 Mb in BAC contigs were associated with one or more overgos, which are serving as sequence-tagged sites for single nucleotide polymorphism development. Overgo density ranged from less than one overgo per megabase to greater than 20 overgos per megabase. The majority of contigs (52%) hit by overgos contained three to nine overgos per megabase. Analysis of approximately 1,022 Mb of genetically anchored BAC contigs indicates that 9,003 of the total 13,900 overgo-contig sites are genetically anchored. Our results indicate overgos are a powerful approach for generating gene-specific hybridization probes that are facilitating the assembly of an integrated genetic and physical map for maize. PMID:15020742
A High-Throughput Data Mining of Single Nucleotide Polymorphisms in Coffea Species Expressed Sequence Tags Suggests Differential Homeologous Gene Expression in the Allotetraploid Coffea arabica1[W

PubMed Central

Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

2010-01-01

Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545
Difficulties in Generating Specific Antibodies for Immunohistochemical Detection of Nitrosylated Tubulins

PubMed Central

Kamnev, Anton; Muhar, Matthias; Preinreich, Martina; Ammer, Hermann; Propst, Friedrich

2013-01-01

Protein S-nitrosylation, the covalent attachment of a nitroso moiety to thiol groups of specific cysteine residues, is one of the major pathways of nitric oxide signaling. Hundreds of proteins are subject to this transient post-translational modification and for some the functional consequences have been identified. Biochemical assays for the analysis of protein S-nitrosylation have been established and can be used to study if and under what conditions a given protein is S-nitrosylated. In contrast, the equally desirable subcellular localization of specific S-nitrosylated protein isoforms has not been achieved to date. In the current study we attempted to specifically localize S-nitrosylated α- and β-tubulin isoforms in primary neurons after fixation. The approach was based on in situ replacement of the labile cysteine nitroso modification with a stable tag and the subsequent use of antibodies which recognize the tag in the context of the tubulin polypeptide sequence flanking the cysteine residue of interest. We established a procedure for tagging S-nitrosylated proteins in cultured primary neurons and obtained polyclonal anti-tag antibodies capable of specifically detecting tagged proteins on immunoblots and in fixed cells. However, the antibodies were not specific for tubulin isoforms. We suggest that different tagging strategies or alternative methods such as fluorescence resonance energy transfer techniques might be more successful. PMID:23840827
A scalable strategy for high-throughput GFP tagging of endogenous human proteins.

PubMed

Leonetti, Manuel D; Sekine, Sayaka; Kamiyama, Daichi; Weissman, Jonathan S; Huang, Bo

2016-06-21

A central challenge of the postgenomic era is to comprehensively characterize the cellular role of the ∼20,000 proteins encoded in the human genome. To systematically study protein function in a native cellular background, libraries of human cell lines expressing proteins tagged with a functional sequence at their endogenous loci would be very valuable. Here, using electroporation of Cas9 nuclease/single-guide RNA ribonucleoproteins and taking advantage of a split-GFP system, we describe a scalable method for the robust, scarless, and specific tagging of endogenous human genes with GFP. Our approach requires no molecular cloning and allows a large number of cell lines to be processed in parallel. We demonstrate the scalability of our method by targeting 48 human genes and show that the resulting GFP fluorescence correlates with protein expression levels. We next present how our protocols can be easily adapted for the tagging of a given target with GFP repeats, critically enabling the study of low-abundance proteins. Finally, we show that our GFP tagging approach allows the biochemical isolation of native protein complexes for proteomic studies. Taken together, our results pave the way for the large-scale generation of endogenously tagged human cell lines for the proteome-wide analysis of protein localization and interaction networks in a native cellular context.
A large scale analysis of cDNA in Arabidopsis thaliana: generation of 12,028 non-redundant expressed sequence tags from normalized and size-selected cDNA libraries.

PubMed

Asamizu, E; Nakamura, Y; Sato, S; Tabata, S

2000-06-30

For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.
Genes are differentially expressed at transcriptional level of Neocaridina denticulata following short-term exposure to nonylphenol.

PubMed

Liu, Chang-Lun; Sung, Hung-Hung

2011-09-01

To assess the toxicity of nonylphenol towards aquatic crustaceans, Neocaridina denticulata were exposed short-term to sublethal concentration (0.001-0.5 mg/L). Following treatment, differentially expressed genes were identified using suppression subtractive hybridization on samples prepared from whole specimens. There were 20 differentially expressed sequence tags that corresponded to known genes and could be divided into six functional classes: defence, translation, metabolism, ribosomal gene expression, respiration, and genes involved in the stress response. Using semi-quantitative RT-PCR, we found that 14 of the differentially expressed sequence tags significantly responded to nonylphenol, including six at a nominal concentration of 0.01 mg/L; among them, 12 genes were down-regulated. These results suggest that under non-lethal concentrations of nonylphenol, the polluted aquatic environment may still present a potential risk to N. denticulata.
Evaluation of the genetic diversity of Plum pox virus in a single plum tree.

PubMed

Predajňa, Lukáš; Šubr, Zdeno; Candresse, Thierry; Glasa, Miroslav

2012-07-01

Genetic diversity of Plum pox virus (PPV) and its distribution within a single perennial woody host (plum, Prunus domestica) has been evaluated. A plum tree was triply infected by chip-budding with PPV-M, PPV-D and PPV-Rec isolates in 2003 and left to develop untreated under open field conditions. In September 2010 leaf and fruit samples were collected from different parts of the tree canopy. A 745-bp NIb-CP fragment of PPV genome, containing the hypervariable region encoding the CP N-terminal end was amplified by RT-PCR from each sample and directly sequenced to determine the dominant sequence. In parallel, the PCR products were cloned and a total of 105 individual clones were sequenced. Sequence analysis revealed that after 7 years of infection, only PPV-M was still detectable in the tree and that the two other isolates (PPV-Rec and PPV-D) had been displaced. Despite the fact that the analysis targeted a relatively short portion of the genome, a substantial amount of intra-isolate variability was observed for PPV-M. A total of 51 different haplotypes could be identified from the 105 individual sequences, two of which were largely dominant. However, no clear-cut structuration of the viral population by the tree architecture could be highlighted although the results obtained suggest the possibility of intra-leaf/fruit differentiation of the viral population. Comparison of the consensus sequence with the original source isolate showed no difference, suggesting within-plant stability of this original isolate under open field conditions. Copyright © 2012 Elsevier B.V. All rights reserved.
Forensic strategy to ensure the quality of sequencing data of mitochondrial DNA in highly degraded samples.

PubMed

Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki

2014-01-01

Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization.

PubMed

Girard, Laurie D; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G

2015-02-07

The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have high complexity and cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotide sequence-dependent segment and a unique "target sequence-independent" 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets.
Expressed sequence tag analysis of adult human lens for the NEIBank Project: over 2000 non-redundant transcripts, novel genes and splice variants.

PubMed

Wistow, Graeme; Bernstein, Steven L; Wyatt, M Keith; Behal, Amita; Touchman, Jeffrey W; Bouffard, Gerald; Smith, Don; Peterson, Katherine

2002-06-15

To explore the expression profile of the human lens and to provide a resource for microarray studies, expressed sequence tag (EST) analysis has been performed on cDNA libraries from adult lenses. A cDNA library was constructed from two adult (40 year old) human lenses. Over two thousand clones were sequenced from the unamplified, un-normalized library. The library was then normalized and a further 2200 sequences were obtained. All the data were analyzed using GRIST (GRouping and Identification of Sequence Tags), a procedure for gene identification and clustering. The lens library (by) contains a low percentage of non-mRNA contaminants and a high fraction (over 75%) of apparently full length cDNA clones. Approximately 2000 reads from the unamplified library yields 810 clusters, potentially representing individual genes expressed in the lens. After normalization, the content of crystallins and other abundant cDNAs is markedly reduced and a similar number of reads from this library (fs) yields 1455 unique groups of which only two thirds correspond to named genes in GenBank. Among the most abundant cDNAs is one for a novel gene related to glutamine synthetase, which was designated "lengsin" (LGS). Analyses of ESTs also reveal examples of alternative transcripts, including a major alternative splice form for the lens specific membrane protein MP19. Variant forms for other transcripts, including those encoding the apoptosis inhibitor Livin and the armadillo repeat protein ARVCF, are also described. The lens cDNA libraries are a resource for gene discovery, full length cDNAs for functional studies and microarrays. The discovery of an abundant, novel transcript, lengsin, and a major novel splice form of MP19 reflect the utility of unamplified libraries constructed from dissected tissue. Many novel transcripts and splice forms are represented, some of which may be candidates for genetic diseases.
Construction of a yeast artificial chromosome contig encompassing the chromosome 14 Alzheimer`s disease locus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sharma, V.; Bonnycastle, L.; Poorkai, P.

1994-09-01

We have constructed a yeast artificial chromosome (YAC) contig of chromosome 14q24.3 which encompasses the chromosome 14 Alzheimer`s disease locus (AD3). Determined by linkage analysis of early-onset Alzheimer`s disease kindreds, this interval is bounded by the genetic markers D14S61-D14S63 and spans approximately 15 centimorgans. The contig consists of 29 markers and 74 YACs of which 57 are defined by one or more sequence tagged sites (STSs). The STS markers comprise 5 genes, 16 short tandem repeat polymorphisms and 8 cDNA clones. An additional number of genes, expressed sequence tags and cDNA fragments have been identified and localized to the contigmore » by hybridization and sequence analysis of anonymous clones isolated by cDNA direct selection techniques. A minimal contig of about 15 YACs averaging 0.5-1.5 megabase in length will span this interval and is, at first approximation, in rough agreement with the genetic map. For two regions of the contig, our coverage has relied on L1/THE fingerprint and Alu-PCR hybridization data of YACs provided by CEPH/Genethon. We are currently developing sequence tagged sites from these to confirm the overlaps revealed by the fingerprint data. Among the genes which map to the contig are transforming growth factor beta 3, c-fos, and heat shock protein 2A (HSPA2). C-fos is not a candidate gene for AD3 based on the sequence analysis of affected and unaffected individuals. HSPA2 maps to the proximal edge of the contig and Calmodulin 1, a candidate gene from 4q24.3, maps outside of the region. The YAC contig is a framework physical map from which cosmid or P1 clone contigs can be constructed. As more genes and cDNAs are mapped, a highly resolved transcription map will emerge, a necessary step towards positionally cloning the AD3 gene.« less
Proposal of a Consensus Set of Hypervariable Mycobacterial Interspersed Repetitive-Unit–Variable-Number Tandem-Repeat Loci for Subtyping of Mycobacterium tuberculosis Beijing Isolates

PubMed Central

Allix-Béguec, Caroline; Wahl, Céline; Hanekom, Madeleine; Nikolayevskyy, Vladyslav; Drobniewski, Francis; Maeda, Shinji; Campos-Herrero, Isolina; Mokrousov, Igor; Niemann, Stefan; Kontsevaya, Irina; Rastogi, Nalin; Samper, Sofia; Sng, Li-Hwei; Warren, Robin M.

2014-01-01

Mycobacterium tuberculosis Beijing strains represent targets of special importance for molecular surveillance of tuberculosis (TB), especially because they are associated with spread of multidrug resistance in some world regions. Standard 24-locus mycobacterial interspersed repetitive-unit–variable-number tandem-repeat (MIRU-VNTR) typing lacks resolution power for accurately discriminating closely related clones that often compose Beijing strain populations. Therefore, we evaluated a set of 7 additional, hypervariable MIRU-VNTR loci for better resolution and tracing of such strains, using a collection of 535 Beijing isolates from six world regions where these strains are known to be prevalent. The typeability and interlaboratory reproducibility of these hypervariable loci were lower than those of the 24 standard loci. Three loci (2163a, 3155, and 3336) were excluded because of their redundant variability and/or more frequent noninterpretable results compared to the 4 other markers. The use of the remaining 4-locus set (1982, 3232, 3820, and 4120) increased the number of types by 52% (from 223 to 340) and reduced the clustering rate from 58.3 to 36.6%, when combined with the use of the standard 24-locus set. Known major clonal complexes/24-locus-based clusters were all subdivided, although the degree of subdivision varied depending on the complex. Only five single-locus variations were detected among the hypervariable loci of an additional panel of 92 isolates, representing 15 years of clonal spread of a single Beijing strain in a geographically restricted setting. On this calibrated basis, we propose this 4-locus set as a consensus for subtyping Beijing clonal complexes and clusters, after standard typing. PMID:24172154
Proposal of a consensus set of hypervariable mycobacterial interspersed repetitive-unit-variable-number tandem-repeat loci for subtyping of Mycobacterium tuberculosis Beijing isolates.

PubMed

Allix-Béguec, Caroline; Wahl, Céline; Hanekom, Madeleine; Nikolayevskyy, Vladyslav; Drobniewski, Francis; Maeda, Shinji; Campos-Herrero, Isolina; Mokrousov, Igor; Niemann, Stefan; Kontsevaya, Irina; Rastogi, Nalin; Samper, Sofia; Sng, Li-Hwei; Warren, Robin M; Supply, Philip

2014-01-01

Mycobacterium tuberculosis Beijing strains represent targets of special importance for molecular surveillance of tuberculosis (TB), especially because they are associated with spread of multidrug resistance in some world regions. Standard 24-locus mycobacterial interspersed repetitive-unit-variable-number tandem-repeat (MIRU-VNTR) typing lacks resolution power for accurately discriminating closely related clones that often compose Beijing strain populations. Therefore, we evaluated a set of 7 additional, hypervariable MIRU-VNTR loci for better resolution and tracing of such strains, using a collection of 535 Beijing isolates from six world regions where these strains are known to be prevalent. The typeability and interlaboratory reproducibility of these hypervariable loci were lower than those of the 24 standard loci. Three loci (2163a, 3155, and 3336) were excluded because of their redundant variability and/or more frequent noninterpretable results compared to the 4 other markers. The use of the remaining 4-locus set (1982, 3232, 3820, and 4120) increased the number of types by 52% (from 223 to 340) and reduced the clustering rate from 58.3 to 36.6%, when combined with the use of the standard 24-locus set. Known major clonal complexes/24-locus-based clusters were all subdivided, although the degree of subdivision varied depending on the complex. Only five single-locus variations were detected among the hypervariable loci of an additional panel of 92 isolates, representing 15 years of clonal spread of a single Beijing strain in a geographically restricted setting. On this calibrated basis, we propose this 4-locus set as a consensus for subtyping Beijing clonal complexes and clusters, after standard typing.
Tandem SUMO fusion vectors for improving soluble protein expression and purification.

PubMed

Guerrero, Fernando; Ciragan, Annika; Iwaï, Hideo

2015-12-01

Availability of highly purified proteins in quantity is crucial for detailed biochemical and structural investigations. Fusion tags are versatile tools to facilitate efficient protein purification and to improve soluble overexpression of proteins. Various purification and fusion tags have been widely used for overexpression in Escherichia coli. However, these tags might interfere with biological functions and/or structural investigations of the protein of interest. Therefore, an additional purification step to remove fusion tags by proteolytic digestion might be required. Here, we describe a set of new vectors in which yeast SUMO (SMT3) was used as the highly specific recognition sequence of ubiquitin-like protease 1, together with other commonly used solubility enhancing proteins, such as glutathione S-transferase, maltose binding protein, thioredoxin and trigger factor for optimizing soluble expression of protein of interest. This tandem SUMO (T-SUMO) fusion system was tested for soluble expression of the C-terminal domain of TonB from different organisms and for the antiviral protein scytovirin. Copyright © 2015 Elsevier Inc. All rights reserved.
Optimized Design and Synthesis of Cell Permeable Biarsenical Cyanine Probe for Imaging Tagged Cytosolic Bacterial Proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Fu, Na; Xiong, Yijia; Squier, Thomas C.

2013-01-21

To optimize cellular delivery and specific labeling of tagged cytosolic proteins by biarsenical fluorescent probes build around a cyanine dye scaffold, we have systematically varied the polarity of the hydrophobic tails (i.e., 4-5 methylene groups appended by a sulfonate or methoxy ester moiety) and arsenic capping reagent (ethanedithiol versus benzenedithiol). Targeted labeling of the cytosolic proteins SlyD and the alpha subunit of RNA polymerase engineered with a tetracysteine tagging sequences demonstrate the utility of the newly synthesized probes for live-cell visualization, albeit with varying efficiencies and background intensities. Optimal routine labeling and visualization is apparent using the ethanedithiol capping reagentmore » with the uncharged methoxy ester functionalized acyl chains. These measurements demonstrate the general utility of this class of photostable and highly fluorescent biarsenical reagents based on the cyanine scaffold for in vivo targeting of tagged cellular proteins for live cell measurements of protein dynamics.« less
Completely monodisperse, highly repetitive proteins for bioconjugate capillary electrophoresis: Development and characterization

PubMed Central

Lin, Jennifer S.; Albrecht, Jennifer Coyne; Meagher, Robert J.; Wang, Xiaoxiao; Barron, Annelise E.

2011-01-01

Protein-based polymers are increasingly being used in biomaterial applications due to their ease of customization and potential monodispersity. These advantages make protein polymers excellent candidates for bioanalytical applications. Here we describe improved methods for producing drag-tags for Free-Solution Conjugate Electrophoresis (FSCE). FSCE utilizes a pure, monodisperse recombinant protein, tethered end-on to a ssDNA molecule, to enable DNA size separation in aqueous buffer. FSCE also provides a highly sensitive method to evaluate the polydispersity of a protein drag-tag and thus its suitability for bioanalytical uses. This method is able to detect slight differences in drag-tag charge or mass. We have devised an improved cloning, expression, and purification strategy that enables us to generate, for the first time, a truly monodisperse 20 kDa protein polymer and a nearly monodisperse 38 kDa protein. These newly produced proteins can be used as drag-tags to enable longer read DNA sequencing by free-solution microchannel electrophoresis. PMID:21553840

Association of ESR1 gene tagging SNPs with breast cancer risk

PubMed Central

Dunning, Alison M.; Healey, Catherine S.; Baynes, Caroline; Maia, Ana-Teresa; Scollen, Serena; Vega, Ana; Rodríguez, Raquel; Barbosa-Morais, Nuno L.; Ponder, Bruce A.J.; Low, Yen-Ling; Bingham, Sheila; Haiman, Christopher A.; Le Marchand, Loic; Broeks, Annegien; Schmidt, Marjanka K.; Hopper, John; Southey, Melissa; Beckmann, Matthias W.; Fasching, Peter A.; Peto, Julian; Johnson, Nichola; Bojesen, Stig E.; Nordestgaard, Børge; Milne, Roger L.; Benitez, Javier; Hamann, Ute; Ko, Yon; Schmutzler, Rita K.; Burwinkel, Barbara; Schürmann, Peter; Dörk, Thilo; Heikkinen, Tuomas; Nevanlinna, Heli; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kosma, Veli-Matti; Chen, Xiaoqing; Spurdle, Amanda; Change-Claude, Jenny; Flesch-Janys, Dieter; Couch, Fergus J.; Olson, Janet E.; Severi, Gianluca; Baglietto, Laura; Børresen-Dale, Anne-Lise; Kristensen, Vessela; Hunter, David J.; Hankinson, Susan E.; Devilee, Peter; Vreeswijk, Maaike; Lissowska, Jolanta; Brinton, Louise; Liu, Jianjun; Hall, Per; Kang, Daehee; Yoo, Keun-Young; Shen, Chen-Yang; Yu, Jyh-Cherng; Anton-Culver, Hoda; Ziogoas, Argyrios; Sigurdson, Alice; Struewing, Jeff; Easton, Douglas F.; Garcia-Closas, Montserrat; Humphreys, Manjeet K.; Morrison, Jonathan; Pharoah, Paul D.P.; Pooley, Karen A.; Chenevix-Trench, Georgia

2009-01-01

We have conducted a three-stage, comprehensive single nucleotide polymorphism (SNP)-tagging association study of ESR1 gene variants (SNPs) in more than 55 000 breast cancer cases and controls from studies within the Breast Cancer Association Consortium (BCAC). No large risks or highly significant associations were revealed. SNP rs3020314, tagging a region of ESR1 intron 4, is associated with an increase in breast cancer susceptibility with a dominant mode of action in European populations. Carriers of the c-allele have an odds ratio (OR) of 1.05 [95% Confidence Intervals (CI) 1.02–1.09] relative to t-allele homozygotes, P = 0.004. There is significant heterogeneity between studies, P = 0.002. The increased risk appears largely confined to oestrogen receptor-positive tumour risk. The region tagged by SNP rs3020314 contains sequence that is more highly conserved across mammalian species than the rest of intron 4, and it may subtly alter the ratio of two mRNA splice forms. PMID:19126777
A set of plastid loci for use in multiplex fragment length genotyping for intraspecific variation in Pinus (Pinaceae)1

PubMed Central

Wofford, Austin M.; Finch, Kristen; Bigott, Adam; Willyard, Ann

2014-01-01

• Premise of the study: Recently released Pinus plastome sequences support characterization of 15 plastid simple sequence repeat (cpSSR) loci originally published for P. contorta and P. thunbergii. This allows selection of loci for single-tube PCR multiplexed genotyping in any subsection of the genus. • Methods: Unique placement of primers and primer conservation across the genus were investigated, and a set of six loci were selected for single-tube multiplexing. We compared interspecific variation between cpSSRs and nucleotide sequences of ycf1 and tested intraspecific variation for cpSSRs using 911 samples in the P. ponderosa species complex. • Results: The cpSSR loci contain mononucleotide and complex repeats with additional length variation in flanking regions. They are not located in hypervariable regions, and most primers are conserved across the genus. A single PCR per sample multiplexed for six loci yielded 45 alleles in 911 samples. • Discussion: The protocol allows efficient genotyping of many samples. The cpSSR loci are too variable for Pinus phylogenies but are useful for the study of genetic structure within and among populations. The multiplex method could easily be extended to other plant groups by choosing primers for cpSSR loci in a plastome alignment for the target group. PMID:25202625
mtDNA variation in the Yanomami: evidence for additional New World founding lineages.

PubMed

Easton, R D; Merriwether, D A; Crews, D E; Ferrell, R E

1996-07-01

Native Americans have been classified into four founding haplogroups with as many as seven founding lineages based on mtDNA RFLPs and DNA sequence data. mtDNA analysis was completed for 83 Yanomami from eight villages in the Surucucu and Catrimani Plateau regions of Roraima in northwestern Brazil. Samples were typed for 15 polymorphic mtDNA sites (14 RFLP sites and 1 deletion site), and a subset was sequenced for both hypervariable regions of the mitochondrial D-loop. Substantial mitochondrial diversity was detected among the Yanomami, five of seven accepted founding haplotypes and three others were observed. Of the 83 samples, 4 (4.8%) were lineage B1, 1 (1.2%) was lineage B2, 31 (37.4%) were lineage C1, 29 (34.9%) were lineage C2, 2 (2.4%) were lineage D1, 6 (7.2%) were lineage D2, 7 (8.4%) were a haplotype we designated "X6," and 3 (3.6%) were a haplotype we designated "X7." Sequence analysis found 43 haplotypes in 50 samples. B2, X6, and X7 are previously unrecognized mitochondrial founding lineage types of Native Americans. The widespread distribution of these haplotypes in the New World and Asia provides support for declaring these lineages to be New World founding types.
mtDNA variation in the Yanomami: evidence for additional New World founding lineages.

PubMed Central

Easton, R. D.; Merriwether, D. A.; Crews, D. E.; Ferrell, R. E.

1996-01-01

Native Americans have been classified into four founding haplogroups with as many as seven founding lineages based on mtDNA RFLPs and DNA sequence data. mtDNA analysis was completed for 83 Yanomami from eight villages in the Surucucu and Catrimani Plateau regions of Roraima in northwestern Brazil. Samples were typed for 15 polymorphic mtDNA sites (14 RFLP sites and 1 deletion site), and a subset was sequenced for both hypervariable regions of the mitochondrial D-loop. Substantial mitochondrial diversity was detected among the Yanomami, five of seven accepted founding haplotypes and three others were observed. Of the 83 samples, 4 (4.8%) were lineage B1, 1 (1.2%) was lineage B2, 31 (37.4%) were lineage C1, 29 (34.9%) were lineage C2, 2 (2.4%) were lineage D1, 6 (7.2%) were lineage D2, 7 (8.4%) were a haplotype we designated "X6," and 3 (3.6%) were a haplotype we designated "X7." Sequence analysis found 43 haplotypes in 50 samples. B2, X6, and X7 are previously unrecognized mitochondrial founding lineage types of Native Americans. The widespread distribution of these haplotypes in the New World and Asia provides support for declaring these lineages to be New World founding types. PMID:8659527
[Application of multiple polymorphism genetic markers in determination of half sibling sharing a same mother].

PubMed

Que, Ting-zhi; Zhao, Shu-min; Li, Cheng-tao

2010-08-01

Determination strategies for half sibling sharing a same mother were investigated through the detection of autosomal and X-chromosomal STR (X-STR) loci and polymorphisms on hypervariable (HV) region of mitochondrial DNA (mtDNA). Genomic DNA were extracted from blood stain samples of the 3 full siblings and one dubious half sibling sharing the same mother with them. Fifteen autosomal STR loci were genotyped by Sinofiler kit, and 19 X-STR loci were genotyped by Mentype Argus X-8 kit and 16 plex in-house system. Polymorphisms of mtDNA HV-I and HV-II were also detected with sequencing technology. Full sibling relationship between the dubious half sibling and each of the 3 full siblings were excluded based on the results of autosomal STR genotyping and calculation of full sibling index (FSI) and half sibling index (HIS). Results of sequencing for mtDNA HV-I and HV-II showed that all of the 4 samples came from a same maternal line. X-STR genotyping results determined that the dubious half sibling shared a same mother with the 3 full siblings. It is reliable to combine three different genotyping technologies including autosomal STR, X-STR and sequencing of mtDNA HV-I and HV-II for determination of half sibling sharing a same mother.
Genetic analysis of 7 medieval skeletons from Aragonese Pyrenees

PubMed Central

Núńez, Carolina; Sosa, Cecilia; Baeta, Miriam; Geppert, Maria; Turnbough, Meredith; Phillips, Nicole; Casalod, Yolanda; Bolea, Miguel; Roby, Rhonda; Budowle, Bruce; Martínez-Jarreta, Begońa

2011-01-01

Aim To perform a genetic characterization of 7 skeletons from medieval age found in a burial site in the Aragonese Pyrenees. Methods Allele frequencies of autosomal short tandem repeats (STR) loci were determined by 3 different STR systems. Mitochondrial DNA (mtDNA) and Y-chromosome haplogroups were determined by sequencing of the hypervariable segment 1 of mtDNA and typing of phylogenetic Y chromosome single nucleotide polymorphisms (Y-SNP) markers, respectively. Possible familial relationships were also investigated. Results Complete or partial STR profiles were obtained in 3 of the 7 samples. Mitochondrial DNA haplogroup was determined in 6 samples, with 5 of them corresponding to the haplogroup H and 1 to the haplogroup U5a. Y-chromosome haplogroup was determined in 2 samples, corresponding to the haplogroup R. In one of them, the sub-branch R1b1b2 was determined. mtDNA sequences indicated that some of the individuals could be maternally related, while STR profiles indicated no direct family relationships. Conclusions Despite the antiquity of the samples and great difficulty that genetic analyses entail, the combined use of autosomal STR markers, Y-chromosome informative SNPs, and mtDNA sequences allowed us to genotype a group of skeletons from the medieval age. PMID:21674829
Myticalins: A Novel Multigenic Family of Linear, Cationic Antimicrobial Peptides from Marine Mussels (Mytilus spp.).

PubMed

Leoni, Gabriele; De Poli, Andrea; Mardirossian, Mario; Gambato, Stefano; Florian, Fiorella; Venier, Paola; Wilson, Daniel N; Tossi, Alessandro; Pallavicini, Alberto; Gerdol, Marco

2017-08-22

The application of high-throughput sequencing technologies to non-model organisms has brought new opportunities for the identification of bioactive peptides from genomes and transcriptomes. From this point of view, marine invertebrates represent a potentially rich, yet largely unexplored resource for de novo discovery due to their adaptation to diverse challenging habitats. Bioinformatics analyses of available genomic and transcriptomic data allowed us to identify myticalins, a novel family of antimicrobial peptides (AMPs) from the mussel Mytilus galloprovincialis , and a similar family of AMPs from Modiolus spp., named modiocalins. Their coding sequence encompasses two conserved N-terminal (signal peptide) and C-terminal (propeptide) regions and a hypervariable central cationic region corresponding to the mature peptide. Myticalins are taxonomically restricted to Mytiloida and they can be classified into four subfamilies. These AMPs are subject to considerable interindividual sequence variability and possibly to presence/absence variation. Functional assays performed on selected members of this family indicate a remarkable tissue-specific expression (in gills) and broad spectrum of activity against both Gram-positive and Gram-negative bacteria. Overall, we present the first linear AMPs ever described in marine mussels and confirm the great potential of bioinformatics tools for the de novo discovery of bioactive peptides in non-model organisms.
The paradox of MHC-DRB exon/intron evolution: alpha-helix and beta-sheet encoding regions diverge while hypervariable intronic simple repeats coevolve with beta-sheet codons.

PubMed

Schwaiger, F W; Weyers, E; Epplen, C; Brün, J; Ruff, G; Crawford, A; Epplen, J T

1993-09-01

Twenty-one different caprine and 13 ovine MHC-DRB exon 2 sequences were determined including part of the adjacent introns containing simple repetitive (gt)n(ga)m elements. The positions for highly polymorphic DRB amino acids vary slightly among ungulates and other mammals. From man and mouse to ungulates the basic (gt)n(ga)m structure is fixed in evolution for 7 x 10(7) years whereas ample variations exist in the tandem (gt)n and (ga)m dinucleotides and especially their "degenerated" derivatives. Phylogenetic trees for the alpha-helices and beta-pleated sheets of the ungulate DRB sequences suggest different evolutionary histories. In hoofed animals as well as in humans DRB beta-sheet encoding sequences and adjacent intronic repeats can be assembled into virtually identical groups suggesting coevolution of noncoding as well as coding DNA. In contrast alpha-helices and C-terminal parts of the first DRB domain evolve distinctly. In the absence of a defined mechanism causing specific, site-directed mutations, double-recombination or gene-conversion-like events would readily explain this fact. The role of the intronic simple (gt)n(ga)m repeat is discussed with respect to these genetic exchange mechanisms during evolution.
Generation and analysis of expressed sequence tags from the bone marrow of Chinese Sika deer.

PubMed

Yao, Baojin; Zhao, Yu; Zhang, Mei; Li, Juan

2012-03-01

Sika deer is one of the best-known and highly valued animals of China. Despite its economic, cultural, and biological importance, there has not been a large-scale sequencing project for Sika deer to date. With the ultimate goal of sequencing the complete genome of this organism, we first established a bone marrow cDNA library for Sika deer and generated a total of 2,025 reads. After processing the sequences, 2,017 high-quality expressed sequence tags (ESTs) were obtained. These ESTs were assembled into 1,157 unigenes, including 238 contigs and 919 singletons. Comparative analyses indicated that 888 (76.75%) of the unigenes had significant matches to sequences in the non-redundant protein database, In addition to highly expressed genes, such as stearoyl-CoA desaturase, cytochrome c oxidase, adipocyte-type fatty acid-binding protein, adiponectin and thymosin beta-4, we also obtained vascular endothelial growth factor-A and heparin-binding growth-associated molecule, both of which are of great importance for angiogenesis research. There were 244 (21.09%) unigenes with no significant match to any sequence in current protein or nucleotide databases, and these sequences may represent genes with unknown function in Sika deer. Open reading frame analysis of the sequences was performed using the getorf program. In addition, the sequences were functionally classified using the gene ontology hierarchy, clusters of orthologous groups of proteins and Kyoto encyclopedia of genes and genomes databases. Analysis of ESTs described in this paper provides an important resource for the transcriptome exploration of Sika deer, and will also facilitate further studies on functional genomics, gene discovery and genome annotation of Sika deer.
Leveraging algal omics to reveal potential targets for augmenting TAG accumulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Arora, Neha; Pienkos, Philip T.; Pruthi, Vikas

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. Here, this review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and informmore » future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less
Leveraging Algal Omics to Reveal Potential Targets for Augmenting TAG Accumulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Guarnieri, Michael T; Pienkos, Philip T; Arora, Neha

2018-04-18

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. This review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and inform futuremore » metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less
Distinguishing Aspartic and Isoaspartic Acids in Peptides by Several Mass Spectrometric Fragmentation Methods

NASA Astrophysics Data System (ADS)

DeGraan-Weber, Nick; Zhang, Jun; Reilly, James P.

2016-12-01

Six ion fragmentation techniques that can distinguish aspartic acid from its isomer, isoaspartic acid, were compared. MALDI post-source decay (PSD), MALDI 157 nm photodissociation, tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) charge tagging in PSD and photodissociation, ESI collision-induced dissociation (CID), electron transfer dissociation (ETD), and free-radical initiated peptide sequencing (FRIPS) with CID were applied to peptides containing either aspartic or isoaspartic acid. Diagnostic ions, such as the y-46 and b+H2O, are present in PSD, photodissociation, and charge tagging. c•+57 and z-57 ions are observed in ETD and FRIPS experiments. For some molecules, aspartic and isoaspartic acid yield ion fragments with significantly different intensities. ETD and charge tagging appear to be most effective at distinguishing these residues.
Expression, purification, and DNA-binding activity of the Herbaspirillum seropedicae RecX protein.

PubMed

Galvão, Carolina W; Pedrosa, Fábio O; Souza, Emanuel M; Yates, M Geoffrey; Chubatsu, Leda S; Steffens, Maria Berenice R

2004-06-01

The Herbaspirillum seropedicae RecX protein participates in the SOS response: a process in which the RecA protein plays a central role. The RecX protein of the H. seropedicae, fused to a His-tag sequence (RecX His-tagged), was over-expressed in Escherichia coli and purified by metal-affinity chromatography to yield a highly purified and active protein. DNA band-shift assays showed that the RecX His-tagged protein bound to both circular and linear double-stranded DNA and also to circular single-stranded DNA. The apparent affinity of RecX for DNA decreased in the presence of Mg(2+) ions. The ability of RecX to bind DNA may be relevant to its function in the SOS response.
Leveraging algal omics to reveal potential targets for augmenting TAG accumulation

DOE PAGES

Arora, Neha; Pienkos, Philip T.; Pruthi, Vikas; ...

2018-04-18

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. Here, this review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and informmore » future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy.« less
Leveraging algal omics to reveal potential targets for augmenting TAG accumulation.

PubMed

Arora, Neha; Pienkos, Philip T; Pruthi, Vikas; Poluri, Krishna Mohan; Guarnieri, Michael T

2018-04-18

Ongoing global efforts to commercialize microalgal biofuels have expedited the use of multi-omics techniques to gain insights into lipid biosynthetic pathways. Functional genomics analyses have recently been employed to complement existing sequence-level omics studies, shedding light on the dynamics of lipid synthesis and its interplay with other cellular metabolic pathways, thus revealing possible targets for metabolic engineering. Here, we review the current status of algal omics studies to reveal potential targets to augment TAG accumulation in various microalgae. This review specifically aims to examine and catalog systems level data related to stress-induced TAG accumulation in oleaginous microalgae and inform future metabolic engineering strategies to develop strains with enhanced bioproductivity, which could pave a path for sustainable green energy. Copyright © 2018. Published by Elsevier Inc.
Method for rapid base sequencing in DNA and RNA with two base labeling

DOEpatents

Jett, J.H.; Keller, R.A.; Martin, J.C.; Posner, R.G.; Marrone, B.L.; Hammond, M.L.; Simpson, D.J.

1995-04-11

A method is described for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand. 4 figures.
Method for rapid base sequencing in DNA and RNA with two base labeling

DOEpatents

Jett, James H.; Keller, Richard A.; Martin, John C.; Posner, Richard G.; Marrone, Babetta L.; Hammond, Mark L.; Simpson, Daniel J.

1995-01-01

Method for rapid-base sequencing in DNA and RNA with two-base labeling and employing fluorescent detection of single molecules at two wavelengths. Bases modified to accept fluorescent labels are used to replicate a single DNA or RNA strand to be sequenced. The bases are then sequentially cleaved from the replicated strand, excited with a chosen spectrum of electromagnetic radiation, and the fluorescence from individual, tagged bases detected in the order of cleavage from the strand.
Expressed sequence tags from heat-shocked seagrass Zostera noltii (Hornemann) from its southern distribution range.

PubMed

Massa, Sónia I; Pearson, Gareth A; Aires, Tânia; Kube, Michael; Olsen, Jeanine L; Reinhardt, Richard; Serrão, Ester A; Arnaud-Haond, Sophie

2011-09-01

Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera noltii during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to four recovery times following sub-lethal temperature stress, and a non-stressed control. We sequenced and analyzed 7009 sequence reads from 30min, 2h, 4h and 24h after the beginning of the heat-shock (AHS), and 1585 from the control library, for a total of 8594 sequence reads. Among 51 Tentative UniGenes (TUGs) exhibiting significantly different expression between libraries, 19 (37.3%) were identified as 'molecular chaperones' and were over-expressed following heat-shock, while 12 (23.5%) were 'photosynthesis TUGs' generally under-expressed in heat-shocked plants. A time course analysis of expression showed a rapid increase in expression of the molecular chaperone class, most of which were heat-shock proteins; which increased from 2 sequence reads in the control library to almost 230 in the 30min AHS library, followed by a slow decrease during further recovery. In contrast, 'photosynthesis TUGs' were under-expressed 30min AHS compared with the control library, and declined progressively with recovery time in the stress libraries, with a total of 29 sequence reads 24h AHS, compared with 125 in the control. A total of 4734 TUGs were screened for EST-Single Sequence Repeats (EST-SSRs) and 86 microsatellites were identified. Copyright © 2011 Elsevier B.V. All rights reserved.
Genetic polymorphisms in prehistoric Pacific islanders determined by analysis of ancient bone DNA.

PubMed

Hagelberg, E; Clegg, J B

1993-05-22

A previously characterized Asian-specific mitochondrial DNA (mtDNA) length mutation has been detected in DNA isolated from prehistoric human bones from Polynesia, including Hawaii, Chatham Islands and Society Islands. In contrast, the Asian mutation was absent in skeletal samples from the Melanesian archipelagos of New Britain and Vanuatu and in the oldest samples from Fiji, Tonga and Samoa in the central Pacific (2700-1600 years BP) although it was present in a more recent prehistoric sample from Tonga. These results, augmented by informative DNA sequence data from the hypervariable region of mtDNA, fail to support current views that the central Pacific was settled directly by voyagers from island Southeast Asia, the putative ancestors of modern Polynesians. An earlier occupation by peoples from the neighbouring Melanesian archipelagos seems more likely.
Genetic analysis and ethnic affinities from two Scytho-Siberian skeletons.

PubMed

Ricaut, François-Xavier; Keyser-Tracqui, Christine; Cammaert, Laurence; Crubézy, Eric; Ludes, Bertrand

2004-04-01

We extracted DNA from two skeletons belonging to the Sytho-Siberian population, which were excavated from the Sebÿstei site (dating back 2,500 years) in the Altai Republic (Central Asia). Ancient DNA was analyzed by autosomal short tandem repeats (STRs) and by the sequencing of the hypervariable region 1 (HV1) of the mitochondrial DNA (mtDNA) control region. The results showed that these two skeletons were not close relatives. Moreover, their haplogroups were characteristic of Asian populations. Comparison with the haplogroup of 3,523 Asian and American individuals linked one skeleton with a putative ancestral paleo-Asiatic population and the other with Chinese populations. It appears that the genetic study of ancient populations of Central Asia brings important elements to the understanding of human population movements in Asia. Copyright 2003 Wiley-Liss, Inc.

In vivo expression and purification of aptamer-tagged small RNA regulators

PubMed Central

Said, Nelly; Rieder, Renate; Hurwitz, Robert; Deckert, Jochen; Urlaub, Henning; Vogel, Jörg

2009-01-01

Small non-coding RNAs (sRNAs) are an emerging class of post-transcriptional regulators of bacterial gene expression. To study sRNAs and their potential protein interaction partners, it is desirable to purify sRNAs from cells in their native form. Here, we used RNA-based affinity chromatography to purify sRNAs following their expression as aptamer-tagged variants in vivo. To this end, we developed a family of plasmids to express sRNAs with any of three widely used aptamer sequences (MS2, boxB, eIF4A), and systematically tested how the aptamer tagging impacted on intracellular accumulation and target regulation of the Salmonella GcvB, InvR or RybB sRNAs. In addition, we successfully tagged the chromosomal rybB gene with MS2 to observe that RybB-MS2 is fully functional as an envelope stress-induced repressor of ompN mRNA following induction of sigmaE. We further demonstrate that the common sRNA-binding protein, Hfq, co-purifies with MS2-tagged sRNAs of Salmonella. The presented affinity purification strategy may facilitate the isolation of in vivo assembled sRNA–protein complexes in a wide range of bacteria. PMID:19726584
Synthesis of Fe3O4@nickel-silicate core-shell nanoparticles for His-tagged enzyme immobilizing agents

NASA Astrophysics Data System (ADS)

Shin, Moo-Kwang; Kang, Byunghoon; Yoon, Nam-Kyung; Kim, Myeong-Hoon; Ki, Jisun; Han, Seungmin; Ahn, Jung-Oh; Haam, Seungjoo

2016-12-01

Immobilizing enzymes on artificially fabricated carriers for their efficient use and easy removal from reactants has attracted enormous interest for decades. Specifically, binding platforms using inorganic nanoparticles have been widely explored because of the benefits of their large surface area, easy surface modification, and high stability in various pH and temperatures. Herein, we fabricated Fe3O4 encapsulated ‘sea-urchin’ shaped nickel-silicate nanoparticles with a facile synthetic route. The enzymes were then rapidly and easily immobilized with poly-histidine tags (His-tags) and nickel ion affinity. Porous nickel silicate covered nanoparticles achieved a high immobilization capacity (85 μg mg-1) of His-tagged tobacco etch virus (TEV) protease. To investigate immobilized TEV protease enzymatic activity, we analyzed the cleaved quantity of maltose binding protein-exendin-fused immunoglobulin fusion protein, which connected with the TEV protease-specific cleavage peptide sequence. Moreover, TEV protease immobilized nanocomplexes conveniently removed and recollected from the reactant by applying an external magnetic field, maintained their enzymatic activity after reuse. Therefore, our newly developed nanoplatform for His-tagged enzyme immobilization provides advantageous features for biotechnological industries including recombinant protein processing.
Development and evaluation of 200 novel SNP assays for population genetic studies of westslope cutthroat trout and genetic identification of related taxa

Treesearch

N. R. Campbell; S. J. Amish; V. L. Prichard; K. M. McKelvey; M. K. Young; M. K. Schwartz; J. C. Garza; G. Luikart; S. R. Narum

2012-01-01

DNA sequence data were collected and screened for single nucleotide polymorphisms (SNPs) in westslope cutthroat trout (Oncorhynchus clarki lewisi) and also for substitutions that could be used to genetically discriminate rainbow trout (O. mykiss) and cutthroat trout, as well as several cutthroat trout subspecies. In total, 260 expressed sequence tag-derived loci were...
Integration of transcriptomic and proteomic data from a single wheat cultivar provides new tools for understanding the roles of individual alpha gliadin proteins in flour quality and celiac disease

USDA-ARS?s Scientific Manuscript database

One-hundred-thirty-six expressed sequence tags (ESTs) encoding alpha gliadins from Triticum aestivum cv Butte 86 were identified in public databases and assembled into 19 contigs. Consensus sequences for 12 of the contigs encoded complete alpha gliadin proteins, but only two were identical to protei...
Use of continuous/contiguous stacking hybridization as a diagnostic tool

DOEpatents

Mirzabekov, Andrei Darievich; Kirillov, Eugene Vladislavovich; Parinov, Sergei Valeryevich; Barski, Victor Evgenievich; Dubiley, Svetlana Alekseevna

2002-01-01

A method for detecting disease-associated alleles in patient genetic material is provided whereby a first group of oligonucleotide molecules, synthesized to compliment base sequences of the disease associated alleles is immobilized on a predetermined position on a substrate, and then contacted with patient genetic material to form duplexes. The duplexes are then contacted with a second group of oligonucleotide molecules which are synthesized to extend the predetermined length of the oligonucleotide molecules of the first group, and where each of the oligonucleotide molecules of the second group are tagged and either incorporate universal bases or a mixture of guanine, cytosine, thymine, and adenine, or complementary nucleotide strands that are tagged with a different fluorochrome which radiates light at a predetermined wavelength. The treated substrate is then washed and the light patterns radiating therefrom are compared with predetermined light patterns of various diseases that were prepared on identical substrates. A method is also provided for determining the length of a repeat sequence in DNA or RNA, and also for determining the base sequence of unknown DNA or RNA.
Use of continuous/contiguous stacking hybridization as a diagnostic tool

DOEpatents

Mirzabekov, Andrei Darievich; Kirillov, Eugene Vladislavovich; Parinov, Sergei Valeryevich; Barski, Victor Evgenievich; Dubiley, Svetlana Alekseevna

2000-01-01

A method for detecting disease-associated alleles in patient genetic material is provided whereby a first group of oligonucleotide molecules, synthesized to compliment base sequences of the disease associated alleles is immobilized on a predetermined position on a substrate, and then contacted with patient genetic material to form duplexes. The duplexes are then contacted with a second group of oligonucleotide molecules which are synthesized to extend the predetermined length of the oligonucleotide molecules of the first group, and where each of the oligonucleotide molecules of the second group are tagged and either incorporate universal bases or a mixture of guanine, cytosine, thymine, and adenine, or complementary nucleotide strands that are tagged with a different fluorochrome which radiates light at a predetermined wavelength. The treated substrate is then washed and the light patterns radiating therefrom are compared with predetermined light patterns of various diseases that were prepared on identical substrates. A method is also provided for determining the length of a repeat sequence in DNA or RNA, and also for determining the base sequence of unknown DNA or RNA.
Saponin Biosynthesis in Saponaria vaccaria. cDNAs Encoding β-Amyrin Synthase and a Triterpene Carboxylic Acid Glucosyltransferase1[OA

PubMed Central

Meesapyodsuk, Dauenpen; Balsevich, John; Reed, Darwin W.; Covello, Patrick S.

2007-01-01

Saponaria vaccaria (Caryophyllaceae), a soapwort, known in western Canada as cowcockle, contains bioactive oleanane-type saponins similar to those found in soapbark tree (Quillaja saponaria; Rosaceae). To improve our understanding of the biosynthesis of these saponins, a combined polymerase chain reaction and expressed sequence tag approach was taken to identify the genes involved. A cDNA encoding a β-amyrin synthase (SvBS) was isolated by reverse transcription-polymerase chain reaction and characterized by expression in yeast (Saccharomyces cerevisiae). The SvBS gene is predominantly expressed in leaves. A S. vaccaria developing seed expressed sequence tag collection was developed and used for the isolation of a full-length cDNA bearing sequence similarity to ester-forming glycosyltransferases. The gene product of the cDNA, classified as UGT74M1, was expressed in Escherichia coli, purified, and identified as a triterpene carboxylic acid glucosyltransferase. UGT74M1 is expressed in roots and leaves and appears to be involved in monodesmoside biosynthesis in S. vaccaria. PMID:17172290
Uncovering the Salt Response of Soybean by Unraveling Its Wild and Cultivated Functional Genomes Using Tag Sequencing

PubMed Central

Ali, Zulfiqar; Zhang, Da Yong; Xu, Zhao Long; Xu, Ling; Yi, Jin Xin; He, Xiao Lan; Huang, Yi Hong; Liu, Xiao Qing; Khan, Asif Ali; Trethowan, Richard M.; Ma, Hong Xiang

2012-01-01

Soil salinity has very adverse effects on growth and yield of crop plants. Several salt tolerant wild accessions and cultivars are reported in soybean. Functional genomes of salt tolerant Glycine soja and a salt sensitive genotype of Glycine max were investigated to understand the mechanism of salt tolerance in soybean. For this purpose, four libraries were constructed for Tag sequencing on Illumina platform. We identify around 490 salt responsive genes which included a number of transcription factors, signaling proteins, translation factors and structural genes like transporters, multidrug resistance proteins, antiporters, chaperons, aquaporins etc. The gene expression levels and ratio of up/down-regulated genes was greater in tolerant plants. Translation related genes remained stable or showed slightly higher expression in tolerant plants under salinity stress. Further analyses of sequenced data and the annotations for gene ontology and pathways indicated that soybean adapts to salt stress through ABA biosynthesis and regulation of translation and signal transduction of structural genes. Manipulation of these pathways may mitigate the effect of salt stress thus enhancing salt tolerance. PMID:23209559
On improving the speed and reliability of T2-Relaxation-Under-Spin-Tagging (TRUST) MRI

PubMed Central

Xu, Feng; Uh, Jinsoo; Liu, Peiying; Lu, Hanzhang

2011-01-01

A T2-Relaxation-Under-Spin-Tagging (TRUST) technique was recently developed to estimate cerebral blood oxygenation, providing potentials for non-invasive assessment of the brain's oxygen consumption. A limitation of the current sequence is the need for long TR, as shorter TR causes an over-estimation in blood R2. The present study proposes a post-saturation TRUST by placing a non-selective 90° pulse after the signal acquisition to reset magnetization in the whole brain. This scheme was found to eliminate estimation bias at a slight cost of precision. To improve the precision, TE of the sequence was optimized and it was found that a modest TE shortening of 3.4ms can reduce the estimation error by 49%. We recommend the use of post-saturation TRUST sequence with a TR of 3000ms and a TE of 3.6ms, which allows the determination of global venous oxygenation with scan duration of 1 minute 12 seconds and an estimation precision of ±1% (in units of oxygen saturation percentage). PMID:22127845
Hypervariable Domain of Nonstructural Protein nsP3 of Venezuelan Equine Encephalitis Virus Determines Cell-Specific Mode of Virus Replication

PubMed Central

Foy, Niall J.; Akhrymuk, Maryna; Shustov, Alexander V.; Frolova, Elena I.

2013-01-01

Venezuelan equine encephalitis virus (VEEV) is one of the most pathogenic members of the Alphavirus genus in the Togaviridae family. This genus is divided into the Old World and New World alphaviruses, which demonstrate profound differences in pathogenesis, replication, and virus-host interactions. VEEV is a representative member of the New World alphaviruses. The biology of this virus is still insufficiently understood, particularly the function of its nonstructural proteins in RNA replication and modification of the intracellular environment. One of these nonstructural proteins, nsP3, contains a hypervariable domain (HVD), which demonstrates very low overall similarity between different alphaviruses, suggesting the possibility of its function in virus adaptation to different hosts and vectors. The results of our study demonstrate the following. (i) Phosphorylation of the VEEV nsP3-specific HVD does not play a critical role in virus replication in cells of vertebrate origin but is important for virus replication in mosquito cells. (ii) The VEEV HVD is not required for viral RNA replication in the highly permissive BHK-21 cell line. In fact, it can be either completely deleted or replaced by a heterologous protein sequence. These variants require only one or two additional adaptive mutations in nsP3 and/or nsP2 proteins to achieve an efficiently replicating phenotype. (iii) However, the carboxy-terminal repeat in the VEEV HVD is indispensable for VEEV replication in the cell lines other than BHK-21 and plays a critical role in formation of VEEV-specific cytoplasmic protein complexes. Natural VEEV variants retain at least one of the repeated elements in their nsP3 HVDs. PMID:23637407
Heparin-binding peptide as a novel affinity tag for purification of recombinant proteins.

PubMed

Morris, Jacqueline; Jayanthi, Srinivas; Langston, Rebekah; Daily, Anna; Kight, Alicia; McNabb, David S; Henry, Ralph; Kumar, Thallapuranam Krishnaswamy Suresh

2016-10-01

Purification of recombinant proteins constitutes a significant part of the downstream processing in biopharmaceutical industries. Major costs involved in the production of bio-therapeutics mainly depend on the number of purification steps used during the downstream process. Affinity chromatography is a widely used method for the purification of recombinant proteins expressed in different expression host platforms. Recombinant protein purification is achieved by fusing appropriate affinity tags to either N- or C- terminus of the target recombinant proteins. Currently available protein/peptide affinity tags have proved quite useful in the purification of recombinant proteins. However, these affinity tags suffer from specific limitations in their use under different conditions of purification. In this study, we have designed a novel 34-amino acid heparin-binding affinity tag (HB-tag) for the purification of recombinant proteins expressed in Escherichia coli (E. coli) cells. HB-tag fused recombinant proteins were overexpressed in E. coli in high yields. A one-step heparin-Sepharose-based affinity chromatography protocol was developed to purify HB-fused recombinant proteins to homogeneity using a simple sodium chloride step gradient elution. The HB-tag has also been shown to facilitate the purification of target recombinant proteins from their 8 M urea denatured state(s). The HB-tag has been demonstrated to be successfully released from the fusion protein by an appropriate protease treatment to obtain the recombinant target protein(s) in high yields. Results of the two-dimensional NMR spectroscopy experiments indicate that the purified recombinant target protein(s) exist in the native conformation. Polyclonal antibodies raised against the HB-peptide sequence, exhibited high binding specificity and sensitivity to the HB-fused recombinant proteins (∼10 ng) in different crude cell extracts obtained from diverse expression hosts. In our opinion, the HB-tag provides a cost-effective, rapid, and reliable avenue for the purification of recombinant proteins in heterologous hosts. Copyright © 2016 Elsevier Inc. All rights reserved.
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment

PubMed Central

2013-01-01

Background Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. Results In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Conclusion Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA. PMID:24564200
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

PubMed

Nagar, Anurag; Hahsler, Michael

2013-01-01

Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA.
De Novo Transcriptome Sequencing Reveals Important Molecular Networks and Metabolic Pathways of the Plant, Chlorophytum borivilianum

PubMed Central

Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

2013-01-01

Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum. PMID:24376689
De Novo transcriptome sequencing reveals important molecular networks and metabolic pathways of the plant, Chlorophytum borivilianum.

PubMed

Kalra, Shikha; Puniya, Bhanwar Lal; Kulshreshtha, Deepika; Kumar, Sunil; Kaur, Jagdeep; Ramachandran, Srinivasan; Singh, Kashmir

2013-01-01

Chlorophytum borivilianum, an endangered medicinal plant species is highly recognized for its aphrodisiac properties provided by saponins present in the plant. The transcriptome information of this species is limited and only few hundred expressed sequence tags (ESTs) are available in the public databases. To gain molecular insight of this plant, high throughput transcriptome sequencing of leaf RNA was carried out using Illumina's HiSeq 2000 sequencing platform. A total of 22,161,444 single end reads were retrieved after quality filtering. Available (e.g., De-Bruijn/Eulerian graph) and in-house developed bioinformatics tools were used for assembly and annotation of transcriptome. A total of 101,141 assembled transcripts were obtained, with coverage size of 22.42 Mb and average length of 221 bp. Guanine-cytosine (GC) content was found to be 44%. Bioinformatics analysis, using non-redundant proteins, gene ontology (GO), enzyme commission (EC) and kyoto encyclopedia of genes and genomes (KEGG) databases, extracted all the known enzymes involved in saponin and flavonoid biosynthesis. Few genes of the alkaloid biosynthesis, along with anticancer and plant defense genes, were also discovered. Additionally, several cytochrome P450 (CYP450) and glycosyltransferase unique sequences were also found. We identified simple sequence repeat motifs in transcripts with an abundance of di-nucleotide simple sequence repeat (SSR; 43.1%) markers. Large scale expression profiling through Reads per Kilobase per Million mapped reads (RPKM) showed major genes involved in different metabolic pathways of the plant. Genes, expressed sequence tags (ESTs) and unique sequences from this study provide an important resource for the scientific community, interested in the molecular genetics and functional genomics of C. borivilianum.
The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE

PubMed Central

2011-01-01

Background The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. Results We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress. Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. Conclusions This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE. PMID:21320317
The salt-responsive transcriptome of chickpea roots and nodules via deepSuperSAGE.

PubMed

Molina, Carlos; Zaman-Allah, Mainassara; Khan, Faheema; Fatnassi, Nadia; Horres, Ralf; Rotter, Björn; Steinhauer, Diana; Amenc, Laurie; Drevon, Jean-Jacques; Winter, Peter; Kahl, Günter

2011-02-14

The combination of high-throughput transcript profiling and next-generation sequencing technologies is a prerequisite for genome-wide comprehensive transcriptome analysis. Our recent innovation of deepSuperSAGE is based on an advanced SuperSAGE protocol and its combination with massively parallel pyrosequencing on Roche's 454 sequencing platform. As a demonstration of the power of this combination, we have chosen the salt stress transcriptomes of roots and nodules of the third most important legume crop chickpea (Cicer arietinum L.). While our report is more technology-oriented, it nevertheless addresses a major world-wide problem for crops generally: high salinity. Together with low temperatures and water stress, high salinity is responsible for crop losses of millions of tons of various legume (and other) crops. Continuously deteriorating environmental conditions will combine with salinity stress to further compromise crop yields. As a good example for such stress-exposed crop plants, we started to characterize salt stress responses of chickpeas on the transcriptome level. We used deepSuperSAGE to detect early global transcriptome changes in salt-stressed chickpea. The salt stress responses of 86,919 transcripts representing 17,918 unique 26 bp deepSuperSAGE tags (UniTags) from roots of the salt-tolerant variety INRAT-93 two hours after treatment with 25 mM NaCl were characterized. Additionally, the expression of 57,281 transcripts representing 13,115 UniTags was monitored in nodules of the same plants. From a total of 144,200 analyzed 26 bp tags in roots and nodules together, 21,401 unique transcripts were identified. Of these, only 363 and 106 specific transcripts, respectively, were commonly up- or down-regulated (>3.0-fold) under salt stress in both organs, witnessing a differential organ-specific response to stress.Profiting from recent pioneer works on massive cDNA sequencing in chickpea, more than 9,400 UniTags were able to be linked to UniProt entries. Additionally, gene ontology (GO) categories over-representation analysis enabled to filter out enriched biological processes among the differentially expressed UniTags. Subsequently, the gathered information was further cross-checked with stress-related pathways. From several filtered pathways, here we focus exemplarily on transcripts associated with the generation and scavenging of reactive oxygen species (ROS), as well as on transcripts involved in Na+ homeostasis. Although both processes are already very well characterized in other plants, the information generated in the present work is of high value. Information on expression profiles and sequence similarity for several hundreds of transcripts of potential interest is now available. This report demonstrates, that the combination of the high-throughput transcriptome profiling technology SuperSAGE with one of the next-generation sequencing platforms allows deep insights into the first molecular reactions of a plant exposed to salinity. Cross validation with recent reports enriched the information about the salt stress dynamics of more than 9,000 chickpea ESTs, and enlarged their pool of alternative transcripts isoforms. As an example for the high resolution of the employed technology that we coin deepSuperSAGE, we demonstrate that ROS-scavenging and -generating pathways undergo strong global transcriptome changes in chickpea roots and nodules already 2 hours after onset of moderate salt stress (25 mM NaCl). Additionally, a set of more than 15 candidate transcripts are proposed to be potential components of the salt overly sensitive (SOS) pathway in chickpea. Newly identified transcript isoforms are potential targets for breeding novel cultivars with high salinity tolerance. We demonstrate that these targets can be integrated into breeding schemes by micro-arrays and RT-PCR assays downstream of the generation of 26 bp tags by SuperSAGE.
A blackberry (Rubus L.) expressed sequence tag library for the development of simple sequence repeat markers

PubMed Central

Lewers, Kim S; Saski, Chris A; Cuthbertson, Brandon J; Henry, David C; Staton, Meg E; Main, Dorrie S; Dhanaraj, Anik L; Rowland, Lisa J; Tomkins, Jeff P

2008-01-01

Background The recent development of novel repeat-fruiting types of blackberry (Rubus L.) cultivars, combined with a long history of morphological marker-assisted selection for thornlessness by blackberry breeders, has given rise to increased interest in using molecular markers to facilitate blackberry breeding. Yet no genetic maps, molecular markers, or even sequences exist specifically for cultivated blackberry. The purpose of this study is to begin development of these tools by generating and annotating the first blackberry expressed sequence tag (EST) library, designing primers from the ESTs to amplify regions containing simple sequence repeats (SSR), and testing the usefulness of a subset of the EST-SSRs with two blackberry cultivars. Results A cDNA library of 18,432 clones was generated from expanding leaf tissue of the cultivar Merton Thornless, a progenitor of many thornless commercial cultivars. Among the most abundantly expressed of the 3,000 genes annotated were those involved with energy, cell structure, and defense. From individual sequences containing SSRs, 673 primer pairs were designed. Of a randomly chosen set of 33 primer pairs tested with two blackberry cultivars, 10 detected an average of 1.9 polymorphic PCR products. Conclusion This rate predicts that this library may yield as many as 940 SSR primer pairs detecting 1,786 polymorphisms. This may be sufficient to generate a genetic map that can be used to associate molecular markers with phenotypic traits, making possible molecular marker-assisted breeding to compliment existing morphological marker-assisted breeding in blackberry. PMID:18570660
The characterisation of novel secreted Ly-6 proteins from rat urine by the combined use of two-dimensional gel electrophoresis, microbore high performance liquid chromatography and expressed sequence tag data.

PubMed

Southan, Christopher; Cutler, Paul; Birrell, Helen; Connell, John; Fantom, Kenneth G M; Sims, Matthew; Shaikh, Narjis; Schneider, Klaus

2002-02-01

A proteomic study of rat urine was undertaken using two-dimensional gel electrophoresis, microbore high performance liquid chromatography, mass spectrometry and N-terminal sequencing. Five known urinary proteins were identified but two novel peptide fragments matched a large number of rat expressed sequence tags (ESTs) from a liver library. By combining protein chemical and nucleotide data, two 101-residue open reading frames with 90% amino acid identity were determined, rat urinary protein 1 (RUP-1) and RUP-2. The data established signal peptide removal and provided evidence for N-glycosylation. A third related sequence, rat spleen protein (RSP-1) was confirmed from EST searches. These three proteins have been submitted to SWISS-PROT as P81827, P81828 and Q9QXN2, respectively. A fourth novel homologue was found in porcine and bovine ESTs from embryo libraries. Alignment with known homologues showed conserved cysteine positions characteristic of a secreted subfamily of Ly-6 proteins. In two cases, antineoplastic urinary protein and caltrin, these homologues have unverified functional annotations. The RUP sequences showed high scoring matches to three unrelated rat mRNAs subsequently established to be chimeric. Two of these share extended sectional identity to RUP-1 but the third may represent another novel Ly-6 homologue. These chimeras have caused serious annotation errors in secondary databases.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Simpson, Jeffrey P.; Thrower, Nicholas; Ohlrogge, John B.

Bayberry (Myrica pensylvanica) fruits are covered with a remarkably thick layer of crystalline wax consisting of triacylglycerol (TAG) and diacylglycerol (DAG) esterified exclusively with saturated fatty acids. As the only plant known to accumulate soluble glycerolipids as a major component of surface waxes, Bayberry represents a novel system to investigate neutral lipid biosynthesis and lipid secretion by vegetative plant cells. The assembly of Bayberry wax is distinct from conventional TAG and other surface waxes, and instead proceeds through a pathway related to cutin synthesis (Simpson and Ohlrogge, 2016). In this study, microscopic examination revealed that the fruit tissue that producesmore » and secretes wax (Bayberry knobs) is fully developed before wax accumulates and that wax is secreted to the surface without cell disruption. Comparison of transcript expression to genetically related tissues (Bayberry leaves, M. rubra fruits), cutin-rich tomato and cherry fruit epidermis, and to oil-rich mesocarp and seeds, revealed exceptionally high expression of 13 transcripts for acyl-lipid metabolism together with down-regulation of fatty acid oxidases and desaturases. The predicted protein sequences of the most highly expressed lipid-related enzyme-encoding transcripts in Bayberry knobs are 100% identical to the sequences from Bayberry leaves,which do not produce surface DAG or TAG. Together, these results indicate that TAG biosynthesis and secretion in Bayberry is achieved by both up and down-regulation of a small subset of genes related to the biosynthesis of cutin and saturated fatty acids, and also implies that modifications in gene expression, rather than evolution of new gene functions, was the major mechanism by which Bayberry evolved its specialized lipid metabolism.« less

Discovery and characterization of Coturnix chinensis avian β-defensin 10, with broad antibacterial activity.

PubMed

Ma, Deying; Lin, Lijuan; Zhang, Kexin; Han, Zongxi; Shao, Yuhao; Wang, Ruiqin; Liu, Shengwang

2012-04-01

A novel avian β-defensin (AvBD), AvBD10, was discovered in the liver and bone marrow tissues from Chinese painted quail (Coturnix chinensis) in the present study. The complete nucleotide sequence of quail AvBD10 contains a 207-bp open reading frame that encodes 68 amino acids. The quail AvBD10 was expressed widely in all the tissues from quails except the tongue, crop, breast muscle, and thymus and was highly expressed in the bone marrow. In contrast to the expression pattern of AvBD10 in tissues from quail, the chicken AvBD10 was expressed in all 21 tissues from the layer hens investigated, with a high level of expression in the kidney, lung, liver, bone marrow, and Harderian glands. Recombinant glutathione S-transferase (GST)-tagged AvBD10s of both quail and chicken were produced and purified by expression of the two cDNAs in Escherichia coli, respectively. In addition, peptide according to the respective AvBD10s sequence was synthesized, named synthetic AvBD10s. As expected, both recombinant GST-tagged AvBD10s and synthetic AvBD10s of quail and chicken exhibited similar bactericidal properties against most bacteria, including Gram-positive and Gram-negative forms. However, no significant bactericidal activity was found for quail recombinant GST-tagged AvBD10 against Salmonella choleraesuis or for chicken recombinant GST-tagged AvBD10 against Proteus mirabilis. Copyright © 2012 European Peptide Society and John Wiley & Sons, Ltd.
Application of volcanic ash particles for protein affinity purification with a minimized silica-binding tag.

PubMed

Abdelhamid, Mohamed A A; Ikeda, Takeshi; Motomura, Kei; Tanaka, Tatsuya; Ishida, Takenori; Hirota, Ryuichi; Kuroda, Akio

2016-11-01

We recently reported that the spore coat protein, CotB1 (171 amino acids), from Bacillus cereus mediates silica biomineralization and that the polycationic C-terminal sequence of CotB1 (14 amino acids), designated CotB1p, serves as a silica-binding tag when fused to other proteins. Here, we reduced the length of this silica-binding tag to only seven amino acids (SB7 tag: RQSSRGR) while retaining its affinity for silica. Alanine scanning mutagenesis indicated that the three arginine residues in the SB7 tag play important roles in binding to a silica surface. Monomeric l-arginine, at concentrations of 0.3-0.5 M, was found to serve as a competitive eluent to release bound SB7-tagged proteins from silica surfaces. To develop a low-cost, silica-based affinity purification procedure, we used natural volcanic ash particles with a silica content of ∼70%, rather than pure synthetic silica particles, as an adsorbent for SB7-tagged proteins. Using green fluorescent protein, mCherry, and mKate2 as model proteins, our purification method achieved 75-90% recovery with ∼90% purity. These values are comparable to or even higher than that of the commonly used His-tag affinity purification. In addition to low cost, another advantage of our method is the use of l-arginine as the eluent because its protein-stabilizing effect would help minimize alteration of the intrinsic properties of the purified proteins. Our approach paves the way for the use of naturally occurring materials as adsorbents for simple, low-cost affinity purification. Copyright © 2016 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
The first genome-level transcriptome of the wood-degrading fungus Phanerochaete chrysosporium grown on red oak.

PubMed

Sato, Shin; Feltus, F Alex; Iyer, Prashanti; Tien, Ming

2009-06-01

As part of an effort to determine all the gene products involved in wood degradation, we have performed massively parallel pyrosequencing on an expression library from the white rot fungus Phanerochaete chrysosporium grown in shallow stationary cultures with red oak as the carbon source. Approximately 48,000 high quality sequence tags (246 bp average length) were generated. 53% of the sequence tags aligned to 4,262 P. chrysosporium gene models, and an additional 18.5% of the tags reliably aligned to the P. chrysosporium genome providing evidence for 961 putative novel fragmented gene models. Due to their role in lignocellulose degradation, the secreted proteins were focused upon. Our results show that the four enzymes required for cellulose degradation: endocellulase, exocellulase CBHI, exocellulase CBHII, and beta-glucosidase are all produced. For hemicellulose degradation, not all known enzymes were produced, but endoxylanases, acetyl xylan esterases and mannosidases were detected. For lignin degradation, the role of peroxidases has been questioned; however, our results show that lignin peroxidase is highly expressed along with the H(2)O(2) generating enzyme, alcohol oxidase. The transcriptome snapshot reveals that H(2)O(2) generation and utilization are central in wood degradation. Our results also reveal new transcripts that encode extracellular proteins with no known function.
The ABC transporter Rv1272c of Mycobacterium tuberculosis enhances the import of long-chain fatty acids in Escherichia coli.

PubMed

Martin, Audrey; Daniel, Jaiyanth

2018-02-05

Mycobacterium tuberculosis (Mtb), which causes tuberculosis, is capable of accumulating triacylglycerol (TAG) by utilizing fatty acids from host cells. ATP-binding cassette (ABC) transporters are involved in transport processes in all organisms. Among the classical ABC transporters in Mtb none have been implicated in fatty acid import. Since the transport of fatty acids from the host cell is important for dormancy-associated TAG synthesis in the pathogen, mycobacterial ABC transporter(s) could potentially be involved in this process. Based on sequence identities with a bacterial ABC transporter that mediates fatty acid import for TAG synthesis, we identified Rv1272c, a hitherto uncharacterized ABC-transporter in Mtb that also shows sequence identities with a plant ABC transporter involved in fatty acid transport. We expressed Rv1272c in E. coli and show that it enhances the import of radiolabeled fatty acids. We also show that Rv1272c causes a significant increase in the metabolic incorporation of radiolabeled long-chain fatty acids into cardiolipin, a tetra-acylated phospholipid, and phosphatidylglycerol in E. coli. This is the first report on the function of Rv1272c showing that it displays a long-chain fatty acid transport function. Copyright © 2018 Elsevier Inc. All rights reserved.
Distinguishing Aspartic and Isoaspartic Acids in Peptides by Several Mass Spectrometric Fragmentation Methods.

PubMed

DeGraan-Weber, Nick; Zhang, Jun; Reilly, James P

2016-12-01

Six ion fragmentation techniques that can distinguish aspartic acid from its isomer, isoaspartic acid, were compared. MALDI post-source decay (PSD), MALDI 157 nm photodissociation, tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) charge tagging in PSD and photodissociation, ESI collision-induced dissociation (CID), electron transfer dissociation (ETD), and free-radical initiated peptide sequencing (FRIPS) with CID were applied to peptides containing either aspartic or isoaspartic acid. Diagnostic ions, such as the y-46 and b+H 2 O, are present in PSD, photodissociation, and charge tagging. c • +57 and z-57 ions are observed in ETD and FRIPS experiments. For some molecules, aspartic and isoaspartic acid yield ion fragments with significantly different intensities. ETD and charge tagging appear to be most effective at distinguishing these residues. Graphical Abstract ᅟ.
Expression, purification, and functional analysis of the C-terminal domain of Herbaspirillum seropedicae NifA protein.

PubMed

Monteiro, Rose A; Souza, Emanuel M; Geoffrey Yates, M; Steffens, M Berenice R; Pedrosa, Fábio O; Chubatsu, Leda S

2003-02-01

The Herbaspirillum seropedicae NifA protein is responsible for nif gene expression. The C-terminal domain of the H. seropedicae NifA protein, fused to a His-Tag sequence (His-Tag-C-terminal), was over-expressed and purified by metal-affinity chromatography to yield a highly purified and active protein. Band-shift assays showed that the NifA His-Tag-C-terminal bound specifically to the H. seropedicae nifB promoter region in vitro. In vivo analysis showed that this protein inhibited the Central + C-terminal domains of NifA protein from activating the nifH promoter of K. pneumoniae in Escherichia coli, indicating that the protein must be bound to the NifA-binding site (UAS site) at the nifH promoter region to activate transcription. Copyright 2002 Elsevier Science (USA)
Gene expression profiling via LongSAGE in a non-model plant species: a case study in seeds of Brassica napus

PubMed Central

Obermeier, Christian; Hosseini, Bashir; Friedt, Wolfgang; Snowdon, Rod

2009-01-01

Background Serial analysis of gene expression (LongSAGE) was applied for gene expression profiling in seeds of oilseed rape (Brassica napus ssp. napus). The usefulness of this technique for detailed expression profiling in a non-model organism was demonstrated for the highly complex, neither fully sequenced nor annotated genome of B. napus by applying a tag-to-gene matching strategy based on Brassica ESTs and the annotated proteome of the closely related model crucifer A. thaliana. Results Transcripts from 3,094 genes were detected at two time-points of seed development, 23 days and 35 days after pollination (DAP). Differential expression showed a shift from gene expression involved in diverse developmental processes including cell proliferation and seed coat formation at 23 DAP to more focussed metabolic processes including storage protein accumulation and lipid deposition at 35 DAP. The most abundant transcripts at 23 DAP were coding for diverse protease inhibitor proteins and proteases, including cysteine proteases involved in seed coat formation and a number of lipid transfer proteins involved in embryo pattern formation. At 35 DAP, transcripts encoding napin, cruciferin and oleosin storage proteins were most abundant. Over both time-points, 18.6% of the detected genes were matched by Brassica ESTs identified by LongSAGE tags in antisense orientation. This suggests a strong involvement of antisense transcript expression in regulatory processes during B. napus seed development. Conclusion This study underlines the potential of transcript tagging approaches for gene expression profiling in Brassica crop species via EST matching to annotated A. thaliana genes. Limits of tag detection for low-abundance transcripts can today be overcome by ultra-high throughput sequencing approaches, so that tag-based gene expression profiling may soon become the method of choice for global expression profiling in non-model species. PMID:19575793
Sequencing small genomic targets with high efficiency and extreme accuracy

PubMed Central

Schmitt, Michael W.; Fox, Edward J.; Prindle, Marc J.; Reid-Bayliss, Kate S.; True, Lawrence D.; Radich, Jerald P.; Loeb, Lawrence A.

2015-01-01

The detection of minority variants in mixed samples demands methods for enrichment and accurate sequencing of small genomic intervals. We describe an efficient approach based on sequential rounds of hybridization with biotinylated oligonucleotides, enabling more than one-million fold enrichment of genomic regions of interest. In conjunction with error correcting double-stranded molecular tags, our approach enables the quantification of mutations in individual DNA molecules. PMID:25849638
Mark report satellite tags (mrPATs) to detail large-scale horizontal movements of deep water species: First results for the Greenland shark (Somniosus microcephalus)

NASA Astrophysics Data System (ADS)

Hussey, Nigel E.; Orr, Jack; Fisk, Aaron T.; Hedges, Kevin J.; Ferguson, Steven H.; Barkley, Amanda N.

2018-04-01

The deep-sea is increasingly viewed as a lucrative environment for the growth of resource extraction industries. To date, our ability to study deep-sea species lags behind that of those inhabiting the photic zone limiting scientific data available for management. In particular, knowledge of horizontal movements is restricted to two locations; capture and recapture, with no temporal information on absolute animal locations between endpoints. To elucidate the horizontal movements of a large deep-sea fish, a novel tagging approach was adopted using the smallest available prototype satellite tag - the mark-report pop-up archival tag (mrPAT). Five Greenland sharks (Somniosus microcephalus) were equipped with multiple mrPATs as well as a standard archival satellite tag (miniPAT) that were programmed to release in sequence at 8-10 day intervals. The performance of the mrPATs was quantified. The tagging approach provided multiple locations per individual and revealed a previously unknown directed migration of Greenland sharks from the Canadian high Arctic to Northwest Greenland. All tags reported locations, however, the accuracy and time from expected release were variable among tags (average time to an accurate location from expected release = 30.8 h, range: 4.9-227.6 h). Average mrPAT drift rate estimated from best quality messages (LQ1,2,3) was 0.37 ± 0.09 m/s indicating tags were on average 41.1 ± 63.4 km (range: 6.5-303.1 km) from the location of the animal when they transmitted. mrPATs provided daily temperature values that were highly correlated among tags and with the miniPAT (70.8% of tag pairs were significant). In contrast, daily tilt sensor data were variable among tags on the same animal (12.5% of tag pairs were significant). Tracking large-scale movements of deep-sea fish has historically been limited by the remote environment they inhabit. The current study provides a new approach to document reliable coarse scale horizontal movements to understand migrations, stock structure and habitat use of large species. Opportunities to apply mrPATs to understand the movements of medium size fish, marine mammals and to validate retrospective movement modeling approaches based on archival data are presented.
Genomic analysis of expressed sequence tags in American black bear Ursus americanus

PubMed Central

2010-01-01

Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065
Genomic analysis of expressed sequence tags in American black bear Ursus americanus.

PubMed

Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun

2010-03-26

Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.
Molecular Identification of Unusual Pathogenic Yeast Isolates by Large Ribosomal Subunit Gene Sequencing: 2 Years of Experience at the United Kingdom Mycology Reference Laboratory▿

PubMed Central

Linton, Christopher J.; Borman, Andrew M.; Cheung, Grace; Holmes, Ann D.; Szekely, Adrien; Palmer, Michael D.; Bridge, Paul D.; Campbell, Colin K.; Johnson, Elizabeth M.

2007-01-01

Rapid identification of yeast isolates from clinical samples is particularly important given their innately variable antifungal susceptibility profiles. We present here an analysis of the utility of PCR amplification and sequence analysis of the hypervariable D1/D2 region of the 26S rRNA gene for the identification of yeast species submitted to the United Kingdom Mycology Reference Laboratory over a 2-year period. A total of 3,033 clinical isolates were received from 2004 to 2006 encompassing 50 different yeast species. While more than 90% of the isolates, corresponding to the most common Candida species, could be identified by using the AUXACOLOR2 yeast identification kit, 153 isolates (5%), comprised of 47 species, could not be identified by using this system and were subjected to molecular identification via 26S rRNA gene sequencing. These isolates included some common species that exhibited atypical biochemical and phenotypic profiles and also many rarer yeast species that are infrequently encountered in the clinical setting. All 47 species requiring molecular identification were unambiguously identified on the basis of D1/D2 sequences, and the molecular identities correlated well with the observed biochemical profiles of the various organisms. Together, our data underscore the utility of molecular techniques as a reference adjunct to conventional methods of yeast identification. Further, we show that PCR amplification and sequencing of the D1/D2 region reliably identifies more than 45 species of clinically significant yeasts and can also potentially identify new pathogenic yeast species. PMID:17251397
Detection and identification of Theileria infection in sika deer ( Cervus nippon ) in China.

PubMed

He, Lan; Khan, Muhanmad Kasib; Zhang, Wen-Jie; Zhang, Qing-Li; Zhou, Yan-Qin; Hu, Min; Zhao, Junlong

2012-06-01

The sika deer ( Cervus nippon ) is a first-grade state-protected animal in China and designated a threatened species by the World Conservation Union. To detect hemoparasite infection of sika deer, blood samples were collected from 24 animals in the Hubei Province Deer Center. Genomic DNA was extracted, and the V4 hypervariable region encoding 18S rRNA was analyzed by reverse line blot hybridization assay. PCR products hybridized with Babesia / Theileria genus-specific probes but failed to hybridize with any of the Babesia or Theileria species-specific probes, suggesting the presence of a novel, or variant, species. Here 18S rRNA and internal transcribed spacer (ITS) genes were amplified, cloned, and sequenced from 7 isolates. Alignment and BlastN of the cloned sequences revealed high similarities to the homologous 18S rRNA genes and ITS genes of Theileria cervi (AY735122), Theileria sp. CNY1A (AB012194), and Theileria sp. ex Yamaguchi (AF529272). Phylogenetic analysis based on the 18S rRNA gene and ITS sequences showed that all cloned sequences were grouped within the Theileria clade. Phylogeny based on the 18S rRNA gene divided the organisms into 2 groups. Group 1 was closest to Theileria sp. ex Yamaguchi (AF529272), and group 2 was distinct from all other identified Theileria and Babesia species. These results suggest the existence of Theileria sp. infection in sika deer in China. To our knowledge, this is the first report of cervine Theileria sp. in China.
Phylogeographic Analysis of Mitochondrial DNA in Northern Asian Populations

PubMed Central

Derenko, Miroslava ; Malyarchuk, Boris ; Grzybowski, Tomasz ; Denisova, Galina ; Dambueva, Irina ; Perkova, Maria ; Dorzhu, Choduraa ; Luzina, Faina ; Lee, Hong Kyu ; Vanecek, Tomas ; Villems, Richard ; Zakharov, Ilia

2007-01-01

To elucidate the human colonization process of northern Asia and human dispersals to the Americas, a diverse subset of 71 mitochondrial DNA (mtDNA) lineages was chosen for complete genome sequencing from the collection of 1,432 control-region sequences sampled from 18 autochthonous populations of northern, central, eastern, and southwestern Asia. On the basis of complete mtDNA sequencing, we have revised the classification of haplogroups A, D2, G1, M7, and I; identified six new subhaplogroups (I4, N1e, G1c, M7d, M7e, and J1b2a); and fully characterized haplogroups N1a and G1b, which were previously described only by the first hypervariable segment (HVS1) sequencing and coding-region restriction-fragment–length polymorphism analysis. Our findings indicate that the southern Siberian mtDNA pool harbors several lineages associated with the Late Upper Paleolithic and/or early Neolithic dispersals from both eastern Asia and southwestern Asia/southern Caucasus. Moreover, the phylogeography of the D2 lineages suggests that southern Siberia is likely to be a geographical source for the last postglacial maximum spread of this subhaplogroup to northern Siberia and that the expansion of the D2b branch occurred in Beringia ∼7,000 years ago. In general, a detailed analysis of mtDNA gene pools of northern Asians provides the additional evidence to rule out the existence of a northern Asian route for the initial human colonization of Asia. PMID:17924343
Phylogeographic analysis of mitochondrial DNA in northern Asian populations.

PubMed

Derenko, Miroslava; Malyarchuk, Boris; Grzybowski, Tomasz; Denisova, Galina; Dambueva, Irina; Perkova, Maria; Dorzhu, Choduraa; Luzina, Faina; Lee, Hong Kyu; Vanecek, Tomas; Villems, Richard; Zakharov, Ilia

2007-11-01

To elucidate the human colonization process of northern Asia and human dispersals to the Americas, a diverse subset of 71 mitochondrial DNA (mtDNA) lineages was chosen for complete genome sequencing from the collection of 1,432 control-region sequences sampled from 18 autochthonous populations of northern, central, eastern, and southwestern Asia. On the basis of complete mtDNA sequencing, we have revised the classification of haplogroups A, D2, G1, M7, and I; identified six new subhaplogroups (I4, N1e, G1c, M7d, M7e, and J1b2a); and fully characterized haplogroups N1a and G1b, which were previously described only by the first hypervariable segment (HVS1) sequencing and coding-region restriction-fragment-length polymorphism analysis. Our findings indicate that the southern Siberian mtDNA pool harbors several lineages associated with the Late Upper Paleolithic and/or early Neolithic dispersals from both eastern Asia and southwestern Asia/southern Caucasus. Moreover, the phylogeography of the D2 lineages suggests that southern Siberia is likely to be a geographical source for the last postglacial maximum spread of this subhaplogroup to northern Siberia and that the expansion of the D2b branch occurred in Beringia ~7,000 years ago. In general, a detailed analysis of mtDNA gene pools of northern Asians provides the additional evidence to rule out the existence of a northern Asian route for the initial human colonization of Asia.
Evolutionary dynamics of an expressed MHC class IIβ locus in the Ranidae (Anura) uncovered by genome walking and high-throughput amplicon sequencing

USGS Publications Warehouse

Mulder, Kevin P.; Cortazar-Chinarro, Maria; Harris, D. James; Crottini, Angelica; Grant, Evan H. Campbell; Fleischer, Robert C.; Savage, Anna E.

2017-01-01

The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa.
Some maternal lineages of domestic horses may have origins in East Asia revealed with further evidence of mitochondrial genomes and HVR-1 sequences.

PubMed

Ma, Hongying; Wu, Yajiang; Xiang, Hai; Yang, Yunzhou; Wang, Min; Zhao, Chunjiang; Wu, Changxin

2018-01-01

There are large populations of indigenous horse ( Equus caballus ) in China and some other parts of East Asia. However, their matrilineal genetic diversity and origin remained poorly understood. Using a combination of mitochondrial DNA (mtDNA) and hypervariable region (HVR-1) sequences, we aim to investigate the origin of matrilineal inheritance in these domestic horses. To investigate patterns of matrilineal inheritance in domestic horses, we conducted a phylogenetic study using 31 de novo mtDNA genomes together with 317 others from the GenBank. In terms of the updated phylogeny, a total of 5,180 horse mitochondrial HVR-1 sequences were analyzed. Eightteen haplogroups (Aw-Rw) were uncovered from the analysis of the whole mitochondrial genomes. Most of which have a divergence time before the earliest domestication of wild horses (about 5,800 years ago) and during the Upper Paleolithic (35-10 KYA). The distribution of some haplogroups shows geographic patterns. The Lw haplogroup contained a significantly higher proportion of European horses than the horses from other regions, while haplogroups Jw, Rw, and some maternal lineages of Cw, have a higher frequency in the horses from East Asia. The 5,180 sequences of horse mitochondrial HVR-1 form nine major haplogroups (A-I). We revealed a corresponding relationship between the haplotypes of HVR-1 and those of whole mitochondrial DNA sequences. The data of the HVR-1 sequences also suggests that Jw, Rw, and some haplotypes of Cw may have originated in East Asia while Lw probably formed in Europe. Our study supports the hypothesis of the multiple origins of the maternal lineage of domestic horses and some maternal lineages of domestic horses may have originated from East Asia.
Evolutionary dynamics of an expressed MHC class IIβ locus in the Ranidae (Anura) uncovered by genome walking and high-throughput amplicon sequencing.

PubMed

Mulder, Kevin P; Cortazar-Chinarro, Maria; Harris, D James; Crottini, Angelica; Campbell Grant, Evan H; Fleischer, Robert C; Savage, Anna E

2017-11-01

The Major Histocompatibility Complex (MHC) is a genomic region encoding immune loci that are important and frequently used markers in studies of adaptive genetic variation and disease resistance. Given the primary role of infectious diseases in contributing to global amphibian declines, we characterized the hypervariable exon 2 and flanking introns of the MHC Class IIβ chain for 17 species of frogs in the Ranidae, a speciose and cosmopolitan family facing widespread pathogen infections and declines. We find high levels of genetic variation concentrated in the Peptide Binding Region (PBR) of the exon. Ten codons are under positive selection, nine of which are located in the mammal-defined PBR. We hypothesize that the tenth codon (residue 21) is an amphibian-specific PBR site that may be important in disease resistance. Trans-species and trans-generic polymorphisms are evident from exon-based genealogies, and co-phylogenetic analyses between intron, exon and mitochondrial based reconstructions reveal incongruent topologies, likely due to different locus histories. We developed two sets of barcoded adapters that reliably amplify a single and likely functional locus in all screened species using both 454 and Illumina based sequencing methods. These primers provide a resource for multiplexing and directly sequencing hundreds of samples in a single sequencing run, avoiding the labour and chimeric sequences associated with cloning, and enabling MHC population genetic analyses. Although the primers are currently limited to the 17 species we tested, these sequences and protocols provide a useful genetic resource and can serve as a starting point for future disease, adaptation and conservation studies across a range of anuran taxa. Copyright © 2017 Elsevier Ltd. All rights reserved.
Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

PubMed

Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

2016-01-01

Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.
Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis

PubMed Central

Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

2016-01-01

Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros ‘Jinzaoshi’ were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. ‘Jinzaoshi’, support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales. PMID:27442423

Evaluations of Different Hypervariable Regions of Archaeal 16S rRNA Genes in Profiling of Methanogens by Archaea-Specific PCR and Denaturing Gradient Gel Electrophoresis▿

PubMed Central

Yu, Zhongtang; García-González, Rubén; Schanbacher, Floyd L.; Morrison, Mark

2008-01-01

Different hypervariable (V) regions of the archaeal 16S rRNA gene (rrs) were compared systematically to establish a preferred V region(s) for use in Archaea-specific PCR-denaturing gradient gel electrophoresis (DGGE). The PCR products of the V3 region produced the most informative DGGE profiles and permitted identification of common methanogens from rumen samples from sheep. This study also showed that different methanogens might be detected when different V regions are targeted by PCR-DGGE. Dietary fat appeared to transiently stimulate Methanosphaera stadtmanae but inhibit Methanobrevibacter sp. strain AbM4 in rumen samples. PMID:18083874
A Cleavable N-Terminal Signal Peptide Promotes Widespread Olfactory Receptor Surface Expression in HEK293T Cells

PubMed Central

Shepard, Blythe D.; Natarajan, Niranjana; Protzko, Ryan J.; Acres, Omar W.; Pluznick, Jennifer L.

2013-01-01

Olfactory receptors (ORs) are G protein-coupled receptors that detect odorants in the olfactory epithelium, and comprise the largest gene family in the genome. Identification of OR ligands typically requires OR surface expression in heterologous cells; however, ORs rarely traffic to the cell surface when exogenously expressed. Therefore, most ORs are orphan receptors with no known ligands. To date, studies have utilized non-cleavable rhodopsin (Rho) tags and/or chaperones (i.e. Receptor Transporting Protein, RTP1S, Ric8b and Gαolf) to improve surface expression. However, even with these tools, many ORs still fail to reach the cell surface. We used a test set of fifteen ORs to examine the effect of a cleavable leucine-rich signal peptide sequence (Lucy tag) on OR surface expression in HEK293T cells. We report here that the addition of the Lucy tag to the N-terminus increases the number of ORs reaching the cell surface to 7 of the 15 ORs (as compared to 3/15 without Rho or Lucy tags). Moreover, when ORs tagged with both Lucy and Rho were co-expressed with previously reported chaperones (RTP1S, Ric8b and Gαolf), we observed surface expression for all 15 receptors examined. In fact, two-thirds of Lucy-tagged ORs are able to reach the cell surface synergistically with chaperones even when the Rho tag is removed (10/15 ORs), allowing for the potential assessment of OR function with only an 8-amino acid Flag tag on the mature protein. As expected for a signal peptide, the Lucy tag was cleaved from the mature protein and did not alter OR-ligand binding and signaling. Our studies demonstrate that widespread surface expression of ORs can be achieved in HEK293T cells, providing promise for future large-scale deorphanization studies. PMID:23840901
Preparative SDS PAGE as an Alternative to His-Tag Purification of Recombinant Amelogenin

PubMed Central

Gabe, Claire M.; Brookes, Steven J.; Kirkham, Jennifer

2017-01-01

Recombinant protein technology provides an invaluable source of proteins for use in structure-function studies, as immunogens, and in the development of therapeutics. Recombinant proteins are typically engineered with “tags” that allow the protein to be purified from crude host cell extracts using affinity based chromatography techniques. Amelogenin is the principal component of the developing enamel matrix and a frequent focus for biomineralization researchers. Several groups have reported the successful production of recombinant amelogenins but the production of recombinant amelogenin free of any tags, and at single band purity on silver stained SDS PAGE is technically challenging. This is important, as rigorous structure-function research frequently demands a high degree of protein purity and fidelity of protein sequence. Our aim was to generate His-tagged recombinant amelogenin at single band purity on silver stained SDS PAGE for use in functionality studies after His-tag cleavage. An acetic acid extraction technique (previously reported to produce recombinant amelogenin at 95% purity directly from E. coli) followed by repeated rounds of nickel column affinity chromatography, failed to generate recombinant amelogenin at single band purity. This was because following an initial round of nickel column affinity chromatography, subsequent cleavage of the His-tag was not 100% efficient. A second round of nickel column affinity chromatography, used in attempts to separate the cleaved His-tag free recombinant from uncleaved His-tagged contaminants, was still unsatisfactory as cleaved recombinant amelogenin exhibited significant affinity for the nickel column. To solve this problem, we used preparative SDS PAGE to successfully purify cleaved recombinant amelogenins to single band purity on silver stained SDS PAGE. The resolving power of preparative SDS PAGE was such that His-tag based purification of recombinant amelogenin becomes redundant. We suggest that acetic acid extraction of recombinant amelogenin and subsequent purification using preparative SDS PAGE provides a simple route to highly purified His-tag free amelogenin for use in structure-function experiments and beyond. PMID:28670287
Comparative and Functional Genomics of Rhodococcus opacus PD630 for Biofuels Development

PubMed Central

Holder, Jason W.; Ulrich, Jil C.; DeBono, Anthony C.; Godfrey, Paul A.; Desjardins, Christopher A.; Zucker, Jeremy; Zeng, Qiandong; Leach, Alex L. B.; Ghiviriga, Ion; Dancel, Christine; Abeel, Thomas; Gevers, Dirk; Kodira, Chinnappa D.; Desany, Brian; Affourtit, Jason P.; Birren, Bruce W.; Sinskey, Anthony J.

2011-01-01

The Actinomycetales bacteria Rhodococcus opacus PD630 and Rhodococcus jostii RHA1 bioconvert a diverse range of organic substrates through lipid biosynthesis into large quantities of energy-rich triacylglycerols (TAGs). To describe the genetic basis of the Rhodococcus oleaginous metabolism, we sequenced and performed comparative analysis of the 9.27 Mb R. opacus PD630 genome. Metabolic-reconstruction assigned 2017 enzymatic reactions to the 8632 R. opacus PD630 genes we identified. Of these, 261 genes were implicated in the R. opacus PD630 TAGs cycle by metabolic reconstruction and gene family analysis. Rhodococcus synthesizes uncommon straight-chain odd-carbon fatty acids in high abundance and stores them as TAGs. We have identified these to be pentadecanoic, heptadecanoic, and cis-heptadecenoic acids. To identify bioconversion pathways, we screened R. opacus PD630, R. jostii RHA1, Ralstonia eutropha H16, and C. glutamicum 13032 for growth on 190 compounds. The results of the catabolic screen, phylogenetic analysis of the TAGs cycle enzymes, and metabolic product characterizations were integrated into a working model of prokaryotic oleaginy. PMID:21931557
Intein-mediated one-step purification of Escherichia coli secreted human antibody fragments.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Wan-Yi; Miller, Keith D.; Coolbaugh, Michael

In this work, we apply self-cleaving affinity tag technology to several target proteins secreted into the Escherichia coli periplasm, including two with disulfide bonds. The target proteins were genetically fused to a self-cleaving chitin-binding domain intein tag for purification via a chitin agarose affinity resin. By attaching the intein-tagged fusion genes to the PelB secretion leader sequence, the tagged target proteins were secreted to the periplasmic space and could be recovered in active form by simple osmotic shock. After chitin-affinity purification, the target proteins were released from the chitin-binding domain tag via intein self-cleaving. This was induced by a smallmore » change in pH from 8.5 to 6.5 at room temperature, allowing direct elution of the cleaved target protein from the chitin affinity resin. The target proteins include the E. coli maltose-binding protein and b-lactamase enzyme, as well as two human antibody fragments that contain disulfide bonds. In all cases, the target proteins were purified with good activity and yield, without the need for refolding. Overall, this work demonstrates the compatibility of the DI-CM intein with the PelB secretion system in E. coli, greatly expanding its potential to more complex proteins.« less
Genetics Home Reference: 3-M syndrome

MedlinePlus

... complex, interfering with the process of tagging unneeded proteins for degradation. The body's response to growth hormones may be ... Kirk J, Chandler K, Kingston H, Donnai D, Clayton PE, Black GC. Exome sequencing identifies CCDC8 mutations in 3-M syndrome, suggesting ...
Characterization of PhPRP1, a histidine domain arabinogalactan protein from Petunia hybrida pistils.

PubMed

Twomey, Megan C; Brooks, Jenna K; Corey, Jillaine M; Singh-Cundy, Anu

2013-10-15

An arabinogalactan protein, PhPRP1, was purified from Petunia hybrida pistils and shown to be orthologous to TTS-1 and TTS-2 from Nicotiana tabacum and NaTTS from Nicotiana alata. Sequence comparisons among these proteins, and CaPRP1 from Capsicum annuum, reveal a conserved histidine-rich domain and two hypervariable domains. Immunoblots show that TTS-1 and PhPRP1 are also expressed in vegetative tissues of tobacco and petunia respectively. In contrast to the molecular mass heterogeneity displayed by the pistil proteins, the different isoforms found in seedlings, roots, and leaves each has a discrete size (37, 80, 160, and 200 kDa) on SDS-PAGE gels. On the basis of their chemistry, distinctive domain architecture, and the unique pattern of expression, we have named this group of proteins HD-AGPs (histidine domain-arabinogalactan proteins). Copyright © 2013 Elsevier GmbH. All rights reserved.
Changes in Abundance of Oral Microbiota Associated with Oral Cancer

PubMed Central

Schmidt, Brian L.; Kuczynski, Justin; Bhattacharya, Aditi; Huey, Bing; Corby, Patricia M.; Queiroz, Erica L. S.; Nightingale, Kira; Kerr, A. Ross; DeLacure, Mark D.; Veeramachaneni, Ratna; Olshen, Adam B.; Albertson, Donna G.

2014-01-01

Individual bacteria and shifts in the composition of the microbiome have been associated with human diseases including cancer. To investigate changes in the microbiome associated with oral cancers, we profiled cancers and anatomically matched contralateral normal tissue from the same patient by sequencing 16S rDNA hypervariable region amplicons. In cancer samples from both a discovery and a subsequent confirmation cohort, abundance of Firmicutes (especially Streptococcus) and Actinobacteria (especially Rothia) was significantly decreased relative to contralateral normal samples from the same patient. Significant decreases in abundance of these phyla were observed for pre-cancers, but not when comparing samples from contralateral sites (tongue and floor of mouth) from healthy individuals. Weighted UniFrac principal coordinates analysis based on 12 taxa separated most cancers from other samples with greatest separation of node positive cases. These studies begin to develop a framework for exploiting the oral microbiome for monitoring oral cancer development, progression and recurrence. PMID:24887397
Association between mitochondrial DNA variations and schizophrenia in the northern Chinese Han population.

PubMed

Xu, Feng-Ling; Ding, Mei; Yao, Jun; Shi, Zhang-Sen; Wu, Xue; Zhang, Jing-Jing; Pang, Hao; Xing, Jia-Xin; Xuan, Jin-Feng; Wang, Bao-Jie

2017-01-01

To determine whether mitochondrial DNA (mtDNA) variations are associated with schizophrenia, 313 patients with schizophrenia and 326 unaffected participants of the northern Chinese Han population were included in a prospective study. Single-nucleotide polymorphisms (SNPs) including C5178A, A10398G, G13708A, and C13928G were analyzed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). Hypervariable regions I and II (HVSI and HVSII) were analyzed by sequencing. The results showed that the 4 SNPs and 11 haplotypes, composed of the 4 SNPs, did not differ significantly between patient and control groups. No significant association between haplogroups and the risk of schizophrenia was ascertained after Bonferroni correction. Drawing a conclusion, there was no evidence of an association between mtDNA (the 4 SNPs and the control region) and schizophrenia in the northern Chinese Han population.
DNA analyses of the remains of the Prince Branciforte Barresi family.

PubMed

Rickards, O; Martínez-Labarga, C; Favaro, M; Frezza, D; Mallegni, F

2001-01-01

The five skeletons found buried in the church of Militello di Catania, Sicily, were tentatively identified by morphological analysis and historical reports as the remains of Prince Branciforte Barresi, two of his children, his brother and another juvenile member of the family (sixteenth and seventeenth centuries). In order to attempt to clarify the degree of relationships of the five skeletons, sex testing and mitochondrial DNA (mtDNA) sequence analysis of the hypervariable segments I and II (HV1 and HV2) of control region were performed. Moreover, the 9 bp-deletion marker of region V (COII/tRNAlys) was examined. Molecular genetic analyses were consistent with historical expectations, although they did not directly demonstrate that these are in fact the remains of the Prince and his relatives, due to the impossibility of obtaining DNA from living maternal relatives of the Prince.
Effect of migration patterns on maternal genetic structure: a case of Tai-Kadai migration from China to Thailand.

PubMed

Kampuansai, Jatupol; Kutanan, Wibhu; Tassi, Francesca; Kaewgahya, Massupa; Ghirotto, Silvia; Kangwanpong, Daoroong

2017-02-01

The migration of the Tai-Kadai speaking people from southern China to northern Thailand over the past hundreds of years has revealed numerous patterns that have likely been influenced by routes, purposes and periods of time. To study the effects of different migration patterns on Tai-Kadai maternal genetic structure, mitochondrial DNA hypervariable region I sequences from the Yong and the Lue people having well-documented histories in northern Thailand were analyzed. Although the Yong and Lue people were historically close relatives who shared Xishuangbanna Dai ancestors, significant genetic differences have been observed among them. The Yong people who have been known to practice mass migration have exhibited a closer genetic affinity to their Dai ancestors than have the Lue people. Genetic heterogeneity and a sudden reduced effective population size within the Lue group is likely a direct result of the circumstances of the founder effect.
Comparison of next generation sequencing technologies for transcriptome characterization

PubMed Central

2009-01-01

Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG) ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19). We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica) and the magnoliid avocado (Persea americana) using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB), 119,518 (88.7%) mapped exactly to known exons, while 1,117 (0.8%) mapped to introns, 11,524 (8.6%) spanned annotated intron/exon boundaries, and 3,066 (2.3%) extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance over capillary-based sequencing, but NG sequencing also presents significant challenges in assembly and sequence accuracy due to short read lengths, method-specific sequencing errors, and the absence of physical clones. These problems may be overcome by hybrid sequencing strategies using a mixture of sequencing methodologies, by new assemblers, and by sequencing more deeply. Sequencing and microarray outcomes from multiple experiments suggest that our simulator will be useful for guiding NG transcriptome sequencing projects in a wide range of organisms. PMID:19646272
Complex sputum microbial composition in patients with pulmonary tuberculosis

PubMed Central

2012-01-01

Background An increasing number of studies have implicated the microbiome in certain diseases, especially chronic diseases. In this study, the bacterial communities in the sputum of pulmonary tuberculosis patients were explored. Total DNA was extracted from sputum samples from 31 pulmonary tuberculosis patients and respiratory secretions of 24 healthy participants. The 16S rRNA V3 hyper-variable regions were amplified using bar-coded primers and pyro-sequenced using Roche 454 FLX. Results The results showed that the microbiota in the sputum of pulmonary tuberculosis patients were more diverse than those of healthy participants (p<0.05). The sequences were classified into 24 phyla, all of which were found in pulmonary tuberculosis patients and 17 of which were found in healthy participants. Furthermore, many foreign bacteria, such as Stenotrophomonas, Cupriavidus, Pseudomonas, Thermus, Sphingomonas, Methylobacterium, Diaphorobacter, Comamonas, and Mobilicoccus, were unique to pulmonary tuberculosis patients. Conclusions This study concluded that the microbial composition of the respiratory tract of pulmonary tuberculosis patients is more complicated than that of healthy participants, and many foreign bacteria were found in the sputum of pulmonary tuberculosis patients. The roles of these foreign bacteria in the onset or development of pulmonary tuberculosis shoud be considered by clinicians. PMID:23176186
Whole-loop mitochondrial DNA D-loop sequence variability in Egyptian Arabian equine matrilines

PubMed Central

Hudson, William

2017-01-01

Background Egyptian Arabian horses have been maintained in a state of genetic isolation for over a hundred years. There is only limited genetic proof that the studbook records of female lines of Egyptian Arabian pedigrees are reliable. This study characterized the mitochondrial DNA (mtDNA) signatures of 126 horses representing 14 matrilines in the Egyptian Agricultural Organization (EAO) horse-breeding program. Findings Analysis of the whole D-loop sequence yielded additional information compared to hypervariable region-1 (HVR1) analysis alone, with 42 polymorphic sites representing ten haplotypes compared to 16 polymorphic sites representing nine haplotypes, respectively. Most EAO haplotypes belonged to ancient haplogroups, suggesting origin from a wide geographical area over many thousands of years, although one haplotype was novel. Conclusions Historical families share haplotypes and some individuals from different strains belonged to the same haplogroup: the classical EAO strain designation is not equivalent to modern monophyletic matrilineal groups. Phylogenetic inference showed that the foundation mares of the historical haplotypes were highly likely to have the same haplotypes as the animals studied (p > 0.998 in all cases), confirming the reliability of EAO studbook records and providing the opportunity for breeders to confirm the ancestry of their horses. PMID:28859174
Evidence for a genetic discontinuity between Neandertals and 24,000-year-old anatomically modern Europeans.

PubMed

Caramelli, David; Lalueza-Fox, Carles; Vernesi, Cristiano; Lari, Martina; Casoli, Antonella; Mallegni, Francesco; Chiarelli, Brunetto; Dupanloup, Isabelle; Bertranpetit, Jaume; Barbujani, Guido; Bertorelle, Giorgio

2003-05-27

During the late Pleistocene, early anatomically modern humans coexisted in Europe with the anatomically archaic Neandertals for some thousand years. Under the recent variants of the multiregional model of human evolution, modern and archaic forms were different but related populations within a single evolving species, and both have contributed to the gene pool of current humans. Conversely, the Out-of-Africa model considers the transition between Neandertals and anatomically modern humans as the result of a demographic replacement, and hence it predicts a genetic discontinuity between them. Following the most stringent current standards for validation of ancient DNA sequences, we typed the mtDNA hypervariable region I of two anatomically modern Homo sapiens sapiens individuals of the Cro-Magnon type dated at about 23 and 25 thousand years ago. Here we show that the mtDNAs of these individuals fall well within the range of variation of today's humans, but differ sharply from the available sequences of the chronologically closer Neandertals. This discontinuity is difficult to reconcile with the hypothesis that both Neandertals and early anatomically modern humans contributed to the current European gene pool.
Changes in community structure of active protistan assemblages from the lower Pearl River to coastal Waters of the South China Sea.

PubMed

Li, Ran; Jiao, Nianzhi; Warren, Alan; Xu, Dapeng

2018-04-01

Protists make up an important component of aquatic ecosystems, playing crucial roles in biogeochemical processes on local and global scales. To reveal the changes of diversity and community structure of protists along the salinity gradients, community compositions of active protistan assemblages were characterized along a transect from the lower Pearl River estuary to the open waters of the South China Sea (SCS), using high-throughput sequencing of the hyper-variable V9 regions of 18S rRNA. This study showed that the alpha diversity of protists, both in the freshwater and in the coastal SCS stations was higher than that in the estuary. The protist community structure also changed along the salinity gradient. The relative sequence abundance of Stramenopiles was highest at stations with lower salinity and decreased with the increasing of salinity. By contrast, the contributions of Alveolata, Hacrobia and Rhizaria to the protistan communities generally increased with the increasing of salinity. The composition of the active protistan community was strongly correlated with salinity, indicating that salinity was the dominant factor among measured environmental parameters affecting protistan community composition and structure. Copyright © 2018 Elsevier GmbH. All rights reserved.
Forensic and phylogeographic characterisation of mtDNA lineages from Somalia.

PubMed

Mikkelsen, Martin; Fendt, Liane; Röck, Alexander W; Zimmermann, Bettina; Rockenbauer, Eszter; Hansen, Anders J; Parson, Walther; Morling, Niels

2012-07-01

The African mitochondrial (mt) phylogeny is coarsely resolved but the majority of population data generated so far is limited to the analysis of the first hypervariable segment (HVS-1) of the control region (CR). Therefore, this study aimed on the investigation of the entire CR of 190 unrelated Somali individuals to enrich the severely underrepresented African mtDNA pool. The majority (60.5 %) of the haplotypes were of sub-Saharan origin with L0a1d, L2a1h and L3f being the most frequently observed haplogroups. This is in sharp contrast to previous data reported from the Y-chromosome, where only about 5 % of the observed haplogroups were of sub-Saharan provenance. We compared the genetic distances based on population pairwise F (st) values between 11 published East, Central and North African as well as western Asian populations and the Somali sequences and displayed them in a multi-dimensional scaling plot. Genetic proximity evidenced by clustering roughly reflected the relative geographic location of the populations. The sequences will be included in the EMPOP database ( www.empop.org ) under accession number EMP00397 upon publication (Parson and Dür Forensic Sci Int Genet 1:88-92, 2007).
Structured oligonucleotides for target indexing to allow single-vessel PCR amplification and solid support microarray hybridization

PubMed Central

Girard, Laurie D.; Boissinot, Karel; Peytavi, Régis; Boissinot, Maurice; Bergeron, Michel G.

2014-01-01

The combination of molecular diagnostic technologies is increasingly used to overcome limitations on sensitivity, specificity or multiplexing capabilities, and provide efficient lab-on-chip devices. Two such techniques, PCR amplification and microarray hybridization are used serially to take advantage of the high sensitivity and specificity of the former combined with high multiplexing capacities of the latter. These methods are usually performed in different buffers and reaction chambers. However, these elaborate methods have a high complexity cost related to reagent requirements, liquid storage and the number of reaction chambers to integrate into automated devices. Furthermore, microarray hybridizations have a sequence dependent efficiency not always predictable. In this work, we have developed the concept of a structured oligonucleotide probe which is activated by cleavage from polymerase exonuclease activity. This technology is called SCISSOHR for Structured Cleavage Induced Single-Stranded Oligonucleotide Hybridization Reaction. The SCISSOHR probes enable indexing the target sequence to a tag sequence. The SCISSOHR technology also allows the combination of nucleic acid amplification and microarray hybridization in a single vessel in presence of the PCR buffer only. The SCISSOHR technology uses an amplification probe that is irreversibly modified in presence of the target, releasing a single-stranded DNA tag for microarray hybridization. Each tag is composed of a 3-nucleotidesequence-dependent segment and a unique “target sequence-independent” 14-nucleotide segment allowing for optimal hybridization with minimal cross-hybridization. We evaluated the performance of five (5) PCR buffers to support microarray hybridization, compared to a conventional hybridization buffer. Finally, as a proof of concept, we developed a multiplexed assay for the amplification, detection, and identification of three (3) DNA targets. This new technology will facilitate the design of lab-on-chip microfluidic devices, while also reducing consumable costs. At term, it will allow the cost-effective automation of highly multiplexed assays for detection and identification of genetic targets. PMID:25489607
The effectiveness of three regions in mitochondrial genome for aphid DNA barcoding: a case in Lachininae.

PubMed

Chen, Rui; Jiang, Li-Yun; Qiao, Ge-Xia

2012-01-01

The mitochondrial gene COI has been widely used by taxonomists as a standard DNA barcode sequence for the identification of many animal species. However, the COI region is of limited use for identifying certain species and is not efficiently amplified by PCR in all animal taxa. To evaluate the utility of COI as a DNA barcode and to identify other barcode genes, we chose the aphid subfamily Lachninae (Hemiptera: Aphididae) as the focus of our study. We compared the results obtained using COI with two other mitochondrial genes, COII and Cytb. In addition, we propose a new method to improve the efficiency of species identification using DNA barcoding. Three mitochondrial genes (COI, COII and Cytb) were sequenced and were used in the identification of over 80 species of Lachninae. The COI and COII genes demonstrated a greater PCR amplification efficiency than Cytb. Species identification using COII sequences had a higher frequency of success (96.9% in "best match" and 90.8% in "best close match") and yielded lower intra- and higher interspecific genetic divergence values than the other two markers. The use of "tag barcodes" is a new approach that involves attaching a species-specific tag to the standard DNA barcode. With this method, the "barcoding overlap" can be nearly eliminated. As a result, we were able to increase the identification success rate from 83.9% to 95.2% by using COI and the "best close match" technique. A COII-based identification system should be more effective in identifying lachnine species than COI or Cytb. However, the Cytb gene is an effective marker for the study of aphid population genetics due to its high sequence diversity. Furthermore, the use of "tag barcodes" can improve the accuracy of DNA barcoding identification by reducing or removing the overlap between intra- and inter-specific genetic divergence values.
Expression and purification of recombinant proteins in Escherichia coli tagged with the metal-binding protein CusF.

PubMed

Cantu-Bustos, J Enrique; Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Galbraith, David W; McEvoy, Megan M; Zarate, Xristo

2016-05-01

Production of recombinant proteins in Escherichia coli has been improved considerably through the use of fusion proteins, because they increase protein solubility and facilitate purification via affinity chromatography. In this article, we propose the use of CusF as a new fusion partner for expression and purification of recombinant proteins in E. coli. Using a cell-free protein expression system, based on the E. coli S30 extract, Green Fluorescent Protein (GFP) was expressed with a series of different N-terminal tags, immobilized on self-assembled protein microarrays, and its fluorescence quantified. GFP tagged with CusF showed the highest fluorescence intensity, and this was greater than the intensities from corresponding GFP constructs that contained MBP or GST tags. Analysis of protein production in vivo showed that CusF produces large amounts of soluble protein with low levels of inclusion bodies. Furthermore, fusion proteins can be exported to the cellular periplasm, if CusF contains the signal sequence. Taking advantage of its ability to bind copper ions, recombinant proteins can be purified with readily available IMAC resins charged with this metal ion, producing pure proteins after purification and tag removal. We therefore recommend the use of CusF as a viable alternative to MBP or GST as a fusion protein/affinity tag for the production of soluble recombinant proteins in E. coli. Copyright © 2016 Elsevier Inc. All rights reserved.

Control of very low-density lipoprotein secretion by N-ethylmaleimide-sensitive factor and miR-33.

PubMed

Allen, Ryan M; Marquart, Tyler J; Jesse, Jordan J; Baldán, Angel

2014-06-20

Several reports suggest that antisense oligonucleotides against miR-33 might reduce cardiovascular risk in patients by accelerating the reverse cholesterol transport pathway. However, conflicting reports exist about the impact of anti-miR-33 therapy on the levels of very low-density lipoprotein-triglycerides (VLDL-TAG). We test the hypothesis that miR-33 controls hepatic VLDL-TAG secretion. Using therapeutic silencing of miR-33 and adenoviral overexpression of miR-33, we show that miR-33 limits hepatic secretion of VLDL-TAG by targeting N-ethylmaleimide-sensitive factor (NSF), both in vivo and in primary hepatocytes. We identify conserved sequences in the 3'UTR of NSF as miR-33 responsive elements and show that Nsf is specifically recruited to the RNA-induced silencing complex following induction of miR-33. In pulse-chase experiments, either miR-33 overexpression or knock-down of Nsf lead to decreased secretion of apolipoproteins and TAG in primary hepatocytes, compared with control cells. Importantly, Nsf rescues miR-33-dependent reduced secretion. Finally, we show that overexpression of Nsf in vivo increases global hepatic secretion and raises plasma VLDL-TAG. Together, our data reveal key roles for the miR-33-NSF axis during hepatic secretion and suggest that caution should be taken with anti-miR-33-based therapies because they might raise proatherogenic VLDL-TAG levels. © 2014 American Heart Association, Inc.
Mining candidate genes associated with powdery mildew resistance in cucumber via super-BSA by specific length amplified fragment (SLAF) sequencing.

PubMed

Zhang, Peng; Zhu, Yuqiang; Wang, Lili; Chen, Liping; Zhou, Shengjun

2015-12-14

Powdery mildew (PM) is the most common fungal disease of cucumber and other cucurbit crops, while breeding the PM-resistant materials is the effective way to defense this disease, and the recent development of modern genetics and genomics make us aware of that studying the resistance genes is the essential way to breed the PM high-resistance plant. With the ever increasing throughput of next-generation sequencing (NGS), the development of specific length amplified fragment sequencing (SLAF-seq) as a high-resolution strategy for large-scale de novo SNP discovery is gradually applied for functional gene mining. Here we combined the bulked segregant analysis (BSA) with SLAF-seq to identify candidate genes associated with PM resistance in cucumber. A segregating population comprising 251 F2 individuals was developed using H136 (female parent) as susceptible parent and BK2 (male parent) as resistance donor. After PMR test, total genomic DNA was prepared from each plant. Systemic genomic analysis of the GC content, repeat sequence, etc. was carried out by prediction software SLAF_Predict to establish condition to ensure the uniformity and density of the molecular markers. After samples were gel purified, SLAFs were generated at Biomarker Technologies Corporation in Beijing. Based on SLAF tags and the PMR test result, the hot region were annotated. A total of 73,100 high-quality SLAF tags with an average depth of 99.11× were sequenced. Among these, 5,355 polymorphic tags were identified with a polymorphism rate of 7.34 %, including 7.09 % SNPs and other polymorphism types. Finally, 140 associated SLAFs were identified, and two main Hot Regions were detected on chromosome 1 and 6, which contained five genes invovled in defense response, toxin metabolism, cell stress response, and injury response in cucumber. Associated markers identified by super-BSA in this study, could not only speed up the study of the PMR genes, but also provide a feasible solution for breeding the marker-assisted PMR cucumber. Moreover, this study could also be extended to any other species with reference genome.
Live single-cell laser tag.

PubMed

Binan, Loïc; Mazzaferri, Javier; Choquet, Karine; Lorenzo, Louis-Etienne; Wang, Yu Chang; Affar, El Bachir; De Koninck, Yves; Ragoussis, Jiannis; Kleinman, Claudia L; Costantino, Santiago

2016-05-20

The ability to conduct image-based, non-invasive cell tagging, independent of genetic engineering, is key to cell biology applications. Here we introduce cell labelling via photobleaching (CLaP), a method that enables instant, specific tagging of individual cells based on a wide array of criteria such as shape, behaviour or positional information. CLaP uses laser illumination to crosslink biotin onto the plasma membrane, coupled with streptavidin conjugates to label individual cells for genomic, cell-tracking, flow cytometry or ultra-microscopy applications. We show that the incorporated mark is stable, non-toxic, retained for several days, and transferred by cell division but not to adjacent cells in culture. To demonstrate the potential of CLaP for genomic applications, we combine CLaP with microfluidics-based single-cell capture followed by transcriptome-wide next-generation sequencing. Finally, we show that CLaP can also be exploited for inducing transient cell adhesion to substrates for microengineering cultures with spatially patterned cell types.
Community Structures of Fecal Bacteria in Cattle from Different Animal Feeding Operations▿†

PubMed Central

Shanks, Orin C.; Kelty, Catherine A.; Archibeque, Shawn; Jenkins, Michael; Newton, Ryan J.; McLellan, Sandra L.; Huse, Susan M.; Sogin, Mitchell L.

2011-01-01

The fecal microbiome of cattle plays a critical role not only in animal health and productivity but also in food safety, pathogen shedding, and the performance of fecal pollution detection methods. Unfortunately, most published molecular surveys fail to provide adequate detail about variability in the community structures of fecal bacteria within and across cattle populations. Using massively parallel pyrosequencing of a hypervariable region of the rRNA coding region, we profiled the fecal microbial communities of cattle from six different feeding operations where cattle were subjected to consistent management practices for a minimum of 90 days. We obtained a total of 633,877 high-quality sequences from the fecal samples of 30 adult beef cattle (5 individuals per operation). Sequence-based clustering and taxonomic analyses indicate less variability within a population than between populations. Overall, bacterial community composition correlated significantly with fecal starch concentrations, largely reflected in changes in the Bacteroidetes, Proteobacteria, and Firmicutes populations. In addition, network analysis demonstrated that annotated sequences clustered by management practice and fecal starch concentration, suggesting that the structures of bovine fecal bacterial communities can be dramatically different in different animal feeding operations, even at the phylum and family taxonomic levels, and that the feeding operation is a more important determinant of the cattle microbiome than is the geographic location of the feedlot. PMID:21378055
Phylogeographic Differentiation of Mitochondrial DNA in Han Chinese

PubMed Central

Yao, Yong-Gang; Kong, Qing-Peng; Bandelt, Hans-Jürgen; Kivisild, Toomas; Zhang, Ya-Ping

2002-01-01

To characterize the mitochondrial DNA (mtDNA) variation in Han Chinese from several provinces of China, we have sequenced the two hypervariable segments of the control region and the segment spanning nucleotide positions 10171–10659 of the coding region, and we have identified a number of specific coding-region mutations by direct sequencing or restriction-fragment–length–polymorphism tests. This allows us to define new haplogroups (clades of the mtDNA phylogeny) and to dissect the Han mtDNA pool on a phylogenetic basis, which is a prerequisite for any fine-grained phylogeographic analysis, the interpretation of ancient mtDNA, or future complete mtDNA sequencing efforts. Some of the haplogroups under study differ considerably in frequencies across different provinces. The southernmost provinces show more pronounced contrasts in their regional Han mtDNA pools than the central and northern provinces. These and other features of the geographical distribution of the mtDNA haplogroups observed in the Han Chinese make an initial Paleolithic colonization from south to north plausible but would suggest subsequent migration events in China that mainly proceeded from north to south and east to west. Lumping together all regional Han mtDNA pools into one fictive general mtDNA pool or choosing one or two regional Han populations to represent all Han Chinese is inappropriate for prehistoric considerations as well as for forensic purposes or medical disease studies. PMID:11836649
ITSoneDB: a comprehensive collection of eukaryotic ribosomal RNA Internal Transcribed Spacer 1 (ITS1) sequences

PubMed Central

Santamaria, Monica; Fosso, Bruno; Licciulli, Flavio; Balech, Bachir; Larini, Ilaria; Grillo, Giorgio; De Caro, Giorgio; Liuni, Sabino

2018-01-01

Abstract A holistic understanding of environmental communities is the new challenge of metagenomics. Accordingly, the amplicon-based or metabarcoding approach, largely applied to investigate bacterial microbiomes, is moving to the eukaryotic world too. Indeed, the analysis of metabarcoding data may provide a comprehensive assessment of both bacterial and eukaryotic composition in a variety of environments, including human body. In this respect, whereas hypervariable regions of the 16S rRNA are the de facto standard barcode for bacteria, the Internal Transcribed Spacer 1 (ITS1) of ribosomal RNA gene cluster has shown a high potential in discriminating eukaryotes at deep taxonomic levels. As metabarcoding data analysis rely on the availability of a well-curated barcode reference resource, a comprehensive collection of ITS1 sequences supplied with robust taxonomies, is highly needed. To address this issue, we created ITSoneDB (available at http://itsonedb.cloud.ba.infn.it/) which in its current version hosts 985 240 ITS1 sequences spanning over 134 000 eukaryotic species. Each ITS1 is mapped on the NCBI reference taxonomy with its start and end positions precisely annotated. ITSoneDB has been developed in agreement to the FAIR guidelines by enabling the users to query and download its content through a simple web-interface and access relevant metadata by cross-linking to European Nucleotide Archive. PMID:29036529
Comparison of the gut microbial community between obese and lean peoples using 16S gene sequencing in a Japanese population.

PubMed

Andoh, Akira; Nishida, Atsushi; Takahashi, Kenichiro; Inatomi, Osamu; Imaeda, Hirotsugu; Bamba, Shigeki; Kito, Katsuyuki; Sugimoto, Mitsushige; Kobayashi, Toshio

2016-07-01

Altered gut microbial ecology contributes to the development of metabolic diseases including obesity. In this study, we performed 16S rRNA sequence analysis of the gut microbiota profiles of obese and lean Japanese populations. The V3-V4 hypervariable regions of 16S rRNA of fecal samples from 10 obese and 10 lean volunteers were sequenced using the Illumina MiSeq(TM)II system. The average body mass index of the obese and lean group were 38.1 and 16.6 kg/m(2), respectively (p<0.01). The Shannon diversity index was significantly higher in the lean group than in the obese group (p<0.01). The phyla Firmicutes and Fusobacteria were significantly more abundant in obese people than in lean people. The abundance of the phylum Bacteroidetes and the Bacteroidetes/Firmicutes ratio were not different between the obese and lean groups. The genera Alistipes, Anaerococcus, Corpococcus, Fusobacterium and Parvimonas increased significantly in obese people, and the genera Bacteroides, Desulfovibrio, Faecalibacterium, Lachnoanaerobaculum and Olsenella increased significantly in lean people. Bacteria species possessing anti-inflammatory properties, such as Faecalibacterium prausnitzii, increased significantly in lean people, but bacteria species possessing pro-inflammatory properties increased in obese people. Obesity-associated gut microbiota in the Japanese population was different from that in Western people.
Next-Generation Sequencing Analyses of Bacterial Community Structures in Soybean Pastes Produced in Northeast China.

PubMed

Lee, Mi-Hwa; Li, Fan-Zhu; Lee, Jiyeon; Kang, Jisu; Lim, Seong-Il; Nam, Young-Do

2017-04-01

Fermented soybean foods contain nutritional components including easily digestible peptides, cholesterol-free oils, minerals, and vitamins. Various fermented soybean foods have been developed and are consumed as flavoring condiments in Asian regions. While the quality of fermented soybean foods is largely affected by microorganisms that participate in the fermentation process, our knowledge about the microorganisms in soybean pastes manufactured in Northeast China is limited. The current study used a culture-independent barcoded pyrosequencing method targeting hypervariable V1/V2 regions of the 16S rRNA gene to evaluate Korean doenjang and soybean pastes prepared by the Hun Chinese (SPHC) and Korean minority (SPKM) populations in Northeast China. In total, 63399 high-quality sequences were derived from 16 soybean paste samples collected in Northeast China. Each bacterial species-level taxon of SPHC, SPKM, and Korean doenjang was clustered separately. Each paste contained representative bacterial species that could be distinguished from each other: Bacillus subtilis in SPKM, Tetragenococcus halophilus in SPHC, and Enterococcus durans in Korean doenjang. This is the 1st massive sequencing-based study analyzing microbial communities in soybean pastes manufactured in Northeast China, compared to Korean doenjang. Our results clearly showed that each soybean paste contained unique microbial communities that varied depending on the manufacturing process and location. © 2017 Institute of Food Technologists®.
Analysis of new isolates reveals new genome organization and a hypervariable region in infectious myonecrosis virus (IMNV).

PubMed

Dantas, Márcia Danielle A; Chavante, Suely F; Teixeira, Dárlio Inácio A; Lima, João Paulo M S; Lanza, Daniel C F

2015-05-04

Infectious myonecrosis virus (IMNV) has been the cause of many losses in shrimp farming since 2002, when the first myonecrosis outbreak was reported at Brazilian's northeast coast. Two additional genomes of Brazilian IMNV isolates collected in 2009 and 2013 were sequenced and analyzed in the present study. The sequencing revealed extra 643 bp and 22 bp, at 5' and 3' ends of IMNV genome respectively, confirming that its actual size is at least 8226 bp long. Considering these additional sequences in genome extremities, ORF1 can starts at nt 470, encoding a 1708 aa polyprotein. Computational predictions reveal two stem loops and two pseudoknots in the 5' end and a putative stem loop and a slippery motif located at 3' end, indicating that these regions can be involved in the start and termination of translation. Through a careful phylogenetic analysis, a higher genetic variability among Brazilian isolates could be observed, comparing with Indonesian IMNV isolates. It was also observed that the most variable region of IMNV genome is located in the first half of ORF1, coinciding with a region which probably encodes the capsid protrusions. The results presented here are a starting point to elucidate the viral's translational regulation and the mechanisms involved in virulence. Copyright © 2015 Elsevier B.V. All rights reserved.
Oligotrophic lagoons of the South Pacific Ocean are home to a surprising number of novel eukaryotic microorganisms.

PubMed

Kim, Eunsoo; Sprung, Ben; Duhamel, Solange; Filardi, Christopher; Kyoon Shin, Mann

2016-12-01

The diversity of microbial eukaryotes was surveyed by environmental sequencing from tropical lagoon sites of the South Pacific, collected through the American Museum of Natural History (AMNH)'s Explore21 expedition to the Solomon Islands in September 2013. The sampled lagoons presented low nutrient concentrations typical of oligotrophic waters, but contained levels of chlorophyll a, a proxy for phytoplankton biomass, characteristic of meso- to eutrophic waters. Two 18S rDNA hypervariable sites, the V4 and V8-V9 regions, were amplified from the total of eight lagoon samples and sequenced on the MiSeq system. After assembly, clustering at 97% similarity, and removal of singletons and chimeras, a total of 2741 (V4) and 2606 (V8-V9) operational taxonomic units (OTUs) were identified. Taxonomic annotation of these reads, including phylogeny, was based on a combination of automated pipeline and manual inspection. About 18.4% (V4) and 13.8% (V8-V9) of the OTUs could not be assigned to any of the known eukaryotic groups. Of these, we focused on OTUs that were not divergent and possessed multiple sources of evidence for their existence. Phylogenetic analyses of these sequences revealed more than ten branches that might represent new deeply-branching lineages of microbial eukaryotes, currently without any cultured representatives or morphological information. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris).

PubMed

Gao, Dongying; Abernathy, Brian; Rohksar, Daniel; Schmutz, Jeremy; Jackson, Scott A

2014-01-01

Common bean (Phaseolus vulgaris) is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs) are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs) were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF) termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3'LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. These transposon data provide a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.
The Transcriptome of Compatible and Incompatible Interactions of Potato (Solanum tuberosum) with Phytophthora infestans Revealed by DeepSAGE Analysis

PubMed Central

Gyetvai, Gabor; Sønderkær, Mads; Göbel, Ulrike; Basekow, Rico; Ballvora, Agim; Imhoff, Maren; Kersten, Birgit; Nielsen, Kåre-Lehman; Gebhardt, Christiane

2012-01-01

Late blight, caused by the oomycete Phytophthora infestans, is the most important disease of potato (Solanum tuberosum). Understanding the molecular basis of resistance and susceptibility to late blight is therefore highly relevant for developing resistant cultivars, either by marker-assissted selection or by transgenic approaches. Specific P. infestans races having the Avr1 effector gene trigger a hypersensitive resistance response in potato plants carrying the R1 resistance gene (incompatible interaction) and cause disease in plants lacking R1 (compatible interaction). The transcriptomes of the compatible and incompatible interaction were captured by DeepSAGE analysis of 44 biological samples comprising five genotypes, differing only by the presence or absence of the R1 transgene, three infection time points and three biological replicates. 30.859 unique 21 base pair sequence tags were obtained, one third of which did not match any known potato transcript sequence. Two third of the tags were expressed at low frequency (<10 tag counts/million). 20.470 unitags matched to approximately twelve thousand potato transcribed genes. Tag frequencies were compared between compatible and incompatible interactions over the infection time course and between compatible and incompatible genotypes. Transcriptional changes were more numerous in compatible than in incompatible interactions. In contrast to incompatible interactions, transcriptional changes in the compatible interaction were observed predominantly for multigene families encoding defense response genes and genes functional in photosynthesis and CO2 fixation. Numerous transcriptional differences were also observed between near isogenic genotypes prior to infection with P. infestans. Our DeepSAGE transcriptome analysis uncovered novel candidate genes for plant host pathogen interactions, examples of which are discussed with respect to possible function. PMID:22328937
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag.

PubMed

Zhang, Mengjie; Martin, Jody L; Kumar, Mohit; Khairallah, Ramzi J; de Tombe, Pieter P

2015-11-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1' cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. Copyright © 2015 the American Physiological Society.
Rapid large-scale purification of myofilament proteins using a cleavable His6-tag

PubMed Central

Zhang, Mengjie; Martin, Jody L.; Kumar, Mohit; de Tombe, Pieter P.

2015-01-01

With the advent of high-throughput DNA sequencing, the number of identified cardiomyopathy-causing mutations has increased tremendously. As the majority of these mutations affect myofilament proteins, there is a need to understand their functional consequence on contraction. Permeabilized myofilament preparations coupled with protein exchange protocols are a common method for examining into contractile mechanics. However, producing large quantities of myofilament proteins can be time consuming and requires different approaches for each protein of interest. In the present study, we describe a unified automated method to produce troponin C, troponin T, and troponin I as well as myosin light chain 2 fused to a His6-tag followed by a tobacco etch virus (TEV) protease site. TEV protease has the advantage of a relaxed P1′ cleavage site specificity, allowing for no residues left after proteolysis and preservation of the native sequence of the protein of interest. After expression in Esherichia coli, cells were lysed by sonication in imidazole-containing buffer. The His6-tagged protein was then purified using a HisTrap nickel metal affinity column, and the His6-tag was removed by His6-TEV protease digestion for 4 h at 30°C. The protease was then removed using a HisTrap column, and complex assembly was performed via column-assisted sequential desalting. This mostly automated method allows for the purification of protein in 1 day and can be adapted to most soluble proteins. It has the advantage of greatly increasing yield while reducing the time and cost of purification. Therefore, production and purification of mutant proteins can be accelerated and functional data collected in a faster, less expensive manner. PMID:26386113
How did nature engineer the highest surface lipid accumulation among plants? Exceptional expression of acyl-lipid-associated genes for the assembly of extracellular triacylglycerol by Bayberry ( Myrica pensylvanica ) fruits

DOE PAGES

Simpson, Jeffrey P.; Thrower, Nicholas; Ohlrogge, John B.

2016-02-09

Bayberry (Myrica pensylvanica) fruits are covered with a remarkably thick layer of crystalline wax consisting of triacylglycerol (TAG) and diacylglycerol (DAG) esterified exclusively with saturated fatty acids. As the only plant known to accumulate soluble glycerolipids as a major component of surface waxes, Bayberry represents a novel system to investigate neutral lipid biosynthesis and lipid secretion by vegetative plant cells. The assembly of Bayberry wax is distinct from conventional TAG and other surface waxes, and instead proceeds through a pathway related to cutin synthesis (Simpson and Ohlrogge, 2016). In this study, microscopic examination revealed that the fruit tissue that producesmore » and secretes wax (Bayberry knobs) is fully developed before wax accumulates and that wax is secreted to the surface without cell disruption. Comparison of transcript expression to genetically related tissues (Bayberry leaves, M. rubra fruits), cutin-rich tomato and cherry fruit epidermis, and to oil-rich mesocarp and seeds, revealed exceptionally high expression of 13 transcripts for acyl-lipid metabolism together with down-regulation of fatty acid oxidases and desaturases. The predicted protein sequences of the most highly expressed lipid-related enzyme-encoding transcripts in Bayberry knobs are 100% identical to the sequences from Bayberry leaves,which do not produce surface DAG or TAG. Together, these results indicate that TAG biosynthesis and secretion in Bayberry is achieved by both up and down-regulation of a small subset of genes related to the biosynthesis of cutin and saturated fatty acids, and also implies that modifications in gene expression, rather than evolution of new gene functions, was the major mechanism by which Bayberry evolved its specialized lipid metabolism.« less
Quantitative Experimental Determination of Primer-Dimer Formation Risk by Free-Solution Conjugate Electrophoresis

PubMed Central

Desmarais, Samantha M.; Leitner, Thomas; Barron, Annelise E.

2012-01-01

DNA barcodes are short, unique ssDNA primers that “mark” individual biomolecules. To gain better understanding of biophysical parameters constraining primer-dimer formation between primers that incorporate barcode sequences, we have developed a capillary electrophoresis method that utilizes drag-tag-DNA conjugates to quantify dimerization risk between primer-barcode pairs. Results obtained with this unique free-solution conjugate electrophoresis (FSCE) approach are useful as quantitatively precise input data to parameterize computation models of dimerization risk. A set of fluorescently labeled, model primer-barcode conjugates were designed with complementary regions of differing lengths to quantify heterodimerization as a function of temperature. Primer-dimer cases comprised two 30-mer primers, one of which was covalently conjugated to a lab-made, chemically synthesized poly-N-methoxyethylglycine drag-tag, which reduced electrophoretic mobility of ssDNA to distinguish it from ds primer-dimers. The drag-tags also provided a shift in mobility for the dsDNA species, which allowed us to quantitate primer-dimer formation. In the experimental studies, pairs of oligonucleotide primer-barcodes with fully or partially complementary sequences were annealed, and then separated by free-solution conjugate CE at different temperatures, to assess effects on primer-dimer formation. When less than 30 out of 30 basepairs were bonded, dimerization was inversely correlated to temperature. Dimerization occurred when more than 15 consecutive basepairs formed, yet non-consecutive basepairs did not create stable dimers even when 20 out of 30 possible basepairs bonded. The use of free-solution electrophoresis in combination with a peptoid drag-tag and different fluorophores enabled precise separation of short DNA fragments to establish a new mobility shift assay for detection of primer-dimer formation. PMID:22331820
Sequence, overproduction and purification of Vibrio proteolyticus ribosomal protein L18 for in vitro and in vivo studies

NASA Technical Reports Server (NTRS)

Setterquist, R. A.; Smith, G. K.; Oakley, T. H.; Lee, Y. H.; Fox, G. E.

1996-01-01

A strategy suggested by comparative genomic studies was used to amplify the entire Vibrio proteolyticus (Vp) gene for ribosomal protein L18. Vp L18 and its flanking regions were sequenced and compared with the deduced amino acid (aa) sequences of other known L18 proteins. A 26-aa residue segment at the carboxy terminus contains many strongly conserved residues and may be critical for the L18 interaction with 5S rRNA. This approach should allow rapid characterization of L18 from large numbers of bacteria. Both Vp L18 and Escherichia coli (Ec) L18 were overproduced and purified using a T7 expression vector which fuses an N-terminal peptide segment (His-tag) containing 6 histidine residues to the recombinant protein. The purified fusion proteins, Vp His::L18 and Ec His::L18, were both found to bind to either the Vp 5S or Ec 5S rRNAs in vitro. Vp His::L18 protein was also shown to incorporate into Ec ribosomes in vivo. This His-tag strategy likely will have general applicability for the study of ribosomal proteins in vitro and in vivo.
Sequence Determinants of Compaction in Intrinsically Disordered Proteins

PubMed Central

Marsh, Joseph A.; Forman-Kay, Julie D.

2010-01-01

Abstract Intrinsically disordered proteins (IDPs), which lack folded structure and are disordered under nondenaturing conditions, have been shown to perform important functions in a large number of cellular processes. These proteins have interesting structural properties that deviate from the random-coil-like behavior exhibited by chemically denatured proteins. In particular, IDPs are often observed to exhibit significant compaction. In this study, we have analyzed the hydrodynamic radii of a number of IDPs to investigate the sequence determinants of this compaction. Net charge and proline content are observed to be strongly correlated with increased hydrodynamic radii, suggesting that these are the dominant contributors to compaction. Hydrophobicity and secondary structure, on the other hand, appear to have negligible effects on compaction, which implies that the determinants of structure in folded and intrinsically disordered proteins are profoundly different. Finally, we observe that polyhistidine tags seem to increase IDP compaction, which suggests that these tags have significant perturbing effects and thus should be removed before any structural characterizations of IDPs. Using the relationships observed in this analysis, we have developed a sequence-based predictor of hydrodynamic radius for IDPs that shows substantial improvement over a simple model based upon chain length alone. PMID:20483348
DOE Office of Scientific and Technical Information (OSTI.GOV)

Vyatkina, Kira; Wu, Si; Dekker, Lennard J. M.

De novo sequencing of proteins and peptides is one of the most important problems in mass spectrometry-driven proteomics. A variety of methods have been developed to accomplish this task from a set of bottom-up tandem (MS/MS) mass spectra. However, a more recently emerged top-down technology, now gaining more and more popularity, opens new perspectives for protein analysis and characterization, implying a need in efficient algorithms for processing this kind of MS/MS data. Here we describe a method that allows to retrieve from a set of top-down MS/MS spectra long and accurate sequence fragments of the proteins contained in a sample.more » To this end, we outline a strategy for generating high-quality sequence tags from top-down spectra, and introduce the concept of a T-Bruijn graph by adapting to the case of tags the notion of an A-Bruijn graph widely used in genomics. The output of the proposed approach represents the set of amino acid strings spelled out by optimal paths in the connected components of a T-Bruijn graph. We illustrate its performance on top-down datasets acquired from carbonic anhydrase 2 (CAH2) and the Fab region of alemtuzumab.« less
Versatile Gene-Specific Sequence Tags for Arabidopsis Functional Genomics: Transcript Profiling and Reverse Genetics Applications

PubMed Central

Hilson, Pierre; Allemeersch, Joke; Altmann, Thomas; Aubourg, Sébastien; Avon, Alexandra; Beynon, Jim; Bhalerao, Rishikesh P.; Bitton, Frédérique; Caboche, Michel; Cannoot, Bernard; Chardakov, Vasil; Cognet-Holliger, Cécile; Colot, Vincent; Crowe, Mark; Darimont, Caroline; Durinck, Steffen; Eickhoff, Holger; de Longevialle, Andéol Falcon; Farmer, Edward E.; Grant, Murray; Kuiper, Martin T.R.; Lehrach, Hans; Léon, Céline; Leyva, Antonio; Lundeberg, Joakim; Lurin, Claire; Moreau, Yves; Nietfeld, Wilfried; Paz-Ares, Javier; Reymond, Philippe; Rouzé, Pierre; Sandberg, Goran; Segura, Maria Dolores; Serizet, Carine; Tabrett, Alexandra; Taconnat, Ludivine; Thareau, Vincent; Van Hummelen, Paul; Vercruysse, Steven; Vuylsteke, Marnik; Weingartner, Magdalena; Weisbeek, Peter J.; Wirta, Valtteri; Wittink, Floyd R.A.; Zabeau, Marc; Small, Ian

2004-01-01

Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics. PMID:15489341

Some links on this page may take you to non-federal websites. Their policies may differ from this site.