Oh, Hye-Seon; Kwon, Hyemi; Park, Suyeon; Kim, Mijin; Jeon, Min Ji; Kim, Tae Yong; Shong, Young Kee; Kim, Won Bae; Choi, Jene
2018-01-01
Background The BRAFV600E mutation is the most common genetic alteration identified in papillary thyroid carcinoma (PTC). Because of its costs effectiveness and sensitivity, direct Sanger sequencing has several limitations. The aim of this study was to evaluate the efficiency of immunohistochemistry (IHC) as an alternative method to detect the BRAFV600E mutation in preoperative and postoperative tissue samples. Methods We evaluated 71 patients who underwent thyroid surgery with the result of direct sequencing of the BRAFV600E mutation. IHC staining of the BRAFV600E mutation was performed in 49 preoperative and 23 postoperative thyroid specimens. Results Sixty-two patients (87.3%) had PTC, and of these, BRAFV600E was confirmed by direct sequencing in 57 patients (91.9%). In 23 postoperative tissue samples, the BRAFV600E mutation was detected in 16 samples (70%) by direct sequencing and 18 samples (78%) by IHC. In 24 fine needle aspiration (FNA) samples, BRAFV600E was detected in 18 samples (75%) by direct sequencing and 16 samples (67%) by IHC. In 25 core needle biopsy (CNB) samples, the BRAFV600E mutation was detected in 15 samples (60%) by direct sequencing and 16 samples (64%) by IHC. The sensitivity and specificity of IHC for detecting the BRAFV600E mutation were 77.8% and 66.7% in FNA samples and 99.3% and 80.0% in CNB samples. Conclusion IHC could be an alternative method to direct Sanger sequencing for BRAFV600E mutation detection both in postoperative and preoperative samples. However, application of IHC to detect the BRAFV600E mutation in FNA samples is of limited value compared with direct sequencing. PMID:29388401
Hamond, C; Pestana, C P; Medeiros, M A; Lilenbaum, W
2016-01-01
The aim of this study was to identify Leptospira in urine samples of cattle by direct sequencing of the secY gene. The validity of this approach was assessed using ten Leptospira strains obtained from cattle in Brazil and 77 DNA samples previously extracted from cattle urine, that were positive by PCR for the genus-specific lipL32 gene of Leptospira. Direct sequencing identified 24 (31·1%) interpretable secY sequences and these were identical to those obtained from direct DNA sequencing of the urine samples from which they were recovered. Phylogenetic analyses identified four species: L. interrogans, L. borgpetersenii, L. noguchii, and L. santarosai with the most prevalent genotypes being associated with L. borgpetersenii. While direct sequencing cannot, as yet, replace culturing of leptospires, it is a valid additional tool for epidemiological studies. An unexpected finding from this study was the genetic diversity of Leptospira infecting Brazilian cattle.
V, Pavana Jyothi; S, Akila; Selvan, Malini K; Naidu, Hariprasad; Raghunathan, Shwethaa; Kota, Sathish; Sundaram, R C Raja; Rana, Samir Kumar; Raj, G Dhinakar; Srinivasan, V A; Mohana Subramanian, B
2016-12-01
Canine parvovirus (CPV) is a non-enveloped single stranded DNA virus with an icosahedral capsid. Mini-sequencing based CPV typing was developed earlier to detect and differentiate all the CPV types and FPV in a single reaction. This technique was further evaluated in the present study by performing the mini-sequencing directly from fecal samples which avoided tedious virus isolation steps by cell culture system. Fecal swab samples were collected from 84 dogs with enteritis symptoms, suggestive of parvoviral infection from different locations across India. Seventy six of these samples were positive by PCR; the subsequent mini-sequencing reaction typed 74 of them as type 2a virus, and 2 samples as type 2b. Additionally, 25 of the positive samples were typed by cycle sequencing of PCR products. Direct CPV typing from fecal samples using mini-sequencing showed 100% correlation with CPV typing by cycle sequencing. Moreover, CPV typing was achieved by mini-sequencing even with faintly positive PCR amplicons which was not possible by cycle sequencing. Therefore, the mini-sequencing technique is recommended for regular epidemiological follow up of CPV types, since the technique is rapid, highly sensitive and high capacity method for CPV typing. Copyright © 2016. Published by Elsevier B.V.
Piri, Fahimeh; Zarei Mahmoudabadi, Ali; Ronagh, Ali; Ahmadi, Bahram; Makimura, Koichi; Rezaei-Matehkolaei, Ali
2018-06-26
Conventional direct microscopy with potassium hydroxide (KOH) and culture were found to lack the ability to establish a fast and specific diagnosis of dermatophytosis. A pan-dermatophyte nested-PCR assay was developed using a novel primer pair targeting the translation elongation factor 1-α (Tef-1α) sequences for direct detection and identification of most veterinary relevant dermatophytes in animal samples suspected to dermatophytosis. A total of 140 animal skin and hair samples were subjected to direct microscopy, culture, and ITS-RFLP/ITS-sequencing of culture isolates for the detection and identification of dermatophytosis agents. Nested-PCR sequencing was performed on all the extracted DNAs using a commercial kit after dissolving the specimens by mechanical beating. Nested-PCR was positive in 90% of samples, followed by direct microscopy (85.7%) and culture (75%). The degree of agreement between nested-PCR and direct microscopy (94.4%) was higher than with culture (83.3%). In 105 culture positive cases, the measures of agreement for the identification of dermatophytosis agents were as follows: 100% between nested-PCR sequencing and ITS-RFLP/ITS-sequencing and 63.8% between nested-PCR sequencing and culture. The developed nested-PCR was faster as well as more sensitive and specific than conventional methods for detection and identification of dermatophytes in clinical samples, which was particularly suitable for epidemiological studies. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Jakupciak, John P; Wells, Jeffrey M; Karalus, Richard J; Pawlowski, David R; Lin, Jeffrey S; Feldman, Andrew B
2013-01-01
Large-scale genomics projects are identifying biomarkers to detect human disease. B. pseudomallei and B. mallei are two closely related select agents that cause melioidosis and glanders. Accurate characterization of metagenomic samples is dependent on accurate measurements of genetic variation between isolates with resolution down to strain level. Often single biomarker sensitivity is augmented by use of multiple or panels of biomarkers. In parallel with single biomarker validation, advances in DNA sequencing enable analysis of entire genomes in a single run: population-sequencing. Potentially, direct sequencing could be used to analyze an entire genome to serve as the biomarker for genome identification. However, genome variation and population diversity complicate use of direct sequencing, as well as differences caused by sample preparation protocols including sequencing artifacts and mistakes. As part of a Department of Homeland Security program in bacterial forensics, we examined how to implement whole genome sequencing (WGS) analysis as a judicially defensible forensic method for attributing microbial sample relatedness; and also to determine the strengths and limitations of whole genome sequence analysis in a forensics context. Herein, we demonstrate use of sequencing to provide genetic characterization of populations: direct sequencing of populations.
Jakupciak, John P.; Wells, Jeffrey M.; Karalus, Richard J.; Pawlowski, David R.; Lin, Jeffrey S.; Feldman, Andrew B.
2013-01-01
Large-scale genomics projects are identifying biomarkers to detect human disease. B. pseudomallei and B. mallei are two closely related select agents that cause melioidosis and glanders. Accurate characterization of metagenomic samples is dependent on accurate measurements of genetic variation between isolates with resolution down to strain level. Often single biomarker sensitivity is augmented by use of multiple or panels of biomarkers. In parallel with single biomarker validation, advances in DNA sequencing enable analysis of entire genomes in a single run: population-sequencing. Potentially, direct sequencing could be used to analyze an entire genome to serve as the biomarker for genome identification. However, genome variation and population diversity complicate use of direct sequencing, as well as differences caused by sample preparation protocols including sequencing artifacts and mistakes. As part of a Department of Homeland Security program in bacterial forensics, we examined how to implement whole genome sequencing (WGS) analysis as a judicially defensible forensic method for attributing microbial sample relatedness; and also to determine the strengths and limitations of whole genome sequence analysis in a forensics context. Herein, we demonstrate use of sequencing to provide genetic characterization of populations: direct sequencing of populations. PMID:24455204
Lavania, Surabhi; Anthwal, Divya; Bhalla, Manpreet; Singh, Nagendra; Haldar, Sagarika; Tyagi, Jaya Sivaswami
2017-01-01
Direct smear microscopy of sputum forms the mainstay of TB diagnosis in resource-limited settings. Stained sputum smear slides can serve as a ready-made resource to transport sputum for molecular drug susceptibility testing. However, bio-safety is a major concern during transport of sputum/stained slides and for laboratory workers engaged in processing Mycobacterium tuberculosis infected sputum specimens. In this study, a bio-safe USP (Universal Sample Processing) concentration-based sputum processing method (Bio-safe method) was assessed on 87 M. tuberculosis culture positive sputum samples. Samples were processed for Ziehl-Neelsen (ZN) smear, liquid culture and DNA isolation. DNA isolated directly from sputum was subjected to an IS6110 PCR assay. Both sputum DNA and DNA extracted from bio-safe ZN concentrated smear slides were subjected to rpoB PCR and simultaneously assessed by DNA sequencing for determining rifampin (RIF) resistance. All sputum samples were rendered sterile by Bio-safe method. Bio-safe smears exhibited a 5% increment in positivity over direct smear with a 14% increment in smear grade status. All samples were positive for IS6110 and rpoB PCR. Thirty four percent samples were RIF resistant by rpoB PCR product sequencing. A 100% concordance (κ value = 1) was obtained between sequencing results derived from bio-safe smear slides and bio-safe sputum. This study demonstrates that Bio-safe method can address safety issues associated with sputum processing, provide an efficient alternative to sample transport in the form of bio-safe stained concentrated smear slides and can also provide information on drug (RIF) resistance by direct DNA sequencing.
Direct detection of Mycobacterium tuberculosis rifampin resistance in bio-safe stained sputum smears
Lavania, Surabhi; Anthwal, Divya; Bhalla, Manpreet; Singh, Nagendra; Haldar, Sagarika; Tyagi, Jaya Sivaswami
2017-01-01
Direct smear microscopy of sputum forms the mainstay of TB diagnosis in resource-limited settings. Stained sputum smear slides can serve as a ready-made resource to transport sputum for molecular drug susceptibility testing. However, bio-safety is a major concern during transport of sputum/stained slides and for laboratory workers engaged in processing Mycobacterium tuberculosis infected sputum specimens. In this study, a bio-safe USP (Universal Sample Processing) concentration-based sputum processing method (Bio-safe method) was assessed on 87 M. tuberculosis culture positive sputum samples. Samples were processed for Ziehl-Neelsen (ZN) smear, liquid culture and DNA isolation. DNA isolated directly from sputum was subjected to an IS6110 PCR assay. Both sputum DNA and DNA extracted from bio-safe ZN concentrated smear slides were subjected to rpoB PCR and simultaneously assessed by DNA sequencing for determining rifampin (RIF) resistance. All sputum samples were rendered sterile by Bio-safe method. Bio-safe smears exhibited a 5% increment in positivity over direct smear with a 14% increment in smear grade status. All samples were positive for IS6110 and rpoB PCR. Thirty four percent samples were RIF resistant by rpoB PCR product sequencing. A 100% concordance (κ value = 1) was obtained between sequencing results derived from bio-safe smear slides and bio-safe sputum. This study demonstrates that Bio-safe method can address safety issues associated with sputum processing, provide an efficient alternative to sample transport in the form of bio-safe stained concentrated smear slides and can also provide information on drug (RIF) resistance by direct DNA sequencing. PMID:29216262
Quick, Josh; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2018-01-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples without isolation remains challenging for viruses such as Zika, where metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence complete genomes comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimised library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved starting with clinical samples in 1-2 days following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. PMID:28538739
Assessing the Relationship of Ancient and Modern Populations
Schraiber, Joshua G.
2018-01-01
Genetic material sequenced from ancient samples is revolutionizing our understanding of the recent evolutionary past. However, ancient DNA is often degraded, resulting in low coverage, error-prone sequencing. Several solutions exist to this problem, ranging from simple approach, such as selecting a read at random for each site, to more complicated approaches involving genotype likelihoods. In this work, we present a novel method for assessing the relationship of an ancient sample with a modern population, while accounting for sequencing error and postmortem damage by analyzing raw reads from multiple ancient individuals simultaneously. We show that, when analyzing SNP data, it is better to sequence more ancient samples to low coverage: two samples sequenced to 0.5× coverage provide better resolution than a single sample sequenced to 2× coverage. We also examined the power to detect whether an ancient sample is directly ancestral to a modern population, finding that, with even a few high coverage individuals, even ancient samples that are very slightly diverged from the modern population can be detected with ease. When we applied our approach to European samples, we found that no ancient samples represent direct ancestors of modern Europeans. We also found that, as shown previously, the most ancient Europeans appear to have had the smallest effective population sizes, indicating a role for agriculture in modern population growth. PMID:29167200
Videvall, Elin; Strandh, Maria; Engelbrecht, Anel; Cloete, Schalk; Cornwallis, Charlie K
2017-01-01
The gut microbiome of animals is emerging as an important factor influencing ecological and evolutionary processes. A major bottleneck in obtaining microbiome data from large numbers of samples is the time-consuming laboratory procedures required, specifically the isolation of DNA and generation of amplicon libraries. Recently, direct PCR kits have been developed that circumvent conventional DNA extraction steps, thereby streamlining the laboratory process by reducing preparation time and costs. However, the reliability and efficacy of direct PCR for measuring host microbiomes have not yet been investigated other than in humans with 454 sequencing. Here, we conduct a comprehensive evaluation of the microbial communities obtained with direct PCR and the widely used Mo Bio PowerSoil DNA extraction kit in five distinct gut sample types (ileum, cecum, colon, feces, and cloaca) from 20 juvenile ostriches, using 16S rRNA Illumina MiSeq sequencing. We found that direct PCR was highly comparable over a range of measures to the DNA extraction method in cecal, colon, and fecal samples. However, the two methods significantly differed in samples with comparably low bacterial biomass: cloacal and especially ileal samples. We also sequenced 100 replicate sample pairs to evaluate repeatability during both extraction and PCR stages and found that both methods were highly consistent for cecal, colon, and fecal samples ( r s > 0.7) but had low repeatability for cloacal ( r s = 0.39) and ileal ( r s = -0.24) samples. This study indicates that direct PCR provides a fast, cheap, and reliable alternative to conventional DNA extraction methods for retrieving 16S rRNA data, which can aid future gut microbiome studies. IMPORTANCE The microbial communities of animals can have large impacts on their hosts, and the number of studies using high-throughput sequencing to measure gut microbiomes is rapidly increasing. However, the library preparation procedure in microbiome research is both costly and time-consuming, especially for large numbers of samples. We investigated a cheaper and faster direct PCR method designed to bypass the DNA isolation steps during 16S rRNA library preparation and compared it with a standard DNA extraction method. We used both techniques on five different gut sample types collected from 20 juvenile ostriches and sequenced samples with Illumina MiSeq. The methods were highly comparable and highly repeatable in three sample types with high microbial biomass (cecum, colon, and feces), but larger differences and low repeatability were found in the microbiomes obtained from the ileum and cloaca. These results will help microbiome researchers assess library preparation procedures and plan their studies accordingly.
To Clone or Not To Clone: Method Analysis for Retrieving Consensus Sequences In Ancient DNA Samples
Winters, Misa; Barta, Jodi Lynn; Monroe, Cara; Kemp, Brian M.
2011-01-01
The challenges associated with the retrieval and authentication of ancient DNA (aDNA) evidence are principally due to post-mortem damage which makes ancient samples particularly prone to contamination from “modern” DNA sources. The necessity for authentication of results has led many aDNA researchers to adopt methods considered to be “gold standards” in the field, including cloning aDNA amplicons as opposed to directly sequencing them. However, no standardized protocol has emerged regarding the necessary number of clones to sequence, how a consensus sequence is most appropriately derived, or how results should be reported in the literature. In addition, there has been no systematic demonstration of the degree to which direct sequences are affected by damage or whether direct sequencing would provide disparate results from a consensus of clones. To address this issue, a comparative study was designed to examine both cloned and direct sequences amplified from ∼3,500 year-old ancient northern fur seal DNA extracts. Majority rules and the Consensus Confidence Program were used to generate consensus sequences for each individual from the cloned sequences, which exhibited damage at 31 of 139 base pairs across all clones. In no instance did the consensus of clones differ from the direct sequence. This study demonstrates that, when appropriate, cloning need not be the default method, but instead, should be used as a measure of authentication on a case-by-case basis, especially when this practice adds time and cost to studies where it may be superfluous. PMID:21738625
[Identification of hepatitis B virus YMDD point mutation using peptide nucleic acid clamping PCR].
Zhang, Yingying; He, Haitang; Yang, Jie; Hou, Jinlin
2013-06-01
To establish a peptide nucleic acid clamping PCR assay for detecting hepatitis B virus (HBV) drug resistance mutation. RtM204I (ATT) mutant, rtM204V (GTG) mutant and rtM204 (ATG) wild-type plasmids mixed at different ratios were detected for mutations by PNA clamping PCR assay and direct sequencing, and the sensitivity and specificity of the two methods were compared. Serum samples from 85 patients with chronic HBV infection were detected for drug resistance using the two methods. The sensitivity of PNA-PCR assay was 0.001% in a 10(5)-fold excess of wild-type HBV DNA with a detection limit of 10(1) copies. The sensitivity of direct sequencing was 10% with a detection limit of 10(4) copies. Mutants were detected in 73 of the 85 serum samples (85.9%), including YIDD in 40 samples, YVDD in 23 samples, and YIDD+YVDD in 10 samples. The agreement of PNA-PCR assay with direct sequencing was only 40% (34/85, YIDD in 21 samples, YVDD in 11 samples, and YIDD+YVDD in 2 samples). Neither of the two methods yielded positive results for the negative control samples, suggesting their good specificity. PNA-PCR assay appears to be a more sensitive and rapid assay for detection of HBV genotypic resistance.
Quick, Joshua; Grubaugh, Nathan D; Pullan, Steven T; Claro, Ingra M; Smith, Andrew D; Gangavarapu, Karthik; Oliveira, Glenn; Robles-Sikisaka, Refugio; Rogers, Thomas F; Beutler, Nathan A; Burton, Dennis R; Lewis-Ximenez, Lia Laura; de Jesus, Jaqueline Goes; Giovanetti, Marta; Hill, Sarah C; Black, Allison; Bedford, Trevor; Carroll, Miles W; Nunes, Marcio; Alcantara, Luiz Carlos; Sabino, Ester C; Baylis, Sally A; Faria, Nuno R; Loose, Matthew; Simpson, Jared T; Pybus, Oliver G; Andersen, Kristian G; Loman, Nicholas J
2017-06-01
Genome sequencing has become a powerful tool for studying emerging infectious diseases; however, genome sequencing directly from clinical samples (i.e., without isolation and culture) remains challenging for viruses such as Zika, for which metagenomic sequencing methods may generate insufficient numbers of viral reads. Here we present a protocol for generating coding-sequence-complete genomes, comprising an online primer design tool, a novel multiplex PCR enrichment protocol, optimized library preparation methods for the portable MinION sequencer (Oxford Nanopore Technologies) and the Illumina range of instruments, and a bioinformatics pipeline for generating consensus sequences. The MinION protocol does not require an Internet connection for analysis, making it suitable for field applications with limited connectivity. Our method relies on multiplex PCR for targeted enrichment of viral genomes from samples containing as few as 50 genome copies per reaction. Viral consensus sequences can be achieved in 1-2 d by starting with clinical samples and following a simple laboratory workflow. This method has been successfully used by several groups studying Zika virus evolution and is facilitating an understanding of the spread of the virus in the Americas. The protocol can be used to sequence other viral genomes using the online Primal Scheme primer designer software. It is suitable for sequencing either RNA or DNA viruses in the field during outbreaks or as an inexpensive, convenient method for use in the lab.
Sanderson, Nicholas D.; Atkins, Bridget L.; Brent, Andrew J.; Cole, Kevin; Foster, Dona; McNally, Martin A.; Oakley, Sarah; Peto, Leon; Taylor, Adrian; Peto, Tim E. A.; Crook, Derrick W.; Eyre, David W.
2017-01-01
ABSTRACT Culture of multiple periprosthetic tissue samples is the current gold standard for microbiological diagnosis of prosthetic joint infections (PJI). Additional diagnostic information may be obtained through culture of sonication fluid from explants. However, current techniques can have relatively low sensitivity, with prior antimicrobial therapy and infection by fastidious organisms influencing results. We assessed if metagenomic sequencing of total DNA extracts obtained direct from sonication fluid can provide an alternative rapid and sensitive tool for diagnosis of PJI. We compared metagenomic sequencing with standard aerobic and anaerobic culture in 97 sonication fluid samples from prosthetic joint and other orthopedic device infections. Reads from Illumina MiSeq sequencing were taxonomically classified using Kraken. Using 50 derivation samples, we determined optimal thresholds for the number and proportion of bacterial reads required to identify an infection and confirmed our findings in 47 independent validation samples. Compared to results from sonication fluid culture, the species-level sensitivity of metagenomic sequencing was 61/69 (88%; 95% confidence interval [CI], 77 to 94%; for derivation samples 35/38 [92%; 95% CI, 79 to 98%]; for validation samples, 26/31 [84%; 95% CI, 66 to 95%]), and genus-level sensitivity was 64/69 (93%; 95% CI, 84 to 98%). Species-level specificity, adjusting for plausible fastidious causes of infection, species found in concurrently obtained tissue samples, and prior antibiotics, was 85/97 (88%; 95% CI, 79 to 93%; for derivation samples, 43/50 [86%; 95% CI, 73 to 94%]; for validation samples, 42/47 [89%; 95% CI, 77 to 96%]). High levels of human DNA contamination were seen despite the use of laboratory methods to remove it. Rigorous laboratory good practice was required to minimize bacterial DNA contamination. We demonstrate that metagenomic sequencing can provide accurate diagnostic information in PJI. Our findings, combined with the increasing availability of portable, random-access sequencing technology, offer the potential to translate metagenomic sequencing into a rapid diagnostic tool in PJI. PMID:28490492
Impact of cultivation on characterisation of species composition of soil bacterial communities.
McCaig, A E.; Grayston, S J.; Prosser, J I.; Glover, L A.
2001-03-01
The species composition of culturable bacteria in Scottish grassland soils was investigated using a combination of Biolog and 16S rDNA analysis for characterisation of isolates. The inclusion of a molecular approach allowed direct comparison of sequences from culturable bacteria with sequences obtained during analysis of DNA extracted directly from the same soil samples. Bacterial strains were isolated on Pseudomonas isolation agar (PIA), a selective medium, and on tryptone soya agar (TSA), a general laboratory medium. In total, 12 and 21 morphologically different bacterial cultures were isolated on PIA and TSA, respectively. Biolog and sequencing placed PIA isolates in the same taxonomic groups, the majority of cultures belonging to the Pseudomonas (sensu stricto) group. However, analysis of 16S rDNA sequences proved more efficient than Biolog for characterising TSA isolates due to limitations of the Microlog database for identifying environmental bacteria. In general, 16S rDNA sequences from TSA isolates showed high similarities to cultured species represented in sequence databases, although TSA-8 showed only 92.5% similarity to the nearest relative, Bacillus insolitus. In general, there was very little overlap between the culturable and uncultured bacterial communities, although two sequences, PIA-2 and TSA-13, showed >99% similarity to soil clones. A cloning step was included prior to sequence analysis of two isolates, TSA-5 and TSA-14, and analysis of several clones confirmed that these cultures comprised at least four and three sequence types, respectively. All isolate clones were most closely related to uncultured bacteria, with clone TSA-5.1 showing 99.8% similarity to a sequence amplified directly from the same soil sample. Interestingly, one clone, TSA-5.4, clustered within a novel group comprising only uncultured sequences. This group, which is associated with the novel, deep-branching Acidobacterium capsulatum lineage, also included clones isolated during direct analysis of the same soil and from a wide range of other sample types studied elsewhere. The study demonstrates the value of fine-scale molecular analysis for identification of laboratory isolates and indicates the culturability of approximately 1% of the total population but under a restricted range of media and cultivation conditions.
Anthwal, Divya; Gupta, Rakesh Kumar; Bhalla, Manpreet; Bhatnagar, Shinjini
2017-01-01
ABSTRACT Drug-resistant tuberculosis (TB) is a major threat to TB control worldwide. Globally, only 40% of the 340,000 notified TB patients estimated to have multidrug-resistant-TB (MDR-TB) were detected in 2015. This study was carried out to evaluate the utility of high-resolution melt curve analysis (HRM) for the rapid and direct detection of MDR-TB in Mycobacterium tuberculosis in sputum samples. A reference plasmid library was first generated of the most frequently observed mutations in the resistance-determining regions of rpoB, katG, and an inhA promoter and used as positive controls in HRM. The assay was first validated in 25 MDR M. tuberculosis clinical isolates. The assay was evaluated on DNA isolated from 99 M. tuberculosis culture-positive sputum samples that included 84 smear-negative sputum samples, using DNA sequencing as gold standard. Mutants were discriminated from the wild type by comparing melting-curve patterns with those of control plasmids using HRM software. Rifampin (RIF) and isoniazid (INH) monoresistance were detected in 11 and 21 specimens, respectively, by HRM. Six samples were classified as MDR-TB by sequencing, one of which was missed by HRM. The HRM-RIF, INH-katG, and INH-inhA assays had 89% (95% confidence interval [CI], 52, 100%), 85% (95% CI, 62, 97%), and 100% (95% CI, 74, 100%) sensitivity, respectively, in smear-negative samples, while all assays had 100% sensitivity in smear-positive samples. All assays had 100% specificity. Concordance of 97% to 100% (κ value, 0.9 to 1) was noted between sequencing and HRM. Heteroresistance was observed in 5 of 99 samples by sequencing. In conclusion, the HRM assay was a cost-effective (Indian rupee [INR]400/US$6), rapid, and closed-tube method for the direct detection of MDR-TB in sputum, especially for direct smear-negative cases. PMID:28330890
Saingam, Prakit; Li, Bo; Yan, Tao
2018-06-01
DNA-based molecular detection of microbial pathogens in complex environments is still plagued by sensitivity, specificity and robustness issues. We propose to address these issues by viewing them as inadvertent consequences of requiring specific and adequate amplification (SAA) of target DNA molecules by current PCR methods. Using the invA gene of Salmonella as the model system, we investigated if next generation sequencing (NGS) can be used to directly detect target sequences in false-negative PCR reaction (PCR-NGS) in order to remove the SAA requirement from PCR. False-negative PCR and qPCR reactions were first created using serial dilutions of laboratory-prepared Salmonella genomic DNA and then analyzed directly by NGS. Target invA sequences were detected in all false-negative PCR and qPCR reactions, which lowered the method detection limits near the theoretical minimum of single gene copy detection. The capability of the PCR-NGS approach in correcting false negativity was further tested and confirmed under more environmentally relevant conditions using Salmonella-spiked stream water and sediment samples. Finally, the PCR-NGS approach was applied to ten urban stream water samples and detected invA sequences in eight samples that would be otherwise deemed Salmonella negative. Analysis of the non-target sequences in the false-negative reactions helped to identify primer dime-like short sequences as the main cause of the false negativity. Together, the results demonstrated that the PCR-NGS approach can significantly improve method sensitivity, correct false-negative detections, and enable sequence-based analysis for failure diagnostics in complex environmental samples. Copyright © 2018 Elsevier B.V. All rights reserved.
Fariña Sarasqueta, Arantza; Moerland, Elna; de Bruyne, Hanneke; de Graaf, Henk; Vrancken, Tamara; van Lijnschoten, Gesina; van den Brule, Adriaan J.C.
2011-01-01
Although direct sequencing is the gold standard for KRAS mutation detection in routine diagnostics, it remains laborious, time consuming, and not very sensitive. Our objective was to evaluate SNaPshot and the KRAS StripAssay as alternatives to sequencing for KRAS mutation detection in daily practice. KRAS exon 2–specific PCR followed by sequencing or by a SNaPshot reaction was performed. For the StripAssay, a mutant-enriched PCR was followed by hybridization to KRAS-specific probes bound to a nitrocellulose strip. To test sensitivities, dilution series of mutated DNA in wild-type DNA were made. Additionally, direct sequencing and SNaPshot were evaluated in 296 colon cancer samples. Detection limits of direct sequencing, SNaPshot, and StripAssay were 20%, 10%, and 1% tumor cells, respectively. Direct sequencing and SNaPshot can detect all 12 mutations in KRAS codons 12 and 13, whereas the StripAssay detects 10 of the most frequent ones. Workload and time to results are comparable for SNaPshot and direct sequencing. SNaPshot is flexible and easy to multiplex. The StripAssay is less time consuming for daily laboratory practice. SNaPshot is more flexible and slightly more sensitive than direct sequencing. The clinical evaluation showed comparable performances between direct sequencing and SNaPshot. The StripAssay is rapid and an extremely sensitive assay that could be considered when few tumor cells are available. However, found mutants should be confirmed to avoid risk of false positives. PMID:21354055
Koo, Bonhan; Lee, Tae Yoon; Lee, Jeong Hoon; Shin, Yong; Lim, Seok-Byung
2017-01-01
Although KRAS mutational status testing is becoming a companion diagnostic tool for managing patients with colorectal cancer (CRC), there are still several difficulties when analyzing KRAS mutations using the existing assays, particularly with regard to low sensitivity, its time-consuming, and the need for large instruments. We developed a rapid, sensitive, and specific mutation detection assay based on the bio-photonic sensor termed ISAD (isothermal solid-phase amplification/detection), and used it to analyze KRAS gene mutations in human clinical samples. To validate the ISAD-KRAS assay for use in clinical diagnostics, we examined for hotspot KRAS mutations (codon 12 and codon 13) in 70 CRC specimens using PCR and direct sequencing methods. In a serial dilution study, ISAD-KRAS could detect mutations in a sample containing only 1% of the mutant allele in a mixture of wild-type DNA, whereas both PCR and direct sequencing methods could detect mutations in a sample containing approximately 30% of mutant cells. The results of the ISAD-KRAS assay from 70 clinical samples matched those from PCR and direct sequencing, except in 5 cases, wherein ISAD-KRAS could detect mutations that were not detected by PCR and direct sequencing. We also found that the sensitivity and specificity of ISAD-KRAS were 100% within 30 min. The ISAD-KRAS assay provides a rapid, highly sensitive, and label-free method for KRAS mutation testing, and can serve as a robust and near patient testing approach for the rapid detection of patients most likely to respond to anti-EGFR drugs. PMID:29137388
D. Lee Taylor; Michael G. Booth; Jack W. McFarland; Ian C. Herriott; Niall J. Lennon; Chad Nusbaum; Thomas G. Marr
2008-01-01
High throughput sequencing methods are widely used in analyses of microbial diversity but are generally applied to small numbers of samples, which precludes charaterization of patterns of microbial diversity across space and time. We have designed a primer-tagging approach that allows pooling and subsequent sorting of numerous samples, which is directed to...
[Study on ITS sequences of Aconitum vilmorinianum and its medicinal adulterant].
Zhang, Xiao-nan; Du, Chun-hua; Fu, De-huan; Gao, Li; Zhou, Pei-jun; Wang, Li
2012-09-01
To analyze and compare the ITS sequences of Aconitum vilmorinianum and its medicinal adulterant Aconitum austroyunnanense. Total genomic DNA were extracted from sample materials by improved CTAB method, ITS sequences of samples were amplified using PCR systems, directly sequenced and analyzed using software DNAStar, ClustalX1.81 and MEGA 4.0. 299 consistent sites, 19 variable sites and 13 informative sites were found in ITS1 sequences, 162 consistent sites, 2 variable sites and 1 informative sites were found in 5.8S sequences, 217 consistent sites, 3 variable sites and 1 informative site were found in ITS2 sequences. Base transition and transversion was not found only in 5.8S sequences, 2 sites transition and 1 site transversion were found in ITS1 sequences, only 1 site transversion was found in ITS2 sequences comparting the ITS sequences data matrix. By analyzing the ITS sequences data matrix from 2 population of Aconitum vilmorinianum and 3 population of Aconitum austroyunnanense, we found a stable informative site at the 596th base in ITS2 sequences, in all the samples of Aconitum vilmorinianum the base was C, and in all the samples of Aconitum austroyunnanense the base was A. Aconitum vilmorinianum and Aconitum austroyunnanense can be identified by their characters of ITS sequences, and the variable sites in ITS1 sequences are more than in ITS2 sequences.
Migration pattern of hepatitis A virus genotype IA in North-Central Tunisia.
Beji-Hamza, Abir; Taffon, Stefania; Mhalla, Salma; Lo Presti, Alessandra; Equestre, Michele; Chionne, Paola; Madonna, Elisabetta; Cella, Eleonora; Bruni, Roberto; Ciccozzi, Massimo; Aouni, Mahjoub; Ciccaglione, Anna Rita
2015-02-08
Hepatitis A virus (HAV) epidemiology in Tunisia has changed from high to intermediate endemicity in the last decades. However, several outbreaks continue to occur. The last reported sequences from Tunisian HAV strains date back to 2006. In order to provide an updated overview of the strains currently circulating in Tunisia, a large-scale molecular analysis of samples from hepatitis A cases was performed, the first in Tunisia. Biological samples were collected from patients with laboratory confirmed hepatitis A: 145 sera samples in Tunis, Monastir, Sousse and Kairouan from 2008 to 2013 and 45 stool samples in Mahdia in 2009. HAV isolates were characterised by nested RT-PCR (VP1/2A region) and sequencing. The sequences finally obtained from 81 samples showed 78 genotype IA and 3 genotype IB isolates. A Tunisian genotype IA sequence dataset, including both the 78 newly obtained IA sequences and 51 sequences retrieved from GenBank, was used for phylogenetic investigation, including analysis of migration pattern among six towns. Virus gene flow from Sfax and Monastir was directed to all other towns; in contrast, the gene flows from Sousse, Tunis, Mahdia and Kairouan were directed to three, two, one and no towns, respectively. Several different HAV strains co-circulate in Tunisia, but the predominant genotype still continues to be IA (78/81, 96% isolates). A complex gene flow (migration) of HAV genotype IA was observed, with Sfax and Monastir showing gene flows to all other investigated towns. This approach coupled to a wider sampling can prove useful to investigate the factors underlying the spread of HAV in Tunisia and, thus, to implement appropriate preventing measures.
Smith, Rick W A; Monroe, Cara; Bolnick, Deborah A
2015-01-01
While cytosine methylation has been widely studied in extant populations, relatively few studies have analyzed methylation in ancient DNA. Most existing studies of epigenetic marks in ancient DNA have inferred patterns of methylation in highly degraded samples using post-mortem damage to cytosines as a proxy for cytosine methylation levels. However, this approach limits the inference of methylation compared with direct bisulfite sequencing, the current gold standard for analyzing cytosine methylation at single nucleotide resolution. In this study, we used direct bisulfite sequencing to assess cytosine methylation in ancient DNA from the skeletal remains of 30 Native Americans ranging in age from approximately 230 to 4500 years before present. Unmethylated cytosines were converted to uracils by treatment with sodium bisulfite, bisulfite products of a CpG-rich retrotransposon were pyrosequenced, and C-to-T ratios were quantified for a single CpG position. We found that cytosine methylation is readily recoverable from most samples, given adequate preservation of endogenous nuclear DNA. In addition, our results indicate that the precision of cytosine methylation estimates is inversely correlated with aDNA preservation, such that samples of low DNA concentration show higher variability in measures of percent methylation than samples of high DNA concentration. In particular, samples in this study with a DNA concentration above 0.015 ng/μL generated the most consistent measures of cytosine methylation. This study presents evidence of cytosine methylation in a large collection of ancient human remains, and indicates that it is possible to analyze epigenetic patterns in ancient populations using direct bisulfite sequencing approaches.
Hagger, Martin S; Chatzisarantis, Nikos L D; Harris, Jemma
2006-02-01
The present study tested a motivational sequence in which global-level psychological need satisfaction from self-determination theory influenced intentions and behavior directly and indirectly through contextual-level motivation and situational-level decision-making constructs from the theory of planned behavior. Two samples of university students (N = 511) completed measures of global-level psychological need satisfaction, contextual-level autonomous motivation, and situational-level attitudes, subjective norms, perceived behavioral control, intentions, and behavior in two behavioral contexts: exercise and dieting. A structural equation model supported the proposed sequence in both samples. The indirect effect was present for exercise behavior, whereas both direct and indirect effects were found for dieting behavior. Findings independently supported the component theories and provided a comprehensive integrated explanation of volitional behavior.
Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood
Fan, H. Christina; Blumenfeld, Yair J.; Chitkara, Usha; Hudgins, Louanne; Quake, Stephen R.
2008-01-01
We directly sequenced cell-free DNA with high-throughput shotgun sequencing technology from plasma of pregnant women, obtaining, on average, 5 million sequence tags per patient sample. This enabled us to measure the over- and underrepresentation of chromosomes from an aneuploid fetus. The sequencing approach is polymorphism-independent and therefore universally applicable for the noninvasive detection of fetal aneuploidy. Using this method, we successfully identified all nine cases of trisomy 21 (Down syndrome), two cases of trisomy 18 (Edward syndrome), and one case of trisomy 13 (Patau syndrome) in a cohort of 18 normal and aneuploid pregnancies; trisomy was detected at gestational ages as early as the 14th week. Direct sequencing also allowed us to study the characteristics of cell-free plasma DNA, and we found evidence that this DNA is enriched for sequences from nucleosomes. PMID:18838674
Detection of a divergent variant of grapevine virus F by next-generation sequencing.
Molenaar, Nicholas; Burger, Johan T; Maree, Hans J
2015-08-01
The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
Metatranscriptomics of Soil Eukaryotic Communities.
Yadav, Rajiv K; Bragalini, Claudia; Fraissinet-Tachet, Laurence; Marmeisse, Roland; Luis, Patricia
2016-01-01
Functions expressed by eukaryotic organisms in soil can be specifically studied by analyzing the pool of eukaryotic-specific polyadenylated mRNA directly extracted from environmental samples. In this chapter, we describe two alternative protocols for the extraction of high-quality RNA from soil samples. Total soil RNA or mRNA can be converted to cDNA for direct high-throughput sequencing. Polyadenylated mRNA-derived full-length cDNAs can also be cloned in expression plasmid vectors to constitute soil cDNA libraries, which can be subsequently screened for functional gene categories. Alternatively, the diversity of specific gene families can also be explored following cDNA sequence capture using exploratory oligonucleotide probes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tan, H.
1999-03-31
The purpose of this research is to develop a multiplexed sample processing system in conjunction with multiplexed capillary electrophoresis for high-throughput DNA sequencing. The concept from DNA template to called bases was first demonstrated with a manually operated single capillary system. Later, an automated microfluidic system with 8 channels based on the same principle was successfully constructed. The instrument automatically processes 8 templates through reaction, purification, denaturation, pre-concentration, injection, separation and detection in a parallel fashion. A multiplexed freeze/thaw switching principle and a distribution network were implemented to manage flow direction and sample transportation. Dye-labeled terminator cycle-sequencing reactions are performedmore » in an 8-capillary array in a hot air thermal cycler. Subsequently, the sequencing ladders are directly loaded into a corresponding size-exclusion chromatographic column operated at {approximately} 60 C for purification. On-line denaturation and stacking injection for capillary electrophoresis is simultaneously accomplished at a cross assembly set at {approximately} 70 C. Not only the separation capillary array but also the reaction capillary array and purification columns can be regenerated after every run. DNA sequencing data from this system allow base calling up to 460 bases with accuracy of 98%.« less
Xu, Yi-Hua; Manoharan, Herbert T; Pitot, Henry C
2007-09-01
The bisulfite genomic sequencing technique is one of the most widely used techniques to study sequence-specific DNA methylation because of its unambiguous ability to reveal DNA methylation status to the order of a single nucleotide. One characteristic feature of the bisulfite genomic sequencing technique is that a number of sample sequence files will be produced from a single DNA sample. The PCR products of bisulfite-treated DNA samples cannot be sequenced directly because they are heterogeneous in nature; therefore they should be cloned into suitable plasmids and then sequenced. This procedure generates an enormous number of sample DNA sequence files as well as adding extra bases belonging to the plasmids to the sequence, which will cause problems in the final sequence comparison. Finding the methylation status for each CpG in each sample sequence is not an easy job. As a result CpG PatternFinder was developed for this purpose. The main functions of the CpG PatternFinder are: (i) to analyze the reference sequence to obtain CpG and non-CpG-C residue position information. (ii) To tailor sample sequence files (delete insertions and mark deletions from the sample sequence files) based on a configuration of ClustalW multiple alignment. (iii) To align sample sequence files with a reference file to obtain bisulfite conversion efficiency and CpG methylation status. And, (iv) to produce graphics, highlighted aligned sequence text and a summary report which can be easily exported to Microsoft Office suite. CpG PatternFinder is designed to operate cooperatively with BioEdit, a freeware on the internet. It can handle up to 100 files of sample DNA sequences simultaneously, and the total CpG pattern analysis process can be finished in minutes. CpG PatternFinder is an ideal software tool for DNA methylation studies to determine the differential methylation pattern in a large number of individuals in a population. Previously we developed the CpG Analyzer program; CpG PatternFinder is our further effort to create software tools for DNA methylation studies.
Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji
2010-07-01
We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.
A novel directional asymmetric sampling search algorithm for fast block-matching motion estimation
NASA Astrophysics Data System (ADS)
Li, Yue-e.; Wang, Qiang
2011-11-01
This paper proposes a novel directional asymmetric sampling search (DASS) algorithm for video compression. Making full use of the error information (block distortions) of the search patterns, eight different direction search patterns are designed for various situations. The strategy of local sampling search is employed for the search of big-motion vector. In order to further speed up the search, early termination strategy is adopted in procedure of DASS. Compared to conventional fast algorithms, the proposed method has the most satisfactory PSNR values for all test sequences.
Fantin, Yuri S.; Neverov, Alexey D.; Favorov, Alexander V.; Alvarez-Figueroa, Maria V.; Braslavskaya, Svetlana I.; Gordukova, Maria A.; Karandashova, Inga V.; Kuleshov, Konstantin V.; Myznikova, Anna I.; Polishchuk, Maya S.; Reshetov, Denis A.; Voiciehovskaya, Yana A.; Mironov, Andrei A.; Chulanov, Vladimir P.
2013-01-01
Sanger sequencing is a common method of reading DNA sequences. It is less expensive than high-throughput methods, and it is appropriate for numerous applications including molecular diagnostics. However, sequencing mixtures of similar DNA of pathogens with this method is challenging. This is important because most clinical samples contain such mixtures, rather than pure single strains. The traditional solution is to sequence selected clones of PCR products, a complicated, time-consuming, and expensive procedure. Here, we propose the base-calling with vocabulary (BCV) method that computationally deciphers Sanger chromatograms obtained from mixed DNA samples. The inputs to the BCV algorithm are a chromatogram and a dictionary of sequences that are similar to those we expect to obtain. We apply the base-calling function on a test dataset of chromatograms without ambiguous positions, as well as one with 3–14% sequence degeneracy. Furthermore, we use BCV to assemble a consensus sequence for an HIV genome fragment in a sample containing a mixture of viral DNA variants and to determine the positions of the indels. Finally, we detect drug-resistant Mycobacterium tuberculosis strains carrying frameshift mutations mixed with wild-type bacteria in the pncA gene, and roughly characterize bacterial communities in clinical samples by direct 16S rRNA sequencing. PMID:23382983
Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki
2014-01-01
Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aw, Tiong Gim; Howe, Adina; Rose, Joan B.
2014-12-01
Genomic-based molecular techniques are emerging as powerful tools that allow a comprehensive characterization of water and wastewater microbiomes. Most recently, next generation sequencing (NGS) technologies which produce large amounts of sequence data are beginning to impact the field of environmental virology. In this study, NGS and bioinformatics have been employed for the direct detection and characterization of viruses in wastewater and of viruses isolated after cell culture. Viral particles were concentrated and purified from sewage samples by polyethylene glycol precipitation. Viral nucleic acid was extracted and randomly amplified prior to sequencing using Illumina technology, yielding a total of 18 millionmore » sequence reads. Most of the viral sequences detected could not be characterized, indicating the great viral diversity that is yet to be discovered. This sewage virome was dominated by bacteriophages and contained sequences related to known human pathogenic viruses such as adenoviruses (species B, C and F), polyomaviruses JC and BK and enteroviruses (type B). An array of other animal viruses was also found, suggesting unknown zoonotic viruses. This study demonstrated the feasibility of metagenomic approaches to characterize viruses in complex environmental water samples.« less
Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants.
Kim, Kyung; Seong, Moon-Woo; Chung, Won-Hyong; Park, Sung Sup; Leem, Sangseob; Park, Won; Kim, Jihyun; Lee, KiYoung; Park, Rae Woong; Kim, Namshin
2015-06-01
Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigate the effect of exome sequencing depth on the discovery of sequence variants for clinical use. Toward this, we sequenced ten germ-line blood samples from breast cancer patients on the Illumina platform GAII(x) at a high depth of ~200×. We observed that most function-related diverse variants in the human exonic regions could be detected at a sequencing depth of 120×. Furthermore, investigation using a diagnostic gene set showed that the number of clinical variants identified using exome sequencing reached a plateau at an average sequencing depth of about 120×. Moreover, the phenomena were consistent across the breast cancer samples.
A model of directional selection applied to the evolution of drug resistance in HIV-1.
Seoighe, Cathal; Ketwaroo, Farahnaz; Pillay, Visva; Scheffler, Konrad; Wood, Natasha; Duffet, Rodger; Zvelebil, Marketa; Martinson, Neil; McIntyre, James; Morris, Lynn; Hide, Winston
2007-04-01
Understanding how pathogens acquire resistance to drugs is important for the design of treatment strategies, particularly for rapidly evolving viruses such as HIV-1. Drug treatment can exert strong selective pressures and sites within targeted genes that confer resistance frequently evolve far more rapidly than the neutral rate. Rapid evolution at sites that confer resistance to drugs can be used to help elucidate the mechanisms of evolution of drug resistance and to discover or corroborate novel resistance mutations. We have implemented standard maximum likelihood methods that are used to detect diversifying selection and adapted them for use with serially sampled reverse transcriptase (RT) coding sequences isolated from a group of 300 HIV-1 subtype C-infected women before and after single-dose nevirapine (sdNVP) to prevent mother-to-child transmission. We have also extended the standard models of codon evolution for application to the detection of directional selection. Through simulation, we show that the directional selection model can provide a substantial improvement in sensitivity over models of diversifying selection. Five of the sites within the RT gene that are known to harbor mutations that confer resistance to nevirapine (NVP) strongly supported the directional selection model. There was no evidence that other mutations that are known to confer NVP resistance were selected in this cohort. The directional selection model, applied to serially sampled sequences, also had more power than the diversifying selection model to detect selection resulting from factors other than drug resistance. Because inference of selection from serial samples is unlikely to be adversely affected by recombination, the methods we describe may have general applicability to the analysis of positive selection affecting recombining coding sequences when serially sampled data are available.
Lee, Sejoon; Lee, Soohyun; Ouellette, Scott; Park, Woong-Yang; Lee, Eunjung A; Park, Peter J
2017-06-20
In many next-generation sequencing (NGS) studies, multiple samples or data types are profiled for each individual. An important quality control (QC) step in these studies is to ensure that datasets from the same subject are properly paired. Given the heterogeneity of data types, file types and sequencing depths in a multi-dimensional study, a robust program that provides a standardized metric for genotype comparisons would be useful. Here, we describe NGSCheckMate, a user-friendly software package for verifying sample identities from FASTQ, BAM or VCF files. This tool uses a model-based method to compare allele read fractions at known single-nucleotide polymorphisms, considering depth-dependent behavior of similarity metrics for identical and unrelated samples. Our evaluation shows that NGSCheckMate is effective for a variety of data types, including exome sequencing, whole-genome sequencing, RNA-seq, ChIP-seq, targeted sequencing and single-cell whole-genome sequencing, with a minimal requirement for sequencing depth (>0.5X). An alignment-free module can be run directly on FASTQ files for a quick initial check. We recommend using this software as a QC step in NGS studies. https://github.com/parklab/NGSCheckMate. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Time-of-flight radio location system
McEwan, T.E.
1996-04-23
A bi-static radar configuration measures the direct time-of-flight of a transmitted RF pulse and is capable of measuring this time-of-flight with a jitter on the order of about one pico-second, or about 0.01 inch of free space distance for an electromagnetic pulse over a range of about one to ten feet. A transmitter transmits a sequence of electromagnetic pulses in response to a transmit timing signal, and a receiver samples the sequence of electromagnetic pulses with controlled timing in response to a receive timing signal, and generates a sample signal in response to the samples. A timing circuit supplies the transmit timing signal to the transmitter and supplies the receive timing signal to the receiver. The receive timing signal causes the receiver to sample the sequence of electromagnetic pulses such that the time between transmission of pulses in the sequence and sampling by the receiver sweeps over a range of delays. The receive timing signal sweeps over the range of delays in a sweep cycle such that pulses in the sequence are sampled at the pulse repetition rate, and with different delays in the range of delays to produce a sample signal representing magnitude of a received pulse in equivalent time. Automatic gain control circuitry in the receiver controls the magnitude of the equivalent time sample signal. A signal processor analyzes the sample signal to indicate the time-of-flight of the electromagnetic pulses in the sequence. 7 figs.
Time-of-flight radio location system
McEwan, Thomas E.
1996-01-01
A bi-static radar configuration measures the direct time-of-flight of a transmitted RF pulse and is capable of measuring this time-of-flight with a jitter on the order of about one pico-second, or about 0.01 inch of free space distance for an electromagnetic pulse over a range of about one to ten feet. A transmitter transmits a sequence of electromagnetic pulses in response to a transmit timing signal, and a receiver samples the sequence of electromagnetic pulses with controlled timing in response to a receive timing signal, and generates a sample signal in response to the samples. A timing circuit supplies the transmit timing signal to the transmitter and supplies the receive timing signal to the receiver. The receive timing signal causes the receiver to sample the sequence of electromagnetic pulses such that the time between transmission of pulses in the sequence and sampling by the receiver sweeps over a range of delays. The receive timing signal sweeps over the range of delays in a sweep cycle such that pulses in the sequence are sampled at the pulse repetition rate, and with different delays in the range of delays to produce a sample signal representing magnitude of a received pulse in equivalent time. Automatic gain control circuitry in the receiver controls the magnitude of the equivalent time sample signal. A signal processor analyzes the sample signal to indicate the time-of-flight of the electromagnetic pulses in the sequence.
Luo, Chengwei; Tsementzi, Despina; Kyrpides, Nikos; Read, Timothy; Konstantinidis, Konstantinos T
2012-01-01
Next-generation sequencing (NGS) is commonly used in metagenomic studies of complex microbial communities but whether or not different NGS platforms recover the same diversity from a sample and their assembled sequences are of comparable quality remain unclear. We compared the two most frequently used platforms, the Roche 454 FLX Titanium and the Illumina Genome Analyzer (GA) II, on the same DNA sample obtained from a complex freshwater planktonic community. Despite the substantial differences in read length and sequencing protocols, the platforms provided a comparable view of the community sampled. For instance, derived assemblies overlapped in ~90% of their total sequences and in situ abundances of genes and genotypes (estimated based on sequence coverage) correlated highly between the two platforms (R(2)>0.9). Evaluation of base-call error, frameshift frequency, and contig length suggested that Illumina offered equivalent, if not better, assemblies than Roche 454. The results from metagenomic samples were further validated against DNA samples of eighteen isolate genomes, which showed a range of genome sizes and G+C% content. We also provide quantitative estimates of the errors in gene and contig sequences assembled from datasets characterized by different levels of complexity and G+C% content. For instance, we noted that homopolymer-associated, single-base errors affected ~1% of the protein sequences recovered in Illumina contigs of 10× coverage and 50% G+C; this frequency increased to ~3% when non-homopolymer errors were also considered. Collectively, our results should serve as a useful practical guide for choosing proper sampling strategies and data possessing protocols for future metagenomic studies.
Mitochondrial signature sequences have frequently been used to study the demographics of many different populations around the world. Traditionally, this requires obtaining samples directly from individuals which is cumbersome, time consuming and limited to the number of individu...
Marks, Michael; Fookes, Maria; Wagner, Josef; Butcher, Robert; Ghinai, Rosanna; Sokana, Oliver; Sarkodie, Yaw-Adu; Lukehart, Sheila A; Solomon, Anthony W; Mabey, David C W; Thomson, Nicholas
2018-01-01
Abstract Background Yaws-like chronic ulcers can be caused by Treponema pallidum subspecies pertenue, Haemophilus ducreyi, or other, still-undefined bacteria. To permit accurate evaluation of yaws elimination efforts, programmatic use of molecular diagnostics is required. The accuracy and sensitivity of current tools remain unclear because our understanding of T. pallidum diversity is limited by the low number of sequenced genomes. Methods We tested samples from patients with suspected yaws collected in the Solomon Islands and Ghana. All samples were from patients whose lesions had previously tested negative using the Centers for Disease Control and Prevention (CDC) diagnostic assay in widespread use. However, some of these patients had positive serological assays for yaws on blood. We used direct whole-genome sequencing to identify T. pallidum subsp pertenue strains missed by the current assay. Results From 45 Solomon Islands and 27 Ghanaian samples, 11 were positive for T. pallidum DNA using the species-wide quantitative polymerase chain reaction (PCR) assay, from which we obtained 6 previously undetected T. pallidum subsp pertenue whole-genome sequences. These show that Solomon Islands sequences represent distinct T. pallidum subsp pertenue clades. These isolates were invisible to the CDC diagnostic PCR assay, due to sequence variation in the primer binding site. Conclusions Our data double the number of published T. pallidum subsp pertenue genomes. We show that Solomon Islands strains are undetectable by the PCR used in many studies and by health ministries. This assay is therefore not adequate for the eradication program. Next-generation genome sequence data are essential for these efforts. PMID:29045605
Study design requirements for RNA sequencing-based breast cancer diagnostics.
Mer, Arvind Singh; Klevebring, Daniel; Grönberg, Henrik; Rantalainen, Mattias
2016-02-01
Sequencing-based molecular characterization of tumors provides information required for individualized cancer treatment. There are well-defined molecular subtypes of breast cancer that provide improved prognostication compared to routine biomarkers. However, molecular subtyping is not yet implemented in routine breast cancer care. Clinical translation is dependent on subtype prediction models providing high sensitivity and specificity. In this study we evaluate sample size and RNA-sequencing read requirements for breast cancer subtyping to facilitate rational design of translational studies. We applied subsampling to ascertain the effect of training sample size and the number of RNA sequencing reads on classification accuracy of molecular subtype and routine biomarker prediction models (unsupervised and supervised). Subtype classification accuracy improved with increasing sample size up to N = 750 (accuracy = 0.93), although with a modest improvement beyond N = 350 (accuracy = 0.92). Prediction of routine biomarkers achieved accuracy of 0.94 (ER) and 0.92 (Her2) at N = 200. Subtype classification improved with RNA-sequencing library size up to 5 million reads. Development of molecular subtyping models for cancer diagnostics requires well-designed studies. Sample size and the number of RNA sequencing reads directly influence accuracy of molecular subtyping. Results in this study provide key information for rational design of translational studies aiming to bring sequencing-based diagnostics to the clinic.
Time-of-flight radio location system
McEwan, T.E.
1997-08-26
A bi-static radar configuration measures the direct time-of-flight of a transmitted RF pulse and is capable of measuring this time-of-flight with a jitter on the order of about one pico-second, or about 0.01 inch of free space distance for an electromagnetic pulse over a range of about one to ten feet. A transmitter transmits a sequence of electromagnetic pulses in response to a transmit timing signal, and a receiver samples the sequence of electromagnetic pulses with controlled timing in response to a receive timing signal, and generates a sample signal in response to the samples. A timing circuit supplies the transmit timing signal to the transmitter and supplies the receive timing signal to the receiver. The receive timing signal causes the receiver to sample the sequence of electromagnetic pulses such that the time between transmission of pulses in the sequence and sampling by the receiver sweeps over a range of delays. The receive timing signal sweeps over the range of delays in a sweep cycle such that pulses in the sequence are sampled at the pulse repetition rate, and with different delays in the range of delays to produce a sample signal representing magnitude of a received pulse in equivalent time. Automatic gain control circuitry in the receiver controls the magnitude of the equivalent time sample signal. A signal processor analyzes the sample signal to indicate the time-of-flight of the electromagnetic pulses in the sequence. The sample signal in equivalent time is passed through an envelope detection circuit, formed of an absolute value circuit followed by a low pass filter, to convert the sample signal to a unipolar signal to eliminate effects of antenna misorientation. 8 figs.
Time-of-flight radio location system
McEwan, Thomas E.
1997-01-01
A bi-static radar configuration measures the direct time-of-flight of a transmitted RF pulse and is capable of measuring this time-of-flight with a jitter on the order of about one pico-second, or about 0.01 inch of free space distance for an electromagnetic pulse over a range of about one to ten feet. A transmitter transmits a sequence of electromagnetic pulses in response to a transmit timing signal, and a receiver samples the sequence of electromagnetic pulses with controlled timing in response to a receive timing signal, and generates a sample signal in response to the samples. A timing circuit supplies the transmit timing signal to the transmitter and supplies the receive timing signal to the receiver. The receive timing signal causes the receiver to sample the sequence of electromagnetic pulses such that the time between transmission of pulses in the sequence and sampling by the receiver sweeps over a range of delays. The receive timing signal sweeps over the range of delays in a sweep cycle such that pulses in the sequence are sampled at the pulse repetition rate, and with different delays in the range of delays to produce a sample signal representing magnitude of a received pulse in equivalent time. Automatic gain control circuitry in the receiver controls the magnitude of the equivalent time sample signal. A signal processor analyzes the sample signal to indicate the time-of-flight of the electromagnetic pulses in the sequence. The sample signal in equivalent time is passed through an envelope detection circuit, formed of an absolute value circuit followed by a low pass filter, to convert the sample signal to a unipolar signal to eliminate effects of antenna misorientation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allen, J; Velsko, S
This report explores the question of whether meaningful conclusions can be drawn regarding the transmission relationship between two microbial samples on the basis of differences observed between the two sample's respective genomes. Unlike similar forensic applications using human DNA, the rapid rate of microbial genome evolution combined with the dynamics of infectious disease require a shift in thinking on what it means for two samples to 'match' in support of a forensic hypothesis. Previous outbreaks for SARS-CoV, FMDV and HIV were examined to investigate the question of how microbial sequence data can be used to draw inferences that link twomore » infected individuals by direct transmission. The results are counter intuitive with respect to human DNA forensic applications in that some genetic change rather than exact matching improve confidence in inferring direct transmission links, however, too much genetic change poses challenges, which can weaken confidence in inferred links. High rates of infection coupled with relatively weak selective pressure observed in the SARS-CoV and FMDV data lead to fairly low confidence for direct transmission links. Confidence values for forensic hypotheses increased when testing for the possibility that samples are separated by at most a few intermediate hosts. Moreover, the observed outbreak conditions support the potential to provide high confidence values for hypothesis that exclude direct transmission links. Transmission inferences are based on the total number of observed or inferred genetic changes separating two sequences rather than uniquely weighing the importance of any one genetic mismatch. Thus, inferences are surprisingly robust in the presence of sequencing errors provided the error rates are randomly distributed across all samples in the reference outbreak database and the novel sequence samples in question. When the number of observed nucleotide mutations are limited due to characteristics of the outbreak or the availability of only partial rather than whole genome sequencing, indel information was shown to have the potential to improve performance but only for select outbreak conditions. In examined HIV transmission cases, extended evolution proved to be the limiting factor in assigning high confidence to transmission links, however, the potential to correct for extended evolution not associated with transmission events is demonstrated. Outbreak specific conditions such as selective pressure (in the form of varying mutation rate), are shown to impact the strength of inference made and a Monte Carlo simulation tool is introduced, which is used to provide upper and lower bounds on the confidence values associated with a forensic hypothesis.« less
Marks, Michael; Fookes, Maria; Wagner, Josef; Butcher, Robert; Ghinai, Rosanna; Sokana, Oliver; Sarkodie, Yaw-Adu; Lukehart, Sheila A; Solomon, Anthony W; Mabey, David C W; Thomson, Nicholas
2018-03-05
Yaws-like chronic ulcers can be caused by Treponema pallidum subspecies pertenue, Haemophilus ducreyi, or other, still-undefined bacteria. To permit accurate evaluation of yaws elimination efforts, programmatic use of molecular diagnostics is required. The accuracy and sensitivity of current tools remain unclear because our understanding of T. pallidum diversity is limited by the low number of sequenced genomes. We tested samples from patients with suspected yaws collected in the Solomon Islands and Ghana. All samples were from patients whose lesions had previously tested negative using the Centers for Disease Control and Prevention (CDC) diagnostic assay in widespread use. However, some of these patients had positive serological assays for yaws on blood. We used direct whole-genome sequencing to identify T. pallidum subsp pertenue strains missed by the current assay. From 45 Solomon Islands and 27 Ghanaian samples, 11 were positive for T. pallidum DNA using the species-wide quantitative polymerase chain reaction (PCR) assay, from which we obtained 6 previously undetected T. pallidum subsp pertenue whole-genome sequences. These show that Solomon Islands sequences represent distinct T. pallidum subsp pertenue clades. These isolates were invisible to the CDC diagnostic PCR assay, due to sequence variation in the primer binding site. Our data double the number of published T. pallidum subsp pertenue genomes. We show that Solomon Islands strains are undetectable by the PCR used in many studies and by health ministries. This assay is therefore not adequate for the eradication program. Next-generation genome sequence data are essential for these efforts. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America.
Chaban, Bonnie; Chu, Shirley; Hendrick, Steven; Waldner, Cheryl; Hill, Janet E.
2012-01-01
The detection and subspeciation of Campylobacter fetus subsp. venerealis (CFV) from veterinary samples is important for both clinical and economic reasons. Campylobacter fetus subsp. venerealis is the causative agent of bovine genital campylobacteriosis, a venereal disease that can lead to serious reproductive problems in cattle, and strict international regulations require animals and animal products to be CFV-free for trade. This study evaluated methods reported in the literature for CFV detection and reports the translation of an extensively tested CFV-specific polymerase chain reaction (PCR) primer set; including the VenSF/VenSR primers and a real-time, quantitative PCR (qPCR) platform using SYBR Green chemistry. Three methods of preputial sample preparation for direct qPCR were evaluated and a heat lysis DNA extraction method was shown to allow for CFV detection at the level of approximately one cell equivalent per reaction (or 1.0 × 103 CFU/mL) from prepuce. The optimized sample preparation and qPCR protocols were then used to evaluate 3 western Canadian bull cohorts, which included 377 bulls, for CFV. The qPCR assay detected 11 positive bulls for the CFV-specific parA gene target. DNA sequence data confirmed the identity of the amplified product and revealed that positive samples were comprised of 2 sequence types; one identical to previously reported CFV parA gene sequences and one with a 9% sequence divergence. These results add valuable information towards our understanding of an important CFV subspeciation target and offer a significantly improved format for an internationally recognized PCR test. PMID:23277694
Engineering of a DNA Polymerase for Direct m6 A Sequencing.
Aschenbrenner, Joos; Werner, Stephan; Marchand, Virginie; Adam, Martina; Motorin, Yuri; Helm, Mark; Marx, Andreas
2018-01-08
Methods for the detection of RNA modifications are of fundamental importance for advancing epitranscriptomics. N 6 -methyladenosine (m 6 A) is the most abundant RNA modification in mammalian mRNA and is involved in the regulation of gene expression. Current detection techniques are laborious and rely on antibody-based enrichment of m 6 A-containing RNA prior to sequencing, since m 6 A modifications are generally "erased" during reverse transcription (RT). To overcome the drawbacks associated with indirect detection, we aimed to generate novel DNA polymerase variants for direct m 6 A sequencing. Therefore, we developed a screen to evolve an RT-active KlenTaq DNA polymerase variant that sets a mark for N 6 -methylation. We identified a mutant that exhibits increased misincorporation opposite m 6 A compared to unmodified A. Application of the generated DNA polymerase in next-generation sequencing allowed the identification of m 6 A sites directly from the sequencing data of untreated RNA samples. © 2017 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA.
Methods for determining the genetic affinity of microorganisms and viruses
NASA Technical Reports Server (NTRS)
Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)
2012-01-01
Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.
Using single cell sequencing data to model the evolutionary history of a tumor.
Kim, Kyung In; Simon, Richard
2014-01-24
The introduction of next-generation sequencing (NGS) technology has made it possible to detect genomic alterations within tumor cells on a large scale. However, most applications of NGS show the genetic content of mixtures of cells. Recently developed single cell sequencing technology can identify variation within a single cell. Characterization of multiple samples from a tumor using single cell sequencing can potentially provide information on the evolutionary history of that tumor. This may facilitate understanding how key mutations accumulate and evolve in lineages to form a heterogeneous tumor. We provide a computational method to infer an evolutionary mutation tree based on single cell sequencing data. Our approach differs from traditional phylogenetic tree approaches in that our mutation tree directly describes temporal order relationships among mutation sites. Our method also accommodates sequencing errors. Furthermore, we provide a method for estimating the proportion of time from the earliest mutation event of the sample to the most recent common ancestor of the sample of cells. Finally, we discuss current limitations on modeling with single cell sequencing data and possible improvements under those limitations. Inferring the temporal ordering of mutational sites using current single cell sequencing data is a challenge. Our proposed method may help elucidate relationships among key mutations and their role in tumor progression.
Meinel, Dominik M; Heinzinger, Susanne; Eberle, Ute; Ackermann, Nikolaus; Schönberger, Katharina; Sing, Andreas
2018-02-01
Influenza with its annual epidemic waves is a major cause of morbidity and mortality worldwide. However, only little whole genome data are available regarding the molecular epidemiology promoting our understanding of viral spread in human populations. We implemented a RT-PCR strategy starting from patient material to generate influenza A whole genome sequences for molecular epidemiological surveillance. Samples were obtained within the Bavarian Influenza Sentinel. The complete influenza virus genome was amplified by a one-tube multiplex RT-PCR and sequenced on an Illumina MiSeq. We report whole genomic sequences for 50 influenza A H3N2 viruses, which was the predominating virus in the season 2014/15, directly from patient specimens. The dataset included random samples from Bavaria (Germany) throughout the influenza season and samples from three suspected transmission clusters. We identified the outbreak samples based on sequence identity. Whole genome sequencing (WGS) was superior in resolution compared to analysis of single segments or partial segment analysis. Additionally, we detected manifestation of substantial amounts of viral quasispecies in several patients, carrying mutations varying from the dominant virus in each patient. Our rapid whole genome sequencing approach for influenza A virus shows that WGS can effectively be used to detect and understand outbreaks in large communities. Additionally, the genomic data provide in-depth details about the circulating virus within one season.
Pedersen, M S; Fahnøe, U; Hansen, T A; Pedersen, A G; Jenssen, H; Bukh, J; Schønning, K
2018-06-01
The current treatment options for hepatitis C virus (HCV), based on direct acting antivirals (DAA), are dependent on virus genotype and previous treatment experience. Treatment failures have been associated with detection of resistance-associated substitutions (RASs) in the DAA targets of HCV, the NS3, NS5A and NS5 B proteins. To develop a next generation sequencing based method that provides genotype and detection of HCV NS3, NS5A, and NS5 B RASs without prior knowledge of sample genotype. In total, 101 residual plasma samples from patients with HCV covering 10 different viral subtypes across 4 genotypes with viral loads of 3.84-7.61 Log IU/mL were included. All samples were de-identified and consequently prior treatment status for patients was unknown. Almost full open reading frame amplicons (∼ 9 kb) were generated using RT-PCR with a single primer set. The resulting amplicons were sequenced with high throughput sequencing and analysed using an in-house developed script for detecting RASs. The method successfully amplified and sequenced 94% (95/101) of samples with an average coverage of 14,035; four of six failed samples were genotype 4a. Samples analysed twice yielded reproducible nucleotide frequencies across all sites. RASs were detected in 21/95 (22%) samples at a 15% threshold. The method identified one patient infected with two genotype 2b variants, and the presence of subgenomic deletion variants in 8 (8.4%) of 95 successfully sequenced samples. The presented method may provide identification of HCV genotype, RASs detection, and detect multiple HCV infection without prior knowledge of sample genotype. Copyright © 2018 Elsevier B.V. All rights reserved.
Almeida, Jonatas Campos; Martins, Felippe Danyel Cardoso; Ferreira Neto, José Maurício; Santos, Maíra Moreira Dos; Garcia, João Luis; Navarro, Italmar Teodorico; Kuroda, Emília Kiyomi; Freire, Roberta Lemos
2015-01-01
The purpose of this study was to investigate the occurrence of Cryptosporidium spp. and Giardia spp. in a public water-treatment system. Samples of raw and treated water were collected and concentrated using the membrane filtration technique. Direct Immunofluorescence Test was performed on the samples. DNA extraction using a commercial kit was performed and the DNA extracted was submitted to a nested-PCR reaction (n-PCR) and sequencing. In the immunofluorescence, 2/24 (8.33%) samples of raw water were positive for Giardia spp.. In n-PCR and sequencing, 2/24 (8.33%) samples of raw water were positive for Giardia spp., and 2/24 (8.33%) samples were positive for Cryptosporidium spp.. The sequencing showed Cryptosporidium parvum and Giardia duodenalis DNA. In raw water, there was moderate correlation among turbidity, color and Cryptosporidium spp. and between turbidity and Giardia spp.. The presence of these protozoans in the water indicates the need for monitoring for water-treatment companies.
Do, Hongdo; Dobrovic, Alexander
2009-01-01
Background Mutation detection in clinical tumour samples is challenging when the proportion of tumour cells, and thus mutant alleles, is low. The limited sensitivity of conventional sequencing necessitates the adoption of more sensitive approaches. High resolution melting (HRM) is more sensitive than sequencing but identification of the mutation is desirable, particularly when it is important to discriminate false positives due to PCR errors or template degradation from true mutations. We thus developed limited copy number - high resolution melting (LCN-HRM) which applies limiting dilution to HRM. Multiple replicate reactions with a limited number of target sequences per reaction allow low level mutations to be detected. The dilutions used (based on Ct values) are chosen such that mutations, if present, can be detected by the direct sequencing of amplicons with aberrant melting patterns. Results Using cell lines heterozygous for mutations, we found that the mutations were not readily detected when they comprised 10% of total alleles (20% tumour cells) by sequencing, whereas they were readily detectable at 5% total alleles by standard HRM. LCN-HRM allowed these mutations to be identified by direct sequencing of those positive reactions. LCN-HRM was then used to review formalin-fixed paraffin-embedded (FFPE) clinical samples showing discordant findings between sequencing and HRM for KRAS exon 2 and EGFR exons 19 and 21. Both true mutations present at low levels and sequence changes due to artefacts were detected by LCN-HRM. The use of high fidelity polymerases showed that the majority of the artefacts were derived from the damaged template rather than replication errors during amplification. Conclusion LCN-HRM bridges the sensitivity gap between HRM and sequencing and is effective in distinguishing between artefacts and true mutations. PMID:19811662
Do, Hongdo; Dobrovic, Alexander
2009-10-08
Mutation detection in clinical tumour samples is challenging when the proportion of tumour cells, and thus mutant alleles, is low. The limited sensitivity of conventional sequencing necessitates the adoption of more sensitive approaches. High resolution melting (HRM) is more sensitive than sequencing but identification of the mutation is desirable, particularly when it is important to discriminate false positives due to PCR errors or template degradation from true mutations.We thus developed limited copy number - high resolution melting (LCN-HRM) which applies limiting dilution to HRM. Multiple replicate reactions with a limited number of target sequences per reaction allow low level mutations to be detected. The dilutions used (based on Ct values) are chosen such that mutations, if present, can be detected by the direct sequencing of amplicons with aberrant melting patterns. Using cell lines heterozygous for mutations, we found that the mutations were not readily detected when they comprised 10% of total alleles (20% tumour cells) by sequencing, whereas they were readily detectable at 5% total alleles by standard HRM. LCN-HRM allowed these mutations to be identified by direct sequencing of those positive reactions.LCN-HRM was then used to review formalin-fixed paraffin-embedded (FFPE) clinical samples showing discordant findings between sequencing and HRM for KRAS exon 2 and EGFR exons 19 and 21. Both true mutations present at low levels and sequence changes due to artefacts were detected by LCN-HRM. The use of high fidelity polymerases showed that the majority of the artefacts were derived from the damaged template rather than replication errors during amplification. LCN-HRM bridges the sensitivity gap between HRM and sequencing and is effective in distinguishing between artefacts and true mutations.
Russell, Joseph A; Campos, Brittany; Stone, Jennifer; Blosser, Erik M; Burkett-Cadena, Nathan; Jacobs, Jonathan L
2018-04-03
The future of infectious disease surveillance and outbreak response is trending towards smaller hand-held solutions for point-of-need pathogen detection. Here, samples of Culex cedecei mosquitoes collected in Southern Florida, USA were tested for Venezuelan Equine Encephalitis Virus (VEEV), a previously-weaponized arthropod-borne RNA-virus capable of causing acute and fatal encephalitis in animal and human hosts. A single 20-mosquito pool tested positive for VEEV by quantitative reverse transcription polymerase chain reaction (RT-qPCR) on the Biomeme two3. The virus-positive sample was subjected to unbiased metatranscriptome sequencing on the Oxford Nanopore MinION and shown to contain Everglades Virus (EVEV), an alphavirus in the VEEV serocomplex. Our results demonstrate, for the first time, the use of unbiased sequence-based detection and subtyping of a high-consequence biothreat pathogen directly from an environmental sample using field-forward protocols. The development and validation of methods designed for field-based diagnostic metagenomics and pathogen discovery, such as those suitable for use in mobile "pocket laboratories", will address a growing demand for public health teams to carry out their mission where it is most urgent: at the point-of-need.
Photothermal method of determining calorific properties of coal
Amer, N.M.
1983-05-16
Predetermined amounts of heat are generated within a coal sample by directing pump light pulses of predetermined energy content into a small surface region of the sample. A beam of probe light is directed along the sample surface and deflection of the probe beam from thermally induced changes of index of refraction in the fluid medium adjacent the heated region are detected. Deflection amplitude and the phase lag of the deflection, relative to the initiating pump light pulse, are indicative of the calorific value and the porosity of the sample. The method provides rapid, accurate and nondestructive analysis of the heat producing capabilities of coal samples. In the preferred form, sequences of pump light pulses of increasing durations are directed into the sample at each of a series of minute regions situated along a raster scan path enabling detailed analysis of variations of thermal properties at different areas of the sample and at different depths.
Shu, Fan-Fan; Lv, Rui-Qing; Zhang, Yi-Fang; Duan, Gang; Wu, Ding-Yu; Li, Bi-Feng; Yang, Jian-Fa; Zou, Feng-Cai
2012-08-01
On mainland China, liver flukes of Fasciola spp. (Digenea: Fasciolidae) can cause serious acute and chronic morbidity in numerous species of mammals such as sheep, goats, cattle, and humans. The objective of the present study was to examine the taxonomic identity of Fasciola species in Yunnan province by sequences of the first and second internal transcribed spacers (ITS-1 and ITS-2) of nuclear ribosomal DNA (rDNA). The ITS rDNA was amplified from 10 samples representing Fasciola species in cattle from 2 geographical locations in Yunnan Province, by polymerase chain reaction (PCR), and the products were sequenced directly. The lengths of the ITS-1 and ITS-2 sequences were 422 and 361-362 base pairs, respectively, for all samples sequenced. Using ITS sequences, 2 Fasciola species were revealed, namely Fasciola hepatica and Fasciola gigantica. This is the first demonstration of F. gigantica in cattle in Yunnan Province, China using a molecular approach; our findings have implications for studying the population genetic characterization of the Chinese Fasciola species and for the prevention and control of Fasciola spp. in this province.
Harnessing Whole Genome Sequencing in Medical Mycology.
Cuomo, Christina A
2017-01-01
Comparative genome sequencing studies of human fungal pathogens enable identification of genes and variants associated with virulence and drug resistance. This review describes current approaches, resources, and advances in applying whole genome sequencing to study clinically important fungal pathogens. Genomes for some important fungal pathogens were only recently assembled, revealing gene family expansions in many species and extreme gene loss in one obligate species. The scale and scope of species sequenced is rapidly expanding, leveraging technological advances to assemble and annotate genomes with higher precision. By using iteratively improved reference assemblies or those generated de novo for new species, recent studies have compared the sequence of isolates representing populations or clinical cohorts. Whole genome approaches provide the resolution necessary for comparison of closely related isolates, for example, in the analysis of outbreaks or sampled across time within a single host. Genomic analysis of fungal pathogens has enabled both basic research and diagnostic studies. The increased scale of sequencing can be applied across populations, and new metagenomic methods allow direct analysis of complex samples.
Single sample resolution of rare microbial dark matter in a marine invertebrate metagenome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, Ian J.; Weyna, Theodore R.; Fong, Stephen S.
Direct, untargeted sequencing of environmental samples (metagenomics) and de novo genome assembly enable the study of uncultured and phylogenetically divergent organisms. However, separating individual genomes from a mixed community has often relied on the differential-coverage analysis of multiple, deeply sequenced samples. In the metagenomic investigation of the marine bryozoan Bugula neritina, we uncovered seven bacterial genomes associated with a single B. neritina individual that appeared to be transient associates, two of which were unique to one individual and undetectable using certain “universal” 16S rRNA primers and probes. We recovered high quality genome assemblies for several rare instances of “microbial darkmore » matter,” or phylogenetically divergent bacteria lacking genomes in reference databases, from a single tissue sample that was not subjected to any physical or chemical pre-treatment. One of these rare, divergent organisms has a small (593 kbp), poorly annotated genome with low GC content (20.9%) and a 16S rRNA gene with just 65% sequence similarity to the closest reference sequence. Lastly, our findings illustrate the importance of sampling strategy and de novo assembly of metagenomic reads to understand the extent and function of bacterial biodiversity.« less
Single sample resolution of rare microbial dark matter in a marine invertebrate metagenome
Miller, Ian J.; Weyna, Theodore R.; Fong, Stephen S.; ...
2016-09-29
Direct, untargeted sequencing of environmental samples (metagenomics) and de novo genome assembly enable the study of uncultured and phylogenetically divergent organisms. However, separating individual genomes from a mixed community has often relied on the differential-coverage analysis of multiple, deeply sequenced samples. In the metagenomic investigation of the marine bryozoan Bugula neritina, we uncovered seven bacterial genomes associated with a single B. neritina individual that appeared to be transient associates, two of which were unique to one individual and undetectable using certain “universal” 16S rRNA primers and probes. We recovered high quality genome assemblies for several rare instances of “microbial darkmore » matter,” or phylogenetically divergent bacteria lacking genomes in reference databases, from a single tissue sample that was not subjected to any physical or chemical pre-treatment. One of these rare, divergent organisms has a small (593 kbp), poorly annotated genome with low GC content (20.9%) and a 16S rRNA gene with just 65% sequence similarity to the closest reference sequence. Lastly, our findings illustrate the importance of sampling strategy and de novo assembly of metagenomic reads to understand the extent and function of bacterial biodiversity.« less
A generic assay for whole-genome amplification and deep sequencing of enterovirus A71
Tan, Le Van; Tuyen, Nguyen Thi Kim; Thanh, Tran Tan; Ngan, Tran Thuy; Van, Hoang Minh Tu; Sabanathan, Saraswathy; Van, Tran Thi My; Thanh, Le Thi My; Nguyet, Lam Anh; Geoghegan, Jemma L.; Ong, Kien Chai; Perera, David; Hang, Vu Thi Ty; Ny, Nguyen Thi Han; Anh, Nguyen To; Ha, Do Quang; Qui, Phan Tu; Viet, Do Chau; Tuan, Ha Manh; Wong, Kum Thong; Holmes, Edward C.; Chau, Nguyen Van Vinh; Thwaites, Guy; van Doorn, H. Rogier
2015-01-01
Enterovirus A71 (EV-A71) has emerged as the most important cause of large outbreaks of severe and sometimes fatal hand, foot and mouth disease (HFMD) across the Asia-Pacific region. EV-A71 outbreaks have been associated with (sub)genogroup switches, sometimes accompanied by recombination events. Understanding EV-A71 population dynamics is therefore essential for understanding this emerging infection, and may provide pivotal information for vaccine development. Despite the public health burden of EV-A71, relatively few EV-A71 complete-genome sequences are available for analysis and from limited geographical localities. The availability of an efficient procedure for whole-genome sequencing would stimulate effort to generate more viral sequence data. Herein, we report for the first time the development of a next-generation sequencing based protocol for whole-genome sequencing of EV-A71 directly from clinical specimens. We were able to sequence viruses of subgenogroup C4 and B5, while RNA from culture materials of diverse EV-A71 subgenogroups belonging to both genogroup B and C was successfully amplified. The nature of intra-host genetic diversity was explored in 22 clinical samples, revealing 107 positions carrying minor variants (ranging from 0 to 15 variants per sample). Our analysis of EV-A71 strains sampled in 2013 showed that they all belonged to subgenogroup B5, representing the first report of this subgenogroup in Vietnam. In conclusion, we have successfully developed a high-throughput next-generation sequencing-based assay for whole-genome sequencing of EV-A71 from clinical samples. PMID:25704598
Zhou, Shuntai; Jones, Corbin; Mieczkowski, Piotr
2015-01-01
ABSTRACT Validating the sampling depth and reducing sequencing errors are critical for studies of viral populations using next-generation sequencing (NGS). We previously described the use of Primer ID to tag each viral RNA template with a block of degenerate nucleotides in the cDNA primer. We now show that low-abundance Primer IDs (offspring Primer IDs) are generated due to PCR/sequencing errors. These artifactual Primer IDs can be removed using a cutoff model for the number of reads required to make a template consensus sequence. We have modeled the fraction of sequences lost due to Primer ID resampling. For a typical sequencing run, less than 10% of the raw reads are lost to offspring Primer ID filtering and resampling. The remaining raw reads are used to correct for PCR resampling and sequencing errors. We also demonstrate that Primer ID reveals bias intrinsic to PCR, especially at low template input or utilization. cDNA synthesis and PCR convert ca. 20% of RNA templates into recoverable sequences, and 30-fold sequence coverage recovers most of these template sequences. We have directly measured the residual error rate to be around 1 in 10,000 nucleotides. We use this error rate and the Poisson distribution to define the cutoff to identify preexisting drug resistance mutations at low abundance in an HIV-infected subject. Collectively, these studies show that >90% of the raw sequence reads can be used to validate template sampling depth and to dramatically reduce the error rate in assessing a genetically diverse viral population using NGS. IMPORTANCE Although next-generation sequencing (NGS) has revolutionized sequencing strategies, it suffers from serious limitations in defining sequence heterogeneity in a genetically diverse population, such as HIV-1 due to PCR resampling and PCR/sequencing errors. The Primer ID approach reveals the true sampling depth and greatly reduces errors. Knowing the sampling depth allows the construction of a model of how to maximize the recovery of sequences from input templates and to reduce resampling of the Primer ID so that appropriate multiplexing can be included in the experimental design. With the defined sampling depth and measured error rate, we are able to assign cutoffs for the accurate detection of minority variants in viral populations. This approach allows the power of NGS to be realized without having to guess about sampling depth or to ignore the problem of PCR resampling, while also being able to correct most of the errors in the data set. PMID:26041299
Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R.; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F.
2014-01-01
ABSTRACT We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. IMPORTANCE This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed. PMID:24429365
Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F; Luzuriaga, Katherine
2014-04-01
We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed.
Environmental surveillance of viruses by tangential flow filtration and metagenomic reconstruction.
Furtak, Vyacheslav; Roivainen, Merja; Mirochnichenko, Olga; Zagorodnyaya, Tatiana; Laassri, Majid; Zaidi, Sohail Z; Rehman, Lubna; Alam, Muhammad M; Chizhikov, Vladimir; Chumakov, Konstantin
2016-04-14
An approach is proposed for environmental surveillance of poliovirus by concentrating sewage samples with tangential flow filtration (TFF) followed by deep sequencing of viral RNA. Subsequent to testing the method with samples from Finland, samples from Pakistan, a country endemic for poliovirus, were investigated. Genomic sequencing was either performed directly, for unbiased identification of viruses regardless of their ability to grow in cell cultures, or after virus enrichment by cell culture or immunoprecipitation. Bioinformatics enabled separation and determination of individual consensus sequences. Overall, deep sequencing of the entire viral population identified polioviruses, non-polio enteroviruses, and other viruses. In Pakistani sewage samples, adeno-associated virus, unable to replicate autonomously in cell cultures, was the most abundant human virus. The presence of recombinants of wild polioviruses of serotype 1 (WPV1) was also inferred, whereby currently circulating WPV1 of south-Asian (SOAS) lineage comprised two sub-lineages depending on their non-capsid region origin. Complete genome analyses additionally identified point mutants and intertypic recombinants between attenuated Sabin strains in the Pakistani samples, and in one Finnish sample. The approach could allow rapid environmental surveillance of viruses causing human infections. It creates a permanent digital repository of the entire virome potentially useful for retrospective screening of future discovered viruses.
Günthard, H F; Wong, J K; Ignacio, C C; Havlir, D V; Richman, D D
1998-07-01
The performance of the high-density oligonucleotide array methodology (GeneChip) in detecting drug resistance mutations in HIV-1 pol was compared with that of automated dideoxynucleotide sequencing (ABI) of clinical samples, viral stocks, and plasmid-derived NL4-3 clones. Sequences from 29 clinical samples (plasma RNA, n = 17; lymph node RNA, n = 5; lymph node DNA, n = 7) from 12 patients, from 6 viral stock RNA samples, and from 13 NL4-3 clones were generated by both methods. Editing was done independently by a different investigator for each method before comparing the sequences. In addition, NL4-3 wild type (WT) and mutants were mixed in varying concentrations and sequenced by both methods. Overall, a concordance of 99.1% was found for a total of 30,865 bases compared. The comparison of clinical samples (plasma RNA and lymph node RNA and DNA) showed a slightly lower match of base calls, 98.8% for 19,831 nucleotides compared (protease region, 99.5%, n = 8272; RT region, 98.3%, n = 11,316), than for viral stocks and NL4-3 clones (protease region, 99.8%; RT region, 99.5%). Artificial mixing experiments showed a bias toward calling wild-type bases by GeneChip. Discordant base calls are most likely due to differential detection of mixtures. The concordance between GeneChip and ABI was high and appeared dependent on the nature of the templates (directly amplified versus cloned) and the complexity of mixes.
Williams-Woods, Jacquelina; González-Escalona, Narjol; Burkhardt, William
2011-12-01
Human norovirus (HuNoV) and hepatitis A (HAV) are recognized as leading causes of non-bacterial foodborne associated illnesses in the United States. DNA sequencing is generally considered the standard for accurate viral genotyping in support of epidemiological investigations. Due to the genetic diversity of noroviruses (NoV), degenerate primer sets are often used in conventional reverse transcription (RT) PCR and real-time RT-quantitative PCR (RT-qPCR) for the detection of these viruses and cDNA fragments are generally cloned prior to sequencing. HAV detection methods that are sensitive and specific for real-time RT-qPCR yields small fragments sizes of 89-150bp, which can be difficult to sequence. In order to overcome these obstacles, norovirus and HAV primers were tailed with M13 forward and reverse primers. This modification increases the sequenced product size and allows for direct sequencing of the amplicons utilizing complementary M13 primers. HuNoV and HAV cDNA products from environmentally contaminated oysters were analyzed using this method. Alignments of the sequenced samples revealed ≥95% nucleotide identities. Tailing NoV and HAV primers with M13 sequence increases the cDNA product size, offers an alternative to cloning, and allows for rapid, accurate and direct sequencing of cDNA products produced by conventional or real time RT-qPCR assays. Published by Elsevier B.V.
Importance Sampling of Word Patterns in DNA and Protein Sequences
Chan, Hock Peng; Chen, Louis H.Y.
2010-01-01
Abstract Monte Carlo methods can provide accurate p-value estimates of word counting test statistics and are easy to implement. They are especially attractive when an asymptotic theory is absent or when either the search sequence or the word pattern is too short for the application of asymptotic formulae. Naive direct Monte Carlo is undesirable for the estimation of small probabilities because the associated rare events of interest are seldom generated. We propose instead efficient importance sampling algorithms that use controlled insertion of the desired word patterns on randomly generated sequences. The implementation is illustrated on word patterns of biological interest: palindromes and inverted repeats, patterns arising from position-specific weight matrices (PSWMs), and co-occurrences of pairs of motifs. PMID:21128856
Radioresistance of GGG Sequences to Prompt Strand Break Formation from Direct-Type Radiation Damage
Black, Paul J.; Miller, Adam S.; Hayes, Jeffrey J.
2016-01-01
Purpose As humans, we are constantly exposed to ionizing radiation from natural, man-made and cosmic sources which can damage DNA, leading to deleterious effects including cancer incidence. In this work we introduce a method to monitor strand breaks resulting from damage due to the direct effect of ionizing radiation and provide evidence for sequence-dependent effects leading to strand breaks. Materials and methods To analyze only DNA strand breaks caused by radiation damage due to the direct effect of ionizing radiation, we combined an established technique to generate dehydrated DNA samples with a technique to analyze single strand breaks on short oligonucleotide sequences via denaturing gel electrophoresis. Results We find that direct damage primarily results in a reduced number of strand breaks in guanine triplet regions (GGG) when compared to isolated guanine (G) bases with identical flanking base context. In addition, we observe strand break behavior possibly indicative of protection of guanine bases when flanked by pyrimidines, and sensitization of guanine to strand break when flanked by adenine (A) bases in both isolated G and GGG cases. Conclusions These observations provide insight into the strand break behavior in GGG regions damaged via the direct effect of ionizing radiation. In addition, this could be indicative of DNA sequences that are naturally more susceptible to strand break due to the direct effect of ionizing radiation. PMID:27349757
Jehee, Ivo; van der Veer, Charlotte; Himschoot, Michelle; Hermans, Mirjam; Bruisten, Sylvia
2017-12-01
Trichomonas vaginalis is the most common sexually transmitted parasitical infection worldwide. T. vaginalis can carry a virus: Trichomonas vaginalis virus (TVV). To date, four TVV species have been described. Few studies have investigated TVV prevalence and its clinical importance. We have developed a nested reverse-transcriptase PCR, with novel, type specific primers to directly detect TVV RNA in T. vaginalis positive clinical samples. A total of 119T. vaginalis positive clinical samples were collected in Amsterdam and "s-Hertogenbosch, the Netherlands, from 2012 to 2016. For all samples T. vaginalis was genotyped using multi-locus sequence typing. The T. vaginalis positive samples segregated into a two-genotype population: type I (n=64) and type II (n=55). All were tested for TVV with the new TVV PCR. We detected 3 of the 4 TVV species. Sequencing of the amplified products showed high homology with published TVV genomes (82-100%). Half of the T. vaginalis clinical samples (n=60, 50.4%) were infected with one or more TVV species, with a preponderance for TVV infections in T. vaginalis type I (n=44, 73.3%). Clinical data was available for a subset of samples (n=34) and we observed an association between testing positive for (any) TVV and reporting urogenital symptoms (p=0.023). The nested RT-PCR allowed for direct detection of TVV in T. vaginalis positive clinical samples. This may be helpful in studies and clinical settings, since T. vaginalis disease and/or treatment outcome may be influenced by the protozoa"s virus. Copyright © 2017 Elsevier B.V. All rights reserved.
Real-Time DNA Sequencing in the Antarctic Dry Valleys Using the Oxford Nanopore Sequencer
Johnson, Sarah S.; Zaikova, Elena; Goerlitz, David S.; Bai, Yu; Tighe, Scott W.
2017-01-01
The ability to sequence DNA outside of the laboratory setting has enabled novel research questions to be addressed in the field in diverse areas, ranging from environmental microbiology to viral epidemics. Here, we demonstrate the application of offline DNA sequencing of environmental samples using a hand-held nanopore sequencer in a remote field location: the McMurdo Dry Valleys, Antarctica. Sequencing was performed using a MK1B MinION sequencer from Oxford Nanopore Technologies (ONT; Oxford, United Kingdom) that was equipped with software to operate without internet connectivity. One-direction (1D) genomic libraries were prepared using portable field techniques on DNA isolated from desiccated microbial mats. By adequately insulating the sequencer and laptop, it was possible to run the sequencing protocol for up to 2½ h under arduous conditions. PMID:28337073
van Hal, Sebastiaan J.; Steen, Jason A.; Espedido, Björn A.; Grimmond, Sean M.; Cooper, Matthew A.; Holden, Matthew T. G.; Bentley, Stephen D.; Gosbell, Iain B.; Jensen, Slade O.
2014-01-01
Objectives To obtain an expanded understanding of antibiotic resistance evolution in vivo, particularly in the context of vancomycin exposure. Methods The whole genomes of six consecutive methicillin-resistant Staphylococcus aureus blood culture isolates (ST239-MRSA-III) from a single patient exposed to various antimicrobials (over a 77 day period) were sequenced and analysed. Results Variant analysis revealed the existence of non-susceptible sub-populations derived from a common susceptible ancestor, with the predominant circulating clone(s) selected for by type and duration of antimicrobial exposure. Conclusions This study highlights the dynamic nature of bacterial evolution and that non-susceptible sub-populations can emerge from clouds of variation upon antimicrobial exposure. Diagnostically, this has direct implications for sample selection when using whole-genome sequencing as a tool to guide clinical therapy. In the context of bacteraemia, deep sequencing of bacterial DNA directly from patient blood samples would avoid culture ‘bias’ and identify mutations associated with circulating non-susceptible sub-populations, some of which may confer cross-resistance to alternate therapies. PMID:24047554
van Hal, Sebastiaan J; Steen, Jason A; Espedido, Björn A; Grimmond, Sean M; Cooper, Matthew A; Holden, Matthew T G; Bentley, Stephen D; Gosbell, Iain B; Jensen, Slade O
2014-02-01
To obtain an expanded understanding of antibiotic resistance evolution in vivo, particularly in the context of vancomycin exposure. The whole genomes of six consecutive methicillin-resistant Staphylococcus aureus blood culture isolates (ST239-MRSA-III) from a single patient exposed to various antimicrobials (over a 77 day period) were sequenced and analysed. Variant analysis revealed the existence of non-susceptible sub-populations derived from a common susceptible ancestor, with the predominant circulating clone(s) selected for by type and duration of antimicrobial exposure. This study highlights the dynamic nature of bacterial evolution and that non-susceptible sub-populations can emerge from clouds of variation upon antimicrobial exposure. Diagnostically, this has direct implications for sample selection when using whole-genome sequencing as a tool to guide clinical therapy. In the context of bacteraemia, deep sequencing of bacterial DNA directly from patient blood samples would avoid culture 'bias' and identify mutations associated with circulating non-susceptible sub-populations, some of which may confer cross-resistance to alternate therapies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren
2011-01-01
Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g.more » Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.« less
Chen, Guiqian; Qiu, Yuan; Zhuang, Qingye; Wang, Suchun; Wang, Tong; Chen, Jiming; Wang, Kaicheng
2018-05-09
Next generation sequencing (NGS) is a powerful tool for the characterization, discovery, and molecular identification of RNA viruses. There were multiple NGS library preparation methods published for strand-specific RNA-seq, but some methods are not suitable for identifying and characterizing RNA viruses. In this study, we report a NGS library preparation method to identify RNA viruses using the Ion Torrent PGM platform. The NGS sequencing adapters were directly inserted into the sequencing library through reverse transcription and polymerase chain reaction, without fragmentation and ligation of nucleic acids. The results show that this method is simple to perform, able to identify multiple species of RNA viruses in clinical samples.
Identification of Bacterial Species in Kuwaiti Waters Through DNA Sequencing
NASA Astrophysics Data System (ADS)
Chen, K.
2017-01-01
With an objective of identifying the bacterial diversity associated with ecosystem of various Kuwaiti Seas, bacteria were cultured and isolated from 3 water samples. Due to the difficulties for cultured and isolated fecal coliforms on the selective agar plates, bacterial isolates from marine agar plates were selected for molecular identification. 16S rRNA genes were successfully amplified from the genome of the selected isolates using Universal Eubacterial 16S rRNA primers. The resulted amplification products were subjected to automated DNA sequencing. Partial 16S rDNA sequences obtained were compared directly with sequences in the NCBI database using BLAST as well as with the sequences available with Ribosomal Database Project (RDP).
Photothermal method of determining calorific properties of coal
Amer, Nabil M.
1985-01-01
Predetermined amounts of heat are generated within a coal sample (11) by directing pump light pulses (14) of predetermined energy content into a small surface region (16) of the sample (11). A beam (18) of probe light is directed along the sample surface (19) and deflection of the probe beam (18) from thermally induced changes of index of refraction in the fluid medium adjacent the heated region (16) are detected. Deflection amplitude and the phase lag of the deflection, relative to the initiating pump light pulse (14), are indicative of the calorific value and the porosity of the sample (11). The method provides rapid, accurate and non-destructive analysis of the heat producing capabilities of coal samples (11). In the preferred form, sequences of pump light pulses (14) of increasing durations are directed into the sample (11) at each of a series of minute regions (16) situated along a raster scan path (21) enabling detailed analysis of variations of thermal properties at different areas of the sample (11) and at different depths.
Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.
Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A
2012-12-01
The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.
Korber, B T; Osmanov, S; Esparza, J; Myers, G
1994-11-01
The World Health Organization Global Programme on AIDS (WHO/GPA) is conducting a large-scale collaborative study of human immunodeficiency virus type 1 (HIV-1) variation, based in four potential vaccine-trial site countries: Brazil, Rwanda, Thailand, and Uganda. Through the course of this study, it was crucial to keep track of certain attributes of the samples from which the viral nucleotide sequences were derived (e.g., country of origin and viral culture characterization), so that meaningful sequence comparisons could be made. Here we describe a system developed in the context of the WHO/GPA study that summarizes such critical attributes by representing them as standardized characters directly incorporated into sequence names. This nomenclature allows linkage of clinical, phenotypic, and geographic information with molecular data. We propose that other investigators involved in human immunodeficiency virus (HIV) nucleotide sequencing efforts adopt a similar standardized sequence nomenclature to facilitate cross-study sequence comparison. HIV sequence data are being generated at an ever-increasing rate; directly coupled to this increase is our deepening understanding of biological parameters that influence or result from sequence variability. A standardized sequence nomenclature that includes relevant biological information would enable researchers to better utilize the growing body of sequence data, and enhance their ability to interpret the biological implications of their own data through facilitating comparisons with previously published work.
Turnbaugh, Peter J.; Quince, Christopher; Faith, Jeremiah J.; McHardy, Alice C.; Yatsunenko, Tanya; Niazi, Faheem; Affourtit, Jason; Egholm, Michael; Henrissat, Bernard; Knight, Rob; Gordon, Jeffrey I.
2010-01-01
We deeply sampled the organismal, genetic, and transcriptional diversity in fecal samples collected from a monozygotic (MZ) twin pair and compared the results to 1,095 communities from the gut and other body habitats of related and unrelated individuals. Using a new scheme for noise reduction in pyrosequencing data, we estimated the total diversity of species-level bacterial phylotypes in the 1.2-1.5 million bacterial 16S rRNA reads obtained from each deeply sampled cotwin to be ~800 (35.9%, 49.1% detected in both). A combined 1.1 million read 16S rRNA dataset representing 281 shallowly sequenced fecal samples from 54 twin pairs and their mothers contained an estimated 4,018 species-level phylotypes, with each sample having a unique species assemblage (53.4 ± 0.6% and 50.3 ± 0.5% overlap with the deeply sampled cotwins). Of the 134 phylotypes with a relative abundance of >0.1% in the combined dataset, only 37 appeared in >50% of the samples, with one phylotype in the Lachnospiraceae family present in 99%. Nongut communities had significantly reduced overlap with the deeply sequenced twins’ fecal microbiota (18.3 ± 0.3%, 15.3 ± 0.3%). The MZ cotwins’ fecal DNA was deeply sequenced (3.8-6.3 Gbp/sample) and assembled reads were assigned to 25 genus-level phylogenetic bins. Only 17% of the genes in these bins were shared between the cotwins. Bins exhibited differences in their degree of sequence variation, gene content including the repertoire of carbohydrate active enzymes present within and between twins (e.g., predicted cellulases, dockerins), and transcriptional activities. These results provide an expanded perspective about features that make each of us unique life forms and directions for future characterization of our gut ecosystems. PMID:20363958
PANGEA: pipeline for analysis of next generation amplicons
Giongo, Adriana; Crabb, David B; Davis-Richardson, Austin G; Chauliac, Diane; Mobberley, Jennifer M; Gano, Kelsey A; Mukherjee, Nabanita; Casella, George; Roesch, Luiz FW; Walts, Brandon; Riva, Alberto; King, Gary; Triplett, Eric W
2010-01-01
High-throughput DNA sequencing can identify organisms and describe population structures in many environmental and clinical samples. Current technologies generate millions of reads in a single run, requiring extensive computational strategies to organize, analyze and interpret those sequences. A series of bioinformatics tools for high-throughput sequencing analysis, including preprocessing, clustering, database matching and classification, have been compiled into a pipeline called PANGEA. The PANGEA pipeline was written in Perl and can be run on Mac OSX, Windows or Linux. With PANGEA, sequences obtained directly from the sequencer can be processed quickly to provide the files needed for sequence identification by BLAST and for comparison of microbial communities. Two different sets of bacterial 16S rRNA sequences were used to show the efficiency of this workflow. The first set of 16S rRNA sequences is derived from various soils from Hawaii Volcanoes National Park. The second set is derived from stool samples collected from diabetes-resistant and diabetes-prone rats. The workflow described here allows the investigator to quickly assess libraries of sequences on personal computers with customized databases. PANGEA is provided for users as individual scripts for each step in the process or as a single script where all processes, except the χ2 step, are joined into one program called the ‘backbone’. PMID:20182525
PANGEA: pipeline for analysis of next generation amplicons.
Giongo, Adriana; Crabb, David B; Davis-Richardson, Austin G; Chauliac, Diane; Mobberley, Jennifer M; Gano, Kelsey A; Mukherjee, Nabanita; Casella, George; Roesch, Luiz F W; Walts, Brandon; Riva, Alberto; King, Gary; Triplett, Eric W
2010-07-01
High-throughput DNA sequencing can identify organisms and describe population structures in many environmental and clinical samples. Current technologies generate millions of reads in a single run, requiring extensive computational strategies to organize, analyze and interpret those sequences. A series of bioinformatics tools for high-throughput sequencing analysis, including pre-processing, clustering, database matching and classification, have been compiled into a pipeline called PANGEA. The PANGEA pipeline was written in Perl and can be run on Mac OSX, Windows or Linux. With PANGEA, sequences obtained directly from the sequencer can be processed quickly to provide the files needed for sequence identification by BLAST and for comparison of microbial communities. Two different sets of bacterial 16S rRNA sequences were used to show the efficiency of this workflow. The first set of 16S rRNA sequences is derived from various soils from Hawaii Volcanoes National Park. The second set is derived from stool samples collected from diabetes-resistant and diabetes-prone rats. The workflow described here allows the investigator to quickly assess libraries of sequences on personal computers with customized databases. PANGEA is provided for users as individual scripts for each step in the process or as a single script where all processes, except the chi(2) step, are joined into one program called the 'backbone'.
Does RecA have a role in Borrelia recurrentis?
Cutler, S J; Rinky, I J; Bonilla, E M
2011-02-01
Genomic sequencing of two relapsing fever spirochaetes showed truncation of recA in Borrelia recurrentis, but not in Borrelia duttonii. RecA has an important role among bacteria; we investigated whether this characteristic was representative of B. recurrentis, or an artefact following in vitro cultivation. We sequenced recA directly from samples of patient with louse-borne relapsing fever (B. recurrentis) or tick-borne relapsing fever (B. duttonii). We confirmed the premature stop codon in seven louse-borne relapsing fever samples, and its absence from three tick-borne relapsing fever samples. Furthermore, specific signature polymorphisms were found that could differentiate between these highly similar spirochaetes. © 2010 The Authors. Journal Compilation © 2010 European Society of Clinical Microbiology and Infectious Diseases.
Chen, Hai-Hua; Yang, Ji-Long; Lu, Hui-Fang; Zhou, Wei-Jun; Yao, Fei; Deng, Lan
2014-02-01
This study was purposed to investigate the feasibility of high resolution melting (HRM) in the detection of JAK2V617F mutation in patients with myeloproliferative neoplasm (MPN). The 29 marrow samples randomly selected from patients with clinically diagnosed MPN from January 2008 to January 2011 were detected by HRM method. The results of HRM analysis were compared with that detected by allele specific polymerase chain reaction (AS-PCR) and DNA direct sequencing. The results showed that the JAK2V617F mutations were detected in 11 (37.9%, 11/29) cases by HRM, and its comparability with the direct sequencing result was 100%. While the consistency of AS-PCR with the direct sequencing was moderate (Kappa = 0.179, P = 0.316). It is concluded that the HRM analysis may be an optimal method for clinical screening of JAK2V617F mutation due to its simplicity and promptness with a high specificity.
Novel and canine genotypes of Giardia duodenalis in harbor seals ( Phoca vitulina richardsi).
Gaydos, J K; Miller, W A; Johnson, C; Zornetzer, H; Melli, A; Packham, A; Jeffries, S J; Lance, M M; Conrad, P A
2008-12-01
Feces of harbor seals (Phoca vitulina richardsi) and hybrid glaucous-winged/western gulls (Larus glaucescens / occidentalis) from Washington State's inland marine waters were examined for Giardia and Cryptosporidium spp. to determine if genotypes carried by these wildlife species were the same genotypes that commonly infect humans and domestic animals. Using immunomagnetic separation followed by direct fluorescent antibody detection, Giardia spp. cysts were detected in 42% of seal fecal samples (41/97). Giardia-positive samples came from 90% of the sites (9/10) and the prevalence of positive seal fecal samples differed significantly among study sites. Fecal samples collected from seal haulout sites with over 400 animals were 4.7 times more likely to have Giardia spp. cysts than samples collected at smaller haulout sites. In gulls, a single Giardia sp. cyst was detected in 4% of fecal samples (3/78). Cryptosporidium spp. oocysts were not detected in any of the seals or gulls tested. Sequence analysis of a 398 bp segment of G. duodenalis DNA at the glutamate dehydrogenase locus suggested that 11 isolates originating from seals throughout the region were a novel genotype and 3 isolates obtained from a single site in south Puget Sound were the G. duodenalis canine genotype D. Real-time TaqMan PCR amplification and subsequent sequencing of a 52 bp small subunit ribosomal DNA region from novel harbor seal genotype isolates showed sequence homology to canine genotypes C and D. Sequence analysis of the 52 bp small subunit ribosomal DNA products from the 3 canine genotype isolates from seals produced mixed sequences at could not be evaluated.
Ahn, Sang Hoon; Chun, Ji-Yong; Shin, Soo-Kyung; Park, Jun Yong; Yoo, Wangdon; Hong, Sun Pyo; Kim, Soo-Ok; Han, Kwang-Hyub
2013-12-01
Molecular diagnostic methods have enabled the rapid diagnosis of drug-resistant mutations in hepatitis B virus (HBV) and have reduced both unnecessary therapeutic interventions and medical costs. In this study we evaluated the analytical and clinical performances of the HepB Typer-Entecavir kit (GeneMatrix, Korea) in detecting entecavir-resistance-associated mutations. The HepB Typer-Entecavir kit was evaluated for its limit of detection, interference, cross-reactivity, and precision using HBV reference standards made by diluting high-titer viral stocks in HBV-negative human serum. The performance of the HepB Typer-Entecavir kit for detecting mutations related to entecavir resistance was compared with direct sequencing for 396 clinical samples from 108 patients. Using the reference standards, the detection limit of the HepB Typer-Entecavir kit was found to be as low as 500 copies/mL. No cross-reactivity was observed, and elevated levels of various interfering substances did not adversely affect its analytical performance. The precision test conducted by repetitive analysis of 2,400 replicates with reference standards at various concentrations showed 99.9% agreement (2398/2400). The overall concordance rate between the HepB Typer-Entecavir kit and direct sequencing assays in 396 clinical samples was 99.5%. The HepB Typer-Entecavir kit showed high reliability and precision, and comparable sensitivity and specificity for detecting mutant virus populations in reference and clinical samples in comparison with direct sequencing. Therefore, this assay would be clinically useful in the diagnosis of entecavir-resistance-associated mutations in chronic hepatitis B.
Shang, Zhi-Yuan; Wang, Jian; Zhang, Wen; Li, Yan-Yan; Cui, Ming-Xing; Chen, Zhen-Ju; Zhao, Xing-Yun
2013-01-01
A measurement was made on the vertical direction tree ring stable carbon isotope ratio (delta13C) and tree ring width of Pinus sylvestris var. mongolica in northern Daxing' an Mountains of Northeast China, with the relationship between the vertical direction variations of the tree ring delta13C and tree ring width analyzed. In the whole ring of xylem, earlywood (EW) and bark endodermis, the delta13C all exhibited an increasing trend from the top to the base at first, with the maximum at the bottom of tree crown, and then, decreased rapidly to the minimum downward. The EW and late-wood (LW) had an increasing ratio of average tree ring width from the base to the top. The average annual sequence of the delta13C in vertical direction had an obvious reverse correspondence with the average annual sequence of tree ring width, and had a trend comparatively in line with the average annual sequence of the tree ring width ratio of EW to LW above tree crown. The variance analysis showed that there existed significant differences in the sequences of tree ring delta13C and ring width in vertical direction, and the magnitude of vertical delta13C variability was basically the same as that of the inter-annual delta13C variability. The year-to-year variation trend of the vertical delta13C sequence was approximately identical. For each sample, the delta13C sequence at the same heights was negatively correlated with the ring width sequence, but the statistical significance differed with tree height.
Mistri, S K; Sultana, M; Kamal, S M M; Alam, M M; Irin, F; Nessa, J; Ahsan, C R; Yasmin, M
2016-05-01
For an effective control of tuberculosis, rapid detection of multidrug resistant tuberculosis (MDR-TB) is necessary. Therefore, we developed a modified nested multiplex allele-specific polymerase chain reaction (MAS-PCR) method that enables rapid MDR-TB detection directly from sputum samples. The efficacy of this method was evaluated using 79 sputum samples collected from suspected tuberculosis patients. The performance of nested MAS-PCR method was compared with other MDR-TB detection methods like drug susceptibility testing (DST) and DNA sequencing. As rifampicin (RIF) resistance conforms to MDR-TB in greater than 90% cases, only the presence of RIF-associated mutations in rpoB gene was determined by DNA sequencing and nested MAS-PCR to detect MDR-TB. The concordance between nested MAS-PCR and DNA sequencing results was found to be 96·3%. When compared with DST, the sensitivity and specificity of nested MAS-PCR for RIF-resistance detection were determined to be 92·9 and 100% respectively. For developing- and high-TB burden countries, molecular-based tests have been recommended by the World Health Organization for rapid detection of MDR-TB. The results of this study indicate that, nested MAS-PCR assay might be a practical and relatively cost effective molecular method for rapid detection of MDR-TB from suspected sputum samples in developing countries with resource poor settings. © 2016 The Society for Applied Microbiology.
Arrays of probes for positional sequencing by hybridization
Cantor, Charles R [Boston, MA; Prezetakiewiczr, Marek [East Boston, MA; Smith, Cassandra L [Boston, MA; Sano, Takeshi [Waltham, MA
2008-01-15
This invention is directed to methods and reagents useful for sequencing nucleic acid targets utilizing sequencing by hybridization technology comprising probes, arrays of probes and methods whereby sequence information is obtained rapidly and efficiently in discrete packages. That information can be used for the detection, identification, purification and complete or partial sequencing of a particular target nucleic acid. When coupled with a ligation step, these methods can be performed under a single set of hybridization conditions. The invention also relates to the replication of probe arrays and methods for making and replicating arrays of probes which are useful for the large scale manufacture of diagnostic aids used to screen biological samples for specific target sequences. Arrays created using PCR technology may comprise probes with 5'- and/or 3'-overhangs.
Vipie: web pipeline for parallel characterization of viral populations from multiple NGS samples.
Lin, Jake; Kramna, Lenka; Autio, Reija; Hyöty, Heikki; Nykter, Matti; Cinek, Ondrej
2017-05-15
Next generation sequencing (NGS) technology allows laboratories to investigate virome composition in clinical and environmental samples in a culture-independent way. There is a need for bioinformatic tools capable of parallel processing of virome sequencing data by exactly identical methods: this is especially important in studies of multifactorial diseases, or in parallel comparison of laboratory protocols. We have developed a web-based application allowing direct upload of sequences from multiple virome samples using custom parameters. The samples are then processed in parallel using an identical protocol, and can be easily reanalyzed. The pipeline performs de-novo assembly, taxonomic classification of viruses as well as sample analyses based on user-defined grouping categories. Tables of virus abundance are produced from cross-validation by remapping the sequencing reads to a union of all observed reference viruses. In addition, read sets and reports are created after processing unmapped reads against known human and bacterial ribosome references. Secured interactive results are dynamically plotted with population and diversity charts, clustered heatmaps and a sortable and searchable abundance table. The Vipie web application is a unique tool for multi-sample metagenomic analysis of viral data, producing searchable hits tables, interactive population maps, alpha diversity measures and clustered heatmaps that are grouped in applicable custom sample categories. Known references such as human genome and bacterial ribosomal genes are optionally removed from unmapped ('dark matter') reads. Secured results are accessible and shareable on modern browsers. Vipie is a freely available web-based tool whose code is open source.
Lagares, Antonio; Agaras, Betina; Bettiol, Marisa P; Gatti, Blanca M; Valverde, Claudio
2015-07-01
Species-specific genetic markers are crucial to develop faithful and sensitive molecular methods for the detection and identification of Pseudomonas aeruginosa (Pa). We have previously set up a PCR-RFLP protocol targeting oprF, the gene encoding the genus-specific outer membrane porin F, whose strong conservation and marked sequence diversity allowed detection and differentiation of environmental isolates (Agaras et al., 2012). Here, we evaluated the ability of the PCR-RFLP assay to genotype clinical isolates previously identified as Pa by conventional microbiological methods within a collection of 62 presumptive Pa isolates from different pediatric clinical samples and different sections of the Hospital de Niños "Sor María Ludovica" from La Plata, Argentina. All isolates, but one, gave an oprF amplicon consistent with that from reference Pa strains. The sequence of the smaller-sized amplicon revealed that the isolate was in fact a mendocina Pseudomonas strain. The oprF RFLP pattern generated with TaqI or HaeIII nucleases matched those of reference Pa strains for 59 isolates (96%). The other two Pa isolates (4%) revealed a different RFLP pattern based on HaeIII digestion, although oprF sequencing confirmed that Pa identification was correct. We next tested the effectiveness of the PCR-RFLP to detect pseudomonads on clinical samples of pediatric fibrocystic patients directly without sample cultivation. The expected amplicon and its cognate RFLP profile were obtained for all samples in which Pa was previously detected by cultivation-dependent methods. Altogether, these results provide the basis for the application of the oprF PCR-RFLP protocol to directly detect and identify Pa and other non-Pa pseudomonads in fibrocystic clinical samples. Copyright © 2015 Elsevier B.V. All rights reserved.
Direct Formalin Fixation Induces Widespread Genomic Effects in Archival Tissues
Recent advances in next generation sequencing have dramatically improved transcriptional analysis of degraded RNA from formalin-fixed paraffin-embedded (FFPE) samples. However, little is known about potential genomic artifacts induced by formalin fixation, which could affect toxi...
Tabb, Michelle M; Batterman, Hollis J
2016-01-01
The Simplexa™ Group A Strep Direct assay is intended for use on the Integrated Cycler for detection of Group A Streptococcus (GAS) directly from throat swabs that have not undergone nucleic acid extraction. A prospective study of 1352 samples in 4 geographically diverse sites showed an overall prevalence of GAS of 15.4%. The assay demonstrated 97.4% sensitivity and 95.2% specificity versus culture. The positive predictive value compared to culture was 72.7%. However, 46 out of 57 discrepant samples were Group A Strep positive when tested using a bi-directional sequencing method illustrating the increased sensitivity of the assay compared to culture for detection of GAS. Rapid and accurate diagnosis of GAS allows for timely treatment to decrease complications of this prevalent organism that continues to cause substantial morbidity and mortality worldwide.
Spatial distribution of marine airborne bacterial communities
Seifried, Jasmin S; Wichels, Antje; Gerdts, Gunnar
2015-01-01
The spatial distribution of bacterial populations in marine bioaerosol samples was investigated during a cruise from the North Sea to the Baltic Sea via Skagerrak and Kattegat. The analysis of the sampled bacterial communities with a pyrosequencing approach revealed that the most abundant phyla were represented by the Proteobacteria (49.3%), Bacteroidetes (22.9%), Actinobacteria (16.3%), and Firmicutes (8.3%). Cyanobacteria were assigned to 1.5% of all bacterial reads. A core of 37 bacterial OTUs made up more than 75% of all bacterial sequences. The most abundant OTU was Sphingomonas sp. which comprised 17% of all bacterial sequences. The most abundant bacterial genera were attributed to distinctly different areas of origin, suggesting highly heterogeneous sources for bioaerosols of marine and coastal environments. Furthermore, the bacterial community was clearly affected by two environmental parameters – temperature as a function of wind direction and the sampling location itself. However, a comparison of the wind directions during the sampling and calculated backward trajectories underlined the need for more detailed information on environmental parameters for bioaerosol investigations. The current findings support the assumption of a bacterial core community in the atmosphere. They may be emitted from strong aerosolizing sources, probably being mixed and dispersed over long distances. PMID:25800495
Townsley, Brad T; Covington, Michael F; Ichihashi, Yasunori; Zumstein, Kristina; Sinha, Neelima R
2015-01-01
Next Generation Sequencing (NGS) is driving rapid advancement in biological understanding and RNA-sequencing (RNA-seq) has become an indispensable tool for biology and medicine. There is a growing need for access to these technologies although preparation of NGS libraries remains a bottleneck to wider adoption. Here we report a novel method for the production of strand specific RNA-seq libraries utilizing the terminal breathing of double-stranded cDNA to capture and incorporate a sequencing adapter. Breath Adapter Directional sequencing (BrAD-seq) reduces sample handling and requires far fewer enzymatic steps than most available methods to produce high quality strand-specific RNA-seq libraries. The method we present is optimized for 3-prime Digital Gene Expression (DGE) libraries and can easily extend to full transcript coverage shotgun (SHO) type strand-specific libraries and is modularized to accommodate a diversity of RNA and DNA input materials. BrAD-seq offers a highly streamlined and inexpensive option for RNA-seq libraries.
Rapid detection of Mannheimia haemolytica in lung tissues of sheep and from bacterial culture.
Kumar, Jyoti; Dixit, Shivendra Kumar; Kumar, Rajiv
2015-09-01
This study was aimed to detect Mannheimia haemolytica in lung tissues of sheep and from a bacterial culture. M. haemolytica is one of the most important and well-established etiological agents of pneumonia in sheep and other ruminants throughout the world. Accurate diagnosis of M. haemolytica primarily relies on bacteriological examination, biochemical characteristics and, biotyping and serotyping of the isolates. In an effort to facilitate rapid M. haemolytica detection, polymerase chain reaction assay targeting Pasteurella haemolytica serotype-1 specific antigens (PHSSA), Rpt2 and 12S ribosomal RNA (rRNA) genes were used to detect M. haemolytica directly from lung tissues and from bacterial culture. A total of 12 archived lung tissues from sheep that died of pneumonia on an organized farm were used. A multiplex polymerase chain reaction (mPCR) based on two-amplicons targeted PHSSA and Rpt2 genes of M. haemolytica were used for identification of M. haemolytica isolates in culture from the lung samples. All the 12 lung tissue samples were tested for the presence M. haemolytica by PHSSA and Rpt2 genes based PCR and its confirmation by sequencing of the amplicons. All the 12 lung tissue samples tested for the presence of PHSSA and Rpt2 genes of M. haemolytica by mPCR were found to be positive. Amplification of 12S rRNA gene fragment as internal amplification control was obtained with each mPCR reaction performed from DNA extracted directly from lung tissue samples. All the M. haemolytica were also positive for mPCR. No amplified DNA bands were observed for negative control reactions. All the three nucleotide sequences were deposited in NCBI GenBank (Accession No. KJ534629, KJ534630 and KJ534631). Sequencing of the amplified products revealed the identity of 99-100%, with published sequence of PHSSA and Rpt2 genes of M. haemolytica available in the NCBI database. Sheep specific mitochondrial 12S rRNA gene sequence also revealed the identity of 98% with published sequences in the NCBI database. The present study emphasized the PCR as a valuable tool for rapid detection of M. haemolytica in clinical samples from animals. In addition, it offers the opportunity to perform large-scale epidemiological studies regarding the role of M. haemolytica in clinical cases of pneumonia and other disease manifestations in sheep and other ruminants, thereby providing the basis for effective preventive strategies.
Whole-genome sequencing to determine Neisseria gonorrhoeae transmission: an observational study
Cole, Kevin; Cole, Michelle J; Cresswell, Fiona; Dean, Gillian; Dave, Jayshree; Thomas, Daniel Rh; Foster, Kirsty; Waldram, Alison; Wilson, Daniel J; Didelot, Xavier; Grad, Yonatan H; Crook, Derrick W; Peto, Tim EA; Walker, A Sarah
2016-01-01
Background New approaches are urgently required to address increasing rates of gonorrhoea and the emergence and global spread of antibiotic-resistant Neisseria gonorrhoeae. Whole genome sequencing (WGS) can be applied to study transmission and track resistance. Methods We performed WGS on 1659 isolates from Brighton, UK, and 217 additional isolates from other UK locations. We included WGS data (n=196) from the USA. Estimated mutation rates, plus diversity observed within patients across anatomical sites and probable transmission pairs, were used to fit a coalescent model to determine the number of single nucleotide polymorphisms (SNPs) expected between sequences related by direct/indirect transmission, depending on the time between samples. Findings We detected extensive local transmission. 281/1061(26%) Brighton cases were indistinguishable (0 SNPs) to ≥1 previous case(s), and 786(74%) had evidence of a sampled direct or indirect Brighton source. There was evidence of sustained transmission of some lineages. We observed multiple related samples across geographic locations. Of 1273 infections in Brighton, 225(18%) were linked to another case from elsewhere in the UK, and 115(9%) to a case from the USA. Four lineages initially identified in Brighton could be linked to 70 USA sequences, including 61 from a lineage carrying the mosaic penA XXXIV associated with reduced cefixime susceptibility. Interpretation We present a WGS-based tool for genomic contact tracing of N. gonorrhoeae and demonstrate local, national and international transmission. WGS can be applied across geographical boundaries to investigate gonorrhoea transmission and to track antimicrobial resistance. Funding Oxford NIHR Health Protection Research Unit and Biomedical Research Centre. PMID:27427203
Direct Calculation of Protein Fitness Landscapes through Computational Protein Design
Au, Loretta; Green, David F.
2016-01-01
Naturally selected amino-acid sequences or experimentally derived ones are often the basis for understanding how protein three-dimensional conformation and function are determined by primary structure. Such sequences for a protein family comprise only a small fraction of all possible variants, however, representing the fitness landscape with limited scope. Explicitly sampling and characterizing alternative, unexplored protein sequences would directly identify fundamental reasons for sequence robustness (or variability), and we demonstrate that computational methods offer an efficient mechanism toward this end, on a large scale. The dead-end elimination and A∗ search algorithms were used here to find all low-energy single mutant variants, and corresponding structures of a G-protein heterotrimer, to measure changes in structural stability and binding interactions to define a protein fitness landscape. We established consistency between these algorithms with known biophysical and evolutionary trends for amino-acid substitutions, and could thus recapitulate known protein side-chain interactions and predict novel ones. PMID:26745411
Kim, Suk Kyeong; Kim, Dong-Lim; Han, Hye Seung; Kim, Wan Seop; Kim, Seung Ja; Moon, Won Jin; Oh, Seo Young; Hwang, Tae Sook
2008-06-01
Fine-needle aspiration biopsy (FNAB) is the primary means of distinguishing benign from malignant and of guiding therapeutic intervention in thyroid nodules. However, 10% to 30% of cases with indeterminate cytology in FNAB need other diagnostic tools to refine diagnosis. We compared the pyrosequencing method with the conventional direct DNA sequencing analysis and investigated the usefulness of preoperative BRAF mutation analysis as an adjunct diagnostic tool with routine FNAB. A total of 103 surgically confirmed patients' FNA slides were recruited and DNA was extracted after atypical cells were scraped from the slides. BRAF mutation was analyzed by pyrosequencing and direct DNA sequencing. Sixty-three (77.8%) of 81 histopathologically diagnosed malignant nodules revealed positive BRAF mutation on pyrosequencing analysis. In detail, 63 (84.0%) of 75 papillary thyroid carcinoma (PTC) samples showed positive BRAF mutation, whereas 3 follicular thyroid carcinomas, 1 anaplastic carcinoma, 1 medullary thyroid carcinoma, and 1 metastatic lung carcinoma did not show BRAF mutation. None of 22 benign nodules had BRAF mutation in both pyrosequencing and direct DNA sequencing. Out of 27 thyroid nodules classified as 'indeterminate' on cytologic examination preoperatively, 21 (77.8%) cases turned out to be malignant: 18 PTCs (including 2 follicular variant types) and 3 follicular thyroid carcinomas. Among these, 13 (61.9%) classic PTCs had BRAF mutation. None of 6 benign nodules, including 3 follicular adenomas and 3 nodular hyperplasias, had BRAF mutation. Among 63 PTCs with positive BRAF mutation detected by pyrosequencing analysis, 3 cases did not show BRAF mutation by direct DNA sequencing. Although it was not statistically significant, pyrosequencing was superior to direct DNA sequencing in detecting the BRAF mutation of thyroid nodules (P=0.25). Detecting BRAF mutation by pyrosequencing is more sensitive, faster, and less expensive than direct DNA sequencing and is proposed as an adjunct diagnostic tool in evaluating thyroid nodules of indeterminate cytology.
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data
Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa
2017-01-01
Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546
Cimino, Matthew T
2010-03-01
Twenty-four herbal dietary supplement powder and extract reference standards provided by the National Institute of Standards and Technology (NIST) were investigated using three different commercially available DNA extraction kits to evaluate DNA availability for downstream nucleotide-based applications. The material included samples of Camellia, Citrus, Ephedra, Ginkgo, Hypericum, Serenoa, And Vaccinium. Protocols from Qiagen, MoBio, and Phytopure were used to isolate and purify DNA from the NIST standards. The resulting DNA concentration was quantified using SYBR Green fluorometry. Each of the 24 samples yielded DNA, though the concentration of DNA from each approach was notably different. The Phytopure method consistently yielded more DNA. The average yield ratio was 22 : 3 : 1 (ng/microL; Phytopure : Qiagen : MoBio). Amplification of the internal transcribed spacer II region using PCR was ultimately successful in 22 of the 24 samples. Direct sequencing chromatograms of the amplified material suggested that most of the samples were comprised of mixtures. However, the sequencing chromatograms of 12 of the 24 samples were sufficient to confirm the identity of the target material. The successful extraction, amplification, and sequencing of DNA from these herbal dietary supplement extracts and powders supports a continued effort to explore nucleotide sequence-based tools for the authentication and identification of plants in dietary supplements. (c) Georg Thieme Verlag KG Stuttgart . New York.
Integrated circuit layer image segmentation
NASA Astrophysics Data System (ADS)
Masalskis, Giedrius; Petrauskas, Romas
2010-09-01
In this paper we present IC layer image segmentation techniques which are specifically created for precise metal layer feature extraction. During our research we used many samples of real-life de-processed IC metal layer images which were obtained using optical light microscope. We have created sequence of various image processing filters which provides segmentation results of good enough precision for our application. Filter sequences were fine tuned to provide best possible results depending on properties of IC manufacturing process and imaging technology. Proposed IC image segmentation filter sequences were experimentally tested and compared with conventional direct segmentation algorithms.
Nonpareil 3: Fast Estimation of Metagenomic Coverage and Sequence Diversity.
Rodriguez-R, Luis M; Gunturu, Santosh; Tiedje, James M; Cole, James R; Konstantinidis, Konstantinos T
2018-01-01
Estimations of microbial community diversity based on metagenomic data sets are affected, often to an unknown degree, by biases derived from insufficient coverage and reference database-dependent estimations of diversity. For instance, the completeness of reference databases cannot be generally estimated since it depends on the extant diversity sampled to date, which, with the exception of a few habitats such as the human gut, remains severely undersampled. Further, estimation of the degree of coverage of a microbial community by a metagenomic data set is prohibitively time-consuming for large data sets, and coverage values may not be directly comparable between data sets obtained with different sequencing technologies. Here, we extend Nonpareil, a database-independent tool for the estimation of coverage in metagenomic data sets, to a high-performance computing implementation that scales up to hundreds of cores and includes, in addition, a k -mer-based estimation as sensitive as the original alignment-based version but about three hundred times as fast. Further, we propose a metric of sequence diversity ( N d ) derived directly from Nonpareil curves that correlates well with alpha diversity assessed by traditional metrics. We use this metric in different experiments demonstrating the correlation with the Shannon index estimated on 16S rRNA gene profiles and show that N d additionally reveals seasonal patterns in marine samples that are not captured by the Shannon index and more precise rankings of the magnitude of diversity of microbial communities in different habitats. Therefore, the new version of Nonpareil, called Nonpareil 3, advances the toolbox for metagenomic analyses of microbiomes. IMPORTANCE Estimation of the coverage provided by a metagenomic data set, i.e., what fraction of the microbial community was sampled by DNA sequencing, represents an essential first step of every culture-independent genomic study that aims to robustly assess the sequence diversity present in a sample. However, estimation of coverage remains elusive because of several technical limitations associated with high computational requirements and limiting statistical approaches to quantify diversity. Here we described Nonpareil 3, a new bioinformatics algorithm that circumvents several of these limitations and thus can facilitate culture-independent studies in clinical or environmental settings, independent of the sequencing platform employed. In addition, we present a new metric of sequence diversity based on rarefied coverage and demonstrate its use in communities from diverse ecosystems.
Daniel, Hubert D-J; David, Joel; Raghuraman, Sukanya; Gnanamony, Manu; Chandy, George M; Sridharan, Gopalan; Abraham, Priya
2017-05-01
Based on genetic heterogeneity, hepatitis C virus (HCV) is classified into seven major genotypes and 64 subtypes. In spite of the sequence heterogeneity, all genotypes share an identical complement of colinear genes within the large open reading frame. The genetic interrelationships between these genes are consistent among genotypes. Due to this property, complete sequencing of the HCV genome is not required. HCV genotypes along with subtypes are critical for planning antiviral therapy. Certain genotypes are also associated with higher progression to liver cirrhosis. In this study, 100 blood samples were collected from individuals who came for routine HCV genotype identification. These samples were used for the comparison of two different genotyping methods (5'NCR PCR-RFLP and HCV core type-specific PCR) with NS5b sequencing. Of the 100 samples genotyped using 5'NCR PCR-RFLP and HCV core type-specific PCR, 90% (κ = 0.913, P < 0.00) and 96% (κ = 0.794, P < 0.00) correlated with NS5b sequencing, respectively. Sixty percent and 75% of discordant samples by 5'NCR PCR-RFLP and HCV core type-specific PCR, respectively, belonged to genotype 6. All the HCV genotype 1 subtypes were classified accurately by both the methods. This study shows that the 5'NCR-based PCR-RFLP and the HCV core type-specific PCR-based assays correctly identified HCV genotypes except genotype 6 from this region. Direct sequencing of the HCV core region was able to identify all the genotype 6 from this region and serves as an alternative to NS5b sequencing. © 2016 Wiley Periodicals, Inc.
A Pan-HIV Strategy for Complete Genome Sequencing
Yamaguchi, Julie; Alessandri-Gradt, Elodie; Tell, Robert W.; Brennan, Catherine A.
2015-01-01
Molecular surveillance is essential to monitor HIV diversity and track emerging strains. We have developed a universal library preparation method (HIV-SMART [i.e., switching mechanism at 5′ end of RNA transcript]) for next-generation sequencing that harnesses the specificity of HIV-directed priming to enable full genome characterization of all HIV-1 groups (M, N, O, and P) and HIV-2. Broad application of the HIV-SMART approach was demonstrated using a panel of diverse cell-cultured virus isolates. HIV-1 non-subtype B-infected clinical specimens from Cameroon were then used to optimize the protocol to sequence directly from plasma. When multiplexing 8 or more libraries per MiSeq run, full genome coverage at a median ∼2,000× depth was routinely obtained for either sample type. The method reproducibly generated the same consensus sequence, consistently identified viral sequence heterogeneity present in specimens, and at viral loads of ≤4.5 log copies/ml yielded sufficient coverage to permit strain classification. HIV-SMART provides an unparalleled opportunity to identify diverse HIV strains in patient specimens and to determine phylogenetic classification based on the entire viral genome. Easily adapted to sequence any RNA virus, this technology illustrates the utility of next-generation sequencing (NGS) for viral characterization and surveillance. PMID:26699702
1982-12-01
Sequence dj Estimate of the Desired Signal DEL Sampling Time Interval DS Direct Sequence c Sufficient Statistic E/T Signal Power Erfc Complimentary Error...Namely, a white Gaussian noise (WGN) generator was added. Also, a statistical subroutine was added in order to assess performance improvement at the...reference code and then passed through a correlation detector whose output is the sufficient 1 statistic , e . Using a threshold device and the sufficient
Library Design-Facilitated High-Throughput Sequencing of Synthetic Peptide Libraries.
Vinogradov, Alexander A; Gates, Zachary P; Zhang, Chi; Quartararo, Anthony J; Halloran, Kathryn H; Pentelute, Bradley L
2017-11-13
A methodology to achieve high-throughput de novo sequencing of synthetic peptide mixtures is reported. The approach leverages shotgun nanoliquid chromatography coupled with tandem mass spectrometry-based de novo sequencing of library mixtures (up to 2000 peptides) as well as automated data analysis protocols to filter away incorrect assignments, noise, and synthetic side-products. For increasing the confidence in the sequencing results, mass spectrometry-friendly library designs were developed that enabled unambiguous decoding of up to 600 peptide sequences per hour while maintaining greater than 85% sequence identification rates in most cases. The reliability of the reported decoding strategy was additionally confirmed by matching fragmentation spectra for select authentic peptides identified from library sequencing samples. The methods reported here are directly applicable to screening techniques that yield mixtures of active compounds, including particle sorting of one-bead one-compound libraries and affinity enrichment of synthetic library mixtures performed in solution.
Direct Sequence Spread Spectrum (DSSS) Receiver, User Manual
2008-01-01
sampled data is clocked in to correlator data registers and a comparison is made between the code and data register contents, producing a correlation ...symbol (equal to the processing gain Gp ) but need not be otherwise synchronised with the spreading codes . This allows a very long and noise- like PRBS...and Q channels are independently but synchronously sampled . Complex Real ADC FIR Filter Interpolator Acquisition Correlators
NASA Astrophysics Data System (ADS)
Ruf, B.; Erdnuess, B.; Weinmann, M.
2017-08-01
With the emergence of small consumer Unmanned Aerial Vehicles (UAVs), the importance and interest of image-based depth estimation and model generation from aerial images has greatly increased in the photogrammetric society. In our work, we focus on algorithms that allow an online image-based dense depth estimation from video sequences, which enables the direct and live structural analysis of the depicted scene. Therefore, we use a multi-view plane-sweep algorithm with a semi-global matching (SGM) optimization which is parallelized for general purpose computation on a GPU (GPGPU), reaching sufficient performance to keep up with the key-frames of input sequences. One important aspect to reach good performance is the way to sample the scene space, creating plane hypotheses. A small step size between consecutive planes, which is needed to reconstruct details in the near vicinity of the camera may lead to ambiguities in distant regions, due to the perspective projection of the camera. Furthermore, an equidistant sampling with a small step size produces a large number of plane hypotheses, leading to high computational effort. To overcome these problems, we present a novel methodology to directly determine the sampling points of plane-sweep algorithms in image space. The use of the perspective invariant cross-ratio allows us to derive the location of the sampling planes directly from the image data. With this, we efficiently sample the scene space, achieving higher sampling density in areas which are close to the camera and a lower density in distant regions. We evaluate our approach on a synthetic benchmark dataset for quantitative evaluation and on a real-image dataset consisting of aerial imagery. The experiments reveal that an inverse sampling achieves equal and better results than a linear sampling, with less sampling points and thus less runtime. Our algorithm allows an online computation of depth maps for subsequences of five frames, provided that the relative poses between all frames are given.
Castoe, Todd A.; Poole, Alexander W.; de Koning, A. P. Jason; Jones, Kenneth L.; Tomback, Diana F.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Lance, Stacey L.; Streicher, Jeffrey W.; Smith, Eric N.; Pollock, David D.
2012-01-01
Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample – a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable.
Castoe, T.A.; Poole, A.W.; de Koning, A. P. J.; Jones, K.L.; Tomback, D.F.; Oyler-McCance, S.J.; Fike, J.A.; Lance, S.L.; Streicher, J.W.; Smith, E.N.; Pollock, D.D.
2012-01-01
Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample - a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable. ?? 2012 Castoe et al.
Castoe, Todd A; Poole, Alexander W; de Koning, A P Jason; Jones, Kenneth L; Tomback, Diana F; Oyler-McCance, Sara J; Fike, Jennifer A; Lance, Stacey L; Streicher, Jeffrey W; Smith, Eric N; Pollock, David D
2012-01-01
Identification of microsatellites, or simple sequence repeats (SSRs), can be a time-consuming and costly investment requiring enrichment, cloning, and sequencing of candidate loci. Recently, however, high throughput sequencing (with or without prior enrichment for specific SSR loci) has been utilized to identify SSR loci. The direct "Seq-to-SSR" approach has an advantage over enrichment-based strategies in that it does not require a priori selection of particular motifs, or prior knowledge of genomic SSR content. It has been more expensive per SSR locus recovered, however, particularly for genomes with few SSR loci, such as bird genomes. The longer but relatively more expensive 454 reads have been preferred over less expensive Illumina reads. Here, we use Illumina paired-end sequence data to identify potentially amplifiable SSR loci (PALs) from a snake (the Burmese python, Python molurus bivittatus), and directly compare these results to those from 454 data. We also compare the python results to results from Illumina sequencing of two bird genomes (Gunnison Sage-grouse, Centrocercus minimus, and Clark's Nutcracker, Nucifraga columbiana), which have considerably fewer SSRs than the python. We show that direct Illumina Seq-to-SSR can identify and characterize thousands of potentially amplifiable SSR loci for as little as $10 per sample--a fraction of the cost of 454 sequencing. Given that Illumina Seq-to-SSR is effective, inexpensive, and reliable even for species such as birds that have few SSR loci, it seems that there are now few situations for which prior hybridization is justifiable.
Anderson, Julia; Lemmer, Darrin; Lehmkuhl, Erik; Georghiou, Sophia B.; Heaton, Hannah; Wiggins, Kristin; Gillece, John D.; Schupp, James M.; Catanzaro, Donald G.; Crudu, Valeriu; Cohen, Ted; Rodwell, Timothy C.; Engelthaler, David M.
2016-01-01
Increasingly complex drug-resistant tuberculosis (DR-TB) is a major global health concern and one of the primary reasons why TB is now the leading infectious cause of death worldwide. Rapid characterization of a DR-TB patient's complete drug resistance profile would facilitate individualized treatment in place of empirical treatment, improve treatment outcomes, prevent amplification of resistance, and reduce the transmission of DR-TB. The use of targeted next-generation sequencing (NGS) to obtain drug resistance profiles directly from patient sputum samples has the potential to enable comprehensive evidence-based treatment plans to be implemented quickly, rather than in weeks to months, which is currently needed for phenotypic drug susceptibility testing (DST) results. In this pilot study, we evaluated the performance of amplicon sequencing of Mycobacterium tuberculosis DNA from patient sputum samples using a tabletop NGS technology and automated data analysis to provide a rapid DST solution (the Next Gen-RDST assay). One hundred sixty-six out of 176 (94.3%) sputum samples from the Republic of Moldova yielded complete Next Gen-RDST assay profiles for 7 drugs of interest. We found a high level of concordance of our Next Gen-RDST assay results with phenotypic DST (97.0%) and pyrosequencing (97.8%) results from the same clinical samples. Our Next Gen-RDST assay was also able to estimate the proportion of resistant-to-wild-type alleles down to mixtures of ≤1%, which demonstrates the ability to detect very low levels of resistant variants not detected by pyrosequencing and possibly below the threshold for phenotypic growth methods. The assay as described here could be used as a clinical or surveillance tool. PMID:27225403
Li, De-Zhu; Gao, Lian-Ming; Li, Hong-Tao; Wang, Hong; Ge, Xue-Jun; Liu, Jian-Quan; Chen, Zhi-Duan; Zhou, Shi-Liang; Chen, Shi-Lin; Yang, Jun-Bo; Fu, Cheng-Xin; Zeng, Chun-Xia; Yan, Hai-Fei; Zhu, Ying-Jie; Sun, Yong-Shuai; Chen, Si-Yun; Zhao, Lei; Wang, Kun; Yang, Tuo; Duan, Guang-Wen
2011-12-06
A two-marker combination of plastid rbcL and matK has previously been recommended as the core plant barcode, to be supplemented with additional markers such as plastid trnH-psbA and nuclear ribosomal internal transcribed spacer (ITS). To assess the effectiveness and universality of these barcode markers in seed plants, we sampled 6,286 individuals representing 1,757 species in 141 genera of 75 families (42 orders) by using four different methods of data analysis. These analyses indicate that (i) the three plastid markers showed high levels of universality (87.1-92.7%), whereas ITS performed relatively well (79%) in angiosperms but not so well in gymnosperms; (ii) in taxonomic groups for which direct sequencing of the marker is possible, ITS showed the highest discriminatory power of the four markers, and a combination of ITS and any plastid DNA marker was able to discriminate 69.9-79.1% of species, compared with only 49.7% with rbcL + matK; and (iii) where multiple individuals of a single species were tested, ascriptions based on ITS and plastid DNA barcodes were incongruent in some samples for 45.2% of the sampled genera (for genera with more than one species sampled). This finding highlights the importance of both sampling multiple individuals and using markers with different modes of inheritance. In cases where it is difficult to amplify and directly sequence ITS in its entirety, just using ITS2 is a useful backup because it is easier to amplify and sequence this subset of the marker. We therefore propose that ITS/ITS2 should be incorporated into the core barcode for seed plants.
Ford, Michael J.; Hempelmann, Jennifer; Hanson, M. Bradley; Ayres, Katherine L.; Baird, Robin W.; Emmons, Candice K.; Lundin, Jessica I.; Schorr, Gregory S.; Wasser, Samuel K.; Park, Linda K.
2016-01-01
Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population’s summer diet. PMID:26735849
Ford, Michael J; Hempelmann, Jennifer; Hanson, M Bradley; Ayres, Katherine L; Baird, Robin W; Emmons, Candice K; Lundin, Jessica I; Schorr, Gregory S; Wasser, Samuel K; Park, Linda K
2016-01-01
Estimating diet composition is important for understanding interactions between predators and prey and thus illuminating ecosystem function. The diet of many species, however, is difficult to observe directly. Genetic analysis of fecal material collected in the field is therefore a useful tool for gaining insight into wild animal diets. In this study, we used high-throughput DNA sequencing to quantitatively estimate the diet composition of an endangered population of wild killer whales (Orcinus orca) in their summer range in the Salish Sea. We combined 175 fecal samples collected between May and September from five years between 2006 and 2011 into 13 sample groups. Two known DNA composition control groups were also created. Each group was sequenced at a ~330bp segment of the 16s gene in the mitochondrial genome using an Illumina MiSeq sequencing system. After several quality controls steps, 4,987,107 individual sequences were aligned to a custom sequence database containing 19 potential fish prey species and the most likely species of each fecal-derived sequence was determined. Based on these alignments, salmonids made up >98.6% of the total sequences and thus of the inferred diet. Of the six salmonid species, Chinook salmon made up 79.5% of the sequences, followed by coho salmon (15%). Over all years, a clear pattern emerged with Chinook salmon dominating the estimated diet early in the summer, and coho salmon contributing an average of >40% of the diet in late summer. Sockeye salmon appeared to be occasionally important, at >18% in some sample groups. Non-salmonids were rarely observed. Our results are consistent with earlier results based on surface prey remains, and confirm the importance of Chinook salmon in this population's summer diet.
Analysis of human herpesvirus-6 IE1 sequence variation in clinical samples.
Stanton, Richard; Wilkinson, Gavin W G; Fox, Julie D
2003-12-01
Herpesvirus immediate early (IE) proteins are known to play key roles in establishing productive infections, regulating reactivation from latency, and creating a cellular environment favourable to viral replication. Human herpesvirus-6 (HHV-6) IE genes have not been studied as intensively as their homologues in the prototype betaherpesvirus human cytomegalovirus (HCMV). Whilst the HCMV IE1 gene is relatively conserved, early studies indicated that HHV-6 IE1 exhibited a high level of sequence variation between HHV-6A and HHV-6B isolates, although the observation was based primarily on virus stocks that had been isolated and propagated in vitro. In this study, we investigated the level of HHV-6 IE1 sequence variation in vivo by direct sequencing of circulating virus in clinical samples without prior in vitro culture. Sequences exactly matching those reported for reference HHV-6 isolates were identified in clinical samples, thus the HHV-6 laboratory strains used in the majority of in vitro studies appear to be representative of virus circulating in vivo with respect to the IE1 gene. The HHV-6 IE1 sequence is also conserved in reference strains that had been passaged extensively in vitro. The high degree of divergence between variant A and B type IE1 sequences was confirmed, but interestingly HHV-6B IE1 sequences were observed to further segregate into two distinct subgroups, with the laboratory strains Z29 and HST representative of these two subgroups. Within each HHV-6B subgroup, a remarkably high level of homology was observed. Thus the HHV-6 IE1 sequence appears highly stable, underlining its potential importance to the viral life cycle. Copyright 2003 Wiley-Liss, Inc.
2013-01-01
Background BRAF mutation is an important diagnostic and prognostic marker in patients with papillary thyroid carcinoma (PTC). To be applicable in clinical laboratories with limited equipment, diverse testing methods are required to detect BRAF mutation. Methods A shifted termination assay (STA) fragment analysis was used to detect common V600 BRAF mutations in 159 PTCs with DNAs extracted from formalin-fixed paraffin-embedded tumor tissue. The results of STA fragment analysis were compared to those of direct sequencing. Serial dilutions of BRAF mutant cell line (SNU-790) were used to calculate limit of detection (LOD). Results BRAF mutations were detected in 119 (74.8%) PTCs by STA fragment analysis. In direct sequencing, BRAF mutations were observed in 118 (74.2%) cases. The results of STA fragment analysis had high correlation with those of direct sequencing (p < 0.00001, κ = 0.98). The LOD of STA fragment analysis and direct sequencing was 6% and 12.5%, respectively. In PTCs with pT3/T4 stages, BRAF mutation was observed in 83.8% of cases. In pT1/T2 carcinomas, BRAF mutation was detected in 65.9% and this difference was statistically significant (p = 0.007). Moreover, BRAF mutation was more frequent in PTCs with extrathyroidal invasion than tumors without extrathyroidal invasion (84.7% versus 62.2%, p = 0.001). To prepare and run the reactions, direct sequencing required 450 minutes while STA fragment analysis needed 290 minutes. Conclusions STA fragment analysis is a simple and sensitive method to detect BRAF V600 mutations in formalin-fixed paraffin-embedded clinical samples. Virtual Slides The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/5684057089135749 PMID:23883275
No need to replace an "anomalous" primate (Primates) with an "anomalous" bear (Carnivora, Ursidae).
Gutiérrez, Eliécer E; Pine, Ronald H
2015-01-01
By means of mitochondrial 12S rRNA sequencing of putative "yeti", "bigfoot", and other "anomalous primate" hair samples, a recent study concluded that two samples, presented as from the Himalayas, do not belong to an "anomalous primate", but to an unknown, anomalous type of ursid. That is, that they match 12S rRNA sequences of a fossil Polar Bear (Ursusmaritimus), but neither of modern Polar Bears, nor of Brown Bears (Ursusarctos), the closest relative of Polar Bears, and one that occurs today in the Himalayas. We have undertaken direct comparison of sequences; replication of the original comparative study; inference of phylogenetic relationships of the two samples with respect to those from all extant species of Ursidae (except for the Giant Panda, Ailuropodamelanoleuca) and two extinct Pleistocene species; and application of a non-tree-based population aggregation approach for species diagnosis and identification. Our results demonstrate that the very short fragment of the 12S rRNA gene sequenced by Sykes et al. is not sufficiently informative to support the hypotheses provided by these authors with respect to the taxonomic identity of the individuals from which these sequences were obtained. We have concluded that there is no reason to believe that the two samples came from anything other than Brown Bears. These analyses afforded an opportunity to test the monophyly of morphologically defined species and to comment on both their phylogenetic relationships and future efforts necessary to advance our understanding of ursid systematics.
Ahn, Sang Hoon; Chun, Ji-Yong; Shin, Soo-Kyung; Park, Jun Yong; Yoo, Wangdon; Hong, Sun Pyo; Han, Kwang-Hyub
2013-01-01
Background/Aims Molecular diagnostic methods have enabled the rapid diagnosis of drug-resistant mutations in hepatitis B virus (HBV) and have reduced both unnecessary therapeutic interventions and medical costs. In this study we evaluated the analytical and clinical performances of the HepB Typer-Entecavir kit (GeneMatrix, Korea) in detecting entecavir-resistance-associated mutations. Methods The HepB Typer-Entecavir kit was evaluated for its limit of detection, interference, cross-reactivity, and precision using HBV reference standards made by diluting high-titer viral stocks in HBV-negative human serum. The performance of the HepB Typer-Entecavir kit for detecting mutations related to entecavir resistance was compared with direct sequencing for 396 clinical samples from 108 patients. Results Using the reference standards, the detection limit of the HepB Typer-Entecavir kit was found to be as low as 500 copies/mL. No cross-reactivity was observed, and elevated levels of various interfering substances did not adversely affect its analytical performance. The precision test conducted by repetitive analysis of 2,400 replicates with reference standards at various concentrations showed 99.9% agreement (2398/2400). The overall concordance rate between the HepB Typer-Entecavir kit and direct sequencing assays in 396 clinical samples was 99.5%. Conclusions The HepB Typer-Entecavir kit showed high reliability and precision, and comparable sensitivity and specificity for detecting mutant virus populations in reference and clinical samples in comparison with direct sequencing. Therefore, this assay would be clinically useful in the diagnosis of entecavir-resistance-associated mutations in chronic hepatitis B. PMID:24459645
Payne, Matthew S; Tabone, Tania; Kemp, Matthew W; Keelan, Jeffrey A; Spiller, O Brad; Newnham, John P
2014-02-01
Ureaplasma sp. infection in neonates and adults underlies a variety of disease pathologies. Of the two human Ureaplasma spp., Ureaplasma parvum is clinically the most common. We have developed a high-resolution melt (HRM) PCR assay for the differentiation of the four serovars of U. parvum in a single step. Currently U. parvum strains are separated into four serovars by sequencing the promoter and coding region of the multiple-banded antigen (MBA) gene. We designed primers to conserved sequences within this region for PCR amplification and HRM analysis to generate reproducible and distinct melt profiles that distinguish clonal representatives of serovars 1, 3, 6, and 14. Furthermore, our HRM PCR assay could classify DNA extracted from 74 known (MBA-sequenced) test strains with 100% accuracy. Importantly, HRM PCR was also able to identify U. parvum serovars directly from 16 clinical swabs. HRM PCR performed with DNA consisting of mixtures of combined known serovars yielded profiles that were easily distinguished from those for single-serovar controls. These profiles mirrored clinical samples that contained mixed serovars. Unfortunately, melt curve analysis software is not yet robust enough to identify the composition of mixed serovar samples, only that more than one serovar is present. HRM PCR provides a single-step, rapid, cost-effective means to differentiate the four serovars of U. parvum that did not amplify any of the known 10 serovars of Ureaplasma urealyticum tested in parallel. Choice of reaction reagents was found to be crucial to allow sufficient sensitivity to differentiate U. parvum serovars directly from clinical swabs rather than requiring cell enrichment using microbial culture techniques.
Direct generation of all-optical random numbers from optical pulse amplitude chaos.
Li, Pu; Wang, Yun-Cai; Wang, An-Bang; Yang, Ling-Zhen; Zhang, Ming-Jiang; Zhang, Jian-Zhong
2012-02-13
We propose and theoretically demonstrate an all-optical method for directly generating all-optical random numbers from pulse amplitude chaos produced by a mode-locked fiber ring laser. Under an appropriate pump intensity, the mode-locked laser can experience a quasi-periodic route to chaos. Such a chaos consists of a stream of pulses with a fixed repetition frequency but random intensities. In this method, we do not require sampling procedure and external triggered clocks but directly quantize the chaotic pulses stream into random number sequence via an all-optical flip-flop. Moreover, our simulation results show that the pulse amplitude chaos has no periodicity and possesses a highly symmetric distribution of amplitude. Thus, in theory, the obtained random number sequence without post-processing has a high-quality randomness verified by industry-standard statistical tests.
Deucher, Anne; Chiang, Tsoyu; Schrijver, Iris
2010-01-01
Typing of STR (short tandem repeat) alleles is used in a variety of applications in clinical molecular pathology, including evaluations for maternal cell contamination. Using a commercially available STR typing assay for maternal cell contamination performed in conjunction with prenatal diagnostic testing, we were posed with apparent nonmaternity when the two fetal samples did not demonstrate the expected maternal allele at one locus. By designing primers external to the region amplified by the primers from the commercial assay and by performing direct sequencing of the resulting amplicon, we were able to determine that a guanine to adenine sequence variation led to primer mismatch and allele dropout. This explained the apparent null allele shared between the maternal and fetal samples. Therefore, although rare, allele dropout must be considered whenever unexplained homozygosity at an STR locus is observed. PMID:20203001
Yazdani, D; Zainal Abidin, M A; Tan, Y H; Kamaruzaman, S
2011-01-01
Thirty milled rice samples were collected from retailers in 4 provinces of Malaysia. These samples were evaluated for Aspergillus spp. infection by direct plating on malt extract salt agar (MESA). All Aspergillus holomorphs were isolated and identified using nucleotide sequences of ITS 1 and ITS 2 of rDNA. Five anamorphs (Aspergillus flavus, A. oryzae, A. tamarii, A. fumigatus and A. niger) and 5 teleomorphs (Eurotium rubrum, E. amstelodami, E. chevalieri, E. cristatum and E. tonophilum) were identified. The PCR-sequencing based technique for sequences of ITS 1 and ITS 2 is a fast technique for identification of Aspergillus and Eurotium species, although it doesn't work flawlessly for differentiation of Eurotium species. All Aspergillus and Eurotium isolates were screened for their ability to produce aflatoxin and ochratoxin A (OTA) by HPLC and TLC techniques. Only A. flavus isolate UPM 89 was able to produce aflatoxins B1 and B2.
Borsu, Laetitia; Intrieri, Julie; Thampi, Linta; Yu, Helena; Riely, Gregory; Nafa, Khedoudja; Chandramohan, Raghu; Ladanyi, Marc; Arcila, Maria E
2016-11-01
Although next-generation sequencing (NGS) is a robust technology for comprehensive assessment of EGFR-mutant lung adenocarcinomas with acquired resistance to tyrosine kinase inhibitors, it may not provide sufficiently rapid and sensitive detection of the EGFR T790M mutation, the most clinically relevant resistance biomarker. Here, we describe a digital PCR (dPCR) assay for rapid T790M detection on aliquots of NGS libraries prepared for comprehensive profiling, fully maximizing broad genomic analysis on limited samples. Tumor DNAs from patients with EGFR-mutant lung adenocarcinomas and acquired resistance to epidermal growth factor receptor inhibitors were prepared for Memorial Sloan-Kettering-Integrated Mutation Profiling of Actionable Cancer Targets sequencing, a hybrid capture-based assay interrogating 410 cancer-related genes. Precapture library aliquots were used for rapid EGFR T790M testing by dPCR, and results were compared with NGS and locked nucleic acid-PCR Sanger sequencing (reference high sensitivity method). Seventy resistance samples showed 99% concordance with the reference high sensitivity method in accuracy studies. Input as low as 2.5 ng provided a sensitivity of 1% and improved further with increasing DNA input. dPCR on libraries required less DNA and showed better performance than direct genomic DNA. dPCR on NGS libraries is a robust and rapid approach to EGFR T790M testing, allowing most economical utilization of limited material for comprehensive assessment. The same assay can also be performed directly on any limited DNA source and cell-free DNA. Copyright © 2016 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Swallow Event Sequencing: Comparing Healthy Older and Younger Adults.
Herzberg, Erica G; Lazarus, Cathy L; Steele, Catriona M; Molfenter, Sonja M
2018-04-23
Previous research has established that a great deal of variation exists in the temporal sequence of swallowing events for healthy adults. Yet, the impact of aging on swallow event sequence is not well understood. Kendall et al. (Dysphagia 18(2):85-91, 2003) suggested there are 4 obligatory paired-event sequences in swallowing. We directly compared adherence to these sequences, as well as event latencies, and quantified the percentage of unique sequences in two samples of healthy adults: young (< 45) and old (> 65). The 8 swallowing events that contribute to the sequences were reliably identified from videofluoroscopy in a sample of 23 healthy seniors (10 male, mean age 74.7) and 20 healthy young adults (10 male, mean age 31.5) with no evidence of penetration-aspiration or post-swallow residue. Chi-square analyses compared the proportions of obligatory pairs and unique sequences by age group. Compared to the older subjects, younger subjects had significantly lower adherence to two obligatory sequences: Upper Esophageal Sphincter (UES) opening occurs before (or simultaneous with) the bolus arriving at the UES and UES maximum distention occurs before maximum pharyngeal constriction. The associated latencies were significantly different between age groups as well. Further, significantly fewer unique swallow sequences were observed in the older group (61%) compared with the young (82%) (χ 2 = 31.8; p < 0.001). Our findings suggest that paired swallow event sequences may not be robust across the age continuum and that variation in swallow sequences appears to decrease with aging. These findings provide normative references for comparisons to older individuals with dysphagia.
2014-01-01
Background Although it is possible to recover the complete mitogenome directly from shotgun sequencing data, currently reported methods and pipelines are still relatively time consuming and costly. Using a sample of the Australian freshwater crayfish Engaeus lengana, we demonstrate that it is possible to achieve three-day turnaround time (four hours hands-on time) from tissue sample to NCBI-ready submission file through the integration of MiSeq sequencing platform, Nextera sample preparation protocol, MITObim assembly algorithm and MITOS annotation pipeline. Results The complete mitochondrial genome of the parastacid freshwater crayfish, Engaeus lengana, was recovered by modest shotgun sequencing (1.2 giga bases) using the Illumina MiSeq benchtop sequencing platform. Genome assembly using the MITObim mitogenome assembler recovered the mitochondrial genome as a single contig with a 97-fold mean coverage (min. = 17; max. = 138). The mitogenome consists of 15,934 base pairs and contains the typical 37 mitochondrial genes and a non-coding AT-rich region. The genome arrangement is similar to the only other published parastacid mitogenome from the Australian genus Cherax. Conclusions We infer that the gene order arrangement found in Cherax destructor is common to Australian crayfish and may be a derived feature of the southern hemisphere family Parastacidae. Further, we report to our knowledge, the simplest and fastest protocol for the recovery and assembly of complete mitochondrial genomes using the MiSeq benchtop sequencer. PMID:24484414
2013-01-01
Background Next-generation-sequencing (NGS) technologies combined with a classic DNA barcoding approach have enabled fast and credible measurement for biodiversity of mixed environmental samples. However, the PCR amplification involved in nearly all existing NGS protocols inevitably introduces taxonomic biases. In the present study, we developed new Illumina pipelines without PCR amplifications to analyze terrestrial arthropod communities. Results Mitochondrial enrichment directly followed by Illumina shotgun sequencing, at an ultra-high sequence volume, enabled the recovery of Cytochrome c Oxidase subunit 1 (COI) barcode sequences, which allowed for the estimation of species composition at high fidelity for a terrestrial insect community. With 15.5 Gbp Illumina data, approximately 97% and 92% were detected out of the 37 input Operational Taxonomic Units (OTUs), whether the reference barcode library was used or not, respectively, while only 1 novel OTU was found for the latter. Additionally, relatively strong correlation between the sequencing volume and the total biomass was observed for species from the bulk sample, suggesting a potential solution to reveal relative abundance. Conclusions The ability of the new Illumina PCR-free pipeline for DNA metabarcoding to detect small arthropod specimens and its tendency to avoid most, if not all, false positives suggests its great potential in biodiversity-related surveillance, such as in biomonitoring programs. However, further improvement for mitochondrial enrichment is likely needed for the application of the new pipeline in analyzing arthropod communities at higher diversity. PMID:23587339
Severe chronic osteomyelitis caused by Morganella morganii with high population diversity.
Zhu, Jialiang; Li, Haifeng; Feng, Li; Yang, Min; Yang, Ronggong; Yang, Lin; Li, Li; Li, Ruoyan; Liu, Minshan; Hou, Shuxun; Ke, Yuehua; Li, Wenfeng; Bai, Fan
2016-09-01
A case of chronic osteomyelitis probably caused by Morganella morganii, occurring over a period of 30 years, is reported. The organism was identified through a combination of sample culture, direct sequencing, and 16S RNA gene amplicon sequencing. Further whole-genome sequencing and population structure analysis of the isolates from the patient showed the bacterial population to be highly diverse. This case provides a valuable example of a long-term infection caused by an opportunistic pathogen, M. morganii, with high diversity, which might evolve during replication within the host. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Salehi, Sohrab; Steif, Adi; Roth, Andrew; Aparicio, Samuel; Bouchard-Côté, Alexandre; Shah, Sohrab P
2017-03-01
Next-generation sequencing (NGS) of bulk tumour tissue can identify constituent cell populations in cancers and measure their abundance. This requires computational deconvolution of allelic counts from somatic mutations, which may be incapable of fully resolving the underlying population structure. Single cell sequencing (SCS) is a more direct method, although its replacement of NGS is impeded by technical noise and sampling limitations. We propose ddClone, which analytically integrates NGS and SCS data, leveraging their complementary attributes through joint statistical inference. We show on real and simulated datasets that ddClone produces more accurate results than can be achieved by either method alone.
Saleh, Mona; El-Matbouli, Mansour
2015-06-01
Cyprinid herpesvirus-3 (CyHV-3) is a highly infectious pathogen that causes fatal disease in common and koi carp Cyprinus carpio L. CyHV-3 detection is usually based on virus propagation or amplification of the viral DNA using the PCR or LAMP techniques. However, due to the limited susceptibility of cells used for propagation, it is not always possible to successfully isolate CyHV-3 even from tissue samples that have high virus titres. All previously described detection methods including PCR-based assays are time consuming, laborious and require specialized equipment. To overcome these limitations, gold nanoparticles (AuNPs) have been explored for direct and sensitive detection of DNA. In this study, a label-free colorimetric nanodiagnostic method for direct detection of unamplified CyHV-3 DNA using gold nanoparticles is introduced. Under appropriate conditions, DNA probes hybridize with their complementary target sequences in the sample DNA, which results in aggregation of the gold nanoparticles and a concomitant colour change from red to blue, whereas test samples with non complementary DNA sequences remain red. In this study, gold nanoparticles were used to develop and evaluate a specific and sensitive hybridization assay for direct and rapid detection of the highly infectious pathogen termed Cyprinid herpesvirus-3. Copyright © 2015 Elsevier B.V. All rights reserved.
Bergroth, T; Ekici, H; Gisslén, M; Hagberg, L; Sönnerborg, A
2009-02-01
The aim of the study was to determine to what extent unique drug resistance patterns appear in minor and major HIV-1 quasispecies in cerebrospinal fluid (CSF) as compared with blood. Forty-four plasma and CSF samples from 13 multi-treatment-experienced patients, seven of whom provided longitudinal samples, were included in the study. The subjects had failed antiretroviral therapy including lamivudine. The reverse transcriptase (RT) gene was examined by selective real-time polymerase chain reaction (SPCR), which can detect M184I/V mutants down to 0.2% of the viral population. SPCR revealed differences at amino acid position 184 in the plasma/CSF populations in 12 paired samples from eight patients. One plasma sample was positive by SPCR where direct sequencing showed wild-type M184. The other 11 paired samples showed quantitative differences in the mixed populations of the mutant or wild-type M184 quasispecies. Differences in other resistance-associated mutations between plasma and CSF viruses were also found by direct sequencing. In multi-treatment-experienced patients with therapy failure, differences in drug resistance patterns were found frequently between plasma and CSF in both minor and major viral populations. To what extent this was a true biological phenomenon remains to be established, and the clinical relevance of these findings is yet to be determined.
NASA Astrophysics Data System (ADS)
Ejembi, J. I.; Ferre, E. C.; Potter-McIntyre, S. L.
2017-12-01
The Middle-Upper Jurassic sedimentary strata in the southwestern Colorado Plateau recorded pervasive eolian to fluvio-lacustrine deposition in the Paradox Basin. While paleocurrents preserved in the Entrada Sandstone, an eolian deposition in the Middle Jurassic, has been well constrained and show a northwesterly to northeasterly migration of ergs from the south onto the Colorado Plateau, there is yet no clear resolution of the paleocurrents preserved in the Wanakah Formation and Tidwell Member of the Morrison Formation, both of which are important sedimentary sequences in the paleogeographic framework of the Colorado Plateau. New U-Pb detrital zircon geochronology of sandstones from these sequences suggests that an abrupt change in provenance occurred in the early Late Jurassic, with sediments largely sourced from eroding highlands in central Colorado. We measured the anisotropy of magnetic susceptibility (AMS) of sediments in oriented sandstone samples from these three successive sequences; first, to determine the paleocurrents from the orientations of the AMS fabrics in order to delineate the source area and sediments dispersal pattern and second, to determine the depositional mechanisms of the sediments. Preliminary AMS data from two study sites show consistency and clustering of the AMS axes in all the sedimentary sequences. The orientations of the Kmin - Kint planes in the Entrada Sandstone sample point to a NNE-NNW paleocurrent directions, which is in agreement with earlier studies. The orientations of the Kmin - Kint planes in the Wanakah Formation and Tidwell Member samples show W-SW trending paleocurrent directions, corroborating our hypothesis of a shift in provenance to the eroding Ancestral Front Range Mountain, located northeast of the Paradox Basin, during the Late Jurassic. Isothermal remanence magnetization (IRM) of the samples indicate that the primary AMS carriers are detrital, syndepositional ferromagnetic minerals. Thus, we contend that AMS can be successfully deployed in constraining paleocurrents in lacustrine sedimentary strata, which lacks traditional sedimentary structures for paleocurrent analyses.
Bacterial diversity characterization in petroleum samples from Brazilian reservoirs
de Oliveira, Valéria Maia; Sette, Lara Durães; Simioni, Karen Christina Marques; dos Santos Neto, Eugênio Vaz
2008-01-01
This study aimed at evaluating potential differences among the bacterial communities from formation water and oil samples originated from biodegraded and non-biodegraded Brazilian petroleum reservoirs by using a PCR-DGGE based approach. Environmental DNA was isolated and used in PCR reactions with bacterial primers, followed by separation of 16S rDNA fragments in the DGGE. PCR products were also cloned and sequenced, aiming at the taxonomic affiliation of the community members. The fingerprints obtained allowed the direct comparison among the bacterial communities from oil samples presenting distinct degrees of biodegradation, as well as between the communities of formation water and oil sample from the non-biodegraded reservoir. Very similar DGGE band profiles were observed for all samples, and the diversity of the predominant bacterial phylotypes was shown to be low. Cloning and sequencing results revealed major differences between formation water and oil samples from the non-biodegraded reservoir. Bacillus sp. and Halanaerobium sp. were shown to be the predominant components of the bacterial community from the formation water sample, whereas the oil sample also included Alicyclobacillus acidoterrestris, Rhodococcus sp., Streptomyces sp. and Acidithiobacillus ferrooxidans. The PCR-DGGE technique, combined with cloning and sequencing of PCR products, revealed the presence of taxonomic groups not found previously in these samples when using cultivation-based methods and 16S rRNA gene library assembly, confirming the need of a polyphasic study in order to improve the knowledge of the extent of microbial diversity in such extreme environments. PMID:24031244
Thomas, Austen C; Jarman, Simon N; Haman, Katherine H; Trites, Andrew W; Deagle, Bruce E
2014-08-01
Ecologists are increasingly interested in quantifying consumer diets based on food DNA in dietary samples and high-throughput sequencing of marker genes. It is tempting to assume that food DNA sequence proportions recovered from diet samples are representative of consumer's diet proportions, despite the fact that captive feeding studies do not support that assumption. Here, we examine the idea of sequencing control materials of known composition along with dietary samples in order to correct for technical biases introduced during amplicon sequencing and biological biases such as variable gene copy number. Using the Ion Torrent PGM(©) , we sequenced prey DNA amplified from scats of captive harbour seals (Phoca vitulina) fed a constant diet including three fish species in known proportions. Alongside, we sequenced a prey tissue mix matching the seals' diet to generate tissue correction factors (TCFs). TCFs improved the diet estimates (based on sequence proportions) for all species and reduced the average estimate error from 28 ± 15% (uncorrected) to 14 ± 9% (TCF-corrected). The experimental design also allowed us to infer the magnitude of prey-specific digestion biases and calculate digestion correction factors (DCFs). The DCFs were compared with possible proxies for differential digestion (e.g. fish protein%, fish lipid%) revealing a strong relationship between the DCFs and percent lipid of the fish prey, suggesting prey-specific corrections based on lipid content would produce accurate diet estimates in this study system. These findings demonstrate the value of parallel sequencing of food tissue mixtures in diet studies and offer new directions for future research in quantitative DNA diet analysis. © 2013 John Wiley & Sons Ltd.
Identifying the core seed bank of a complex boreal bacterial metacommunity.
Ruiz-González, Clara; Niño-García, Juan Pablo; Kembel, Steven W; Del Giorgio, Paul A
2017-09-01
Seed banks are believed to contribute to compositional changes within and across microbial assemblages, but the application of this concept to natural communities remains challenging. Here we describe the core seed bank of a bacterial metacommunity from a boreal watershed, using the spatial distribution of bacterial operational taxonomic units (OTUs) across 223 heterogeneous terrestrial, aquatic and phyllosphere bacterial assemblages. Taxa were considered potential seeds if they transitioned from rare to abundant somewhere within the metacommunity and if they were ubiquitous and able to persist under unfavorable conditions, the latter assessed by checking their presence in three deeply sequenced samples (one soil, one river and one lake, 2.2-3 million reads per sample). We show that only a small fraction (13%) of all detected OTUs constitute a metacommunity seed bank that is shared between all terrestrial and aquatic communities, but not by phyllosphere assemblages, which seem to recruit from a different taxa pool. Our results suggest directional recruitment driven by the flow of water in the landscape, since most aquatic sequences were associated to OTUs found in a single deeply-sequenced soil sample, but only 45% of terrestrial sequences belonged to OTUs found in the two deeply-sequenced aquatic communities. Finally, we hypothesize that extreme rarity, and its interplay with water residence time and growth rates, may further constrain the size of the potential seed bank.
Dridi, M; Rosseel, T; Orton, R; Johnson, P; Lecollinet, S; Muylkens, B; Lambrecht, B; Van Borm, S
2015-10-01
West Nile virus (WNV) occurs as a population of genetic variants (quasispecies) infecting a single animal. Previous low-resolution viral genetic diversity estimates in sampled wild birds and mosquitoes, and in multiple-passage adaptation studies in vivo or in cell culture, suggest that WNV genetic diversification is mostly limited to the mosquito vector. This study investigated genetic diversification of WNV in avian hosts during a single passage using next-generation sequencing. Wild-captured carrion crows were subcutaneously infected using a clonal Middle-East WNV. Blood samples were collected 2 and 4 days post-infection. A reverse-transcription (RT)-PCR approach was used to amplify the WNV genome directly from serum samples prior to next-generation sequencing resulting in an average depth of at least 700 × in each sample. Appropriate controls were sequenced to discriminate biologically relevant low-frequency variants from experimentally introduced errors. The WNV populations in the wild crows showed significant diversification away from the inoculum virus quasispecies structure. By contrast, WNV populations in intracerebrally infected day-old chickens did not diversify from that of the inoculum. Where previous studies concluded that WNV genetic diversification is only experimentally demonstrated in its permissive insect vector species, we have experimentally shown significant diversification of WNV populations in a wild bird reservoir species.
Sonnenberg, Avery; Marciniak, Jennifer Y.; Skowronski, Elaine A.; Manouchehri, Sareh; Rassenti, Laura; Ghia, Emanuela M.; Widhopf, George F.; Kipps, Thomas J.; Heller, Michael J.
2014-01-01
Conventional methods for the isolation of cancer-related circulating cell-free (ccf) DNA from patient blood (plasma) are time consuming and laborious. A DEP approach utilizing a microarray device now allows rapid isolation of ccf-DNA directly from a small volume of unprocessed blood. In this study, the DEP device is used to compare the ccf-DNA isolated directly from whole blood and plasma from 11 chronic lymphocytic leukemia (CLL) patients and one normal individual. Ccf-DNA from both blood and plasma samples was separated into DEP high-field regions, after which cells (blood), proteins, and other biomolecules were removed by a fluidic wash. The concentrated ccf-DNA was detected on-chip by fluorescence, and then eluted for PCR and DNA sequencing. The complete process from blood to PCR required less than 10 min; an additional 15 min was required to obtain plasma from whole blood. Ccf-DNA from the equivalent of 5 µL of CLL blood and 5 µL of plasma was amplified by PCR using Ig heavy-chain variable (IGHV) specific primers to identify the unique IGHV gene expressed by the leukemic B-cell clone. The PCR and DNA sequencing results obtained by DEP from all 11 CLL blood samples and from 8 of the 11 CLL plasma samples were exactly comparable to the DNA sequencing results obtained from genomic DNA isolated from CLL patient leukemic B cells (gold standard). PMID:24723219
Sonnenberg, Avery; Marciniak, Jennifer Y; Skowronski, Elaine A; Manouchehri, Sareh; Rassenti, Laura; Ghia, Emanuela M; Widhopf, George F; Kipps, Thomas J; Heller, Michael J
2014-07-01
Conventional methods for the isolation of cancer-related circulating cell-free (ccf) DNA from patient blood (plasma) are time consuming and laborious. A DEP approach utilizing a microarray device now allows rapid isolation of ccf-DNA directly from a small volume of unprocessed blood. In this study, the DEP device is used to compare the ccf-DNA isolated directly from whole blood and plasma from 11 chronic lymphocytic leukemia (CLL) patients and one normal individual. Ccf-DNA from both blood and plasma samples was separated into DEP high-field regions, after which cells (blood), proteins, and other biomolecules were removed by a fluidic wash. The concentrated ccf-DNA was detected on-chip by fluorescence, and then eluted for PCR and DNA sequencing. The complete process from blood to PCR required less than 10 min; an additional 15 min was required to obtain plasma from whole blood. Ccf-DNA from the equivalent of 5 μL of CLL blood and 5 μL of plasma was amplified by PCR using Ig heavy-chain variable (IGHV) specific primers to identify the unique IGHV gene expressed by the leukemic B-cell clone. The PCR and DNA sequencing results obtained by DEP from all 11 CLL blood samples and from 8 of the 11 CLL plasma samples were exactly comparable to the DNA sequencing results obtained from genomic DNA isolated from CLL patient leukemic B cells (gold standard). © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Molecular detection of Sarcocystis lutrae in the European badger (Meles meles) in Scotland.
Lepore, T; Bartley, P M; Chianini, F; Macrae, A I; Innes, E A; Katzer, F
2017-09-01
Neck samples from 54 badgers and 32 tongue samples of the same badgers (Meles meles), collected in the Lothians and Borders regions of Scotland, were tested using polymerase chain reactions (PCRs) directed against the 18S ribosomal DNA and the internal transcribed spacer (ITS1) region of protozoan parasites of the family Sarcocystidae. Positive results were obtained from 36/54 (67%) neck and 24/32 (75%) tongue samples using an 18S rDNA PCR. A 468 base pair consensus sequence that was generated from the 18S rDNA PCR amplicons (KX229728) showed 100% identity to Sarcocystis lutrae. The ITS1 PCR results revealed that 12/20 (60%) neck and 10/20 (50%) tongue samples were positive for Sarcocystidae DNA. A 1074 bp consensus sequence was generated from the ITS1 PCR amplicons (KX431307) and showed 100% identity to S. lutrae. Multiple sequence alignments and phylogenetic analysis support the finding that the rDNA found in badgers is identical to that of S. lutrae. This parasite has not been previously reported in badgers or in the UK. Sarcocystis lutrae has previously only been detected in tongue, skeletal muscle and diaphragm samples of the Eurasian otter (Lutra lutra) in Norway and potentially in the Arctic fox (Vulpes lagopus).
Multilocus sequence typing of Trichomonas vaginalis clinical samples from Amsterdam, the Netherlands
van der Veer, C; Himschoot, M; Bruisten, S M
2016-01-01
Objectives In this cross-sectional epidemiological study we aimed to identify molecular profiles for Trichomonas vaginalis and to determine how these molecular profiles were related to patient demographic and clinical characteristics. Setting Molecular typing methods previously identified two genetically distinct subpopulations for T. vaginalis; however, few molecular epidemiological studies have been performed. We now increased the sensitivity of a previously described multilocus sequence typing (MLST) tool for T. vaginalis by using nested PCR. This enabled the typing of direct patient samples. Participants From January to December 2014, we collected all T. vaginalis positive samples as detected by routine laboratory testing. Samples from patients either came from general practitioners offices or from the sexually transmitted infections (STI) clinic in Amsterdam. Epidemiological data for the STI clinic patients were retrieved from electronic patient files. Primary and secondary outcome measures The primary outcome was the success rate of genotyping direct T. vaginalis positive samples. The secondary outcome was the relation between T. vaginalis genotypes and risk factors for STI. Results All 7 MLST loci were successfully typed for 71/87 clinical samples. The 71 typed samples came from 69 patients, the majority of whom were women (n=62; 90%) and half (n=34; 49%) were STI clinic patients. Samples segregated into a two population structure for T. vaginalis representing genotypes I and II. Genotype I was most common (n=40; 59.7%). STI clinic patients infected with genotype II reported more sexual partners in the preceding 6 months than patients infected with genotype I (p=0.028). No other associations for gender, age, ethnicity, urogenital discharge or co-occurring STIs with T. vaginalis genotype were found. Conclusions MLST with nested PCR is a sensitive typing method that allows typing of direct (uncultured) patient material. Genotype II is possibly more prevalent in high-risk sexual networks. PMID:27737887
van der Veer, C; Himschoot, M; Bruisten, S M
2016-10-13
In this cross-sectional epidemiological study we aimed to identify molecular profiles for Trichomonas vaginalis and to determine how these molecular profiles were related to patient demographic and clinical characteristics. Molecular typing methods previously identified two genetically distinct subpopulations for T. vaginalis; however, few molecular epidemiological studies have been performed. We now increased the sensitivity of a previously described multilocus sequence typing (MLST) tool for T. vaginalis by using nested PCR. This enabled the typing of direct patient samples. From January to December 2014, we collected all T. vaginalis positive samples as detected by routine laboratory testing. Samples from patients either came from general practitioners offices or from the sexually transmitted infections (STI) clinic in Amsterdam. Epidemiological data for the STI clinic patients were retrieved from electronic patient files. The primary outcome was the success rate of genotyping direct T. vaginalis positive samples. The secondary outcome was the relation between T. vaginalis genotypes and risk factors for STI. All 7 MLST loci were successfully typed for 71/87 clinical samples. The 71 typed samples came from 69 patients, the majority of whom were women (n=62; 90%) and half (n=34; 49%) were STI clinic patients. Samples segregated into a two population structure for T. vaginalis representing genotypes I and II. Genotype I was most common (n=40; 59.7%). STI clinic patients infected with genotype II reported more sexual partners in the preceding 6 months than patients infected with genotype I (p=0.028). No other associations for gender, age, ethnicity, urogenital discharge or co-occurring STIs with T. vaginalis genotype were found. MLST with nested PCR is a sensitive typing method that allows typing of direct (uncultured) patient material. Genotype II is possibly more prevalent in high-risk sexual networks. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Abdelrahman, Tamer; Hughes, Joseph; Main, Janice; McLauchlan, John; Thursz, Mark; Thomson, Emma
2015-01-01
High rates of sexually transmitted infection and reinfection with hepatitis C virus (HCV) have recently been reported in human immunodeficiency virus (HIV)-infected men who have sex with men and reinfection has also been described in monoinfected injecting drug users. The diagnosis of reinfection has traditionally been based on direct Sanger sequencing of samples pre- and posttreatment, but not on more sensitive deep sequencing techniques. We studied viral quasispecies dynamics in patients who failed standard of care therapy in a high-risk HIV-infected cohort of patients with early HCV infection to determine whether treatment failure was associated with reinfection or recrudescence of preexisting infection. Paired sequences (pre- and posttreatment) were analyzed. The HCV E2 hypervariable region-1 was amplified using nested reverse-transcription polymerase chain reaction (RT-PCR) with indexed genotype-specific primers and the same products were sequenced using both Sanger and 454 pyrosequencing approaches. Of 99 HIV-infected patients with acute HCV treated with 24-48 weeks of pegylated interferon alpha and ribavirin, 15 failed to achieve a sustained virological response (six relapsed, six had a null response, and three had a partial response). Using direct sequencing, 10/15 patients (66%) had evidence of a previously undetected strain posttreatment; in many studies, this is interpreted as reinfection. However, pyrosequencing revealed that 15/15 (100%) of patients had evidence of persisting infection; 6/15 (40%) patients had evidence of a previously undetected variant present in the posttreatment sample in addition to a variant that was detected at baseline. This could represent superinfection or a limitation of the sensitivity of pyrosequencing. In this high-risk group, the emergence of new viral strains following treatment failure is most commonly associated with emerging dominance of preexisting minority variants rather than reinfection. Superinfection may occur in this cohort but reinfection is overestimated by Sanger sequencing. © 2014 The Authors. Hepatology published by Wiley on behalf of the American Association for the Study of Liver Diseases.
Zhu, X Q; Gasser, R B
1998-06-01
In this study, we assessed single-strand conformation polymorphism (SSCP)-based approaches for their capacity to fingerprint sequence variation in ribosomal DNA (rDNA) of ascaridoid nematodes of veterinary and/or human health significance. The second internal transcribed spacer region (ITS-2) of rDNA was utilised as the target region because it is known to provide species-specific markers for this group of parasites. ITS-2 was amplified by PCR from genomic DNA derived from individual parasites and subjected to analysis. Direct SSCP analysis of amplicons from seven taxa (Toxocara vitulorum, Toxocara cati, Toxocara canis, Toxascaris leonina, Baylisascaris procyonis, Ascaris suum and Parascaris equorum) showed that the single-strand (ss) ITS-2 patterns produced allowed their unequivocal identification to species. While no variation in SSCP patterns was detected in the ITS-2 within four species for which multiple samples were available, the method allowed the direct display of four distinct sequence types of ITS-2 among individual worms of T. cati. Comparison of SSCP/sequencing with the methods of dideoxy fingerprinting (ddF) and restriction endonuclease fingerprinting (REF) revealed that also ddF allowed the definition of the four sequence types, whereas REF displayed three of four. The findings indicate the usefulness of the SSCP-based approaches for the identification of ascaridoid nematodes to species, the direct display of sequence variation in rDNA and the detection of population variation. The ability to fingerprint microheterogeneity in ITS-2 rDNA using such approaches also has implications for studying fundamental aspects relating to mutational change in rDNA.
No need to replace an “anomalous” primate (Primates) with an “anomalous” bear (Carnivora, Ursidae)
Gutiérrez, Eliécer E.; Pine, Ronald H.
2015-01-01
Abstract By means of mitochondrial 12S rRNA sequencing of putative “yeti”, “bigfoot”, and other “anomalous primate” hair samples, a recent study concluded that two samples, presented as from the Himalayas, do not belong to an “anomalous primate”, but to an unknown, anomalous type of ursid. That is, that they match 12S rRNA sequences of a fossil Polar Bear (Ursus maritimus), but neither of modern Polar Bears, nor of Brown Bears (Ursus arctos), the closest relative of Polar Bears, and one that occurs today in the Himalayas. We have undertaken direct comparison of sequences; replication of the original comparative study; inference of phylogenetic relationships of the two samples with respect to those from all extant species of Ursidae (except for the Giant Panda, Ailuropoda melanoleuca) and two extinct Pleistocene species; and application of a non-tree-based population aggregation approach for species diagnosis and identification. Our results demonstrate that the very short fragment of the 12S rRNA gene sequenced by Sykes et al. is not sufficiently informative to support the hypotheses provided by these authors with respect to the taxonomic identity of the individuals from which these sequences were obtained. We have concluded that there is no reason to believe that the two samples came from anything other than Brown Bears. These analyses afforded an opportunity to test the monophyly of morphologically defined species and to comment on both their phylogenetic relationships and future efforts necessary to advance our understanding of ursid systematics. PMID:25829853
Identification of a novel circular DNA virus in pig feces
USDA-ARS?s Scientific Manuscript database
Metagenomic analysis of fecal samples collected from a swine with diarrhea detected sequences encoding a replicase (Rep) protein typically found in small circular Rep-encoding ssDNA (CRESS-DNA) viruses. The complete 3,062 nucleotide genome was generated and found to encode two bi-directionally trans...
1982-08-23
LUBRICATION, FAILURE PROGRESSION WNITORING OIL-ANALYSIS, FAILURE ANALYSIS, TRIBOLOGY WEAR DEBRIS ANALYSIS, WEAR REGIMS DIAGNOSTICS, BENCH TESTING, FERROGRApHy ...Spectrometric Oil Analysis . ............... 400 G. Analytical Ferrography ............................. 411 3 NAEC-92-153 TABLE OF CONTENTS (Continued...of ferrography entry deposit mnicrographs of these sequences, which can be directly related to sample debris concentration levels. These micrographs
LOPES, Estela Gallucci; GERALDO, Carlos Alberto; MARCILI, Arlei; SILVA, Ricardo Duarte; KEID, Lara Borges; OLIVEIRA, Trícia Maria Ferreira da Silva; SOARES, Rodrigo Martins
2016-01-01
In visceral leishmaniasis, the detection of the agent is of paramount importance to identify reservoirs of infection. Here, we evaluated the diagnostic attributes of PCRs based on primers directed to cytochrome-B (cytB), cytochrome-oxidase-subunit II (coxII), cytochrome-C (cytC), and the minicircle-kDNA. Although PCRs directed to cytB, coxII, cytC were able to detect different species of Leishmania, and the nucleotide sequence of their amplicons allowed the unequivocal differentiation of species, the analytical and diagnostic sensitivity of these PCRs were much lower than the analytical and diagnostic sensitivity of the kDNA-PCR. Among the 73 seropositive animals, the asymptomatic dogs had spleen and bone marrow samples collected and tested; only two animals were positive by PCRs based on cytB, coxII, and cytC, whereas 18 were positive by the kDNA-PCR. Considering the kDNA-PCR results, six dogs had positive spleen and bone marrow samples, eight dogs had positive bone marrow results but negative results in spleen samples and, in four dogs, the reverse situation occurred. We concluded that PCRs based on cytB, coxII, and cytC can be useful tools to identify Leishmania species when used in combination with automated sequencing. The discordance between the results of the kDNA-PCR in bone marrow and spleen samples may indicate that conventional PCR lacks sensitivity for the detection of infected dogs. Thus, primers based on the kDNA should be preferred for the screening of infected dogs. PMID:27253743
Alvarado, David M; Yang, Ping; Druley, Todd E; Lovett, Michael; Gurnett, Christina A
2014-06-01
Despite declining sequencing costs, few methods are available for cost-effective single-nucleotide polymorphism (SNP), insertion/deletion (INDEL) and copy number variation (CNV) discovery in a single assay. Commercially available methods require a high investment to a specific region and are only cost-effective for large samples. Here, we introduce a novel, flexible approach for multiplexed targeted sequencing and CNV analysis of large genomic regions called multiplexed direct genomic selection (MDiGS). MDiGS combines biotinylated bacterial artificial chromosome (BAC) capture and multiplexed pooled capture for SNP/INDEL and CNV detection of 96 multiplexed samples on a single MiSeq run. MDiGS is advantageous over other methods for CNV detection because pooled sample capture and hybridization to large contiguous BAC baits reduces sample and probe hybridization variability inherent in other methods. We performed MDiGS capture for three chromosomal regions consisting of ∼ 550 kb of coding and non-coding sequence with DNA from 253 patients with congenital lower limb disorders. PITX1 nonsense and HOXC11 S191F missense mutations were identified that segregate in clubfoot families. Using a novel pooled-capture reference strategy, we identified recurrent chromosome chr17q23.1q23.2 duplications and small HOXC 5' cluster deletions (51 kb and 12 kb). Given the current interest in coding and non-coding variants in human disease, MDiGS fulfills a niche for comprehensive and low-cost evaluation of CNVs, coding, and non-coding variants across candidate regions of interest. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Isolation and clinical sample typing of human leptospirosis cases in Argentina.
Chiani, Yosena; Jacob, Paulina; Varni, Vanina; Landolt, Noelia; Schmeling, María Fernanda; Pujato, Nazarena; Caimi, Karina; Vanasco, Bibiana
2016-01-01
Leptospira typing is carried out using isolated strains. Because of difficulties in obtaining them, direct identification of infective Leptospira in clinical samples is a high priority. Multilocus sequence typing (MLST) proved highly discriminatory for seven pathogenic species of Leptospira, allowing isolate characterization and robust assignment to species, in addition to phylogenetic evidence for the relatedness between species. In this study we characterized Leptospira strains circulating in Argentina, using typing methods applied to human clinical samples and isolates. Phylogenetic studies based on 16S ribosomal RNA gene sequences enabled typing of 8 isolates (6 Leptospira interrogans, one Leptospira wolffii and one Leptospira broomii) and 58 out of 85 (68.2%) clinical samples (55 L. interrogans, 2 Leptospira meyeri, and one Leptospira kirschneri). MLST results for the L. interrogans isolates indicated that five were probably Canicola serogroup (ST37) and one was probably Icterohaemorrhagiae serogroup (ST17). Eleven clinical samples (21.6%), provided MLST interpretable data: five were probably Pyrogenes serogroup (ST13), four Sejroe (ST20), one Autumnalis (ST22) and one Canicola (ST37). To the best of our knowledge this study is the first report of the use of an MLST typing scheme with seven loci to identify Leptospira directly from clinical samples in Argentina. The use of clinical samples presents the advantage of the possibility of knowing the infecting strain without resorting to isolates. This study also allowed, for the first time, the characterization of isolates of intermediate pathogenicity species (L. wolffii and L. broomii) from symptomatic patients. Copyright © 2015 Elsevier B.V. All rights reserved.
Peng, Xin; Yu, Ke-Qiang; Deng, Guan-Hua; Jiang, Yun-Xia; Wang, Yu; Zhang, Guo-Xia; Zhou, Hong-Wei
2013-12-01
Low cost and high throughput capacity are major advantages of using next generation sequencing (NGS) techniques to determine metagenomic 16S rRNA tag sequences. These methods have significantly changed our view of microorganisms in the fields of human health and environmental science. However, DNA extraction using commercial kits has shortcomings of high cost and time constraint. In the present study, we evaluated the determination of fecal microbiomes using a direct boiling method compared with 5 different commercial extraction methods, e.g., Qiagen and MO BIO kits. Principal coordinate analysis (PCoA) using UniFrac distances and clustering showed that direct boiling of a wide range of feces concentrations gave a similar pattern of bacterial communities as those obtained from most of the commercial kits, with the exception of the MO BIO method. Fecal concentration by boiling method affected the estimation of α-diversity indices, otherwise results were generally comparable between boiling and commercial methods. The operational taxonomic units (OTUs) determined through direct boiling showed highly consistent frequencies with those determined through most of the commercial methods. Even those for the MO BIO kit were also obtained by the direct boiling method with high confidence. The present study suggested that direct boiling could be used to determine the fecal microbiome and using this method would significantly reduce the cost and improve the efficiency of the sample preparation for studying gut microbiome diversity. © 2013 Elsevier B.V. All rights reserved.
Flores, María D; Gonzalez, Luis M; Hurtado, Carolina; Motta, Yamileth Monje; Domínguez-Hidalgo, Cristina; Merino, Francisco Jesús; Perteguer, María J; Gárate, Teresa
2018-02-27
Taenia solium, T. asiatica and T. saginata tapeworms cause human taeniasis and are the origin of porcine and bovine cysticercosis. Furthermore, T. solium eggs can cause human cysticercosis, with neurocysticercosis being the most serious form of the disease. These helminth infections are neglected tropical diseases and are endemic in several countries in the Americas, Asia and Africa. As a result of globalization, migration in particular, the infections have been extending to non-endemic territories. Species-specific diagnosis of taeniasis is subject to drawbacks that could be resolved using molecular approaches. In the present study, conventional and real-time amplification protocols (cPCR and qPCR) based on the T. saginata HDP2 sequence were applied in the differential diagnosis of taeniasis (T. saginata, T. solium) in both fecal samples and proglottids expelled by patients. The HDP2 homolog in T. solium was cloned and characterized. Semi-nested cPCR and qPCR (Sn-HDP2 cPCR and Sn-HDP2 qPCR) amplified T. saginata and T. solium DNA, with an analytical sensitivity of 40 and 400 fg, respectively, and identically in both protocols. Eighteen taeniasis patients were diagnosed directly with T. saginata or T. solium, either from proglottids or fecal samples with/without eggs (detected using microscopy), based on the optimized Sn-HDP2 qPCR. After cloning, the T. solium HDP2 homolog sequence was confirmed to be a ribosomal sequence. The HDP2 fragment corresponded to a non-transcribed sequence/external transcribed repeat (NTS/ETS) of ribosomal DNA. Compared with the T. saginata HDP2 homolog, the T solium HDP2 sequence lacked the first 900 nt at the 5' end and showed nucleotide substitutions and small deletions. Sn-HDP2 cPCR and Sn-HDP2 qPCR were set up for the diagnosis of human taeniasis, using proglottids and fecal samples from affected patients. The new Sn-HDP2 qPCR protocol was the best option, as it directly differentiated T. saginata from T. solium. The diagnosis of an imported T. solium-taeniasis case and nine European T. saginata cases was relevant. Finally, the cloning and sequencing of the T. solium HDP2 fragment confirmed that HDP2 was part of a ribosomal unit.
Rocher, Solen; Jean, Martine; Castonguay, Yves; Belzile, François
2015-01-01
Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids. PMID:26115486
Extremophiles in Household Water Heaters
NASA Astrophysics Data System (ADS)
Wilpiszeski, R.; House, C. H.
2016-12-01
A significant fraction of Earth's microbial diversity comes from species living in extreme environments, but natural extreme environments can be difficult to access. Manmade systems like household water heaters serve as an effective proxy for thermophilic environments that are otherwise difficult to sample directly. As such, we are investigating the biogeography, taxonomic distribution, and evolution of thermophiles growing in domestic water heaters. Citizen scientists collected hot tap water culture- and filter- samples from 101 homes across the United States. We recovered a single species of thermophilic heterotroph from culture samples inoculated from water heaters across the United States, Thermus scotoductus. Whole-genome sequencing was conducted to better understand the distribution and evolution of this single species. We have also sequenced hyper-variable regions of the 16S rRNA gene from whole-community filter samples to identify the broad diversity and distribution of microbial cells captured from each water heater. These results shed light on the processes that shape thermophilic populations and genomes at a spatial resolution that is difficult to access in naturally occurring extreme ecosystems.
Schmitz, Ralf W.; Serre, David; Bonani, Georges; Feine, Susanne; Hillgruber, Felix; Krainitzki, Heike; Pääbo, Svante; Smith, Fred H.
2002-01-01
The 1856 discovery of the Neandertal type specimen (Neandertal 1) in western Germany marked the beginning of human paleontology and initiated the longest-standing debate in the discipline: the role of Neandertals in human evolutionary history. We report excavations of cave sediments that were removed from the Feldhofer caves in 1856. These deposits have yielded over 60 human skeletal fragments, along with a large series of Paleolithic artifacts and faunal material. Our analysis of this material represents the first interdisciplinary analysis of Neandertal remains incorporating genetic, direct dating, and morphological dimensions simultaneously. Three of these skeletal fragments fit directly on Neandertal 1, whereas several others have distinctively Neandertal features. At least three individuals are represented in the skeletal sample. Radiocarbon dates for Neandertal 1, from which a mtDNA sequence was determined in 1997, and a second individual indicate an age of ≈40,000 yr for both. mtDNA analysis on the same second individual yields a sequence that clusters with other published Neandertal sequences. PMID:12232049
NASA Astrophysics Data System (ADS)
Bakker, Eric J.; Eiroa, Carlos
2003-10-01
With our minds focussed on the direct detection of planets using the space interferometry mission DARWIN/TPF, we have made an attempt to identify how the set of ESO Very Large Telescope Interferometer instruments available now, and in the near future (VINCI, MIDI, AMBER, GENIE, FINITO and PRIMA) could contribute to the DARWIN/TPF precursory science program. In particular related to the identification of a short list of science stars to be observed with DARWIN/TPF. We have identified two research projects which can be viewed as DARWIN/TPF precursory science and can be embarked upon shortly using the available VLTI instruments: (1) the direct measurement of stellar angular diameters of a statistically meaningful sample of main-sequence stars with AMBER; (2) an interferometric study of those main-sequence stars that exhibit an infrared excess with either AMBER or MIDI. On the longer run, VLTI can obviously make a significant impact through the exploitation of the infrared nuller GENIE and the astrometric facility PRIMA.
Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting
NASA Astrophysics Data System (ADS)
Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.
1997-05-01
Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
Application of binomial-edited CPMG to shale characterization
Washburn, Kathryn E.; Birdwell, Justin E.
2014-01-01
Unconventional shale resources may contain a significant amount of hydrogen in organic solids such as kerogen, but it is not possible to directly detect these solids with many NMR systems. Binomial-edited pulse sequences capitalize on magnetization transfer between solids, semi-solids, and liquids to provide an indirect method of detecting solid organic materials in shales. When the organic solids can be directly measured, binomial-editing helps distinguish between different phases. We applied a binomial-edited CPMG pulse sequence to a range of natural and experimentally-altered shale samples. The most substantial signal loss is seen in shales rich in organic solids while fluids associated with inorganic pores seem essentially unaffected. This suggests that binomial-editing is a potential method for determining fluid locations, solid organic content, and kerogen–bitumen discrimination.
NASA Astrophysics Data System (ADS)
Calvo-Rathert, Manuel; Goguitchaichvili, Avto; Bógalo, María-Felicidad; Vegas-Tubía, Néstor; Carrancho, Ángel; Sologashvili, Jemal
2011-08-01
New paleomagnetic, rock-magnetic and paleointensity results obtained on samples from 23 basaltic lava flows belonging to four different flow sequences (Mashavera, Kvemo Orozmani, Zemo Karabulaki and Diliska) of Pleistocene and Pliocene age from the eastern Djavakheti Highland, in southern Georgia, are presented. Radiometric dating of these sequences yields ages between 1.8 and 2.18 Ma for Mashavera, 2.07 and 2.58 Ma for Zemo-Karabulakhi and 2.12 and 3.27 for Diliska. No radiometric ages are available for the Kvemo Orozmani sequence, which is considered to be coeval to the Mashavera sequence. Rock-magnetic experiments including measurement of thermomagnetic, hysteresis and IRM-acquisition curves suggest low-Ti titanomagnetite as main carrier of remanence, although a lower Curie-temperature component was also observed in several cases. Reversible and non-reversible curves were recorded in thermomagnetic experiments. Paleomagnetic analysis generally indicated the presence of a single component (mainly in the Mashavera sequence), but also two more or less superimposed components in some other cases. In 21 sites a characteristic component could be determined and all except one were characterised by normal-polarity directions. Flows from the Mashavera sequence had a rather steep inclination (73.1°). Nevertheless, a mean paleomagnetic direction of all four sequences is obtained ( D = 8.5°, I = 60.8°, N = 4, α95 = 11.7°, k = 62.7) which agrees with the Plio-Quaternary directions obtained in previous studies in Georgia. The paleomagnetic pole obtained (latitude ϕ = 82.1°, longitude λ = 118.2°, A95 = 8.0°, k = 240.7) agrees with the pole values of both the 0 Ma and the 5 Ma windows of the synthetic Eurasian polar wander path from Besse and Courtillot (2002). In order to analyse the behaviour of secular variation, the scatter of paleosecular variation of virtual geomagnetic poles of both the Mashavera flow and all 18 studied flows of Pleistocene age was calculated. It could be observed that both data-sets seem to fit well the expected scatter at latitude 41°N. Paleointensity experiments were carried out with the Coe modification of the Thellier method. Twenty-five out of 84 samples (30%) provided reliable paleointensity results. These successful results were mainly obtained in the Mashavera sequence. Most flows yielded paleointensity results in the 30-45 μT range, in accordance with expected Pliocene to present day intensities. Two flows, however, located near the top of the Mashavera sequence yield high paleointensity values around 60 μT. Anomalous paleointensity results in the upper-lying Mashavera flows together with the steep inclinations observed in that sequence, could perhaps signal the near onset of the Olduvai-Matuyama reversal.
Diverse tulasnelloid fungi form mycorrhizas with epiphytic orchids in an Andean cloud forest.
Suárez, Juan Pablo; Weiss, Michael; Abele, Andrea; Garnica, Sigisfredo; Oberwinkler, Franz; Kottke, Ingrid
2006-11-01
The mycorrhizal state of epiphytic orchids has been controversially discussed, and the state and mycobionts of the pleurothallid orchids, occurring abundantly and with a high number of species on stems of trees in the Andean cloud forest, were unknown. Root samples of 77 adult individuals of the epiphytic orchids Stelis hallii, S. superbiens, S. concinna and Pleurothallis lilijae were collected in a tropical mountain rainforest of southern Ecuador. Ultrastructural evidence of symbiotic interaction was combined with molecular sequencing of fungi directly from the mycorrhizas and isolation of mycobionts. Ultrastructural analyses displayed vital orchid mycorrhizas formed by fungi with an imperforate parenthesome and cell wall slime bodies typical for the genus Tulasnella. Three different Tulasnella isolates were obtained in pure culture. Phylogenetic analysis of nuclear rDNA sequences from coding regions of the ribosomal large subunit (nucLSU) and the 5.8S subunit, including parts of the internal transcribed spacers, obtained directly from the roots and from the fungal isolates, yielded seven distinct Tulasnella clades. Tulasnella mycobionts in Stelis concinna were restricted to two Tulasnella sequence types while the other orchids were associated with up to six Tulasnella sequence types. All Tulasnella sequences are new to science and distinct from known sequences of mycobionts of terrestrial orchids. The results indicate that tulasnelloid fungi, adapted to the conditions on tree stems, might be important for orchid growth and maintenance in the Andean cloud forest.
Equine behavioral enrichment toys as tools for non-invasive recovery of viral and host DNA.
Seeber, Peter A; Soilemetzidou, Sanatana E; East, Marion L; Walzer, Chris; Greenwood, Alex D
2017-09-01
Direct collection of samples from wildlife can be difficult and sometimes impossible. Non-invasive remote sampling for the purpose of DNA extraction is a potential tool for monitoring the presence of wildlife at the individual level, and for identifying the pathogens shed by wildlife. Equine herpesviruses (EHV) are common pathogens of equids that can be fatal if transmitted to other mammals. Transmission usually occurs by nasal aerosol discharge from virus-shedding individuals. The aim of this study was to validate a simple, non-invasive method to track EHV shedding in zebras and to establish an efficient protocol for genotyping individual zebras from environmental DNA (eDNA). A commercially available horse enrichment toy was deployed in captive Grévy's, mountain, and plains zebra enclosures and swabbed after 4-24 hr. Using eDNA extracted from these swabs four EHV strains (EHV-1, EHV-7, wild ass herpesvirus and zebra herpesvirus) were detected by PCR and confirmed by sequencing, and 12 of 16 zebras present in the enclosures were identified as having interacted with the enrichment toy by mitochondrial DNA amplification and sequencing. We conclude that, when direct sampling is difficult or prohibited, non-invasive sampling of eDNA can be a useful tool to determine the genetics of individuals or populations and for detecting pathogen shedding in captive wildlife. © 2017 Wiley Periodicals, Inc.
Rôças, I N; Siqueira, J F
2005-12-01
Recent evidence from molecular genetic studies has revealed that oral Treponema species are involved in infections of endodontic origin. This study assessed the occurrence of two newly named oral treponemes - Treponema parvum and Treponema putidum - in primary endodontic infections using a culture-independent identification technique. Genomic DNA was isolated directly from clinical samples, and a 16S rRNA gene-based nested polymerase chain reaction (PCR) assay was used to determine the presence of T. parvum and T. putidum. Species-specific primer pairs were developed by aligning closely related 16S rRNA gene sequences. The specificity for each primer pair was validated by running PCR against a panel of oral bacteria and by sequence analysis of PCR products from positive clinical samples. T. parvum was detected in 52% of the root canals associated with chronic apical periodontitis, in 20% of the cases diagnosed as acute apical periodontitis, and in no abscessed case. In general, T. parvum was detected in 26% of the samples from primary endodontic infections. T. putidum was found in only one case of acute apical periodontitis (2% of the total number of cases investigated). The devised nested PCR protocol was able to identify both T. parvum and T. putidum directly in clinical samples and demonstrated that these two treponemes can take part in endodontic infections.
Profiling cellular protein complexes by proximity ligation with dual tag microarray readout.
Hammond, Maria; Nong, Rachel Yuan; Ericsson, Olle; Pardali, Katerina; Landegren, Ulf
2012-01-01
Patterns of protein interactions provide important insights in basic biology, and their analysis plays an increasing role in drug development and diagnostics of disease. We have established a scalable technique to compare two biological samples for the levels of all pairwise interactions among a set of targeted protein molecules. The technique is a combination of the proximity ligation assay with readout via dual tag microarrays. In the proximity ligation assay protein identities are encoded as DNA sequences by attaching DNA oligonucleotides to antibodies directed against the proteins of interest. Upon binding by pairs of antibodies to proteins present in the same molecular complexes, ligation reactions give rise to reporter DNA molecules that contain the combined sequence information from the two DNA strands. The ligation reactions also serve to incorporate a sample barcode in the reporter molecules to allow for direct comparison between pairs of samples. The samples are evaluated using a dual tag microarray where information is decoded, revealing which pairs of tags that have become joined. As a proof-of-concept we demonstrate that this approach can be used to detect a set of five proteins and their pairwise interactions both in cellular lysates and in fixed tissue culture cells. This paper provides a general strategy to analyze the extent of any pairwise interactions in large sets of molecules by decoding reporter DNA strands that identify the interacting molecules.
Genomic Investigation of a Legionellosis Outbreak in a Persistently Colonized Hotel.
Sánchez-Busó, Leonor; Guiral, Silvia; Crespi, Sebastián; Moya, Víctor; Camaró, María L; Olmos, María P; Adrián, Francisco; Morera, Vicente; González-Morán, Francisco; Vanaclocha, Hermelinda; González-Candelas, Fernando
2015-01-01
A long-lasting legionellosis outbreak was reported between November 2011 and July 2012 in a hotel in Calpe (Spain) affecting 44 patients including six deaths. Intensive epidemiological and microbiological investigations were performed in order to detect the reservoirs. Clinical and environmental samples were tested for the presence and genetic characterization of Legionella pneumophila. Six of the isolates were subjected to whole-genome sequencing. Sequencing of 14 clinical and 260 environmental samples revealed sequence type (ST) 23 as the main responsible strain for the infections. This ST was found in the spa pool, from where it spread to other hotel public spaces, explaining the ST23 clinical cases, including guests who had not visited the spa. Uncultured clinical specimens showed profiles compatible with ST23, ST578, and mixed patterns. Profiles compatible with ST578 were obtained by direct sequencing from biofilm samples collected from the domestic water system, which provided evidence for the source of infection for non ST23 patients. Whole genome data from five ST23 strains and the identification of different STs and Legionella species showed that different hotel premises were likely colonized since the hotel opening thus explaining how different patients had been infected by distinct STs. Both epidemiological and molecular data are essential in the investigation of legionellosis outbreaks. Whole-genome sequencing data revealed significant intra-ST variability and allowed to make further inference on the short-term evolution of a local colonization of L. pneumophila.
Wang, Chunxiao; García-Fernández, David; Mas, Albert; Esteve-Zarzoso, Braulio
2015-01-01
The diversity of fungi in grape must and during wine fermentation was investigated in this study by culture-dependent and culture-independent techniques. Carignan and Grenache grapes were harvested from three vineyards in the Priorat region (Spain) in 2012, and nine samples were selected from the grape must after crushing and during wine fermentation. From culture-dependent techniques, 362 isolates were randomly selected and identified by 5.8S-ITS-RFLP and 26S-D1/D2 sequencing. Meanwhile, genomic DNA was extracted directly from the nine samples and analyzed by qPCR, DGGE and massive sequencing. The results indicated that grape must after crushing harbored a high species richness of fungi with Aspergillus tubingensis, Aureobasidium pullulans, or Starmerella bacillaris as the dominant species. As fermentation proceeded, the species richness decreased, and yeasts such as Hanseniaspora uvarum, Starmerella bacillaris and Saccharomyces cerevisiae successively occupied the must samples. The “terroir” characteristics of the fungus population are more related to the location of the vineyard than to grape variety. Sulfur dioxide treatment caused a low effect on yeast diversity by similarity analysis. Because of the existence of large population of fungi on grape berries, massive sequencing was more appropriate to understand the fungal community in grape must after crushing than the other techniques used in this study. Suitable target sequences and databases were necessary for accurate evaluation of the community and the identification of species by the 454 pyrosequencing of amplicons. PMID:26557110
Genomic Investigation of a Legionellosis Outbreak in a Persistently Colonized Hotel
Sánchez-Busó, Leonor; Guiral, Silvia; Crespi, Sebastián; Moya, Víctor; Camaró, María L.; Olmos, María P.; Adrián, Francisco; Morera, Vicente; González-Morán, Francisco; Vanaclocha, Hermelinda; González-Candelas, Fernando
2016-01-01
Objectives: A long-lasting legionellosis outbreak was reported between November 2011 and July 2012 in a hotel in Calpe (Spain) affecting 44 patients including six deaths. Intensive epidemiological and microbiological investigations were performed in order to detect the reservoirs. Methods: Clinical and environmental samples were tested for the presence and genetic characterization of Legionella pneumophila. Six of the isolates were subjected to whole-genome sequencing. Results: Sequencing of 14 clinical and 260 environmental samples revealed sequence type (ST) 23 as the main responsible strain for the infections. This ST was found in the spa pool, from where it spread to other hotel public spaces, explaining the ST23 clinical cases, including guests who had not visited the spa. Uncultured clinical specimens showed profiles compatible with ST23, ST578, and mixed patterns. Profiles compatible with ST578 were obtained by direct sequencing from biofilm samples collected from the domestic water system, which provided evidence for the source of infection for non ST23 patients. Whole genome data from five ST23 strains and the identification of different STs and Legionella species showed that different hotel premises were likely colonized since the hotel opening thus explaining how different patients had been infected by distinct STs. Conclusions: Both epidemiological and molecular data are essential in the investigation of legionellosis outbreaks. Whole-genome sequencing data revealed significant intra-ST variability and allowed to make further inference on the short-term evolution of a local colonization of L. pneumophila. PMID:26834713
Sun, Yanqing; Qi, Li; Yang, Guangren; Gilbert, Peter B
2018-05-01
This article develops hypothesis testing procedures for the stratified mark-specific proportional hazards model with missing covariates where the baseline functions may vary with strata. The mark-specific proportional hazards model has been studied to evaluate mark-specific relative risks where the mark is the genetic distance of an infecting HIV sequence to an HIV sequence represented inside the vaccine. This research is motivated by analyzing the RV144 phase 3 HIV vaccine efficacy trial, to understand associations of immune response biomarkers on the mark-specific hazard of HIV infection, where the biomarkers are sampled via a two-phase sampling nested case-control design. We test whether the mark-specific relative risks are unity and how they change with the mark. The developed procedures enable assessment of whether risk of HIV infection with HIV variants close or far from the vaccine sequence are modified by immune responses induced by the HIV vaccine; this question is interesting because vaccine protection occurs through immune responses directed at specific HIV sequences. The test statistics are constructed based on augmented inverse probability weighted complete-case estimators. The asymptotic properties and finite-sample performances of the testing procedures are investigated, demonstrating double-robustness and effectiveness of the predictive auxiliaries to recover efficiency. The finite-sample performance of the proposed tests are examined through a comprehensive simulation study. The methods are applied to the RV144 trial. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Hammoumi, Saliha; Vallaeys, Tatiana; Santika, Ayi; Leleux, Philippe; Borzym, Ewa; Klopp, Christophe; Avarre, Jean-Christophe
2016-01-01
Koi herpesvirus disease (KHVD) is an emerging disease that causes mass mortality in koi and common carp, Cyprinus carpio L. Its causative agent is Cyprinid herpesvirus 3 (CyHV-3), also known as koi herpesvirus (KHV). Although data on the pathogenesis of this deadly virus is relatively abundant in the literature, still little is known about its genomic diversity and about the molecular mechanisms that lead to such a high virulence. In this context, we developed a new strategy for sequencing full-length CyHV-3 genomes directly from infected fish tissues. Total genomic DNA extracted from carp gill tissue was specifically enriched with CyHV-3 sequences through hybridization to a set of nearly 2 million overlapping probes designed to cover the entire genome length, using KHV-J sequence (GenBank accession number AP008984) as reference. Applied to 7 CyHV-3 specimens from Poland and Indonesia, this targeted genomic enrichment enabled recovery of the full genomes with >99.9% reference coverage. The enrichment rate was directly correlated to the estimated number of viral copies contained in the DNA extracts used for library preparation, which varied between ∼5000 and ∼2×10 7 . The average sequencing depth was >200 for all samples, thus allowing the search for variants with high confidence. Sequence analyses highlighted a significant proportion of intra-specimen sequence heterogeneity, suggesting the presence of mixed infections in all investigated fish. They also showed that inter-specimen genetic diversity at the genome scale was very low (>99.95% of sequence identity). By enabling full genome comparisons directly from infected fish tissues, this new method will be valuable to trace outbreaks rapidly and at a reasonable cost, and in turn to understand the transmission routes of CyHV-3.
Hammoumi, Saliha; Vallaeys, Tatiana; Santika, Ayi; Leleux, Philippe; Borzym, Ewa; Klopp, Christophe
2016-01-01
Koi herpesvirus disease (KHVD) is an emerging disease that causes mass mortality in koi and common carp, Cyprinus carpio L. Its causative agent is Cyprinid herpesvirus 3 (CyHV-3), also known as koi herpesvirus (KHV). Although data on the pathogenesis of this deadly virus is relatively abundant in the literature, still little is known about its genomic diversity and about the molecular mechanisms that lead to such a high virulence. In this context, we developed a new strategy for sequencing full-length CyHV-3 genomes directly from infected fish tissues. Total genomic DNA extracted from carp gill tissue was specifically enriched with CyHV-3 sequences through hybridization to a set of nearly 2 million overlapping probes designed to cover the entire genome length, using KHV-J sequence (GenBank accession number AP008984) as reference. Applied to 7 CyHV-3 specimens from Poland and Indonesia, this targeted genomic enrichment enabled recovery of the full genomes with >99.9% reference coverage. The enrichment rate was directly correlated to the estimated number of viral copies contained in the DNA extracts used for library preparation, which varied between ∼5000 and ∼2×107. The average sequencing depth was >200 for all samples, thus allowing the search for variants with high confidence. Sequence analyses highlighted a significant proportion of intra-specimen sequence heterogeneity, suggesting the presence of mixed infections in all investigated fish. They also showed that inter-specimen genetic diversity at the genome scale was very low (>99.95% of sequence identity). By enabling full genome comparisons directly from infected fish tissues, this new method will be valuable to trace outbreaks rapidly and at a reasonable cost, and in turn to understand the transmission routes of CyHV-3. PMID:27703859
Takaesu, Azusa; Watanabe, Kiyotaka; Takai, Shinji; Sasaki, Yukako; Orino, Koichi
2008-01-01
Background Iron-storage protein, ferritin plays a central role in iron metabolism. Ferritin has dual function to store iron and segregate iron for protection of iron-catalyzed reactive oxygen species. Tissue ferritin is composed of two kinds of subunits (H: heavy chain or heart-type subunit; L: light chain or liver-type subunit). Ferritin gene expression is controlled at translational level in iron-dependent manner or at transcriptional level in iron-independent manner. However, sequencing analysis of marine mammalian ferritin subunits has not yet been performed fully. The purpose of this study is to reveal cDNA-derived amino acid sequences of cetacean ferritin H and L subunits, and demonstrate the possibility of expression of these subunits, especially H subunit, by iron. Methods Sequence analyses of cetacean ferritin H and L subunits were performed by direct sequencing of polymerase chain reaction (PCR) fragments from cDNAs generated via reverse transcription-PCR of leukocyte total RNA prepared from blood samples of six different dolphin species (Pseudorca crassidens, Lagenorhynchus obliquidens, Grampus griseus, Globicephala macrorhynchus, Tursiops truncatus, and Delphinapterus leucas). The putative iron-responsive element sequence in the 5'-untranslated region of the six different dolphin species was revealed by direct sequencing of PCR fragments obtained using leukocyte genomic DNA. Results Dolphin H and L subunits consist of 182 and 174 amino acids, respectively, and amino acid sequence identities of ferritin subunits among these dolphins are highly conserved (H: 99–100%, (99→98) ; L: 98–100%). The conserved 28 bp IRE sequence was located -144 bp upstream from the initiation codon in the six different dolphin species. Conclusion These results indicate that six different dolphin species have conserved ferritin sequences, and suggest that these genes are iron-dependently expressed. PMID:18954429
Rusch, Douglas B; Halpern, Aaron L; Sutton, Granger; Heidelberg, Karla B; Williamson, Shannon; Yooseph, Shibu; Wu, Dongying; Eisen, Jonathan A; Hoffman, Jeff M; Remington, Karin; Beeson, Karen; Tran, Bao; Smith, Hamilton; Baden-Tillson, Holly; Stewart, Clare; Thorpe, Joyce; Freeman, Jason; Andrews-Pfannkoch, Cynthia; Venter, Joseph E; Li, Kelvin; Kravitz, Saul; Heidelberg, John F; Utterback, Terry; Rogers, Yu-Hui; Falcón, Luisa I; Souza, Valeria; Bonilla-Rosso, Germán; Eguiarte, Luis E; Karl, David M; Sathyendranath, Shubha; Platt, Trevor; Bermingham, Eldredge; Gallardo, Victor; Tamayo-Castillo, Giselle; Ferrari, Michael R; Strausberg, Robert L; Nealson, Kenneth; Friedman, Robert; Frazier, Marvin; Venter, J. Craig
2007-01-01
The world's oceans contain a complex mixture of micro-organisms that are for the most part, uncharacterized both genetically and biochemically. We report here a metagenomic study of the marine planktonic microbiota in which surface (mostly marine) water samples were analyzed as part of the Sorcerer II Global Ocean Sampling expedition. These samples, collected across a several-thousand km transect from the North Atlantic through the Panama Canal and ending in the South Pacific yielded an extensive dataset consisting of 7.7 million sequencing reads (6.3 billion bp). Though a few major microbial clades dominate the planktonic marine niche, the dataset contains great diversity with 85% of the assembled sequence and 57% of the unassembled data being unique at a 98% sequence identity cutoff. Using the metadata associated with each sample and sequencing library, we developed new comparative genomic and assembly methods. One comparative genomic method, termed “fragment recruitment,” addressed questions of genome structure, evolution, and taxonomic or phylogenetic diversity, as well as the biochemical diversity of genes and gene families. A second method, termed “extreme assembly,” made possible the assembly and reconstruction of large segments of abundant but clearly nonclonal organisms. Within all abundant populations analyzed, we found extensive intra-ribotype diversity in several forms: (1) extensive sequence variation within orthologous regions throughout a given genome; despite coverage of individual ribotypes approaching 500-fold, most individual sequencing reads are unique; (2) numerous changes in gene content some with direct adaptive implications; and (3) hypervariable genomic islands that are too variable to assemble. The intra-ribotype diversity is organized into genetically isolated populations that have overlapping but independent distributions, implying distinct environmental preference. We present novel methods for measuring the genomic similarity between metagenomic samples and show how they may be grouped into several community types. Specific functional adaptations can be identified both within individual ribotypes and across the entire community, including proteorhodopsin spectral tuning and the presence or absence of the phosphate-binding gene PstS. PMID:17355176
Genotyping of ancient Mycobacterium tuberculosis strains reveals historic genetic diversity.
Müller, Romy; Roberts, Charlotte A; Brown, Terence A
2014-04-22
The evolutionary history of the Mycobacterium tuberculosis complex (MTBC) has previously been studied by analysis of sequence diversity in extant strains, but not addressed by direct examination of strain genotypes in archaeological remains. Here, we use ancient DNA sequencing to type 11 single nucleotide polymorphisms and two large sequence polymorphisms in the MTBC strains present in 10 archaeological samples from skeletons from Britain and Europe dating to the second-nineteenth centuries AD. The results enable us to assign the strains to groupings and lineages recognized in the extant MTBC. We show that at least during the eighteenth-nineteenth centuries AD, strains of M. tuberculosis belonging to different genetic groups were present in Britain at the same time, possibly even at a single location, and we present evidence for a mixed infection in at least one individual. Our study shows that ancient DNA typing applied to multiple samples can provide sufficiently detailed information to contribute to both archaeological and evolutionary knowledge of the history of tuberculosis.
Chemical-biogeographic survey of secondary metabolism in soil.
Charlop-Powers, Zachary; Owen, Jeremy G; Reddy, Boojala Vijay B; Ternei, Melinda A; Brady, Sean F
2014-03-11
In this study, we compare biosynthetic gene richness and diversity of 96 soil microbiomes from diverse environments found throughout the southwestern and northeastern regions of the United States. The 454-pyroseqencing of nonribosomal peptide adenylation (AD) and polyketide ketosynthase (KS) domain fragments amplified from these microbiomes provide a means to evaluate the variation of secondary metabolite biosynthetic diversity in different soil environments. Through soil composition and AD- and KS-amplicon richness analysis, we identify soil types with elevated biosynthetic potential. In general, arid soils show the richest observed biosynthetic diversity, whereas brackish sediments and pine forest soils show the least. By mapping individual environmental amplicon sequences to sequences derived from functionally characterized biosynthetic gene clusters, we identified conserved soil type-specific secondary metabolome enrichment patterns despite significant sample-to-sample sequence variation. These data are used to create chemical biogeographic distribution maps for biomedically valuable families of natural products in the environment that should prove useful for directing the discovery of bioactive natural products in the future.
Wang, Qiuyan; Wu, Huili; Wang, Anming; Du, Pengfei; Pei, Xiaolin; Li, Haifeng; Yin, Xiaopu; Huang, Lifeng; Xiong, Xiaolong
2010-01-01
DNA family shuffling is a powerful method for enzyme engineering, which utilizes recombination of naturally occurring functional diversity to accelerate laboratory-directed evolution. However, the use of this technique has been hindered by the scarcity of family genes with the required level of sequence identity in the genome database. We describe here a strategy for collecting metagenomic homologous genes for DNA shuffling from environmental samples by truncated metagenomic gene-specific PCR (TMGS-PCR). Using identified metagenomic gene-specific primers, twenty-three 921-bp truncated lipase gene fragments, which shared 64–99% identity with each other and formed a distinct subfamily of lipases, were retrieved from 60 metagenomic samples. These lipase genes were shuffled, and selected active clones were characterized. The chimeric clones show extensive functional and genetic diversity, as demonstrated by functional characterization and sequence analysis. Our results indicate that homologous sequences of genes captured by TMGS-PCR can be used as suitable genetic material for DNA family shuffling with broad applications in enzyme engineering. PMID:20962349
Genealogical analyses of rabies virus strains from Brazil based on N gene alleles.
Heinemann, M. B.; Fernandes-Matioli, F. M. C.; Cortez, A.; Soares, R. M.; Sakamoto, S. M.; Bernardi, F.; Ito, F. H.; Madeira, A. M. B. N.; Richtzenhain, L. J.
2002-01-01
Thirty rabies virus isolates from cows and vampire bats from different regions of São Paulo State, Southeastern Brazil and three rabies vaccines were studied genetically. The analysis was based on direct sequencing of PCR-amplified products of 600 nucleotides coding for the amino terminus of nucleoprotein gene. The sequences were checked to verify their genealogical and evolutionary relationships and possible implication for health programmes. Statistical data indicated that there were no significant genetic differences between samples isolated from distinct hosts, from different geographical regions and between samples collected in the last two decades. According to the HKA test, the variability observed in the sequences is probably due to genetic drift. Since changes in genetic material may produce modifications in the protein responsible for immunogenicity of virus, which may eventually cause vaccine failure in herds, we suggest that continuous efforts in monitoring genetic diversity in rabies virus field strains, in relation to vaccine strains, must be conducted. PMID:12113496
Palæomagnetism of Hawaiian lava flows
Doell, Richard R.; Cox, Allan
1961-01-01
PALÆOMAGNETIC investigations of volcanic rocks extruded in various parts of the world during the past several million years have generally revealed a younger sequence of lava flows magnetized nearly parallel to the field of a theoretical geocentric axial dipole, underlain by a sequence of older flows with exactly the opposite direction of remanent magnetization. A 180-degree reversal of the geomagnetic field, occurring near the middle of the Pleistocene epoch, has been inferred by many workers from such results1–3. This is a preliminary report of an investigation of 755 oriented samples collected from 152 lava flows on the island of Hawaii, selected to represent as many stratigraphic horizons as possible. (Sampling details are indicated in Table 1.) This work was undertaken because Hawaii's numerous thick sequences of lava flows, previously mapped as Pliocene to Historic by Stearns and Macdonald4, and afterwards assigned ages ranging from later Tertiary to Recent, by Macdonald and Davis5, appeared to offer an ideal opportunity to examine the most recent reversal of Earth's field.
Genetic analysis of 7 medieval skeletons from Aragonese Pyrenees
Núńez, Carolina; Sosa, Cecilia; Baeta, Miriam; Geppert, Maria; Turnbough, Meredith; Phillips, Nicole; Casalod, Yolanda; Bolea, Miguel; Roby, Rhonda; Budowle, Bruce; Martínez-Jarreta, Begońa
2011-01-01
Aim To perform a genetic characterization of 7 skeletons from medieval age found in a burial site in the Aragonese Pyrenees. Methods Allele frequencies of autosomal short tandem repeats (STR) loci were determined by 3 different STR systems. Mitochondrial DNA (mtDNA) and Y-chromosome haplogroups were determined by sequencing of the hypervariable segment 1 of mtDNA and typing of phylogenetic Y chromosome single nucleotide polymorphisms (Y-SNP) markers, respectively. Possible familial relationships were also investigated. Results Complete or partial STR profiles were obtained in 3 of the 7 samples. Mitochondrial DNA haplogroup was determined in 6 samples, with 5 of them corresponding to the haplogroup H and 1 to the haplogroup U5a. Y-chromosome haplogroup was determined in 2 samples, corresponding to the haplogroup R. In one of them, the sub-branch R1b1b2 was determined. mtDNA sequences indicated that some of the individuals could be maternally related, while STR profiles indicated no direct family relationships. Conclusions Despite the antiquity of the samples and great difficulty that genetic analyses entail, the combined use of autosomal STR markers, Y-chromosome informative SNPs, and mtDNA sequences allowed us to genotype a group of skeletons from the medieval age. PMID:21674829
How Do Deep Saline Aquifer Microbial Communities Respond to Supercritical CO2 Injection?
NASA Astrophysics Data System (ADS)
Mu, A.; Billman-Jacobe, H.; Boreham, C.; Schacht, U.; Moreau, J. W.
2011-12-01
Carbon Capture and Storage (CCS) is currently seen as a viable strategy for mitigating anthropogenic carbon dioxide pollution. The Cooperative Research Centre for Greenhouse Gas Technologies (CO2CRC) is currently conducting a field experiment in the Otway Basin (Australia) studying residual gas saturation in the water-saturated reservoir of the Paaratte Formation. As part of this study, a suite of pre-CO2 injection water samples were collected from approximately 1400 meters depth (60°C, 13.8 MPa) via an in situ sampling system. The in situ sampling system isolates aquifer water from sources of contamination while maintaining the formation pressure. Whole community DNA was extracted from these samples to investigate the prokaryotic biodiversity of the saline Paaratte aquifer (EC = 1509.6 uS/cm). Bioinformatic analysis of preliminary 16S ribosomal gene data revealed Thermincola, Acinetobacter, Sphingobium, and Dechloromonas amongst the closest related genera to environmental clone sequences obtained from a subset of pre-CO2 injection groundwater samples. Epifluorescent microscopy with 4',6-diamidino-2-phenylindole (DAPI) highlighted an abundance of filamentous cells ranging from 5 to 45 μM. Efforts are currently directed towards utilising a high throughput sequencing approach to capture an exhaustive profile of the microbial diversity of the Paaratte aquifer CO2 injection site, and to understand better the response of in situ microbial populations to the injection of large volumes (e.g. many kilotonnes) of supercritical CO2 (sc-CO2). Sequencing results will be used to direct cultivation efforts towards enrichment of a CO2-tolerant microorganism. Understanding the microbial response to sc-CO2 is an integral aspect of carbon dioxide storage, for which very little information exists in the literature. This study aims to elucidate molecular mechanisms, through genomic and cultivation-based methods, for CO2 tolerance with the prospect of engineering biofilms to enhance trapping of CO2 in saline aquifers.
PIÑEYRO-NELSON, A; VAN HEERWAARDEN, J; PERALES, H R; SERRATOS-HERNÁNDEZ, J A; RANGEL, A; HUFFORD, M B; GEPTS, P; GARAY-ARROYO, A; RIVERA-BUSTAMANTE, R; ÁLVAREZ-BUYLLA, E R
2009-01-01
A possible consequence of planting genetically modified organisms (GMOs) in centres of crop origin is unintended gene flow into traditional landraces. In 2001, a study reported the presence of the transgenic 35S promoter in maize landraces sampled in 2000 from the Sierra Juarez of Oaxaca, Mexico. Analysis of a large sample taken from the same region in 2003 and 2004 could not confirm the existence of transgenes, thereby casting doubt on the earlier results. These two studies were based on different sampling and analytical procedures and are thus hard to compare. Here, we present new molecular data for this region that confirm the presence of transgenes in three of 23 localities sampled in 2001. Transgene sequences were not detected in samples taken in 2002 from nine localities, while directed samples taken in 2004 from two of the positive 2001 localities were again found to contain transgenic sequences. These findings suggest the persistence or re-introduction of transgenes up until 2004 in this area. We address variability in recombinant sequence detection by analyzing the consistency of current molecular assays. We also present theoretical results on the limitations of estimating the probability of transgene detection in samples taken from landraces. The inclusion of a limited number of female gametes and, more importantly, aggregated transgene distributions may significantly lower detection probabilities. Our analytical and sampling considerations help explain discrepancies among different detection efforts, including the one presented here, and provide considerations for the establishment of monitoring protocols to detect the presence of transgenes among structured populations of landraces. PMID:19143938
Piñeyro-Nelson, A; Van Heerwaarden, J; Perales, H R; Serratos-Hernández, J A; Rangel, A; Hufford, M B; Gepts, P; Garay-Arroyo, A; Rivera-Bustamante, R; Alvarez-Buylla, E R
2009-02-01
A possible consequence of planting genetically modified organisms (GMOs) in centres of crop origin is unintended gene flow into traditional landraces. In 2001, a study reported the presence of the transgenic 35S promoter in maize landraces sampled in 2000 from the Sierra Juarez of Oaxaca, Mexico. Analysis of a large sample taken from the same region in 2003 and 2004 could not confirm the existence of transgenes, thereby casting doubt on the earlier results. These two studies were based on different sampling and analytical procedures and are thus hard to compare. Here, we present new molecular data for this region that confirm the presence of transgenes in three of 23 localities sampled in 2001. Transgene sequences were not detected in samples taken in 2002 from nine localities, while directed samples taken in 2004 from two of the positive 2001 localities were again found to contain transgenic sequences. These findings suggest the persistence or re-introduction of transgenes up until 2004 in this area. We address variability in recombinant sequence detection by analyzing the consistency of current molecular assays. We also present theoretical results on the limitations of estimating the probability of transgene detection in samples taken from landraces. The inclusion of a limited number of female gametes and, more importantly, aggregated transgene distributions may significantly lower detection probabilities. Our analytical and sampling considerations help explain discrepancies among different detection efforts, including the one presented here, and provide considerations for the establishment of monitoring protocols to detect the presence of transgenes among structured populations of landraces.
Korber, B T; Kunstman, K J; Patterson, B K; Furtado, M; McEvilly, M M; Levy, R; Wolinsky, S M
1994-01-01
Human immunodeficiency virus type 1 (HIV-1) sequences were generated from blood and from brain tissue obtained by stereotactic biopsy from six patients undergoing a diagnostic neurosurgical procedure. Proviral DNA was directly amplified by nested PCR, and 8 to 36 clones from each sample were sequenced. Phylogenetic analysis of intrapatient envelope V3-V5 region HIV-1 DNA sequence sets revealed that brain viral sequences were clustered relative to the blood viral sequences, suggestive of tissue-specific compartmentalization of the virus in four of the six cases. In the other two cases, the blood and brain virus sequences were intermingled in the phylogenetic analyses, suggesting trafficking of virus between the two tissues. Slide-based PCR-driven in situ hybridization of two of the patients' brain biopsy samples confirmed our interpretation of the intrapatient phylogenetic analyses. Interpatient V3 region brain-derived sequence distances were significantly less than blood-derived sequence distances. Relative to the tip of the loop, the set of brain-derived viral sequences had a tendency towards negative or neutral charge compared with the set of blood-derived viral sequences. Entropy calculations were used as a measure of the variability at each position in alignments of blood and brain viral sequences. A relatively conserved set of positions were found, with a significantly lower entropy in the brain-than in the blood-derived viral sequences. These sites constitute a brain "signature pattern," or a noncontiguous set of amino acids in the V3 region conserved in viral sequences derived from brain tissue. This brain-derived signature pattern was also well preserved among isolates previously characterized in vitro as macrophage tropic. Macrophage-monocyte tropism may be the biological constraint that results in the conservation of the viral brain signature pattern. Images PMID:7933130
Nield, Blair S.; Willows, Robert D.; Torda, Andrew E.; Gillings, Michael R.; Holmes, Andrew J.; Nevalainen, K.M. Helena; Stokes, H.W.; Mabbutt, Bridget C.
2004-01-01
By targeting gene cassettes by polymerase chain reaction (PCR) directly from environmentally derived DNA, we are able to amplify entire open reading frames (ORFs) independently of prior sequence knowledge. Approximately 10% of the mobile genes recovered by these means can be attributed to known protein families. Here we describe the characterization of two ORFs which show moderate homology to known proteins: (1) an aminoglycoside phosphotransferase displaying 25% sequence identity with APH(7″) from Streptomyces hygroscopicus, and (2) an RNA methyltransferase sharing 25%–28% identity with a group of recently defined bacterial RNA methyltransferases distinct from the SpoU enzyme family. Our novel genes were expressed as recombinant products and assayed for appropriate enzyme activity. The aminoglycoside phosphotransferase displayed ATPase activity, consistent with the presence of characteristic Mg2+-binding residues. Unlike related APH(4) or APH(7″) enzymes, however, this activity was not enhanced by hygromycin B or kanamycin, suggesting the normal substrate to be a different aminoglycoside. The RNA methyltransferase contains sequence motifs of the RNA methyltransferase superfamily, and our recombinant version showed methyltransferase activity with RNA. Our data confirm that gene cassettes present in the environment encode folded enzymes with novel sequence variation and demonstrable catalytic activity. Our PCR approach (cassette PCR) may be used to identify a diverse range of ORFs from any environmental sample, as well as to directly access the gene pool found in mobile gene cassettes commonly associated with integrons. This gene pool can be accessed from both cultured and uncultured microbial samples as a source of new enzymes and proteins. PMID:15152095
Nield, Blair S; Willows, Robert D; Torda, Andrew E; Gillings, Michael R; Holmes, Andrew J; Nevalainen, K M Helena; Stokes, H W; Mabbutt, Bridget C
2004-06-01
By targeting gene cassettes by polymerase chain reaction (PCR) directly from environmentally derived DNA, we are able to amplify entire open reading frames (ORFs) independently of prior sequence knowledge. Approximately 10% of the mobile genes recovered by these means can be attributed to known protein families. Here we describe the characterization of two ORFs which show moderate homology to known proteins: (1) an aminoglycoside phosphotransferase displaying 25% sequence identity with APH(7") from Streptomyces hygroscopicus, and (2) an RNA methyltransferase sharing 25%-28% identity with a group of recently defined bacterial RNA methyltransferases distinct from the SpoU enzyme family. Our novel genes were expressed as recombinant products and assayed for appropriate enzyme activity. The aminoglycoside phosphotransferase displayed ATPase activity, consistent with the presence of characteristic Mg(2+)-binding residues. Unlike related APH(4) or APH(7") enzymes, however, this activity was not enhanced by hygromycin B or kanamycin, suggesting the normal substrate to be a different aminoglycoside. The RNA methyltransferase contains sequence motifs of the RNA methyltransferase superfamily, and our recombinant version showed methyltransferase activity with RNA. Our data confirm that gene cassettes present in the environment encode folded enzymes with novel sequence variation and demonstrable catalytic activity. Our PCR approach (cassette PCR) may be used to identify a diverse range of ORFs from any environmental sample, as well as to directly access the gene pool found in mobile gene cassettes commonly associated with integrons. This gene pool can be accessed from both cultured and uncultured microbial samples as a source of new enzymes and proteins.
Van Neste, Christophe; Gansemans, Yannick; De Coninck, Dieter; Van Hoofstat, David; Van Criekinge, Wim; Deforce, Dieter; Van Nieuwerburgh, Filip
2015-03-01
Routine use of massively parallel sequencing (MPS) for forensic genomics is on the horizon. The last few years, several algorithms and workflows have been developed to analyze forensic MPS data. However, none have yet been tailored to the needs of the forensic analyst who does not possess an extensive bioinformatics background. We developed our previously published forensic MPS data analysis framework MyFLq (My-Forensic-Loci-queries) into an open-source, user-friendly, web-based application. It can be installed as a standalone web application, or run directly from the Illumina BaseSpace environment. In the former, laboratories can keep their data on-site, while in the latter, data from forensic samples that are sequenced on an Illumina sequencer can be uploaded to Basespace during acquisition, and can subsequently be analyzed using the published MyFLq BaseSpace application. Additional features were implemented such as an interactive graphical report of the results, an interactive threshold selection bar, and an allele length-based analysis in addition to the sequenced-based analysis. Practical use of the application is demonstrated through the analysis of four 16-plex short tandem repeat (STR) samples, showing the complementarity between the sequence- and length-based analysis of the same MPS data. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Sampson, Jason S.; Murray, Kermit K.; Muddiman, David C.
2013-01-01
We report the implementation of an infrared laser onto our previously reported matrix-assisted laser desorption electrospray ionization (MALDESI) source with ESI post-ionization yielding multiply charged peptides and proteins. Infrared (IR)-MALDESI is demonstrated for atmospheric pressure desorption and ionization of biological molecules ranging in molecular weight from 1.2 to 17 kDa. High resolving power, high mass accuracy single-acquisition Fourier transform ion cyclotron resonance (FT-ICR) mass spectra were generated from liquid-and solid-state peptide and protein samples by desorption with an infrared laser (2.94 µm) followed by ESI post-ionization. Intact and top-down analysis of equine myoglobin (17 kDa) desorbed from the solid state with ESI post-ionization demonstrates the sequencing capabilities using IR-MALDESI coupled to FT-ICR mass spectrometry. Carbohydrates and lipids were detected through direct analysis of milk and egg yolk using both UV- and IR-MALDESI with minimal sample preparation. Three of the four classes of biological macromolecules (proteins, carbohydrates, and lipids) have been ionized and detected using MALDESI with minimal sample preparation. Sequencing of O-linked glycans, cleaved from mucin using reductive β-elimination chemistry, is also demonstrated. PMID:19185512
2013-01-01
Background Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Results Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li’s D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li’s D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. Conclusions This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens. PMID:23497218
Cornman, Robert Scott; Boncristiani, Humberto; Dainat, Benjamin; Chen, Yanping; vanEngelsdorp, Dennis; Weaver, Daniel; Evans, Jay D
2013-03-07
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RNA viruses of the Western honey bee (Apis mellifera), deformed wing virus (DWV) and Israel acute paralysis virus (IAPV). All viral RNA was extracted from North American samples of honey bees or, in one case, the ectoparasitic mite Varroa destructor. Coverage depth was generally lower for IAPV than DWV, and marked gaps in coverage occurred in several narrow regions (< 50 bp) of IAPV. These coverage gaps occurred across sequencing runs and were virtually unchanged when reads were re-mapped with greater permissiveness (up to 8% divergence), suggesting a recurrent sequencing artifact rather than strain divergence. Consensus sequences of DWV for each sample showed little phylogenetic divergence, low nucleotide diversity, and strongly negative values of Fu and Li's D statistic, suggesting a recent population bottleneck and/or purifying selection. The Kakugo strain of DWV fell outside of all other DWV sequences at 100% bootstrap support. IAPV consensus sequences supported the existence of multiple clades as had been previously reported, and Fu and Li's D was closer to neutral expectation overall, although a sliding-window analysis identified a significantly positive D within the protease region, suggesting selection maintains diversity in that region. Within-sample mean diversity was comparable between the two viruses on average, although for both viruses there was substantial variation among samples in mean diversity at third codon positions and in the number of high-diversity sites. FST values were bimodal for DWV, likely reflecting neutral divergence in two low-diversity populations, whereas IAPV had several sites that were strong outliers with very low FST. This initial survey of genetic variation within honey bee RNA viruses suggests future directions for studies examining the underlying causes of population-genetic structure in these economically important pathogens.
Botti, Sara; Giuffra, Elisabetta
2010-08-23
DNA barcodes are a global standard for species identification and have countless applications in the medical, forensic and alimentary fields, but few barcoding methods work efficiently in samples in which DNA is degraded, e.g. foods and archival specimens. This limits the choice of target regions harbouring a sufficient number of diagnostic polymorphisms. The method described here uses existing PCR and sequencing methodologies to detect mitochondrial DNA polymorphisms in complex matrices such as foods. The reported application allowed the discrimination among 17 fish species of the Scombridae family with high commercial interest such as mackerels, bonitos and tunas which are often present in processed seafood. The approach can be easily upgraded with the release of new genetic diversity information to increase the range of detected species. Cocktail of primers are designed for PCR using publicly available sequences of the target sequence. They are composed of a fixed 5' region and of variable 3' cocktail portions that allow amplification of any member of a group of species of interest. The population of short amplicons is directly sequenced and indexed using primers containing a longer 5' region and the non polymorphic portion of the cocktail portion. A 226 bp region of CytB was selected as target after collection and screening of 148 online sequences; 85 SNPs were found, of which 75 were present in at least two sequences. Primers were also designed for two shorter sub-fragments that could be amplified from highly degraded samples. The test was used on 103 samples of seafood (canned tuna and scomber, tuna salad, tuna sauce) and could successfully detect the presence of different or additional species that were not identified on the labelling of canned tuna, tuna salad and sauce samples. The described method is largely independent of the degree of degradation of DNA source and can thus be applied to processed seafood. Moreover, the method is highly flexible: publicly available sequence information on mitochondrial genomes are rapidly increasing for most species, facilitating the choice of target sequences and the improvement of resolution of the test. This is particularly important for discrimination of marine and aquaculture species for which genome information is still limited.
Population diversity of ammonium oxidizers investigated by specific PCR amplification
Ward, B.B.; Voytek, M.A.; Witzel, K.-P.
1997-01-01
The species composition of ammonia-oxidizing bacteria in aquatic environments was investigated using PCR primers for 16S rRNA genes to amplify specific subsets of the total ammonia-oxidizer population. The specificity of the amplification reactions was determined using total genomic DNA from known nitrifying strains and non-nitrifying strains identified as having similar rDNA sequences. Specificity of amplification was determined both for direct amplification, using the nitrifier specific primers, and with nested amplification, in which the nitrifier primers were used to reamplify a fragment obtained from direct amplification with Eubacterial universal primers. The present level of specificity allows the distinction between Nitrosomonas europaea, Nitrosomonas sp. (marine) and the other known ammonia-oxidizers in the beta subclass of the Proteobacteria. Using total DNA extracted from natural samples, we used direct amplification to determine presence/absence of different species groups. Species composition was found to differ among depths in vertical profiles of lake samples and among samples and enrichments from various other aquatic environments. Nested PCR yielded several more positive reactions, which implies that nitrifier DNA was present in most samples, but often at very low levels.
NASA Astrophysics Data System (ADS)
Meyer, Matthew; Morris, Antony; Anderson, Mark; MacLeod, Chris
2015-04-01
The Oman ophiolite is an important natural laboratory for understanding the construction of oceanic crust at fast spreading axes and its subsequent tectonic evolution. Previous paleomagnetic research in lavas of the northern ophiolitic blocks (Perrin et al., 2000) has demonstrated substantial clockwise intraoceanic tectonic rotations. Paleomagnetic data from lower crustal sequences in the southern blocks, however, have been more equivocal due to complications arising from remagnetization, and have been used to infer that clockwise rotations seen in the north are internal to the ophiolite rather than regionally significant (Weiler, 2000). Here we demonstrate the importance and advantages of sampling crustal transects in the ophiolite in order to understand the nature and variability in magnetization directions. By systematically sampling the lower crustal sequence exposed in Wadi Abyad (Rustaq block) we resolve for the first time in a single section a pattern of remagnetized lowermost gabbros and retention of earlier magnetizations by uppermost gabbros and the overlying dyke-rooting zone. Results are supported by a positive fold test that shows that remagnetization of lower gabbros occurred prior to the Campanian structural disruption of the Moho. NW-directed remagnetized remanences in the lower units are consistent with those used by Weiler (2000) to infer lack of significant rotation of the southern blocks and to argue, therefore, that rotation of the northern blocks was internal to the ophiolite. In contrast, E/ENE-directed remanences in the uppermost levels of Wadi Abyad imply large, clockwise rotation of the Rustaq block, of a sense and magnitude consistent with intraoceanic rotations inferred from extrusive sections in the northern blocks. We conclude that without the control provided by systematic crustal sampling, the potential for different remanence directions being acquired at different times may lead to erroneous tectonic interpretation.
Mak, Sarah Siu Tze; Gopalakrishnan, Shyam; Carøe, Christian; Geng, Chunyu; Liu, Shanlin; Sinding, Mikkel-Holger S; Kuderna, Lukas F K; Zhang, Wenwei; Fu, Shujin; Vieira, Filipe G; Germonpré, Mietje; Bocherens, Hervé; Fedorov, Sergey; Petersen, Bent; Sicheritz-Pontén, Thomas; Marques-Bonet, Tomas; Zhang, Guojie; Jiang, Hui; Gilbert, M Thomas P
2017-01-01
Abstract Ancient DNA research has been revolutionized following development of next-generation sequencing platforms. Although a number of such platforms have been applied to ancient DNA samples, the Illumina series are the dominant choice today, mainly because of high production capacities and short read production. Recently a potentially attractive alternative platform for palaeogenomic data generation has been developed, the BGISEQ-500, whose sequence output are comparable with the Illumina series. In this study, we modified the standard BGISEQ-500 library preparation specifically for use on degraded DNA, then directly compared the sequencing performance and data quality of the BGISEQ-500 to the Illumina HiSeq2500 platform on DNA extracted from 8 historic and ancient dog and wolf samples. The data generated were largely comparable between sequencing platforms, with no statistically significant difference observed for parameters including level (P = 0.371) and average sequence length (P = 0718) of endogenous nuclear DNA, sequence GC content (P = 0.311), double-stranded DNA damage rate (v. 0.309), and sequence clonality (P = 0.093). Small significant differences were found in single-strand DNA damage rate (δS; slightly lower for the BGISEQ-500, P = 0.011) and the background rate of difference from the reference genome (θ; slightly higher for BGISEQ-500, P = 0.012). This may result from the differences in amplification cycles used to polymerase chain reaction–amplify the libraries. A significant difference was also observed in the mitochondrial DNA percentages recovered (P = 0.018), although we believe this is likely a stochastic effect relating to the extremely low levels of mitochondria that were sequenced from 3 of the samples with overall very low levels of endogenous DNA. Although we acknowledge that our analyses were limited to animal material, our observations suggest that the BGISEQ-500 holds the potential to represent a valid and potentially valuable alternative platform for palaeogenomic data generation that is worthy of future exploration by those interested in the sequencing and analysis of degraded DNA. PMID:28854615
A global sampling approach to designing and reengineering RNA secondary structures.
Levin, Alex; Lis, Mieszko; Ponty, Yann; O'Donnell, Charles W; Devadas, Srinivas; Berger, Bonnie; Waldispühl, Jérôme
2012-11-01
The development of algorithms for designing artificial RNA sequences that fold into specific secondary structures has many potential biomedical and synthetic biology applications. To date, this problem remains computationally difficult, and current strategies to address it resort to heuristics and stochastic search techniques. The most popular methods consist of two steps: First a random seed sequence is generated; next, this seed is progressively modified (i.e. mutated) to adopt the desired folding properties. Although computationally inexpensive, this approach raises several questions such as (i) the influence of the seed; and (ii) the efficiency of single-path directed searches that may be affected by energy barriers in the mutational landscape. In this article, we present RNA-ensign, a novel paradigm for RNA design. Instead of taking a progressive adaptive walk driven by local search criteria, we use an efficient global sampling algorithm to examine large regions of the mutational landscape under structural and thermodynamical constraints until a solution is found. When considering the influence of the seeds and the target secondary structures, our results show that, compared to single-path directed searches, our approach is more robust, succeeds more often and generates more thermodynamically stable sequences. An ensemble approach to RNA design is thus well worth pursuing as a complement to existing approaches. RNA-ensign is available at http://csb.cs.mcgill.ca/RNAensign.
A global sampling approach to designing and reengineering RNA secondary structures
Levin, Alex; Lis, Mieszko; Ponty, Yann; O’Donnell, Charles W.; Devadas, Srinivas; Berger, Bonnie; Waldispühl, Jérôme
2012-01-01
The development of algorithms for designing artificial RNA sequences that fold into specific secondary structures has many potential biomedical and synthetic biology applications. To date, this problem remains computationally difficult, and current strategies to address it resort to heuristics and stochastic search techniques. The most popular methods consist of two steps: First a random seed sequence is generated; next, this seed is progressively modified (i.e. mutated) to adopt the desired folding properties. Although computationally inexpensive, this approach raises several questions such as (i) the influence of the seed; and (ii) the efficiency of single-path directed searches that may be affected by energy barriers in the mutational landscape. In this article, we present RNA-ensign, a novel paradigm for RNA design. Instead of taking a progressive adaptive walk driven by local search criteria, we use an efficient global sampling algorithm to examine large regions of the mutational landscape under structural and thermodynamical constraints until a solution is found. When considering the influence of the seeds and the target secondary structures, our results show that, compared to single-path directed searches, our approach is more robust, succeeds more often and generates more thermodynamically stable sequences. An ensemble approach to RNA design is thus well worth pursuing as a complement to existing approaches. RNA-ensign is available at http://csb.cs.mcgill.ca/RNAensign. PMID:22941632
Lallemand, E; Coiffier, G; Arvieux, C; Brillet, E; Guggenbuhl, P; Jolivet-Gougeon, A
2016-05-01
The rapid identification of bacterial species involved in bone and joint infections (BJI) is an important element to optimize the diagnosis and care of patients. The aim of this study was to evaluate the usefulness of matrix-assisted laser desorption ionization mass spectrometry (MALDI-TOF MS) for the rapid diagnosis of bone infections, directly on synovial fluid (SF) or on crushed osteoarticular samples (CS). From January to October 2013, we prospectively analyzed 111 osteoarticular samples (bone and joint samples, BJS) from 78 patients in care at the University Hospital of Rennes, France. The diagnosis procedure leading to the sample collection was linked to a suspicion of infection, inflammatory disease, arthritis, or for any bone or joint abnormalities. Standard bacteriological diagnosis and molecular biology analysis [16S rRNA polymerase chain reaction (PCR) and sequencing] were conducted. In addition, analysis by MALDI-TOF MS was performed directly on the osteoarticular samples, as soon as the amount allowed. Culture, which remains the gold standard for the diagnosis of BJI, has the highest sensitivity (85.9 %) and remains necessary to test antimicrobial susceptibility. The 16S rDNA PCR results were positive in the group with positive BJI (28.6 %) and negative in the group without infection. Direct examination remains insensitive (31.7 %) but more effective than MALDI-TOF MS directly on the sample (6.3 %). The specificity was 100 % in all cases, except for culture (74.5 %). Bacterial culture remains the gold standard, especially enrichment in blood bottles. Direct analysis of bone samples with MALDI-TOF MS is not useful, possibly due to the low inoculum of BJS.
A Cluster of Fatal Tick-borne Encephalitis Virus Infection in Organ Transplant Setting.
Lipowski, Dariusz; Popiel, Marta; Perlejewski, Karol; Nakamura, Shota; Bukowska-Osko, Iwona; Rzadkiewicz, Ewa; Dzieciatkowski, Tomasz; Milecka, Anna; Wenski, Wojciech; Ciszek, Michal; Debska-Slizien, Alicja; Ignacak, Ewa; Cortes, Kamila Caraballo; Pawelczyk, Agnieszka; Horban, Andrzej; Radkowski, Marek; Laskus, Tomasz
2017-03-15
Tick-borne encephalitis virus (TBEV) infection has become a major health problem in Europe and is currently a common cause of viral brain infection in many countries. Encephalitis in transplant recipients, althrough rare, is becoming a recognized complication. Our study provides the first description of transmission of TBEV through transplantation of solid organs. Three patients who received solid organ transplants from a single donor (2 received kidney, and 1 received liver) developed encephalitis 17-49 days after transplantation and subsequently died. Blood and autopsy tissue samples were tested by next-generation sequencing (NGS) and reverse transcription polymerase chain reaction (RT-PCR). All 3 recipients were first analyzed in autopsy brain tissue samples and/or cerebrospinal fluid by NGS, which yielded 24-52 million sequences per sample and 9-988 matched TBEV sequences in each patient. The presence of TBEV was confirmed by RT-PCR in all recipients and in the donor, and direct sequencing of amplification products corroborated the presence of the same viral strain. We demonstrated transmission of TBEV by transplantation of solid organs. In such a setting, TBEV infection may be fatal, probably due to pharmacological immunosuppression. Organ donors should be screened for TBEV when coming from or visiting endemic areas. © The Author 2017. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
Barbano, Raffaela; Pasculli, Barbara; Coco, Michelina; Fontana, Andrea; Copetti, Massimiliano; Rendina, Michelina; Valori, Vanna Maria; Graziano, Paolo; Maiello, Evaristo; Fazio, Vito Michele; Parrella, Paola
2015-01-01
BRAF codon 600 mutation testing of melanoma patients is mandatory for the choice of the most appropriate therapy in the clinical setting. Competitive allele specific TaqMan PCR (Cast-PCR) technology allows not only the selective amplification of minor alleles, but it also blocks the amplification of non-mutant allele. We genotyped codon 600 of the BRAF gene in 54 patients’ samples by Cast-PCR and bidirectional direct sequence analysis. All the mutations detected by sequencing were also identified by Cast-PCR. In addition, Cast-PCR assay detected four samples carrying mutations and was able to clearly identify two mutations of uncertain interpretation by Sanger sequencing. The limit of detection of Cast-PCR was evaluated by constructing dilution curves of BRAFV600E and BRAFV600K mutated clinical samples mixed with a not-mutated specimens. Both mutations could be detected until a 1:100 mutated/not mutated ratio. Cloning and sequencing of the clones was used to confirm mutations on representative discrepant cases. Cast PCR performances were not affected by intratumour heterogeneity, and less affected by melanin content. Our results indicate that Cast-PCR is a reliable diagnostic tool for the identification of melanoma patients as eligible to be treated with TKIs and might be implemented in the clinical setting as elective screening method. PMID:26690267
Naccache, Samia N.; Federman, Scot; Veeraraghavan, Narayanan; Zaharia, Matei; Lee, Deanna; Samayoa, Erik; Bouquet, Jerome; Greninger, Alexander L.; Luk, Ka-Cheung; Enge, Barryett; Wadford, Debra A.; Messenger, Sharon L.; Genrich, Gillian L.; Pellegrino, Kristen; Grard, Gilda; Leroy, Eric; Schneider, Bradley S.; Fair, Joseph N.; Martínez, Miguel A.; Isa, Pavel; Crump, John A.; DeRisi, Joseph L.; Sittler, Taylor; Hackett, John; Miller, Steve; Chiu, Charles Y.
2014-01-01
Unbiased next-generation sequencing (NGS) approaches enable comprehensive pathogen detection in the clinical microbiology laboratory and have numerous applications for public health surveillance, outbreak investigation, and the diagnosis of infectious diseases. However, practical deployment of the technology is hindered by the bioinformatics challenge of analyzing results accurately and in a clinically relevant timeframe. Here we describe SURPI (“sequence-based ultrarapid pathogen identification”), a computational pipeline for pathogen identification from complex metagenomic NGS data generated from clinical samples, and demonstrate use of the pipeline in the analysis of 237 clinical samples comprising more than 1.1 billion sequences. Deployable on both cloud-based and standalone servers, SURPI leverages two state-of-the-art aligners for accelerated analyses, SNAP and RAPSearch, which are as accurate as existing bioinformatics tools but orders of magnitude faster in performance. In fast mode, SURPI detects viruses and bacteria by scanning data sets of 7–500 million reads in 11 min to 5 h, while in comprehensive mode, all known microorganisms are identified, followed by de novo assembly and protein homology searches for divergent viruses in 50 min to 16 h. SURPI has also directly contributed to real-time microbial diagnosis in acutely ill patients, underscoring its potential key role in the development of unbiased NGS-based clinical assays in infectious diseases that demand rapid turnaround times. PMID:24899342
Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf
2014-01-01
CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB PMID:25281234
Bacterial taxa–area and distance–decay relationships in marine environments
Zinger, L; Boetius, A; Ramette, A
2014-01-01
The taxa–area relationship (TAR) and the distance–decay relationship (DDR) both describe spatial turnover of taxa and are central patterns of biodiversity. Here, we compared TAR and DDR of bacterial communities across different marine realms and ecosystems at the global scale. To obtain reliable global estimates for both relationships, we quantified the poorly assessed effects of sequencing depth, rare taxa removal and number of sampling sites. Slope coefficients of bacterial TARs were within the range of those of plants and animals, whereas slope coefficients of bacterial DDR were much lower. Slope coefficients were mostly affected by removing rare taxa and by the number of sampling sites considered in the calculations. TAR and DDR slope coefficients were overestimated at sequencing depth <4000 sequences per sample. Noticeably, bacterial TAR and DDR patterns did not correlate with each other both within and across ecosystem types, suggesting that (i) TAR cannot be directly derived from DDR and (ii) TAR and DDR may be influenced by different ecological factors. Nevertheless, we found marine bacterial TAR and DDR to be steeper in ecosystems associated with high environmental heterogeneity or spatial isolation, namely marine sediments and coastal environments compared with pelagic ecosystems. Hence, our study provides information on macroecological patterns of marine bacteria, as well as methodological and conceptual insights, at a time when biodiversity surveys increasingly make use of high-throughput sequencing technologies. PMID:24460915
Løvoll, Marie; Wiik-Nielsen, Jannicke; Grove, Søren; Wiik-Nielsen, Christer R; Kristoffersen, Anja B; Faller, Randi; Poppe, Trygve; Jung, Joonil; Pedamallu, Chandra S; Nederbragt, Alexander J; Meyerson, Matthew; Rimstad, Espen; Tengs, Torstein
2010-11-10
Cardiomyopathy syndrome (CMS) is a severe disease affecting large farmed Atlantic salmon. Mortality often appears without prior clinical signs, typically shortly prior to slaughter. We recently reported the finding and the complete genomic sequence of a novel piscine reovirus (PRV), which is associated with another cardiac disease in Atlantic salmon; heart and skeletal muscle inflammation (HSMI). In the present work we have studied whether PRV or other infectious agents may be involved in the etiology of CMS. Using high throughput sequencing on heart samples from natural outbreaks of CMS and from fish experimentally challenged with material from fish diagnosed with CMS a high number of sequence reads identical to the PRV genome were identified. In addition, a sequence contig from a novel totivirus could also be constructed. Using RT-qPCR, levels of PRV in tissue samples were quantified and the totivirus was detected in all samples tested from CMS fish but not in controls. In situ hybridization supported this pattern indicating a possible association between CMS and the novel piscine totivirus. Although causality for CMS in Atlantic salmon could not be proven for either of the two viruses, our results are compatible with a hypothesis where, in the experimental challenge studied, PRV behaves as an opportunist whereas the totivirus might be more directly linked with the development of CMS.
Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf
2014-01-01
CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB. © The Author(s) 2014. Published by Oxford University Press.
Detection of JAK2 V617F mutation increases the diagnosis of myeloproliferative neoplasms
ZHANG, SHU-PENG; LI, HUI; LAI, REN-SHENG
2015-01-01
The Janus kinase (JAK)2 gene, which is located on chromosome 9p24, is involved in the signaling transduction pathways of the hematopoietic and immune system. Mutations in the JAK2 gene have served as disease markers for myeloproliferative neoplasms (MPNs). The aim of the present study was to investigate the occurrence of the JAK2 gene mutation in 140 clinical samples, and to evaluate its clinical significance in MPNs and other hematological diseases. Genomic DNA was extracted from the peripheral blood leukocytes or bone marrow karyocytes of 140 clinical samples, which included 130 patients with various types of hematological disease and 10 control patients. In addition, exons 12 and 14 of the JAK2 gene were analyzed by direct sequencing and the mutation rates of various MPN subtypes were evaluated. Of the 140 samples, exons 12 and 14 were tested in 74 samples, however, exon 14 only was tested in 66 samples. No mutations were identified in exon 12. The V617F mutation rate in polycythemia vera was 82.1% (23/28), and the mutation rates in essential thrombocythemia histiocytosis, primary myelofibrosis and other MPNs were 53.1% (17/32), 40.0% (4/10) and 60.0% (6/10), respectively. Therefore, the total mutation rate of the JAK2 gene in MPN was 62.5% (50/80). For non-MPN hematological diseases, four V617F mutations were detected in samples of leukocytosis of unknown origin (4/12), however, no JAK2 V617F mutations were identified in the 10 controls. Therefore, JAK2 V617F mutations may present a novel marker for diagnosis of MPNs. Furthermore, the direct sequencing method appeared to be satisfactory for the clinical gene testing of hematological samples. PMID:25624900
[Krigle estimation and its simulated sampling of Chilo suppressalis population density].
Yuan, Zheming; Bai, Lianyang; Wang, Kuiwu; Hu, Xiangyue
2004-07-01
In order to draw up a rational sampling plan for the larvae population of Chilo suppressalis, an original population and its two derivative populations, random population and sequence population, were sampled and compared with random sampling, gap-range-random sampling, and a new systematic sampling integrated Krigle interpolation and random original position. As for the original population whose distribution was up to aggregative and dependence range in line direction was 115 cm (6.9 units), gap-range-random sampling in line direction was more precise than random sampling. Distinguishing the population pattern correctly is the key to get a better precision. Gap-range-random sampling and random sampling are fit for aggregated population and random population, respectively, but both of them are difficult to apply in practice. Therefore, a new systematic sampling named as Krigle sample (n = 441) was developed to estimate the density of partial sample (partial estimation, n = 441) and population (overall estimation, N = 1500). As for original population, the estimated precision of Krigle sample to partial sample and population was better than that of investigation sample. With the increase of the aggregation intensity of population, Krigel sample was more effective than investigation sample in both partial estimation and overall estimation in the appropriate sampling gap according to the dependence range.
Park, Jeong-Hoon; Park, Jong-Hun; Je Seong, Hoon; Sul, Woo Jun; Jin, Kang-Hyun; Park, Hee-Deung
2018-07-01
To provide insight into direct interspecies electron transfer via granular activated carbon (GAC), the effect of GAC supplementation on anaerobic digestion was evaluated. Compared to control samples, the GAC supplementation increased the total amount of methane production and its production rate by 31% and 72%, respectively. 16S rDNA sequencing analysis revealed a shift in the archaeal community composition; the Methanosarcina proportion decreased 17%, while the Methanosaeta proportion increased 5.6%. Metagenomic analyses based on shotgun sequencing demonstrated that the abundance of pilA and omcS genes belonging to Geobacter species decreased 69.4% and 29.4%, respectively. Furthermore, the analyses suggested a carbon dioxide reduction pathway rather than an acetate decarboxylation pathway for methane formation. Taken together, these results suggest that GAC improved methane production performance by shifting the microbial community and altering functional genes associated with direct interspecies electron transfer via conductive materials. Copyright © 2018 Elsevier Ltd. All rights reserved.
Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations
Truong, Hoa T.; Ramos, A. Marcos; Yalcin, Feyruz; de Ruiter, Marjo; van der Poel, Hein J. A.; Huvenaars, Koen H. J.; Hogers, René C. J.; van Enckevort, Leonora. J. G.; Janssen, Antoine; van Orsouw, Nathalie J.; van Eijk, Michiel J. T.
2012-01-01
Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n = 222 samples) and lettuce (n = 87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike. PMID:22662172
Environmental DNA sequencing primers for eutardigrades and bdelloid rotifers
2009-01-01
Background The time it takes to isolate individuals from environmental samples and then extract DNA from each individual is one of the problems with generating molecular data from meiofauna such as eutardigrades and bdelloid rotifers. The lack of consistent morphological information and the extreme abundance of these classes makes morphological identification of rare, or even common cryptic taxa a large and unwieldy task. This limits the ability to perform large-scale surveys of the diversity of these organisms. Here we demonstrate a culture-independent molecular survey approach that enables the generation of large amounts of eutardigrade and bdelloid rotifer sequence data directly from soil. Our PCR primers, specific to the 18s small-subunit rRNA gene, were developed for both eutardigrades and bdelloid rotifers. Results The developed primers successfully amplified DNA of their target organism from various soil DNA extracts. This was confirmed by both the BLAST similarity searches and phylogenetic analyses. Tardigrades showed much better phylogenetic resolution than bdelloids. Both groups of organisms exhibited varying levels of endemism. Conclusion The development of clade-specific primers for characterizing eutardigrades and bdelloid rotifers from environmental samples should greatly increase our ability to characterize the composition of these taxa in environmental samples. Environmental sequencing as shown here differs from other molecular survey methods in that there is no need to pre-isolate the organisms of interest from soil in order to amplify their DNA. The DNA sequences obtained from methods that do not require culturing can be identified post-hoc and placed phylogenetically as additional closely related sequences are obtained from morphologically identified conspecifics. Our non-cultured environmental sequence based approach will be able to provide a rapid and large-scale screening of the presence, absence and diversity of Bdelloidea and Eutardigrada in a variety of soils. PMID:20003362
González-Chávez, Ma Del Carmen A; Torres-Cruz, Terry J; Sánchez, Samantha Albarrán; Carrillo-González, Rogelio; Carrillo-López, Luis Manuel; Porras-Alfaro, Andrea
2018-02-01
Vanilla is an orchid of economic importance widely cultivated in tropical regions and native to Mexico. We sampled three species of Vanilla (V. planifolia, V. pompona, and V. insignis) in different crop systems. We studied the effect of crop system on the abundance, type of fungi, and quality of pelotons found in the roots using light and electron microscopy and direct sequencing of mycorrhizal structures. Fungi were identified directly from pelotons obtained from terrestrial roots of vanilla plants in the flowering stage. Root samples were collected from plants in crop systems located in the Totonacapan area in Mexico (states of Puebla and Veracruz). DNA was extracted directly from 40 pelotons and amplified using ITS rRNA sequencing. Peloton-like structures were observed, presenting a combination of active pelotons characterized by abundant hyphal coils and pelotons in various stages of degradation. The most active pelotons were observed in crop systems throughout living tutors (host tree) in comparison with roots collected from dead or artificial tutors. Fungi identified directly from pelotons included Scleroderma areolatum, a common ectomycorrhizal fungus that has not been reported as a mycorrhizal symbiont in orchids. Direct amplification of pelotons also yielded common plant pathogens, including Fusarium and Pyrenophora seminiperda, especially in those sites with low colonization rates, and where large numbers of degraded pelotons were observed. This research reports for the first time the potential colonization of Vanilla by Scleroderma, as a putative orchid mycorrhizal symbiont in four sites in Mexico and the influence of crop system on mycorrhizal colonization on this orchid.
NASA Astrophysics Data System (ADS)
Munday, Chris; De Deckker, Patrick; Tapper, Nigel; Allison, Gwen
2014-05-01
Following a prolonged drought in Australia in the first decade of the 21st century, several dust storms affected the heavily populated East coast of Australia. The largest such storm occurred on 22-23 September 2009 and had a front of an estimated 3000km. A 24hr average PM10 concentration of over 2,000μg/m3 was recorded in several locations and an hourly peak of over 15,000μg/m3 was recorded (Leys et al. 2011). Over two time periods duplicate aerosol samples were collected on 47mm diameter cellulose nitrate membranes at a location removed from anthropogenic influences. One set of samples was collected in the afternoon the dust event started and another was collected overnight. Additionally, overnight rainfall was collected in a sterile bottle.DNA was directly extracted one membrane from each time point for molecular cloning and high throughput sequencing, while the other was cultivated on Tryptic Soy Agar (TSA). High throughput sequencing was performed using the 454 Titanium platform. From the three samples, 19,945 curated sequences were obtained representing 942 OTUS, with the three samples approximately equal in number. Unclassified Rhizobiales and Stenotrophomonas were the most abundant groups which could be attributed names. A total of 942 OTUs were identified (cutoff = 0.03), and despite the temporal relation of the samples, only eleven were found in all three samples, indicating that the dust storm evolved in composition as it passed over the region. Approximately 800 and 500 CFU/m3 were found in the two cultivated samples, tenfold more than was collected from previous dust events (Lim et al, 2011). Identification of cultivars revealed a dominance of the gram positive Firmicutes phylum, while the clone library showed a more even distribution of taxa, with Actinobacteria the most common and Firmicutes comprising less than 10% of sequences. Collectively, the analyses indicate that the concentration of cultivable organisms during the dust storm dramatically relative to calm conditions. A diverse and variable population of microorganisms were present reflecting the vast source and dynamic nature of the storm.
A complete method, incorporating recently improved reverse transcriptase-PCR primer/probe assays and including controls for determining interferences to phage recoveries from water sample concentrates and for detecting interferences to their analysis, was developed for the direct...
A complete method, incorporating recently improved reverse transcriptase-PCR primer/probe assays and including controls for determining interferences to phage recoveries from water sample concentrates and for detecting interferences to their analysis, was developed for the direct...
Phylogenetic analysis of a transfusion-transmitted hepatitis A outbreak.
Hettmann, Andrea; Juhász, Gabriella; Dencs, Ágnes; Tresó, Bálint; Rusvai, Erzsébet; Barabás, Éva; Takács, Mária
2017-02-01
A transfusion-associated hepatitis A outbreak was found in the first time in Hungary. The outbreak involved five cases. Parenteral transmission of hepatitis A is rare, but may occur during viraemia. Direct sequencing of nested PCR products was performed, and all the examined samples were identical in the VP1/2A region of the hepatitis A virus genome. HAV sequences found in recent years were compared and phylogenetic analysis showed that the strain which caused these cases is the same as that had spread in Hungary recently causing several hepatitis A outbreaks throughout the country.
Erlanger, B.F.; Chen, B.X.
1997-07-22
The subject invention provides an antibody which specifically binds to the product of a reaction between a labeling substance and a substrate. The subject invention also provides a method of making an immunogen used to produce the antibody of the subject invention. The invention further provides methods of using the subject antibody for detecting an antigen of interest in a sample, for example detecting a protein comprising an amino acid sequence of interest and detecting a nucleic acid molecule comprising a nucleic acid sequence of interest. 8 figs.
Erlanger, Bernard F.; Chen, Bi-Xing
1997-01-01
The subject invention provides an antibody which specifically binds to the product of a reaction between a labeling substance and a substrate. The subject invention also provides a method of making an immunogen used to produce the antibody of the subject invention. The invention further provides methods of using the subject antibody for detecting an antigen of interest in a sample, for example detecting a protein comprising an amino acid sequence of interest and detecting a nucleic acid molecule comprising a nucleic acid sequence of interest.
Wright, David; Mallon, Tom; McCormick, Carl; Orton, Richard J.; McDowell, Stanley; Trewby, Hannah; Skuce, Robin A.; Kao, Rowland R.
2012-01-01
Whole genome sequencing (WGS) technology holds great promise as a tool for the forensic epidemiology of bacterial pathogens. It is likely to be particularly useful for studying the transmission dynamics of an observed epidemic involving a largely unsampled ‘reservoir’ host, as for bovine tuberculosis (bTB) in British and Irish cattle and badgers. BTB is caused by Mycobacterium bovis, a member of the M. tuberculosis complex that also includes the aetiological agent for human TB. In this study, we identified a spatio-temporally linked group of 26 cattle and 4 badgers infected with the same Variable Number Tandem Repeat (VNTR) type of M. bovis. Single-nucleotide polymorphisms (SNPs) between sequences identified differences that were consistent with bacterial lineages being persistent on or near farms for several years, despite multiple clear whole herd tests in the interim. Comparing WGS data to mathematical models showed good correlations between genetic divergence and spatial distance, but poor correspondence to the network of cattle movements or within-herd contacts. Badger isolates showed between zero and four SNP differences from the nearest cattle isolate, providing evidence for recent transmissions between the two hosts. This is the first direct genetic evidence of M. bovis persistence on farms over multiple outbreaks with a continued, ongoing interaction with local badgers. However, despite unprecedented resolution, directionality of transmission cannot be inferred at this stage. Despite the often notoriously long timescales between time of infection and time of sampling for TB, our results suggest that WGS data alone can provide insights into TB epidemiology even where detailed contact data are not available, and that more extensive sampling and analysis will allow for quantification of the extent and direction of transmission between cattle and badgers. PMID:23209404
Mohajeri, Parviz; Yazdani, Laya; Shahraki, Abdolrazagh Hashemi; Alvandi, Amirhoshang; Atashi, Sara; Farahani, Abbas; Almasi, Ali; Rezaei, Mansour
2017-04-01
Nontuberculous mycobacteria are habitants of environment, especially in aquatic systems. Some of them cause problems in immunodeficient patients. Over the last decade, 16S rRNA gene sequencing was established in 45 novel species of nontuberculous mycobacteria. Experiences revealed that this method underestimates the diversity, but does not distinguish between some of mycobacterium subsp. To recognize emerging rapidly growing mycobacteria and identify their subsp, rpoB gene sequencing has been developed. To better understand the transmission of nontuberculous mycobacterial species from drinking water and preventing the spread of illness with these bacteria, the aim of this study was to detect the presence of bacteria by PCR-sequencing techniques. Drinking water samples were collected from different areas of Kermanshah city in west of IRAN. After decontamination with cetylpyridinium chloride, samples were filtered with 0.45-micron filters, the filter transferred directly on growth medium waiting to appear in colonies, then DNA extraction and PCR were performed, and products were sent to sequencing. We found 35/110 (32%) nontuberculous mycobacterial species in drinking water samples, isolates included Mycobacterium goodii, Mycobacterium aurum, and Mycobacterium gastri with the most abundance (11.5%), followed by Mycobacterium smegmatis, Mycobacterium porcinum, Mycobacterium peregrinum, Mycobacterium mucogenicum, and Mycobacterium chelonae (8%). In this study, we recognized the evidence of contamination by nontuberculous mycobacteria in corroded water pipes. As a result of the high prevalence of these bacteria in drinking water in Kermanshah, this is important evidence of transmission through drinking water. This finding can also help public health policy makers control these isolates in drinking water supplies in Kermanshah.
Sheik, Cody S.; Reese, Brandi Kiel; Twing, Katrina I.; Sylvan, Jason B.; Grim, Sharon L.; Schrenk, Matthew O.; Sogin, Mitchell L.; Colwell, Frederick S.
2018-01-01
Earth’s subsurface environment is one of the largest, yet least studied, biomes on Earth, and many questions remain regarding what microorganisms are indigenous to the subsurface. Through the activity of the Census of Deep Life (CoDL) and the Deep Carbon Observatory, an open access 16S ribosomal RNA gene sequence database from diverse subsurface environments has been compiled. However, due to low quantities of biomass in the deep subsurface, the potential for incorporation of contaminants from reagents used during sample collection, processing, and/or sequencing is high. Thus, to understand the ecology of subsurface microorganisms (i.e., the distribution, richness, or survival), it is necessary to minimize, identify, and remove contaminant sequences that will skew the relative abundances of all taxa in the sample. In this meta-analysis, we identify putative contaminants associated with the CoDL dataset, recommend best practices for removing contaminants from samples, and propose a series of best practices for subsurface microbiology sampling. The most abundant putative contaminant genera observed, independent of evenness across samples, were Propionibacterium, Aquabacterium, Ralstonia, and Acinetobacter. While the top five most frequently observed genera were Pseudomonas, Propionibacterium, Acinetobacter, Ralstonia, and Sphingomonas. The majority of the most frequently observed genera (high evenness) were associated with reagent or potential human contamination. Additionally, in DNA extraction blanks, we observed potential archaeal contaminants, including methanogens, which have not been discussed in previous contamination studies. Such contaminants would directly affect the interpretation of subsurface molecular studies, as methanogenesis is an important subsurface biogeochemical process. Utilizing previously identified contaminant genera, we found that ∼27% of the total dataset were identified as contaminant sequences that likely originate from DNA extraction and DNA cleanup methods. Thus, controls must be taken at every step of the collection and processing procedure when working with low biomass environments such as, but not limited to, portions of Earth’s deep subsurface. Taken together, we stress that the CoDL dataset is an incredible resource for the broader research community interested in subsurface life, and steps to remove contamination derived sequences must be taken prior to using this dataset. PMID:29780369
Sheik, Cody S; Reese, Brandi Kiel; Twing, Katrina I; Sylvan, Jason B; Grim, Sharon L; Schrenk, Matthew O; Sogin, Mitchell L; Colwell, Frederick S
2018-01-01
Earth's subsurface environment is one of the largest, yet least studied, biomes on Earth, and many questions remain regarding what microorganisms are indigenous to the subsurface. Through the activity of the Census of Deep Life (CoDL) and the Deep Carbon Observatory, an open access 16S ribosomal RNA gene sequence database from diverse subsurface environments has been compiled. However, due to low quantities of biomass in the deep subsurface, the potential for incorporation of contaminants from reagents used during sample collection, processing, and/or sequencing is high. Thus, to understand the ecology of subsurface microorganisms (i.e., the distribution, richness, or survival), it is necessary to minimize, identify, and remove contaminant sequences that will skew the relative abundances of all taxa in the sample. In this meta-analysis, we identify putative contaminants associated with the CoDL dataset, recommend best practices for removing contaminants from samples, and propose a series of best practices for subsurface microbiology sampling. The most abundant putative contaminant genera observed, independent of evenness across samples, were Propionibacterium , Aquabacterium , Ralstonia , and Acinetobacter . While the top five most frequently observed genera were Pseudomonas , Propionibacterium , Acinetobacter , Ralstonia , and Sphingomonas . The majority of the most frequently observed genera (high evenness) were associated with reagent or potential human contamination. Additionally, in DNA extraction blanks, we observed potential archaeal contaminants, including methanogens, which have not been discussed in previous contamination studies. Such contaminants would directly affect the interpretation of subsurface molecular studies, as methanogenesis is an important subsurface biogeochemical process. Utilizing previously identified contaminant genera, we found that ∼27% of the total dataset were identified as contaminant sequences that likely originate from DNA extraction and DNA cleanup methods. Thus, controls must be taken at every step of the collection and processing procedure when working with low biomass environments such as, but not limited to, portions of Earth's deep subsurface. Taken together, we stress that the CoDL dataset is an incredible resource for the broader research community interested in subsurface life, and steps to remove contamination derived sequences must be taken prior to using this dataset.
Microfluidics for genome-wide studies involving next generation sequencing
Murphy, Travis W.; Lu, Chang
2017-01-01
Next-generation sequencing (NGS) has revolutionized how molecular biology studies are conducted. Its decreasing cost and increasing throughput permit profiling of genomic, transcriptomic, and epigenomic features for a wide range of applications. Microfluidics has been proven to be highly complementary to NGS technology with its unique capabilities for handling small volumes of samples and providing platforms for automation, integration, and multiplexing. In this article, we review recent progress on applying microfluidics to facilitate genome-wide studies. We emphasize on several technical aspects of NGS and how they benefit from coupling with microfluidic technology. We also summarize recent efforts on developing microfluidic technology for genomic, transcriptomic, and epigenomic studies, with emphasis on single cell analysis. We envision rapid growth in these directions, driven by the needs for testing scarce primary cell samples from patients in the context of precision medicine. PMID:28396707
Rapid detection of Corynebacterium pseudotuberculosis in clinical samples from sheep.
Kumar, Jyoti; Tripathi, Bhupendra Nath; Kumar, Rajiv; Sonawane, Ganesh Gangaram; Dixit, Shivendra Kumar
2013-08-01
Corynebacterium pseudotuberculosis, a Gram-positive bacterium is the causative agent of caseous lymphadenitis (CLA), a chronic disease of sheep, goats and other warm blooded animals. In the present study, a total of 1,080 sheep reared under semi-intensive system on organized farms situated in the semi arid tropical region of Rajasthan, India, was clinically examined. Pus samples from superficial lymph nodes of 25 (2.31%) adult sheep showing clinical lesions similar to CLA were collected for laboratory analyses. On the basis of morphological, cultural and biochemical characteristics 12 (48%) bacterial isolates from pus identified it as C. pseudotuberculosis. A polymerase chain reaction (PCR) assay targeting Putative oligopeptide/dipeptide ABC transporter, nicotinamide adenine dinucleotide phosphate (NADP) oxidoreductase coenzyme F420-dependent and proline iminopeptidase (PIP) genes of C. pseudotuberculosis was developed that showed 14 pus samples as positive. All C. pseudotuberculosis isolates were also found positive for these genes in the PCR. The specificity of the PCR products was confirmed by sequencing of the amplified products that showed 98-100% homology with published sequences available in the NCBI database. The present study shows the incidence of CLA as 2.31%, 1.1% and 1.29% based on clinical, bacterial culture and direct pus PCR assay, respectively. The PCR assay was rapid, specific and as significant as bacterial culture in detecting bacteria directly in the clinical pus samples. The PCR assay developed in the study can be applied for the diagnosis and control of CLA. Furthermore, the assay can also be applied to detect C. pseudotuberculosis in various clinical samples.
Climate oscillations reflected in the Arabian Sea subseafloor microbiome
NASA Astrophysics Data System (ADS)
Orsi, William; Coolen, Marco; He, Lijun; Wuchter, Cornelia; Irigoien, Xabier; Chust, Guillem; Johnson, Carl; Hemingway, Jordon; Lee, Mitchell; Galy, Valier; Giosan, Liviu
2016-04-01
Marine sediment contains a vast microbial biosphere that influences global biogeochemical cycles over geological timescales. However, the environmental factors controlling the stratigraphy of subseafloor microbial communities are poorly understood. We studied a sediment core directly underlying the Arabian Sea oxygen minimum zone (OMZ), which exhibits organic carbon rich sapropelic laminae deposited under low oxygen conditions. Consistent with several other cores from the same location, age dating revealed the sapropelic layers coincide with warm North Atlantic millennial-scale Dansgaard-Oeschger events, indicating a direct link between the strength of the OMZ and paleoclimate. A total of 214 samples spanning 13 m and 52 Kyr of deposition were selected for geochemical analyses and paleoclimate proxy measurements, as well as high-throughput metagenomic DNA sequencing of bacteria and archaea. A novel DNA extraction protocol was developed that allowed for direct (unamplified) metagenomic sequencing of DNA from each sample. This dataset represents the highest resolved sedimentary metagenomic sampling profile to date. Analysis of these data together with multiple paleoceanographic proxies show that millennial-scale paleoenvironmental conditions correlate with the metabolism and diversity of bacteria and archaea over the last glacial-interglacial cycle in the Arabian Sea. The metabolic potential for bacterial denitrification correlates with climate-driven OMZ strength and concomitant nitrogen stable isotope fractionation, whereas catabolic potential reflects changing marine organic matter sources across the Last Glacial Maximum. These results indicate that the subsisting microbial communities had been stratified to a large extent by paleoceanographic conditions at the time of deposition. Paleoenvironmental conditions should thus be considered as a mechanism that can help explain microbiome stratigraphy in marine sediment.
Denoising time-resolved microscopy image sequences with singular value thresholding.
Furnival, Tom; Leary, Rowan K; Midgley, Paul A
2017-07-01
Time-resolved imaging in microscopy is important for the direct observation of a range of dynamic processes in both the physical and life sciences. However, the image sequences are often corrupted by noise, either as a result of high frame rates or a need to limit the radiation dose received by the sample. Here we exploit both spatial and temporal correlations using low-rank matrix recovery methods to denoise microscopy image sequences. We also make use of an unbiased risk estimator to address the issue of how much thresholding to apply in a robust and automated manner. The performance of the technique is demonstrated using simulated image sequences, as well as experimental scanning transmission electron microscopy data, where surface adatom motion and nanoparticle structural dynamics are recovered at rates of up to 32 frames per second. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Debenham, John J; Atencia, Rebeca; Midtgaard, Fred; Robertson, Lucy J
2015-04-01
The aim of this study was to investigate the occurrence of Giardia duodenalis and Cryptosporidium spp. in primates and determine their zoonotic or anthropozoonotic potential. Direct immunofluorescence was used to identify Giardia and Cryptosporidium from faecal samples. PCR and DNA sequencing was performed on positive results. Giardia cysts were identified from 5.5% (5/90) of captive chimpanzees and 0% (0/11) of captive mandrills in the Republic of Congo; 0% (0/10) of captive chimpanzees in Norway; and 0% of faecal samples (n = 49) from wild Zanzibar red colobus monkeys. Two Giardia positive samples were also positive on PCR, and sequencing revealed identical isolates of Assemblage B. Cryptosporidium oocysts were not detected in any of the samples. In these primate groups, in which interactions with humans and human environments are quite substantial, Giardia and Cryptosporidium are rare pathogens. In chimpanzees, Giardia may have a zoonotic or anthropozoonotic potential. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Reconstructing rare soil microbial genomes using in situ enrichments and metagenomics
Delmont, Tom O.; Eren, A. Murat; Maccario, Lorrie; Prestat, Emmanuel; Esen, Özcan C.; Pelletier, Eric; Le Paslier, Denis; Simonet, Pascal; Vogel, Timothy M.
2015-01-01
Despite extensive direct sequencing efforts and advanced analytical tools, reconstructing microbial genomes from soil using metagenomics have been challenging due to the tremendous diversity and relatively uniform distribution of genomes found in this system. Here we used enrichment techniques in an attempt to decrease the complexity of a soil microbiome prior to sequencing by submitting it to a range of physical and chemical stresses in 23 separate microcosms for 4 months. The metagenomic analysis of these microcosms at the end of the treatment yielded 540 Mb of assembly using standard de novo assembly techniques (a total of 559,555 genes and 29,176 functions), from which we could recover novel bacterial genomes, plasmids and phages. The recovered genomes belonged to Leifsonia (n = 2), Rhodanobacter (n = 5), Acidobacteria (n = 2), Sporolactobacillus (n = 2, novel nitrogen fixing taxon), Ktedonobacter (n = 1, second representative of the family Ktedonobacteraceae), Streptomyces (n = 3, novel polyketide synthase modules), and Burkholderia (n = 2, includes mega-plasmids conferring mercury resistance). Assembled genomes averaged to 5.9 Mb, with relative abundances ranging from rare (<0.0001%) to relatively abundant (>0.01%) in the original soil microbiome. Furthermore, we detected them in samples collected from geographically distant locations, particularly more in temperate soils compared to samples originating from high-latitude soils and deserts. To the best of our knowledge, this study is the first successful attempt to assemble multiple bacterial genomes directly from a soil sample. Our findings demonstrate that developing pertinent enrichment conditions can stimulate environmental genomic discoveries that would have been impossible to achieve with canonical approaches that focus solely upon post-sequencing data treatment. PMID:25983722
Sparks, Jackson T; Dickens, Joseph C
2014-12-30
Electrophysiological recording of action potentials from sensory neurons of mosquitoes provides investigators a glimpse into the chemical perception of these disease vectors. We have recently identified a bitter sensing neuron in the labellum of female Aedes aegypti that responds to DEET and other repellents, as well as bitter quinine, through direct electrophysiological investigation. These gustatory receptor neuron responses prompted our sequencing of total mRNA from both male and female labella and tarsi samples to elucidate the putative chemoreception genes expressed in these contact chemoreception tissues. Samples of tarsi were divided into pro-, meso- and metathoracic subtypes for both sexes. We then validated our dataset by conducting qRT-PCR on the same tissue samples and used statistical methods to compare results between the two methods. Studies addressing molecular function may now target specific genes to determine those involved in repellent perception by mosquitoes. These receptor pathways may be used to screen novel repellents towards disruption of host-seeking behavior to curb the spread of harmful viruses.
RNA-Seq analysis to capture the transcriptome landscape of a single cell
Tang, Fuchou; Barbacioru, Catalin; Nordman, Ellen; Xu, Nanlan; Bashkirov, Vladimir I; Lao, Kaiqin; Surani, M. Azim
2013-01-01
We describe here a protocol for digital transcriptome analysis in a single mouse blastomere using a deep sequencing approach. An individual blastomere was first isolated and put into lysate buffer by mouth pipette. Reverse transcription was then performed directly on the whole cell lysate. After this, the free primers were removed by Exonuclease I and a poly(A) tail was added to the 3′ end of the first-strand cDNA by Terminal Deoxynucleotidyl Transferase. Then the single cell cDNAs were amplified by 20 plus 9 cycles of PCR. Then 100-200 ng of these amplified cDNAs were used to construct a sequencing library. The sequencing library can be used for deep sequencing using the SOLiD system. Compared with the cDNA microarray technique, our assay can capture up to 75% more genes expressed in early embryos. The protocol can generate deep sequencing libraries within 6 days for 16 single cell samples. PMID:20203668
[Spatial distribution pattern of Chilo suppressalis analyzed by classical method and geostatistics].
Yuan, Zheming; Fu, Wei; Li, Fangyi
2004-04-01
Two original samples of Chilo suppressalis and their grid, random and sequence samples were analyzed by classical method and geostatistics to characterize the spatial distribution pattern of C. suppressalis. The limitations of spatial distribution analysis with classical method, especially influenced by the original position of grid, were summarized rather completely. On the contrary, geostatistics characterized well the spatial distribution pattern, congregation intensity and spatial heterogeneity of C. suppressalis. According to geostatistics, the population was up to Poisson distribution in low density. As for higher density population, its distribution was up to aggregative, and the aggregation intensity and dependence range were 0.1056 and 193 cm, respectively. Spatial heterogeneity was also found in the higher density population. Its spatial correlativity in line direction was more closely than that in row direction, and the dependence ranges in line and row direction were 115 and 264 cm, respectively.
Pestaña Nieto, Montserrat; Santolamazza Carbone, Serena
2009-02-01
Using ectomycorrhizal root tip morphotyping (anatomical and morphological identification), molecular analysis (internal transcribed spacer region amplification and sequencing), and fruitbody sampling, we assessed diversity and composition of the ectomycorrhizal fungal community colonizing juvenile Pinus pinaster Ait. under natural conditions in NW Spain. Overall, we found 15 Basidiomycetes and two Ascomycetes. Members of the family Thelephoraceae represented up to 59.4% of the samples. The most frequent species was Tomentella sublilacina followed by Thelephora terrestris, Russula drimeia, Suillus bovinus, and Paxillus involutus, while the less frequent were Pseudotomentella tristis, Lactarius subdulcis, Russula ochroleuca, and Entoloma conferendum. From October 2007 to June 2008, we sampled 208 sporocarps belonging to seven genera and nine species: Thelephora terrestris, Paxillus involutus, Suillus bovinus, Xerocomus badius, Scleroderma verrucosum, Amanita gemmata, A. rubescens, Amanita sp., and Russula sp. The species belonging to the genus Amanita, X. badius and S. verrucosum were not found on root samples. By comparing our results with a bibliographic review of papers published from 1922 to 2006, we found five genera and six species which have not been previously reported in symbiosis with P. pinaster. This is the first time that the diversity of the ectomycorrhizal fungal community associated with P. pinaster was investigated using molecular techniques. Considering that only 38% of the genera found by sequencing were found as fruitbodies, we conclude that integrating morphotyping and sporocarps surveys with molecular analysis of ectomycorrhizas is important to documenting the ectomycorrhizal fungus community.
Fakoli, Lawrence S.; Bolay, Kpehe; Bolay, Fatorma K.; Diclaro, Joseph W.; Brackney, Doug E.; Stenglein, Mark D.; Ebel, Gregory D.
2018-01-01
Background Novel surveillance strategies are needed to detect the rapid and continuous emergence of infectious disease agents. Ideally, new sampling strategies should be simple to implement, technologically uncomplicated, and applicable to areas where emergence events are known to occur. To this end, xenosurveillance is a technique that makes use of blood collected by hematophagous arthropods to monitor and identify vertebrate pathogens. Mosquitoes are largely ubiquitous animals that often exist in sizable populations. As well, many domestic or peridomestic species of mosquitoes will preferentially take blood-meals from humans, making them a unique and largely untapped reservoir to collect human blood. Methodology/Principal findings We sought to take advantage of this phenomenon by systematically collecting blood-fed mosquitoes during a field trail in Northern Liberia to determine whether pathogen sequences from blood engorged mosquitoes accurately mirror those obtained directly from humans. Specifically, blood was collected from humans via finger-stick and by aspirating bloodfed mosquitoes from the inside of houses. Shotgun metagenomic sequencing of RNA and DNA derived from these specimens was performed to detect pathogen sequences. Samples obtained from xenosurveillance and from finger-stick blood collection produced a similar number and quality of reads aligning to two human viruses, GB virus C and hepatitis B virus. Conclusions/Significance This study represents the first systematic comparison between xenosurveillance and more traditional sampling methodologies, while also demonstrating the viability of xenosurveillance as a tool to sample human blood for circulating pathogens. PMID:29561834
Fauver, Joseph R; Weger-Lucarelli, James; Fakoli, Lawrence S; Bolay, Kpehe; Bolay, Fatorma K; Diclaro, Joseph W; Brackney, Doug E; Foy, Brian D; Stenglein, Mark D; Ebel, Gregory D
2018-03-01
Novel surveillance strategies are needed to detect the rapid and continuous emergence of infectious disease agents. Ideally, new sampling strategies should be simple to implement, technologically uncomplicated, and applicable to areas where emergence events are known to occur. To this end, xenosurveillance is a technique that makes use of blood collected by hematophagous arthropods to monitor and identify vertebrate pathogens. Mosquitoes are largely ubiquitous animals that often exist in sizable populations. As well, many domestic or peridomestic species of mosquitoes will preferentially take blood-meals from humans, making them a unique and largely untapped reservoir to collect human blood. We sought to take advantage of this phenomenon by systematically collecting blood-fed mosquitoes during a field trail in Northern Liberia to determine whether pathogen sequences from blood engorged mosquitoes accurately mirror those obtained directly from humans. Specifically, blood was collected from humans via finger-stick and by aspirating bloodfed mosquitoes from the inside of houses. Shotgun metagenomic sequencing of RNA and DNA derived from these specimens was performed to detect pathogen sequences. Samples obtained from xenosurveillance and from finger-stick blood collection produced a similar number and quality of reads aligning to two human viruses, GB virus C and hepatitis B virus. This study represents the first systematic comparison between xenosurveillance and more traditional sampling methodologies, while also demonstrating the viability of xenosurveillance as a tool to sample human blood for circulating pathogens.
Bontempi, Iván A; Bizai, María L; Ortiz, Sylvia; Manattini, Silvia; Fabbro, Diana; Solari, Aldo; Diez, Cristina
2016-09-01
Different DNA markers to genotype Trypanosoma cruzi are now available. However, due to the low quantity of parasites present in biological samples, DNA markers with high copy number like kinetoplast minicircles are needed. The aim of this study was to complete a DNA assay called minicircle lineage specific-PCR (MLS-PCR) previously developed to genotype the T. cruzi DTUs TcV and TcVI, in order to genotype DTUs TcI and TcII and to improve TcVI detection. We screened kinetoplast minicircle hypervariable sequences from cloned PCR products from reference strains belonging to the mentioned DTUs using specific kDNA probes. With the four highly specific sequences selected, we designed primers to be used in the MLS-PCR to directly genotype T. cruzi from biological samples. High specificity and sensitivity were obtained when we evaluated the new approach for TcI, TcII, TcV and TcVI genotyping in twenty two T. cruzi reference strains. Afterward, we compared it with hybridization tests using specific kDNA probes in 32 blood samples from chronic chagasic patients from North Eastern Argentina. With both tests we were able to genotype 94% of the samples and the concordance between them was very good (kappa=0.855). The most frequent T. cruzi DTUs detected were TcV and TcVI, followed by TcII and much lower TcI. A unique T. cruzi DTU was detected in 18 samples meantime more than one in the remaining; being TcV and TcVI the most frequent association. A high percentage of mixed detections were obtained with both assays and its impact was discussed. Copyright © 2016 Elsevier B.V. All rights reserved.
Owor, Betty E; Shepherd, Dionne N; Taylor, Nigel J; Edema, Richard; Monjane, Adérito L; Thomson, Jennifer A; Martin, Darren P; Varsani, Arvind
2007-03-01
Leaf samples from 155 maize streak virus (MSV)-infected maize plants were collected from 155 farmers' fields in 23 districts in Uganda in May/June 2005 by leaf-pressing infected samples onto FTA Classic Cards. Viral DNA was successfully extracted from cards stored at room temperature for 9 months. The diversity of 127 MSV isolates was analysed by PCR-generated RFLPs. Six representative isolates having different RFLP patterns and causing either severe, moderate or mild disease symptoms, were chosen for amplification from FTA cards by bacteriophage phi29 DNA polymerase using the TempliPhi system. Full-length genomes were inserted into a cloning vector using a unique restriction enzyme site, and sequenced. The 1.3-kb PCR product amplified directly from FTA-eluted DNA and used for RFLP analysis was also cloned and sequenced. Comparison of cloned whole genome sequences with those of the original PCR products indicated that the correct virus genome had been cloned and that no errors were introduced by the phi29 polymerase. This is the first successful large-scale application of FTA card technology to the field, and illustrates the ease with which large numbers of infected samples can be collected and stored for downstream molecular applications such as diversity analysis and cloning of potentially new virus genomes.
High prevalence of Hepatitis C virus genotype 6 in Vietnam.
Pham, Duc Anh; Leuangwutiwong, Pornsawan; Jittmittraphap, Akanitt; Luplertlop, Nattanej; Bach, Hoa Khanh; Akkarathamrongsin, Srunthron; Theamboonlers, Apiradee; Poovorawan, Yong
2009-01-01
This study aimed to update the prevalence of the various Hepatitis C virus genotypes in Vietnamese blood donors. One hundred and three HCV antibody-positive plasma samples were collected from blood donors at the National Institute of Hematology and Blood Transfusion, Hanoi, Vietnam. All specimens were subjected to RT-PCR of the 5' untranslated region (UTR) to confirm the presence of HCV RNA. The core and NS5B regions of thh positive samples were subsequently amplified by RT-PCR followed by direct sequencing and phylogenetic analysis. Seventy out of 103 samples (68.0%) were RNA positive. Core and NS5B were successfully amplified and sequences were obtained for 70 and 65 samples, respectively. Phylogenetic analysis revealed that genotype 6a was the most predominant among Vietnamese blood donors with a prevalence of 37.1% (26/70), followed by genotype 1a at 30.0% (21/70) and genotype 1b at 17.1% (12/70). The prevalence of two other genotype 6 variants, 6e and 61 was 8.6% and 1.4%, respectively. Further analysis of recent studies showed that the geographic distribution of genotype 6 covered mainly southern China and the mainland of Southeast Asia including Vietnam, Laos, Thailand, and Myanmar. The GenBank accession numbers for the sequences reported in this study are FJ768772-FJ768906.
The microbiome of Brazilian mangrove sediments as revealed by metagenomics.
Andreote, Fernando Dini; Jiménez, Diego Javier; Chaves, Diego; Dias, Armando Cavalcante Franco; Luvizotto, Danice Mazzer; Dini-Andreote, Francisco; Fasanella, Cristiane Cipola; Lopez, Maryeimy Varon; Baena, Sandra; Taketani, Rodrigo Gouvêa; de Melo, Itamar Soares
2012-01-01
Here we embark in a deep metagenomic survey that revealed the taxonomic and potential metabolic pathways aspects of mangrove sediment microbiology. The extraction of DNA from sediment samples and the direct application of pyrosequencing resulted in approximately 215 Mb of data from four distinct mangrove areas (BrMgv01 to 04) in Brazil. The taxonomic approaches applied revealed the dominance of Deltaproteobacteria and Gammaproteobacteria in the samples. Paired statistical analysis showed higher proportions of specific taxonomic groups in each dataset. The metabolic reconstruction indicated the possible occurrence of processes modulated by the prevailing conditions found in mangrove sediments. In terms of carbon cycling, the sequences indicated the prevalence of genes involved in the metabolism of methane, formaldehyde, and carbon dioxide. With respect to the nitrogen cycle, evidence for sequences associated with dissimilatory reduction of nitrate, nitrogen immobilization, and denitrification was detected. Sequences related to the production of adenylsulfate, sulfite, and H(2)S were relevant to the sulphur cycle. These data indicate that the microbial core involved in methane, nitrogen, and sulphur metabolism consists mainly of Burkholderiaceae, Planctomycetaceae, Rhodobacteraceae, and Desulfobacteraceae. Comparison of our data to datasets from soil and sea samples resulted in the allotment of the mangrove sediments between those samples. The results of this study add valuable data about the composition of microbial communities in mangroves and also shed light on possible transformations promoted by microbial organisms in mangrove sediments.
The Microbiome of Brazilian Mangrove Sediments as Revealed by Metagenomics
Andreote, Fernando Dini; Jiménez, Diego Javier; Chaves, Diego; Dias, Armando Cavalcante Franco; Luvizotto, Danice Mazzer; Dini-Andreote, Francisco; Fasanella, Cristiane Cipola; Lopez, Maryeimy Varon; Baena, Sandra; Taketani, Rodrigo Gouvêa; de Melo, Itamar Soares
2012-01-01
Here we embark in a deep metagenomic survey that revealed the taxonomic and potential metabolic pathways aspects of mangrove sediment microbiology. The extraction of DNA from sediment samples and the direct application of pyrosequencing resulted in approximately 215 Mb of data from four distinct mangrove areas (BrMgv01 to 04) in Brazil. The taxonomic approaches applied revealed the dominance of Deltaproteobacteria and Gammaproteobacteria in the samples. Paired statistical analysis showed higher proportions of specific taxonomic groups in each dataset. The metabolic reconstruction indicated the possible occurrence of processes modulated by the prevailing conditions found in mangrove sediments. In terms of carbon cycling, the sequences indicated the prevalence of genes involved in the metabolism of methane, formaldehyde, and carbon dioxide. With respect to the nitrogen cycle, evidence for sequences associated with dissimilatory reduction of nitrate, nitrogen immobilization, and denitrification was detected. Sequences related to the production of adenylsulfate, sulfite, and H2S were relevant to the sulphur cycle. These data indicate that the microbial core involved in methane, nitrogen, and sulphur metabolism consists mainly of Burkholderiaceae, Planctomycetaceae, Rhodobacteraceae, and Desulfobacteraceae. Comparison of our data to datasets from soil and sea samples resulted in the allotment of the mangrove sediments between those samples. The results of this study add valuable data about the composition of microbial communities in mangroves and also shed light on possible transformations promoted by microbial organisms in mangrove sediments. PMID:22737213
Studier, F. William
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.
Studier, F.W.
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.
Wooley, John C.; Godzik, Adam; Friedberg, Iddo
2010-01-01
Metagenomics is a discipline that enables the genomic study of uncultured microorganisms. Faster, cheaper sequencing technologies and the ability to sequence uncultured microbes sampled directly from their habitats are expanding and transforming our view of the microbial world. Distilling meaningful information from the millions of new genomic sequences presents a serious challenge to bioinformaticians. In cultured microbes, the genomic data come from a single clone, making sequence assembly and annotation tractable. In metagenomics, the data come from heterogeneous microbial communities, sometimes containing more than 10,000 species, with the sequence data being noisy and partial. From sampling, to assembly, to gene calling and function prediction, bioinformatics faces new demands in interpreting voluminous, noisy, and often partial sequence data. Although metagenomics is a relative newcomer to science, the past few years have seen an explosion in computational methods applied to metagenomic-based research. It is therefore not within the scope of this article to provide an exhaustive review. Rather, we provide here a concise yet comprehensive introduction to the current computational requirements presented by metagenomics, and review the recent progress made. We also note whether there is software that implements any of the methods presented here, and briefly review its utility. Nevertheless, it would be useful if readers of this article would avail themselves of the comment section provided by this journal, and relate their own experiences. Finally, the last section of this article provides a few representative studies illustrating different facets of recent scientific discoveries made using metagenomics. PMID:20195499
msgbsR: An R package for analysing methylation-sensitive restriction enzyme sequencing data.
Mayne, Benjamin T; Leemaqz, Shalem Y; Buckberry, Sam; Rodriguez Lopez, Carlos M; Roberts, Claire T; Bianco-Miotto, Tina; Breen, James
2018-02-01
Genotyping-by-sequencing (GBS) or restriction-site associated DNA marker sequencing (RAD-seq) is a practical and cost-effective method for analysing large genomes from high diversity species. This method of sequencing, coupled with methylation-sensitive enzymes (often referred to as methylation-sensitive restriction enzyme sequencing or MRE-seq), is an effective tool to study DNA methylation in parts of the genome that are inaccessible in other sequencing techniques or are not annotated in microarray technologies. Current software tools do not fulfil all methylation-sensitive restriction sequencing assays for determining differences in DNA methylation between samples. To fill this computational need, we present msgbsR, an R package that contains tools for the analysis of methylation-sensitive restriction enzyme sequencing experiments. msgbsR can be used to identify and quantify read counts at methylated sites directly from alignment files (BAM files) and enables verification of restriction enzyme cut sites with the correct recognition sequence of the individual enzyme. In addition, msgbsR assesses DNA methylation based on read coverage, similar to RNA sequencing experiments, rather than methylation proportion and is a useful tool in analysing differential methylation on large populations. The package is fully documented and available freely online as a Bioconductor package ( https://bioconductor.org/packages/release/bioc/html/msgbsR.html ).
Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi
2013-11-20
With the remarkable increase of genomic sequence data of microorganisms, novel tools are needed for comprehensive analyses of the big sequence data available. The self-organizing map (SOM) is an effective tool for clustering and visualizing high-dimensional data, such as oligonucleotide composition on one map. By modifying the conventional SOM, we developed batch-learning SOM (BLSOM), which allowed classification of sequence fragments (e.g., 1 kb) according to phylotypes, solely depending on oligonucleotide composition. Metagenomics studies of uncultivable microorganisms in clinical and environmental samples should allow extensive surveys of genes important in life sciences. BLSOM is most suitable for phylogenetic assignment of metagenomic sequences, because fragmental sequences can be clustered according to phylotypes, solely depending on oligonucleotide composition. We first constructed oligonucleotide BLSOMs for all available sequences from genomes of known species, and by mapping metagenomic sequences on these large-scale BLSOMs, we can predict phylotypes of individual metagenomic sequences, revealing a microbial community structure of uncultured microorganisms, including viruses. BLSOM has shown that influenza viruses isolated from humans and birds clearly differ in oligonucleotide composition. Based on this host-dependent oligonucleotide composition, we have proposed strategies for predicting directional changes of virus sequences and for surveilling potentially hazardous strains when introduced into humans from non-human sources.
Spancerniene, Ugne; Grigas, Juozas; Buitkuviene, Jurate; Zymantiene, Judita; Juozaitiene, Vida; Stankeviciute, Milda; Razukevicius, Dainius; Zienius, Dainius; Stankevicius, Arunas
2018-02-23
Hepatitis E virus (HEV) is one of the major causes of acute viral hepatitis worldwide. In Europe, food-borne zoonotic transmission of HEV genotype 3 has been associated with domestic pigs and wild boar. Controversial data are available on the circulation of the virus in animals that are used for human consumption, and to date, no gold standard has yet been defined for the diagnosis of HEV-associated hepatitis. To investigate the current HEV infection status in Lithuanian pigs and wild ungulates, the presence of viral RNA was analyzed by nested reverse transcription polymerase chain reaction (RT-nPCR) in randomly selected samples, and the viral RNA was subsequently genotyped. In total, 32.98 and 22.55% of the domestic pig samples were HEV-positive using RT-nPCR targeting the ORF1 and ORF2 fragments, respectively. Among ungulates, 25.94% of the wild boar samples, 22.58% of the roe deer samples, 6.67% of the red deer samples and 7.69% of the moose samples were positive for HEV RNA using primers targeting the ORF1 fragment. Using primers targeting the ORF2 fragment of the HEV genome, viral RNA was only detected in 17.03% of the wild boar samples and 12.90% of the roe deer samples. Phylogenetic analysis based on a 348-nucleotide-long region of the HEV ORF2 showed that all obtained sequences detected in Lithuanian domestic pigs and wildlife belonged to genotype 3. In this study, the sequences identified from pigs, wild boars and roe deer clustered within the 3i subtype reference sequences from the GenBank database. The sequences obtained from pig farms located in two different counties of Lithuania were of the HEV 3f subtype. The wild boar sequences clustered within subtypes 3i and 3h, clearly indicating that wild boars can harbor additional subtypes of HEV. For the first time, the ORF2 nucleotide sequences obtained from roe deer proved that HEV subtype 3i can be found in a novel host. The results of the viral prevalence and phylogenetic analyses clearly demonstrated viral infection in Lithuanian pigs and wild ungulates, thus highlighting a significant concern for zoonotic virus transmission through both the food chain and direct contact with animals. Unexpected HEV genotype 3 subtype diversity in Lithuania and neighboring countries revealed that further studies are necessary to understand the mode of HEV transmission between animals and humans in the Baltic States region.
Damiani, Céline; Balthazard-Accou, Ketty; Clervil, Elmyre; Diallo, Aïssata; Da Costa, Cécilia; Emmanuel, Evens; Totet, Anne; Agnamey, Patrice
2013-01-01
The protozoan parasite Cryptosporidium sp. has emerged as one of the most important water contaminants, causing waterborne outbreaks of diarrhoeal diseases worldwide. In Haiti, cryptosporidiosis is a frequent cause of diarrhoea in children under the age of five years, HIV-infected individuals, and people living in low socioeconomic conditions, mainly due to the consumption of water or food polluted by Cryptosporidium oocysts. The aim of this study was to detect and identify Cryptosporidium oocysts present in 12 water samples collected in Port-au-Prince and 4 water samples collected in Cap Haïtien. Initial detection consisted of immunomagnetic separation – immunofluorescence assay (IMS-IFA), which was confirmed by nested PCR, targeting the most polymorphic region of the 18S rRNA gene in 15/16 samples. Genotyping was performed by PCR-restriction fragment length polymorphism (RFLP) analysis and DNA sequencing. Under our working conditions, neither nested PCR-RFLP nor direct DNA sequencing revealed the expected species diversity, as only Cryptosporidium parvum was identified in the water samples studied. This study highlights the difficulty of detecting mixed populations of Cryptosporidium species in environmental samples. PMID:24252814
Occurrence of Giardia in Swedish Red Foxes ( Vulpes vulpes ).
Debenham, John J; Landuyt, Hanne; Troell, Karin; Tysnes, Kristoffer; Robertson, Lucy J
2017-07-01
Giardia duodenalis is an intestinal protozoan capable of causing gastrointestinal disease in a range of vertebrate hosts. It is transmitted via the fecal-oral route. Understanding the epidemiology of G. duodenalis in animals is important, both for public health and for the health of the animals it infects. We investigated the occurrence of G. duodenalis in wild Swedish red foxes ( Vulpes vulpes ), with the aim of providing preliminary information on how this abundant predator might be involved in the transmission and epidemiology of G. duodenalis . Fecal samples (n=104) were analysed for G. duodenalis using a commercially available direct immunofluorescent antibody test. Giardia duodenalis cysts were found in 44% (46/104) of samples, with foxes excreting 100 to 140,500 cysts per gram of feces (mean, 4,930; median, 600). Molecular analysis, using PCR with sequencing of PCR amplicons, was performed on 14 samples, all containing over 2,000 cysts per gram feces. Amplification only occurred in four samples at the tpi gene, sequencing of which revealed assemblage B in all four samples. This study provides baseline information on the role of red foxes in the transmission dynamics of G. duodenalis in Sweden.
Rozenberg, Andrey; Leese, Florian; Weiss, Linda C; Tollrian, Ralph
2016-01-01
Tag-Seq is a high-throughput approach used for discovering SNPs and characterizing gene expression. In comparison to RNA-Seq, Tag-Seq eases data processing and allows detection of rare mRNA species using only one tag per transcript molecule. However, reduced library complexity raises the issue of PCR duplicates, which distort gene expression levels. Here we present a novel Tag-Seq protocol that uses the least biased methods for RNA library preparation combined with a novel approach for joint PCR template and sample labeling. In our protocol, input RNA is fragmented by hydrolysis, and poly(A)-bearing RNAs are selected and directly ligated to mixed DNA-RNA P5 adapters. The P5 adapters contain i5 barcodes composed of sample-specific (moderately) degenerate base regions (mDBRs), which later allow detection of PCR duplicates. The P7 adapter is attached via reverse transcription with individual i7 barcodes added during the amplification step. The resulting libraries can be sequenced on an Illumina sequencer. After sample demultiplexing and PCR duplicate removal with a free software tool we designed, the data are ready for downstream analysis. Our protocol was tested on RNA samples from predator-induced and control Daphnia microcrustaceans.
NASA Astrophysics Data System (ADS)
Tung, Min-Che; Chang, Tien-Yu; Hsu, Bing-Mu; Shen, Shu-Min; Huang, Jen-Te; Kao, Po-Min; Chiu, Yi-Chou; Fan, Cheng-Wei; Huang, Yu-Li
2013-07-01
In this study, we evaluated the presence and amount of Legionella in along a river in Taiwan, and the relations between seasonal distribution of Legionella spp. and geographic characteristics in the watershed were also evaluated. Water samples were pre-treated and analyzed with culture-confirmed and direct DNA extraction methods. For culture-confirmed method, water samples were cultivated through a series of selective media, and candidate colonies were confirmed by PCR. For direct DNA extraction method, direct DNA extraction was performed from pre-treated water samples. The DNA extracts were analyzed with PCR and DNA sequence analysis for species determination, quantitative PCR (qPCR) was performed to quantify Legionella concentration in the water sample. In all, 150 water samples were included in this study, with 73 (48.6%) water samples detected with Legionella spp., and 17 with L. pneumophila. Over 80% Legionella spp. detections were through direct DNA extraction method, but more than 80% L. pneumophila detections were through culture-confirmed method. While detection of Legionella spp. was done with two methods, positive results were found through only one method. Legionella spp. was detected in all seasons with detection rate ranging between 34.3-58.8% and seasonal average concentration from 1.9 × 102 to 7.1 × 103 CFU/L. Most of the L. pneumophila detections were from samples collected in fall (38.2%) and summer (6.0%), which also coincided with increased cases of Legionellosis reported through Center of Disease Control in Taiwan. The high prevalence and concentration of Legionella spp. and L. pneumophila in the surface waters should be further evaluated for potential health risks.
Lefèvre, Emilie; Bardot, Corinne; Noël, Christophe; Carrias, Jean-François; Viscogliosi, Eric; Amblard, Christian; Sime-Ngando, Télesphore
2007-01-01
This study presents an original 18S rRNA PCR survey of the freshwater picoeukaryote community, and was designed to detect unidentified heterotrophic picoflagellates (size range 0.6-5 microm) which are prevalent throughout the year within the heterotrophic flagellate assemblage in Lake Pavin. Four clone libraries were constructed from samples collected in two contrasting zones in the lake. Computerized statistic tools have suggested that sequence retrieval was representative of the in situ picoplankton diversity. The two sampling zones exhibited similar diversity patterns but shared only about 5% of the operational taxonomic units (OTUs). Phylogenetic analysis clustered our sequences into three taxonomic groups: Alveolates (30% of OTUs), Fungi (23%) and Cercozoa (19%). Fungi thus substantially contributed to the detected diversity, as was additionally supported by direct microscopic observations of fungal zoospores and sporangia. A large fraction of the sequences belonged to parasites, including Alveolate sequences affiliated to the genus Perkinsus known as zooparasites, and chytrids that include host-specific parasitic fungi of various freshwater phytoplankton species, primarily diatoms. Phylogenetic analysis revealed five novel clades that probably include typical freshwater environmental sequences. Overall, from the unsuspected fungal diversity unveiled, we think that fungal zooflagellates have been misidentified as phagotrophic nanoflagellates in previous studies. This is in agreement with a recent experimental demonstration that zoospore-producing fungi and parasitic activity may play an important role in aquatic food webs.
BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.
Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F
2014-01-01
We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.
Pyle, Angela; Hudson, Gavin; Wilson, Ian J; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F
2015-05-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level.
Pyle, Angela; Hudson, Gavin; Wilson, Ian J.; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F.
2015-01-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level. PMID:25973765
Andrulis, Mindaugas; Capper, David; Luft, Thomas; Hartmann, Christian; Zentgraf, Hanswalter; von Deimling, Andreas
2010-08-01
Sequencing of the acute myeloid leukemia genome revealed somatic mutations in isocitrate dehydrogenase-1. Acute myeloid leukemia frequently develops from myelodysplastic syndrome. In order to test whether myelodysplastic syndrome also carries isocitrate dehydrogenase-1 mutations, we stained a series of bone marrow samples from patients with myelodysplastic syndrome using an antibody specific for the R132H mutation. Three out of 71 patients exhibited antibody binding to myeloid precursor cells. The presence of the R132H mutation was confirmed by DNA sequencing. We demonstrated that isocitrate dehydrogenase-1 mutations occur in myelodysplasia preceding acute myeloid leukemia and that the R132H alteration can be detected by immunohistochemistry. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Ferro, Myriam; Tardif, Marianne; Reguer, Erwan; Cahuzac, Romain; Bruley, Christophe; Vermat, Thierry; Nugues, Estelle; Vigouroux, Marielle; Vandenbrouck, Yves; Garin, Jérôme; Viari, Alain
2008-05-01
PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.
Johnston, Christine; Magaret, Amalia; Roychoudhury, Pavitra; Greninger, Alexander L; Cheng, Anqi; Diem, Kurt; Fitzgibbon, Matthew P; Huang, Meei-Li; Selke, Stacy; Lingappa, Jairam R; Celum, Connie; Jerome, Keith R; Wald, Anna; Koelle, David M
2017-10-01
Understanding the variability in circulating herpes simplex virus type 2 (HSV-2) genomic sequences is critical to the development of HSV-2 vaccines. Genital lesion swabs containing ≥ 10 7 log 10 copies HSV DNA collected from Africa, the USA, and South America underwent next-generation sequencing, followed by K-mer based filtering and de novo genomic assembly. Sites of heterogeneity within coding regions in unique long and unique short (U L _U S ) regions were identified. Phylogenetic trees were created using maximum likelihood reconstruction. Among 46 samples from 38 persons, 1468 intragenic base-pair substitutions were identified. The maximum nucleotide distance between strains for concatenated U L_ U S segments was 0.4%. Phylogeny did not reveal geographic clustering. The most variable proteins had non-synonymous mutations in < 3% of amino acids. Unenriched HSV-2 DNA can undergo next-generation sequencing to identify intragenic variability. The use of clinical swabs for sequencing expands the information that can be gathered directly from these specimens. Copyright © 2017 Elsevier Inc. All rights reserved.
A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications.
Haque, Ashraful; Engel, Jessica; Teichmann, Sarah A; Lönnberg, Tapio
2017-08-18
RNA sequencing (RNA-seq) is a genomic approach for the detection and quantitative analysis of messenger RNA molecules in a biological sample and is useful for studying cellular responses. RNA-seq has fueled much discovery and innovation in medicine over recent years. For practical reasons, the technique is usually conducted on samples comprising thousands to millions of cells. However, this has hindered direct assessment of the fundamental unit of biology-the cell. Since the first single-cell RNA-sequencing (scRNA-seq) study was published in 2009, many more have been conducted, mostly by specialist laboratories with unique skills in wet-lab single-cell genomics, bioinformatics, and computation. However, with the increasing commercial availability of scRNA-seq platforms, and the rapid ongoing maturation of bioinformatics approaches, a point has been reached where any biomedical researcher or clinician can use scRNA-seq to make exciting discoveries. In this review, we present a practical guide to help researchers design their first scRNA-seq studies, including introductory information on experimental hardware, protocol choice, quality control, data analysis and biological interpretation.
Photothermal method for in situ microanalysis of the chemical composition of coal samples
Amer, Nabil M.
1986-01-01
Successive minute regions (13) along a scan path on a coal sample (11) are individually analyzed, at a series of different depths if desired, to determine chemical composition including the locations, sizes and distributions of different maceral inclusions (12). A sequence of infrared light pulses (17) of progressively changing wavelengths is directed into each minute region (13) and a probe light beam (22) is directed along the sample surface (21) adjacent the region (13). Infrared wavelengths at which strong absorption occurs in the region (13) are identified by detecting the resulting deflections (.phi.) of the probe beam (22) caused by thermally induced index of refraction changes in the air or other medium (19) adjacent the region (13). The detected peak absorption wavelengths are correlated with known characteristic peak absorption wavelengths of specific coal constituents to identify the composition of each such minute region (13) of the sample (11). The method enables rapid, convenient and non-destructive analyses of coal specimens to facilitate mining, processing and utilization of coals.
Photothermal method for in situ microanalysis of the chemical composition of coal samples
Amer, N.M.
1983-10-25
Successive minute regions along a scan path on a coal sample are individually analyzed, at a series of different depths if desired, to determine chemical composition including the locations, sizes and distributions of different maceral inclusions. A sequence of infrared light pulses of progressively changing wavelengths is directed into each minute region and a probe light beam is directed along the sample surface adjacent the region. Infrared wavelengths at which strong absorption occurs in the region are identified by detecting the resulting deflections of the probe beam caused by thermally induced index of refraction changes in the air or other medium adjacent the region. The detected peak absorption wavelengths are correlated with known characteristic peak absorption wavelengths of specific coal constituents to identify the composition of each such minute region of the sample. The method enables rapid, convenient and non-destructive analyses of coal specimens to facilitate mining, processing and utilization of coals. 2 figures.
Microbial community structure in three deep-sea carbonate crusts.
Heijs, S K; Aloisi, G; Bouloubassi, I; Pancost, R D; Pierre, C; Sinninghe Damsté, J S; Gottschal, J C; van Elsas, J D; Forney, L J
2006-10-01
Carbonate crusts in marine environments can act as sinks for carbon dioxide. Therefore, understanding carbonate crust formation could be important for understanding global warming. In the present study, the microbial communities of three carbonate crust samples from deep-sea mud volcanoes in the eastern Mediterranean were characterized by sequencing 16S ribosomal RNA (rRNA) genes amplified from DNA directly retrieved from the samples. In combination with the mineralogical composition of the crusts and lipid analyses, sequence data were used to assess the possible role of prokaryotes in crust formation. Collectively, the obtained data showed the presence of highly diverse communities, which were distinct in each of the carbonate crusts studied. Bacterial 16S rRNA gene sequences were found in all crusts and the majority was classified as alpha-, gamma-, and delta- Proteobacteria. Interestingly, sequences of Proteobacteria related to Halomonas and Halovibrio sp., which can play an active role in carbonate mineral formation, were present in all crusts. Archaeal 16S rRNA gene sequences were retrieved from two of the crusts studied. Several of those were closely related to archaeal sequences of organisms that have previously been linked to the anaerobic oxidation of methane (AOM). However, the majority of archaeal sequences were not related to sequences of organisms known to be involved in AOM. In combination with the strongly negative delta 13C values of archaeal lipids, these results open the possibility that organisms with a role in AOM may be more diverse within the Archaea than previously suggested. Different communities found in the crusts could carry out similar processes that might play a role in carbonate crust formation.
Oligonucleotide gap-fill ligation for mutation detection and sequencing in situ
Mignardi, Marco; Mezger, Anja; Qian, Xiaoyan; La Fleur, Linnea; Botling, Johan; Larsson, Chatarina; Nilsson, Mats
2015-01-01
In clinical diagnostics a great need exists for targeted in situ multiplex nucleic acid analysis as the mutational status can offer guidance for effective treatment. One well-established method uses padlock probes for mutation detection and multiplex expression analysis directly in cells and tissues. Here, we use oligonucleotide gap-fill ligation to further increase specificity and to capture molecular substrates for in situ sequencing. Short oligonucleotides are joined at both ends of a padlock gap probe by two ligation events and are then locally amplified by target-primed rolling circle amplification (RCA) preserving spatial information. We demonstrate the specific detection of the A3243G mutation of mitochondrial DNA and we successfully characterize a single nucleotide variant in the ACTB mRNA in cells by in situ sequencing of RCA products generated by padlock gap-fill ligation. To demonstrate the clinical applicability of our assay, we show specific detection of a point mutation in the EGFR gene in fresh frozen and formalin-fixed, paraffin-embedded (FFPE) lung cancer samples and confirm the detected mutation by in situ sequencing. This approach presents several advantages over conventional padlock probes allowing simpler assay design for multiplexed mutation detection to screen for the presence of mutations in clinically relevant mutational hotspots directly in situ. PMID:26240388
Transcriptome and Small RNA Deep Sequencing Reveals Deregulation of miRNA Biogenesis in Human Glioma
Moore, Lynette M.; Kivinen, Virpi; Liu, Yuexin; Annala, Matti; Cogdell, David; Liu, Xiuping; Liu, Chang-Gong; Sawaya, Raymond; Yli-Harja, Olli; Shmulevich, Ilya; Fuller, Gregory N.; Zhang, Wei; Nykter, Matti
2013-01-01
Altered expression of oncogenic and tumor-suppressing microRNAs (miRNAs) is widely associated with tumorigenesis. However, the regulatory mechanisms underlying these alterations are poorly understood. We sought to shed light on the deregulation of miRNA biogenesis promoting the aberrant miRNA expression profiles identified in these tumors. Using sequencing technology to perform both whole-transcriptome and small RNA sequencing of glioma patient samples, we examined precursor and mature miRNAs to directly evaluate the miRNA maturation process, and interrogated expression profiles for genes involved in the major steps of miRNA biogenesis. We found that ratios of mature to precursor forms of a large number of miRNAs increased with the progression from normal brain to low-grade and then to high-grade gliomas. The expression levels of genes involved in each of the three major steps of miRNA biogenesis (nuclear processing, nucleo-cytoplasmic transport, and cytoplasmic processing) were systematically altered in glioma tissues. Survival analysis of an independent data set demonstrated that the alteration of genes involved in miRNA maturation correlates with survival in glioma patients. Direct quantification of miRNA maturation with deep sequencing demonstrated that deregulation of the miRNA biogenesis pathway is a hallmark for glioma genesis and progression. PMID:23007860
Zhang, Wanying; Wang, Tao; Huang, Shuaiwu; Zhao, Xiuli
2018-04-10
To detect mutation of HPGD gene among three pedigrees affected with primary hypertrophic osteoarthropathy (PHO) by DNA sequencing and high-resolution melting (HRM) analysis. Genomic DNA was extracted from peripheral blood samples collected from the pedigrees. PCR and direct sequencing were carried out to identify potential mutations of the HPGD gene. Amplicons containing the mutation spot were generated by nested PCR. The products were then subjected to HRM analysis using the HR-1 instrument. Direct sequencing was carried out in family members and healthy individuals to confirm the result of HRM analysis. A homozygous mutation c.310_311delCT was detected in 2 affected probands, while a heterozygous mutation c.310_311delCT was detected in the third proband. HRM analysis of the fragments encompassing HPGD exon 3 showed 3 curve patterns representing three different genotypes, i.e., the wild type, the c.310_311delCT homozygote, and the c.310_311delCT heterozygote. Result of DNA sequencing was consistent with that of the HRM analysis and phenotype of the subjects. The c.310_311delCT mutation may be the most prevalent mutation among Chinese population. HRM analysis has provided an optimized method for genetic testing of HPGD mutation for its simplicity, rapid turnover and high sensitivity.
Rapid and efficient cDNA library screening by self-ligation of inverse PCR products (SLIP).
Hoskins, Roger A; Stapleton, Mark; George, Reed A; Yu, Charles; Wan, Kenneth H; Carlson, Joseph W; Celniker, Susan E
2005-12-02
cDNA cloning is a central technology in molecular biology. cDNA sequences are used to determine mRNA transcript structures, including splice junctions, open reading frames (ORFs) and 5'- and 3'-untranslated regions (UTRs). cDNA clones are valuable reagents for functional studies of genes and proteins. Expressed Sequence Tag (EST) sequencing is the method of choice for recovering cDNAs representing many of the transcripts encoded in a eukaryotic genome. However, EST sequencing samples a cDNA library at random, and it recovers transcripts with low expression levels inefficiently. We describe a PCR-based method for directed screening of plasmid cDNA libraries. We demonstrate its utility in a screen of libraries used in our Drosophila EST projects for 153 transcription factor genes that were not represented by full-length cDNA clones in our Drosophila Gene Collection. We recovered high-quality, full-length cDNAs for 72 genes and variously compromised clones for an additional 32 genes. The method can be used at any scale, from the isolation of cDNA clones for a particular gene of interest, to the improvement of large gene collections in model organisms and the human. Finally, we discuss the relative merits of directed cDNA library screening and RT-PCR approaches.
Mökkönen, Harri; Ala-Nissila, Tapio; Jónsson, Hannes
2016-09-07
The recrossing correction to the transition state theory estimate of a thermal rate can be difficult to calculate when the energy barrier is flat. This problem arises, for example, in polymer escape if the polymer is long enough to stretch between the initial and final state energy wells while the polymer beads undergo diffusive motion back and forth over the barrier. We present an efficient method for evaluating the correction factor by constructing a sequence of hyperplanes starting at the transition state and calculating the probability that the system advances from one hyperplane to another towards the product. This is analogous to what is done in forward flux sampling except that there the hyperplane sequence starts at the initial state. The method is applied to the escape of polymers with up to 64 beads from a potential well. For high temperature, the results are compared with direct Langevin dynamics simulations as well as forward flux sampling and excellent agreement between the three rate estimates is found. The use of a sequence of hyperplanes in the evaluation of the recrossing correction speeds up the calculation by an order of magnitude as compared with the traditional approach. As the temperature is lowered, the direct Langevin dynamics simulations as well as the forward flux simulations become computationally too demanding, while the harmonic transition state theory estimate corrected for recrossings can be calculated without significant increase in the computational effort.
Fusarium diversity in soil using a specific molecular approach and a cultural approach.
Edel-Hermann, Véronique; Gautheron, Nadine; Mounier, Arnaud; Steinberg, Christian
2015-04-01
Fusarium species are ubiquitous in soil. They cause plant and human diseases and can produce mycotoxins. Surveys of Fusarium species diversity in environmental samples usually rely on laborious culture-based methods. In the present study, we have developed a molecular method to analyze Fusarium diversity directly from soil DNA. We designed primers targeting the translation elongation factor 1-alpha (EF-1α) gene and demonstrated their specificity toward Fusarium using a large collection of fungi. We used the specific primers to construct a clone library from three contrasting soils. Sequence analysis confirmed the specificity of the assay, with 750 clones identified as Fusarium and distributed among eight species or species complexes. The Fusarium oxysporum species complex (FOSC) was the most abundant one in the three soils, followed by the Fusarium solani species complex (FSSC). We then compared our molecular approach results with those obtained by isolating Fusarium colonies on two culture media and identifying species by sequencing part of the EF-1α gene. The 750 isolates were distributed into eight species or species complexes, with the same dominant species as with the cloning method. Sequence diversity was much higher in the clone library than in the isolate collection. The molecular approach proved to be a valuable tool to assess Fusarium diversity in environmental samples. Combined with high throughput sequencing, it will allow for in-depth analysis of large numbers of samples. Published by Elsevier B.V.
Naccache, Samia N; Federman, Scot; Veeraraghavan, Narayanan; Zaharia, Matei; Lee, Deanna; Samayoa, Erik; Bouquet, Jerome; Greninger, Alexander L; Luk, Ka-Cheung; Enge, Barryett; Wadford, Debra A; Messenger, Sharon L; Genrich, Gillian L; Pellegrino, Kristen; Grard, Gilda; Leroy, Eric; Schneider, Bradley S; Fair, Joseph N; Martínez, Miguel A; Isa, Pavel; Crump, John A; DeRisi, Joseph L; Sittler, Taylor; Hackett, John; Miller, Steve; Chiu, Charles Y
2014-07-01
Unbiased next-generation sequencing (NGS) approaches enable comprehensive pathogen detection in the clinical microbiology laboratory and have numerous applications for public health surveillance, outbreak investigation, and the diagnosis of infectious diseases. However, practical deployment of the technology is hindered by the bioinformatics challenge of analyzing results accurately and in a clinically relevant timeframe. Here we describe SURPI ("sequence-based ultrarapid pathogen identification"), a computational pipeline for pathogen identification from complex metagenomic NGS data generated from clinical samples, and demonstrate use of the pipeline in the analysis of 237 clinical samples comprising more than 1.1 billion sequences. Deployable on both cloud-based and standalone servers, SURPI leverages two state-of-the-art aligners for accelerated analyses, SNAP and RAPSearch, which are as accurate as existing bioinformatics tools but orders of magnitude faster in performance. In fast mode, SURPI detects viruses and bacteria by scanning data sets of 7-500 million reads in 11 min to 5 h, while in comprehensive mode, all known microorganisms are identified, followed by de novo assembly and protein homology searches for divergent viruses in 50 min to 16 h. SURPI has also directly contributed to real-time microbial diagnosis in acutely ill patients, underscoring its potential key role in the development of unbiased NGS-based clinical assays in infectious diseases that demand rapid turnaround times. © 2014 Naccache et al.; Published by Cold Spring Harbor Laboratory Press.
Phan, My V. T.; Anh, Pham Hong; Cuong, Nguyen Van; Munnink, Bas B. Oude; van der Hoek, Lia; My, Phuc Tran; Tri, Tue Ngo; Bryant, Juliet E.; Baker, Stephen; Thwaites, Guy; Woolhouse, Mark; Kellam, Paul; Rabaa, Maia A.
2016-01-01
Abstract Coordinated and synchronous surveillance for zoonotic viruses in both human clinical cases and animal reservoirs provides an opportunity to identify interspecies virus movement. Rotavirus (RV) is an important cause of viral gastroenteritis in humans and animals. In this study, we document the RV diversity within co-located humans and animals sampled from the Mekong delta region of Vietnam using a primer-independent, agnostic, deep sequencing approach. A total of 296 stool samples (146 from diarrhoeal human patients and 150 from pigs living in the same geographical region) were directly sequenced, generating the genomic sequences of sixty human rotaviruses (all group A) and thirty-one porcine rotaviruses (thirteen group A, seven group B, six group C, and five group H). Phylogenetic analyses showed the co-circulation of multiple distinct RV group A (RVA) genotypes/strains, many of which were divergent from the strain components of licensed RVA vaccines, as well as considerable virus diversity in pigs including full genomes of rotaviruses in groups B, C, and H, none of which have been previously reported in Vietnam. Furthermore, the detection of an atypical RVA genotype constellation (G4-P[6]-I1-R1-C1-M1-A8-N1-T7-E1-H1) in a human patient and a pig from the same region provides some evidence for a zoonotic event. PMID:28748110
Arai, Y T; Takahashi, H; Kameoka, Y; Shiino, T; Wimalaratne, O; Lodmell, D L
2001-01-01
Thirty-four suspected rabid brain samples from 2 humans, 24 dogs, 4 cats, 2 mongooses, I jackal and I water buffalo were collected in 1995-1996 in Sri Lanka. Total RNA was extracted directly from brain suspensions and examined using a one-step reverse transcription-polymerase chain reaction (RT-PCR) for the rabies virus nucleoprotein (N) gene. Twenty-eight samples were found positive for the virus N gene by RT-PCR and also for the virus antigens by fluorescent antibody (FA) test. Rabies virus isolates obtained from different animal species in different regions of Sri Lanka were genetically homogenous. Sequences of 203 nucleotides (nt)-long RT-PCR products obtained from 16 of 27 samples were found identical. Sequences of 1350 nt of N genes of 14 RT-PCR products were determined. The Sri Lanka isolates under study formed a specific cluster that included also an earlier isolate from India but did not include the known isolates from China, Thailand, Malaysia, Israel, Iran, Oman, Saudi Arabia, Russia, Nepal, Philippines, Japan and from several other countries. These results suggest that one type of rabies virus is circulating among human, dog, cat, mongoose, jackal and water buffalo living near Colombo City and in other five remote regions in Sri Lanka.
Vollenhofer-Schrumpf, Sabine; Buresch, Ronald; Schinkinger, Manfred
2007-03-01
We have developed a new method for the detection of nucleic acid hybridization, based on a simple latex agglutination test that can be evaluated by the unaided eye. Nucleic acid, e.g., a polymerase chain reaction (PCR) product, is denatured and incubated with polystyrene beads carrying covalently bound complementary oligonucleotide sequences. Hybridization of the nucleic acids leads to aggregation of the latex particles, thereby verifying the presence of target sequence. The test is performed at room temperature, and results are available within 10 min. As a proof of principle, the hybridization/latex agglutination assay was applied to the detection of purified PCR fragments either specific for Salmonella spp. or a synthetic sequence, and to the detection of Salmonella enterica in artificially contaminated chicken samples. A few nanograms of purified PCR fragments were detectable. In artificially contaminated chicken samples, 3 colony-forming units (cfu)/25 g were detected in one of three replicates, and 30 cfu/25 g were detected in both of two replicates when samples for PCR were taken directly from primary enrichment, demonstrating the practical applicability of this test system. Even multiplex detection might be achievable. This novel kind of assay could be useful for a range of applications where hybridization of nucleic acids, e.g., PCR fragments, is to be detected.
Kamau, Everlyn; Agoti, Charles N; Lewa, Clement S; Oketch, John; Owor, Betty E; Otieno, Grieven P; Bett, Anne; Cane, Patricia A; Nokes, D James
2017-03-01
Direct immuno-fluorescence test (IFAT) and multiplex real-time RT-PCR have been central to RSV diagnosis in Kilifi, Kenya. Recently, these two methods showed discrepancies with an increasing number of PCR undetectable RSV-B viruses. Establish if mismatches in the primer and probe binding sites could have reduced real-time RT-PCR sensitivity. Nucleoprotein (N) and glycoprotein (G) genes were sequenced for real-time RT-PCR positive and negative samples. Primer and probe binding regions in N gene were checked for mismatches and phylogenetic analyses done to determine molecular epidemiology of these viruses. New primers and probe were designed and tested on the previously real-time RT-PCR negative samples. N gene sequences revealed 3 different mismatches in the probe target site of PCR negative, IFAT positive viruses. The primers target sites had no mismatches. Phylogenetic analysis of N and G genes showed that real-time RT-PCR positive and negative samples fell into distinct clades. Newly designed primers-probe pair improved detection and recovered previous PCR undetectable viruses. An emerging RSV-B variant is undetectable by a quite widely used real-time RT-PCR assay due to polymorphisms that influence probe hybridization affecting PCR accuracy. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Noise and drift analysis of non-equally spaced timing data
NASA Technical Reports Server (NTRS)
Vernotte, F.; Zalamansky, G.; Lantz, E.
1994-01-01
Generally, it is possible to obtain equally spaced timing data from oscillators. The measurement of the drifts and noises affecting oscillators is then performed by using a variance (Allan variance, modified Allan variance, or time variance) or a system of several variances (multivariance method). However, in some cases, several samples, or even several sets of samples, are missing. In the case of millisecond pulsar timing data, for instance, observations are quite irregularly spaced in time. Nevertheless, since some observations are very close together (one minute) and since the timing data sequence is very long (more than ten years), information on both short-term and long-term stability is available. Unfortunately, a direct variance analysis is not possible without interpolating missing data. Different interpolation algorithms (linear interpolation, cubic spline) are used to calculate variances in order to verify that they neither lose information nor add erroneous information. A comparison of the results of the different algorithms is given. Finally, the multivariance method was adapted to the measurement sequence of the millisecond pulsar timing data: the responses of each variance of the system are calculated for each type of noise and drift, with the same missing samples as in the pulsar timing sequence. An estimation of precision, dynamics, and separability of this method is given.
Chemale, Gustavo; Paneto, Greiciane Gaburro; Menezes, Meiga Aurea Mendes; de Freitas, Jorge Marcelo; Jacques, Guilherme Silveira; Cicarelli, Regina Maria Barretto; Fagundes, Paulo Roberto
2013-05-01
Mitochondrial DNA (mtDNA) analysis is usually a last resort in routine forensic DNA casework. However, it has become a powerful tool for the analysis of highly degraded samples or samples containing too little or no nuclear DNA, such as old bones and hair shafts. The gold standard methodology still constitutes the direct sequencing of polymerase chain reaction (PCR) products or cloned amplicons from the HVS-1 and HVS-2 (hypervariable segment) control region segments. Identifications using mtDNA are time consuming, expensive and can be very complex, depending on the amount and nature of the material being tested. The main goal of this work is to develop a less labour-intensive and less expensive screening method for mtDNA analysis, in order to aid in the exclusion of non-matching samples and as a presumptive test prior to final confirmatory DNA sequencing. We have selected 14 highly discriminatory single nucleotide polymorphisms (SNPs) based on simulations performed by Salas and Amigo (2010) to be typed using SNaPShot(TM) (Applied Biosystems, Foster City, CA, USA). The assay was validated by typing more than 100 HVS-1/HVS-2 sequenced samples. No differences were observed between the SNP typing and DNA sequencing when results were compared, with the exception of allelic dropouts observed in a few haplotypes. Haplotype diversity simulations were performed using 172 mtDNA sequences representative of the Brazilian population and a score of 0.9794 was obtained when the 14 SNPs were used, showing that the theoretical prediction approach for the selection of highly discriminatory SNPs suggested by Salas and Amigo (2010) was confirmed in the population studied. As the main goal of the work is to develop a screening assay to skip the sequencing of all samples in a particular case, a pair-wise comparison of the sequences was done using the selected SNPs. When both HVS-1/HVS-2 SNPs were used for simulations, at least two differences were observed in 93.2% of the comparisons performed. The assay was validated with casework samples. Results show that the method is straightforward and can be used for exclusionary purposes, saving time and laboratory resources. The assay confirms the theoretic prediction suggested by Salas and Amigo (2010). All forensic advantages, such as high sensitivity and power of discrimination, as also the disadvantages, such as the occurrence of allele dropouts, are discussed throughout the article. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Rowe, Will; Baker, Kate S; Verner-Jeffreys, David; Baker-Austin, Craig; Ryan, Jim J; Maskell, Duncan; Pearce, Gareth
2015-01-01
Antimicrobial resistance remains a growing and significant concern in human and veterinary medicine. Current laboratory methods for the detection and surveillance of antimicrobial resistant bacteria are limited in their effectiveness and scope. With the rapidly developing field of whole genome sequencing beginning to be utilised in clinical practice, the ability to interrogate sequencing data quickly and easily for the presence of antimicrobial resistance genes will become increasingly important and useful for informing clinical decisions. Additionally, use of such tools will provide insight into the dynamics of antimicrobial resistance genes in metagenomic samples such as those used in environmental monitoring. Here we present the Search Engine for Antimicrobial Resistance (SEAR), a pipeline and web interface for detection of horizontally acquired antimicrobial resistance genes in raw sequencing data. The pipeline provides gene information, abundance estimation and the reconstructed sequence of antimicrobial resistance genes; it also provides web links to additional information on each gene. The pipeline utilises clustering and read mapping to annotate full-length genes relative to a user-defined database. It also uses local alignment of annotated genes to a range of online databases to provide additional information. We demonstrate SEAR's application in the detection and abundance estimation of antimicrobial resistance genes in two novel environmental metagenomes, 32 human faecal microbiome datasets and 126 clinical isolates of Shigella sonnei. We have developed a pipeline that contributes to the improved capacity for antimicrobial resistance detection afforded by next generation sequencing technologies, allowing for rapid detection of antimicrobial resistance genes directly from sequencing data. SEAR uses raw sequencing data via an intuitive interface so can be run rapidly without requiring advanced bioinformatic skills or resources. Finally, we show that SEAR is effective in detecting antimicrobial resistance genes in metagenomic and isolate sequencing data from both environmental metagenomes and sequencing data from clinical isolates.
Ndunguru, Joseph; Taylor, Nigel J; Yadav, Jitender; Aly, Haytham; Legg, James P; Aveling, Terry; Thompson, Graham; Fauquet, Claude M
2005-05-18
Plant viral diseases present major constraints to crop production. Effective sampling of the viruses infecting plants is required to facilitate their molecular study and is essential for the development of crop protection and improvement programs. Retaining integrity of viral pathogens within sampled plant tissues is often a limiting factor in this process, most especially when sample sizes are large and when operating in developing counties and regions remote from laboratory facilities. FTA is a paper-based system designed to fix and store nucleic acids directly from fresh tissues pressed into the treated paper. We report here the use of FTA as an effective technology for sampling and retrieval of DNA and RNA viruses from plant tissues and their subsequent molecular analysis. DNA and RNA viruses were successfully recovered from leaf tissues of maize, cassava, tomato and tobacco pressed into FTA Classic Cards. Viral nucleic acids eluted from FTA cards were found to be suitable for diagnostic molecular analysis by PCR-based techniques and restriction analysis, and for cloning and nucleotide sequencing in a manner equivalent to that offered by tradition isolation methods. Efficacy of the technology was demonstrated both from sampled greenhouse-grown plants and from leaf presses taken from crop plants growing in farmer's fields in East Africa. In addition, FTA technology was shown to be suitable for recovery of viral-derived transgene sequences integrated into the plant genome. Results demonstrate that FTA is a practical, economical and sensitive method for sampling, storage and retrieval of viral pathogens and plant genomic sequences, when working under controlled conditions and in the field. Application of this technology has the potential to significantly increase ability to bring modern analytical techniques to bear on the viral pathogens infecting crop plants.
Gene calling and bacterial genome annotation with BG7.
Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo
2015-01-01
New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).
DNA-encoded chemistry: enabling the deeper sampling of chemical space.
Goodnow, Robert A; Dumelin, Christoph E; Keefe, Anthony D
2017-02-01
DNA-encoded chemical library technologies are increasingly being adopted in drug discovery for hit and lead generation. DNA-encoded chemistry enables the exploration of chemical spaces four to five orders of magnitude more deeply than is achievable by traditional high-throughput screening methods. Operation of this technology requires developing a range of capabilities including aqueous synthetic chemistry, building block acquisition, oligonucleotide conjugation, large-scale molecular biological transformations, selection methodologies, PCR, sequencing, sequence data analysis and the analysis of large chemistry spaces. This Review provides an overview of the development and applications of DNA-encoded chemistry, highlighting the challenges and future directions for the use of this technology.
Representing and computing regular languages on massively parallel networks
DOE Office of Scientific and Technical Information (OSTI.GOV)
Miller, M.I.; O'Sullivan, J.A.; Boysam, B.
1991-01-01
This paper proposes a general method for incorporating rule-based constraints corresponding to regular languages into stochastic inference problems, thereby allowing for a unified representation of stochastic and syntactic pattern constraints. The authors' approach first established the formal connection of rules to Chomsky grammars, and generalizes the original work of Shannon on the encoding of rule-based channel sequences to Markov chains of maximum entropy. This maximum entropy probabilistic view leads to Gibb's representations with potentials which have their number of minima growing at precisely the exponential rate that the language of deterministically constrained sequences grow. These representations are coupled to stochasticmore » diffusion algorithms, which sample the language-constrained sequences by visiting the energy minima according to the underlying Gibbs' probability law. The coupling to stochastic search methods yields the all-important practical result that fully parallel stochastic cellular automata may be derived to generate samples from the rule-based constraint sets. The production rules and neighborhood state structure of the language of sequences directly determines the necessary connection structures of the required parallel computing surface. Representations of this type have been mapped to the DAP-510 massively-parallel processor consisting of 1024 mesh-connected bit-serial processing elements for performing automated segmentation of electron-micrograph images.« less
Denoising Algorithm for CFA Image Sensors Considering Inter-Channel Correlation.
Lee, Min Seok; Park, Sang Wook; Kang, Moon Gi
2017-05-28
In this paper, a spatio-spectral-temporal filter considering an inter-channel correlation is proposed for the denoising of a color filter array (CFA) sequence acquired by CCD/CMOS image sensors. Owing to the alternating under-sampled grid of the CFA pattern, the inter-channel correlation must be considered in the direct denoising process. The proposed filter is applied in the spatial, spectral, and temporal domain, considering the spatio-tempo-spectral correlation. First, nonlocal means (NLM) spatial filtering with patch-based difference (PBD) refinement is performed by considering both the intra-channel correlation and inter-channel correlation to overcome the spatial resolution degradation occurring with the alternating under-sampled pattern. Second, a motion-compensated temporal filter that employs inter-channel correlated motion estimation and compensation is proposed to remove the noise in the temporal domain. Then, a motion adaptive detection value controls the ratio of the spatial filter and the temporal filter. The denoised CFA sequence can thus be obtained without motion artifacts. Experimental results for both simulated and real CFA sequences are presented with visual and numerical comparisons to several state-of-the-art denoising methods combined with a demosaicing method. Experimental results confirmed that the proposed frameworks outperformed the other techniques in terms of the objective criteria and subjective visual perception in CFA sequences.
Passos-Castilho, Ana Maria; Granato, Celso Francisco Hernandes
Hepatitis E virus is responsible for acute and chronic liver infections worldwide. Swine hepatitis E virus has been isolated in Brazil, and a probable zoonotic transmission has been described, although data are still scarce. The aim of this study was to investigate the frequency of hepatitis E virus infection in pigs from a small-scale farm in the rural area of Paraná State, South Brazil. Fecal samples were collected from 170 pigs and screened for hepatitis E virus RNA using a duplex real-time RT-PCR targeting a highly conserved 70nt long sequence within overlapping parts of ORF2 and ORF3 as well as a 113nt sequence of ORF2. Positive samples with high viral loads were subjected to direct sequencing and phylogenetic analysis. hepatitis E virus RNA was detected in 34 (20.0%) of the 170 pigs following positive results in at least one set of screening real-time RT-PCR primers and probes. The swine hepatitis E virus strains clustered with the genotype hepatitis E virus-3b reference sequences in the phylogenetic analysis and showed close similarity to human hepatitis E virus isolates previously reported in Brazil. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
Emerging genotype (GGIIb) of norovirus in drinking water, Sweden.
Nygård, Karin; Torvén, Maria; Ancker, Camilla; Knauth, Siv Britt; Hedlund, Kjell-Olof; Giesecke, Johan; Andersson, Yvonne; Svensson, Lennart
2003-12-01
From May through June 2001, an outbreak of acute gastroenteritis that affected at least 200 persons occurred in a combined activity camp and conference center in Stockholm County. The source of illness was contaminated drinking water obtained from private wells. The outbreak appears to have started with sewage pipeline problems near the kitchen, which caused overflow of the sewage system and contaminated the environment. While no pathogenic bacteria were found in water or stools specimens, norovirus was detected in 8 of 11 stool specimens and 2 of 3 water samples by polymerase chain reaction. Nucleotide sequencing of amplicons from two patients and two water samples identified an emerging genotype designated GGIIb, which was circulating throughout several European countries during 2000 and 2001. This investigation documents the first waterborne outbreak of viral gastroenteritis in Sweden, where nucleotide sequencing showed a direct link between contaminated water and illness.
Liu, Bin; Wu, Hao; Zhang, Deyuan; Wang, Xiaolong; Chou, Kuo-Chen
2017-02-21
To expedite the pace in conducting genome/proteome analysis, we have developed a Python package called Pse-Analysis. The powerful package can automatically complete the following five procedures: (1) sample feature extraction, (2) optimal parameter selection, (3) model training, (4) cross validation, and (5) evaluating prediction quality. All the work a user needs to do is to input a benchmark dataset along with the query biological sequences concerned. Based on the benchmark dataset, Pse-Analysis will automatically construct an ideal predictor, followed by yielding the predicted results for the submitted query samples. All the aforementioned tedious jobs can be automatically done by the computer. Moreover, the multiprocessing technique was adopted to enhance computational speed by about 6 folds. The Pse-Analysis Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/Pse-Analysis/, and can be directly run on Windows, Linux, and Unix.
Genomics approach to the environmental community of microorganisms
NASA Astrophysics Data System (ADS)
Kawarabayasi, Y.; Maruyama, A.
2004-12-01
It was indicated by microscopic observation or comparison of 16S rDNA sequence that many extremophiles were surviving in many hydrothermal environments. But it is generally said that over 99% of total microbes are now uncultivable. Thus, we planned to identify uncultivable microbes through direct sequencing of environmental DNA. At first, shotgun plasmid libraries were directly constructed with the DNA molecules prepared from mixed microbes collected from low-temperature hydrothermal water at RM24 in the Southern East Pacific Rise (S-EPR). It was shown that the sequences of some number of clones indicated the similar feature to the intron in eukaryote or tandem repetitive sequence identified in some human familiar diseases. The results indicated that many microorganisms with eukaryotic feature were dominant in low temperature water of S-EPR. Secondly, shotgun plasmid libraries were constructed from the environmental DNA prepared from Beppu hot springs. The ORFs were easily identified all clones determined entire sequence. Thus it can be said that hot springs is good resources for searching novel genes. At last, the mixed microbes isolated from Suiyo seamount were used for construction of shotgun library. The clones in this library contained the ORFs. From some clones in hot spring and Suiyo sample, aminoacyl-tRNA synthatase, which is generally present in all organisms, was isolated by similarity. The phylogenetic analysis of aminoacyl-tRNA synthetase identified indicated that novel and unidentified microorganisms should be present in hot spring or Suiyo seamount. The novel genes identified from Suiyo seamount were also utilized for expression in E. coli. Some gene products were successfully obtained from the E. coli cells as soluble proteins. Some protein indicated the thermostability up to 70_E#8249;C, meaning that the original host cell of this gene should be stable up to the same temperature. Our work indicates that environmental genomics, including the direct cloning, sequencing of environmental DNA and expression of gene identified, is powerful approach to collect novel uncultivable microbes or novel active genes.
Zhao, Xinyan; Dong, Tao
2012-10-16
This study reports a quantitative nucleic acid sequence-based amplification (Q-NASBA) microfluidic platform composed of a membrane-based sampling module, a sample preparation cassette, and a 24-channel Q-NASBA chip for environmental investigations on aquatic microorganisms. This low-cost and highly efficient sampling module, having seamless connection with the subsequent steps of sample preparation and quantitative detection, is designed for the collection of microbial communities from aquatic environments. Eight kinds of commercial membrane filters are relevantly analyzed using Saccharomyces cerevisiae, Escherichia coli, and Staphylococcus aureus as model microorganisms. After the microorganisms are concentrated on the membrane filters, the retentate can be easily conserved in a transport medium (TM) buffer and sent to a remote laboratory. A Q-NASBA-oriented sample preparation cassette is originally designed to extract DNA/RNA molecules directly from the captured cells on the membranes. Sequentially, the extract is analyzed within Q-NASBA chips that are compatible with common microplate readers in laboratories. Particularly, a novel analytical algorithmic method is developed for simple but robust on-chip Q-NASBA assays. The reported multifunctional microfluidic system could detect a few microorganisms quantitatively and simultaneously. Further research should be conducted to simplify and standardize ecological investigations on aquatic environments.
Ehsan, Amimul M.; Geurden, Thomas; Casaert, Stijn; Parvin, Sonia M.; Islam, Taohidul M.; Ahmed, Uddin M.; Levecke, Bruno; Vercruysse, Jozef; Claerebout, Edwin
2015-01-01
Giardia and Cryptosporidium are important causes of diarrhoea in Bangladesh. The high prevalence of both parasites in humans and cattle in rural Bangladesh and the common use of water ponds by village inhabitants and their animals suggest a potential for zoonotic transmission. Direct transmission of Giardia and Cryptosporidium between cattle and their handlers and indirect transmission through water ponds was investigated. Faecal/stool samples were collected from 623 calves and 125 calf handlers in a cross-sectional survey. In two villages, water samples were collected monthly from water ponds and faecal/stool samples were collected monthly from inhabitants and their cattle. Giardia cysts and Cryptosporidium oocysts were detected in water samples and in faecal/stool samples and positive samples were genotyped, to determine their human or animal origin. The prevalence of Giardia and Cryptosporidium in calves was 22% and 5% respectively. In calf handlers, the prevalence of Giardia and Cryptosporidium was 11.2% and 3.2% respectively. Both in the cross-sectional survey and in the longitudinal study in the villages, G. duodenalis assemblage E was most prevalent in calves, while in humans assemblage AII, BIII and BIV were found. In cattle, Cryptosporidium parvum, C. bovis and C. andersoni were identified, but no Cryptosporidium sequences were obtained from humans. Giardia and Cryptosporidium were detected in 14/24 and 12/24 water samples respectively. G. duodenalis assemblage E and BIV (-like), as well as C. andersoni and C. hominis were identified. Although the presence of Giardia and Cryptosporidium in both water ponds suggests that water-borne transmission of Giardia and Cryptosporidium is possible, the genotyping results indicate that there is no significant direct or indirect (water-borne) transmission of Giardia between cattle and people in this area of rural Bangladesh. No conclusions could be drawn for Cryptosporidium, because of the low number of sequences that were obtained from human and water samples. PMID:25695662
Ehsan, Amimul M; Geurden, Thomas; Casaert, Stijn; Parvin, Sonia M; Islam, Taohidul M; Ahmed, Uddin M; Levecke, Bruno; Vercruysse, Jozef; Claerebout, Edwin
2015-01-01
Giardia and Cryptosporidium are important causes of diarrhoea in Bangladesh. The high prevalence of both parasites in humans and cattle in rural Bangladesh and the common use of water ponds by village inhabitants and their animals suggest a potential for zoonotic transmission. Direct transmission of Giardia and Cryptosporidium between cattle and their handlers and indirect transmission through water ponds was investigated. Faecal/stool samples were collected from 623 calves and 125 calf handlers in a cross-sectional survey. In two villages, water samples were collected monthly from water ponds and faecal/stool samples were collected monthly from inhabitants and their cattle. Giardia cysts and Cryptosporidium oocysts were detected in water samples and in faecal/stool samples and positive samples were genotyped, to determine their human or animal origin. The prevalence of Giardia and Cryptosporidium in calves was 22% and 5% respectively. In calf handlers, the prevalence of Giardia and Cryptosporidium was 11.2% and 3.2% respectively. Both in the cross-sectional survey and in the longitudinal study in the villages, G. duodenalis assemblage E was most prevalent in calves, while in humans assemblage AII, BIII and BIV were found. In cattle, Cryptosporidium parvum, C. bovis and C. andersoni were identified, but no Cryptosporidium sequences were obtained from humans. Giardia and Cryptosporidium were detected in 14/24 and 12/24 water samples respectively. G. duodenalis assemblage E and BIV (-like), as well as C. andersoni and C. hominis were identified. Although the presence of Giardia and Cryptosporidium in both water ponds suggests that water-borne transmission of Giardia and Cryptosporidium is possible, the genotyping results indicate that there is no significant direct or indirect (water-borne) transmission of Giardia between cattle and people in this area of rural Bangladesh. No conclusions could be drawn for Cryptosporidium, because of the low number of sequences that were obtained from human and water samples.
Pramanik, Sreemanta; Li, Honghua
2002-01-01
Direct polymerase chain reaction (PCR) detection of insertion/deletion (indel) polymorphisms requires sample homozygosity. For the indel polymorphisms that have the deletion allele with a relatively low frequency in the autosomal regions, direct PCR detection becomes difficult or impossible. The present study is, to our knowledge, the first designed to directly detect indel polymorphisms in a human autosomal region (i.e., the immunoglobulin VH region), through use of single haploid sperm cells as subjects. Unique marker sequences (n=32), spaced at ∼5-kb intervals, were selected near the 3′ end of the VH region. A two-round multiplex PCR protocol was used to amplify these sequences from single sperm samples from nine unrelated healthy donors. The parental haplotypes of the donors were determined by examining the presence or absence of these markers. Seven clustered markers in 6 of the 18 haplotypes were missing and likely represented a 35–40-kb indel polymorphism. The genotypes of the donors, with respect to this polymorphism, perfectly matched the expectation under Hardy-Weinberg equilibrium. Three VH gene segments, of which two are functional, are affected by this polymorphism. According to these results, >10% of individuals in the human population may not have these gene segments in their genome, and ∼44% may have only one copy of these gene segments. The biological impact of this polymorphism would be very interesting to study. The approach used in the present study could be applied to understand the physical structure and diversity of all other autosomal regions. PMID:12442231
PanFP: Pangenome-based functional profiles for microbial communities
Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren; ...
2015-09-26
For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less
PanFP: pangenome-based functional profiles for microbial communities.
Jun, Se-Ran; Robeson, Michael S; Hauser, Loren J; Schadt, Christopher W; Gorin, Andrey A
2015-09-26
For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost-effective way to screen samples of interest for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. We present a computational method called pangenome-based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU's taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome's functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8-0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed-reference OTU picking strategies against specific reference sequence databases. We developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub ( https://github.com/srjun/PanFP ).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jun, Se -Ran; Hauser, Loren John; Schadt, Christopher Warren
For decades there has been increasing interest in understanding the relationships between microbial communities and ecosystem functions. Current DNA sequencing technologies allows for the exploration of microbial communities in two principle ways: targeted rRNA gene surveys and shotgun metagenomics. For large study designs, it is often still prohibitively expensive to sequence metagenomes at both the breadth and depth necessary to statistically capture the true functional diversity of a community. Although rRNA gene surveys provide no direct evidence of function, they do provide a reasonable estimation of microbial diversity, while being a very cost effective way to screen samples of interestmore » for later shotgun metagenomic analyses. However, there is a great deal of 16S rRNA gene survey data currently available from diverse environments, and thus a need for tools to infer functional composition of environmental samples based on 16S rRNA gene survey data. As a result, we present a computational method called pangenome based functional profiles (PanFP), which infers functional profiles of microbial communities from 16S rRNA gene survey data for Bacteria and Archaea. PanFP is based on pangenome reconstruction of a 16S rRNA gene operational taxonomic unit (OTU) from known genes and genomes pooled from the OTU s taxonomic lineage. From this lineage, we derive an OTU functional profile by weighting a pangenome s functional profile with the OTUs abundance observed in a given sample. We validated our method by comparing PanFP to the functional profiles obtained from the direct shotgun metagenomic measurement of 65 diverse communities via Spearman correlation coefficients. These correlations improved with increasing sequencing depth, within the range of 0.8 0.9 for the most deeply sequenced Human Microbiome Project mock community samples. PanFP is very similar in performance to another recently released tool, PICRUSt, for almost all of survey data analysed here. But, our method is unique in that any OTU building method can be used, as opposed to being limited to closed reference OTU picking strategies against specific reference sequence databases. In conclusion, we developed an automated computational method, which derives an inferred functional profile based on the 16S rRNA gene surveys of microbial communities. The inferred functional profile provides a cost effective way to study complex ecosystems through predicted comparative functional metagenomes and metadata analysis. All PanFP source code and additional documentation are freely available online at GitHub.« less
Ma, Xin-Ye; Xie, Cai-Xiang; Liu, Chang; Song, Jing-Yuan; Yao, Hui; Luo, Kun; Zhu, Ying-Jie; Gao, Ting; Pang, Xiao-Hui; Qian, Jun; Chen, Shi-Lin
2010-01-01
Medicinal pteridophytes are an important group used in traditional Chinese medicine; however, there is no simple and universal way to differentiate various species of this group by morphological traits. A novel technology termed "DNA barcoding" could discriminate species by a standard DNA sequence with universal primers and sufficient variation. To determine whether DNA barcoding would be effective for differentiating pteridophyte species, we first analyzed five DNA sequence markers (psbA-trnH intergenic region, rbcL, rpoB, rpoC1, and matK) using six chloroplast genomic sequences from GeneBank and found psbA-trnH intergenic region the best candidate for availability of universal primers. Next, we amplified the psbA-trnH region from 79 samples of medicinal pteridophyte plants. These samples represented 51 species from 24 families, including all the authentic pteridophyte species listed in the Chinese pharmacopoeia (2005 version) and some commonly used adulterants. We found that the sequence of the psbA-trnH intergenic region can be determined with both high polymerase chain reaction (PCR) amplification efficiency (94.1%) and high direct sequencing success rate (81.3%). Combined with GeneBank data (54 species cross 12 pteridophyte families), species discriminative power analysis showed that 90.2% of species could be separated/identified successfully by the TaxonGap method in conjunction with the Basic Local Alignment Search Tool 1 (BLAST1) method. The TaxonGap method results further showed that, for 37 out of 39 separable species with at least two samples each, between-species variation was higher than the relevant within-species variation. Thus, the psbA-trnH intergenic region is a suitable DNA marker for species identification in medicinal pteridophytes.
Odegaard, Justin I; Vincent, John J; Mortimer, Stefanie; Vowles, James V; Ulrich, Bryan C; Banks, Kimberly C; Fairclough, Stephen R; Zill, Oliver A; Sikora, Marcin; Mokhtari, Reza; Abdueva, Diana; Nagy, Rebecca J; Lee, Christine E; Kiedrowski, Lesli A; Paweletz, Cloud P; Eltoukhy, Helmy; Lanman, Richard B; Chudova, Darya I; Talasaz, AmirAli
2018-04-24
Purpose: To analytically and clinically validate a circulating cell-free tumor DNA sequencing test for comprehensive tumor genotyping and demonstrate its clinical feasibility. Experimental Design: Analytic validation was conducted according to established principles and guidelines. Blood-to-blood clinical validation comprised blinded external comparison with clinical droplet digital PCR across 222 consecutive biomarker-positive clinical samples. Blood-to-tissue clinical validation comprised comparison of digital sequencing calls to those documented in the medical record of 543 consecutive lung cancer patients. Clinical experience was reported from 10,593 consecutive clinical samples. Results: Digital sequencing technology enabled variant detection down to 0.02% to 0.04% allelic fraction/2.12 copies with ≤0.3%/2.24-2.76 copies 95% limits of detection while maintaining high specificity [prevalence-adjusted positive predictive values (PPV) >98%]. Clinical validation using orthogonal plasma- and tissue-based clinical genotyping across >750 patients demonstrated high accuracy and specificity [positive percent agreement (PPAs) and negative percent agreement (NPAs) >99% and PPVs 92%-100%]. Clinical use in 10,593 advanced adult solid tumor patients demonstrated high feasibility (>99.6% technical success rate) and clinical sensitivity (85.9%), with high potential actionability (16.7% with FDA-approved on-label treatment options; 72.0% with treatment or trial recommendations), particularly in non-small cell lung cancer, where 34.5% of patient samples comprised a directly targetable standard-of-care biomarker. Conclusions: High concordance with orthogonal clinical plasma- and tissue-based genotyping methods supports the clinical accuracy of digital sequencing across all four types of targetable genomic alterations. Digital sequencing's clinical applicability is further supported by high rates of technical success and biomarker target discovery. Clin Cancer Res; 1-11. ©2018 AACR. ©2018 American Association for Cancer Research.
HIV Transmission Networks in the San Diego–Tijuana Border Region
Mehta, Sanjay R.; Wertheim, Joel O.; Brouwer, Kimberly C.; Wagner, Karla D.; Chaillon, Antoine; Strathdee, Steffanie; Patterson, Thomas L.; Rangel, Maria G.; Vargas, Mlenka; Murrell, Ben; Garfein, Richard; Little, Susan J.; Smith, Davey M.
2015-01-01
Background HIV sequence data can be used to reconstruct local transmission networks. Along international borders, like the San Diego–Tijuana region, understanding the dynamics of HIV transmission across reported risks, racial/ethnic groups, and geography can help direct effective prevention efforts on both sides of the border. Methods We gathered sociodemographic, geographic, clinical, and viral sequence data from HIV infected individuals participating in ten studies in the San Diego–Tijuana border region. Phylogenetic and network analysis was performed to infer putative relationships between HIV sequences. Correlates of identified clusters were evaluated and spatiotemporal relationships were explored using Bayesian phylogeographic analysis. Findings After quality filtering, 843 HIV sequences with associated demographic data and 263 background sequences from the region were analyzed, and 138 clusters were inferred (2–23 individuals). Overall, the rate of clustering did not differ by ethnicity, residence, or sex, but bisexuals were less likely to cluster than heterosexuals or men who have sex with men (p = 0.043), and individuals identifying as white (p ≤ 0.01) were more likely to cluster than other races. Clustering individuals were also 3.5 years younger than non-clustering individuals (p < 0.001). Although the sampled San Diego and Tijuana epidemics were phylogenetically compartmentalized, five clusters contained individuals residing on both sides of the border. Interpretation This study sampled ~ 7% of HIV infected individuals in the border region, and although the sampled networks on each side of the border were largely separate, there was evidence of persistent bidirectional cross-border transmissions that linked risk groups, thus highlighting the importance of the border region as a “melting pot” of risk groups. Funding NIH, VA, and Pendleton Foundation. PMID:26629540
HIV Transmission Networks in the San Diego-Tijuana Border Region.
Mehta, Sanjay R; Wertheim, Joel O; Brouwer, Kimberly C; Wagner, Karla D; Chaillon, Antoine; Strathdee, Steffanie; Patterson, Thomas L; Rangel, Maria G; Vargas, Mlenka; Murrell, Ben; Garfein, Richard; Little, Susan J; Smith, Davey M
2015-10-01
HIV sequence data can be used to reconstruct local transmission networks. Along international borders, like the San Diego-Tijuana region, understanding the dynamics of HIV transmission across reported risks, racial/ethnic groups, and geography can help direct effective prevention efforts on both sides of the border. We gathered sociodemographic, geographic, clinical, and viral sequence data from HIV infected individuals participating in ten studies in the San Diego-Tijuana border region. Phylogenetic and network analysis was performed to infer putative relationships between HIV sequences. Correlates of identified clusters were evaluated and spatiotemporal relationships were explored using Bayesian phylogeographic analysis. After quality filtering, 843 HIV sequences with associated demographic data and 263 background sequences from the region were analyzed, and 138 clusters were inferred (2-23 individuals). Overall, the rate of clustering did not differ by ethnicity, residence, or sex, but bisexuals were less likely to cluster than heterosexuals or men who have sex with men (p = 0.043), and individuals identifying as white (p ≤ 0.01) were more likely to cluster than other races. Clustering individuals were also 3.5 years younger than non-clustering individuals (p < 0.001). Although the sampled San Diego and Tijuana epidemics were phylogenetically compartmentalized, five clusters contained individuals residing on both sides of the border. This study sampled ~ 7% of HIV infected individuals in the border region, and although the sampled networks on each side of the border were largely separate, there was evidence of persistent bidirectional cross-border transmissions that linked risk groups, thus highlighting the importance of the border region as a "melting pot" of risk groups. NIH, VA, and Pendleton Foundation.
Ethical issues in consumer genome sequencing: Use of consumers' samples and data
Niemiec, Emilia; Howard, Heidi Carmen
2016-01-01
High throughput approaches such as whole genome sequencing (WGS) and whole exome sequencing (WES) create an unprecedented amount of data providing powerful resources for clinical care and research. Recently, WGS and WES services have been made available by commercial direct-to-consumer (DTC) companies. The DTC offer of genetic testing (GT) has already brought attention to potentially problematic issues such as the adequacy of consumers' informed consent and transparency of companies' research activities. In this study, we analysed the websites of four DTC GT companies offering WGS and/or WES with regard to their policies governing storage and future use of consumers' data and samples. The results are discussed in relation to recommendations and guiding principles such as the “Statement of the European Society of Human Genetics on DTC GT for health-related purposes” (2010) and the “Framework for responsible sharing of genomic and health-related data” (Global Alliance for Genomics and Health, 2014). The analysis reveals that some companies may store and use consumers' samples or sequencing data for unspecified research and share the data with third parties. Moreover, the companies do not provide sufficient or clear information to consumers about this, which can undermine the validity of the consent process. Furthermore, while all companies state that they provide privacy safeguards for data and mention the limitations of these, information about the possibility of re-identification is lacking. Finally, although the companies that may conduct research do include information regarding proprietary claims and commercialisation of the results, it is not clear whether consumers are aware of the consequences of these policies. These results indicate that DTC GT companies still need to improve the transparency regarding handling of consumers' samples and data, including having an explicit and clear consent process for research activities. PMID:27047756
Lopez, Philippe; Halary, Sébastien; Bapteste, Eric
2015-10-26
Microbial genetic diversity is often investigated via the comparison of relatively similar 16S molecules through multiple alignments between reference sequences and novel environmental samples using phylogenetic trees, direct BLAST matches, or phylotypes counts. However, are we missing novel lineages in the microbial dark universe by relying on standard phylogenetic and BLAST methods? If so, how can we probe that universe using alternative approaches? We performed a novel type of multi-marker analysis of genetic diversity exploiting the topology of inclusive sequence similarity networks. Our protocol identified 86 ancient gene families, well distributed and rarely transferred across the 3 domains of life, and retrieved their environmental homologs among 10 million predicted ORFs from human gut samples and other metagenomic projects. Numerous highly divergent environmental homologs were observed in gut samples, although the most divergent genes were over-represented in non-gut environments. In our networks, most divergent environmental genes grouped exclusively with uncultured relatives, in maximal cliques. Sequences within these groups were under strong purifying selection and presented a range of genetic variation comparable to that of a prokaryotic domain. Many genes families included environmental homologs that were highly divergent from cultured homologs: in 79 gene families (including 18 ribosomal proteins), Bacteria and Archaea were less divergent than some groups of environmental sequences were to any cultured or viral homologs. Moreover, some groups of environmental homologs branched very deeply in phylogenetic trees of life, when they were not too divergent to be aligned. These results underline how limited our understanding of the most diverse elements of the microbial world remains, and encourage a deeper exploration of natural communities and their genetic resources, hinting at the possibility that still unknown yet major divisions of life have yet to be discovered.
Hahn, Andrea; Bendall, Matthew L; Gibson, Keylie M; Chaney, Hollis; Sami, Iman; Perez, Geovanny F; Koumbourlis, Anastassios C; McCaffrey, Timothy A; Freishtat, Robert J; Crandall, Keith A
2018-01-01
Cystic fibrosis (CF) is an autosomal recessive disease associated with recurrent lung infections that can lead to morbidity and mortality. The impact of antibiotics for treatment of acute pulmonary exacerbations on the CF airway microbiome remains unclear with prior studies giving conflicting results and being limited by their use of 16S ribosomal RNA sequencing. Our primary objective was to validate the use of true single molecular sequencing (tSMS) and PathoScope in the analysis of the CF airway microbiome. Three control samples were created with differing amounts of Burkholderia cepacia , Pseudomonas aeruginosa , and Prevotella melaninogenica , three common bacteria found in cystic fibrosis lungs. Paired sputa were also obtained from three study participants with CF before and >6 days after initiation of antibiotics. Antibiotic resistant B. cepacia and P. aeruginosa were identified in concurrently obtained respiratory cultures. Direct sequencing was performed using tSMS, and filtered reads were aligned to reference genomes from NCBI using PathoScope and Kraken and unique clade-specific marker genes using MetaPhlAn. A total of 180-518 K of 6-12 million filtered reads were aligned for each sample. Detection of known pathogens in control samples was most successful using PathoScope. In the CF sputa, alpha diversity measures varied based on the alignment method used, but similar trends were found between pre- and post-antibiotic samples. PathoScope outperformed Kraken and MetaPhlAn in our validation study of artificial bacterial community controls and also has advantages over Kraken and MetaPhlAn of being able to determine bacterial strains and the presence of fungal organisms. PathoScope can be confidently used when evaluating metagenomic data to determine CF airway microbiome diversity.
Duarte, Gabriela Frois; Rosado, Alexandre Soares; Seldin, Lucy; de Araujo, Welington; van Elsas, Jan Dirk
2001-01-01
The selective effects of sulfur-containing hydrocarbons, with respect to changes in bacterial community structure and selection of desulfurizing organisms and genes, were studied in soil. Samples taken from a polluted field soil (A) along a concentration gradient of sulfurous oil and from soil microcosms treated with dibenzothiophene (DBT)-containing petroleum (FSL soil) were analyzed. Analyses included plate counts of total bacteria and of DBT utilizers, molecular community profiling via soil DNA-based PCR-denaturing gradient gel electrophoresis (PCR-DGGE), and detection of genes that encode enzymes involved in the desulfurization of hydrocarbons, i.e., dszA, dszB, and dszC.Data obtained from the A soil showed no discriminating effects of oil levels on the culturable bacterial numbers on either medium used. Generally, counts of DBT degraders were 10- to 100-fold lower than the total culturable counts. However, PCR-DGGE showed that the numbers of bands detected in the molecular community profiles decreased with increasing oil content of the soil. Analysis of the sequences of three prominent bands of the profiles generated with the highly polluted soil samples suggested that the underlying organisms were related to Actinomyces sp., Arthrobacter sp., and a bacterium of uncertain affiliation. dszA, dszB, and dszC genes were present in all A soil samples, whereas a range of unpolluted soils gave negative results in this analysis. Results from the study of FSL soil revealed minor effects of the petroleum-DBT treatment on culturable bacterial numbers and clear effects on the DBT-utilizing communities. The molecular community profiles were largely stable over time in the untreated soil, whereas they showed a progressive change over time following treatment with DBT-containing petroleum. Direct PCR assessment revealed the presence of dszB-related signals in the untreated FSL soil and the apparent selection of dszA- and dszC-related sequences by the petroleum-DBT treatment. PCR-DGGE applied to sequential enrichment cultures in DBT-containing sulfur-free basal salts medium prepared from the A and treated FSL soils revealed the selection of up to 10 distinct bands. Sequencing a subset of these bands provided evidence for the presence of organisms related to Pseudomonas putida, a Pseudomonas sp., Stenotrophomonas maltophilia, and Rhodococcus erythropolis. Several of 52 colonies obtained from the A and FSL soils on agar plates with DBT as the sole sulfur source produced bands that matched the migration of bands selected in the enrichment cultures. Evidence for the presence of dszB in 12 strains was obtained, whereas dszA and dszC genes were found in only 7 and 6 strains, respectively. Most of the strains carrying dszA or dszC were classified as R. erythropolis related, and all revealed the capacity to desulfurize DBT. A comparison of 37 dszA sequences, obtained via PCR from the A and FSL soils, from enrichments of these soils, and from isolates, revealed the great similarity of all sequences to the canonical (R. erythropolis strain IGTS8) dszA sequence and a large degree of internal conservation. The 37 sequences recovered were grouped in three clusters. One group, consisting of 30 sequences, was minimally 98% related to the IGTS8 sequence, a second group of 2 sequences was slightly different, and a third group of 5 sequences was 95% similar. The first two groups contained sequences obtained from both soil types and enrichment cultures (including isolates), but the last consisted of sequences obtained directly from the polluted A soil. PMID:11229891
2010-01-01
Background BamHI-A rightward frame-1 (BARF1) is a carcinoma-specific Epstein-Barr virus (EBV) encoded oncogene. Here we describe the BARF1 sequence diversity in nasopharyngeal carcinoma (NPC), other EBV-related diseases and Indonesian healthy EBV carriers in relation to EBV genotype, viral load and serology markers. Nasopharyngeal brushings from 56 NPC cases, blood or tissue from 15 other EBV-related disorders, spontaneous B cell lines (LCL) from 5 Indonesian healthy individuals and several prototype EBV isolates were analysed by PCR-direct sequencing. Results Most NPC isolates revealed specific BARF1 nucleotide changes compared to prototype B95-8 virus. At the protein level these mutations resulted in 3 main substitutions (V29A, W72G, H130R), which are not considered to cause gross tertiary structure alterations in the hexameric BARF1 protein. At least one amino acid conversion was detected in 80.3% of NPC samples compared to 33.3% of non-NPC samples (p < 0.001) and 40.0% of healthy LCLs (p = 0.074). NPC isolates also showed more frequent codon mutation than non-NPC samples. EBV strain typing revealed most isolates as EBV type 1. The viral load of either NPC or non-NPC samples was high, but only in non- NPC group it related to a particular BARF1 variant. Serology on NPC sera using IgA/EBNA-1 ELISA, IgA/VCA-p18 ELISA and immunoblot score showed no relation with BARF1 sequence diversity (p = 0.802, 0.382 and 0.058, respectively). NPC patients had variable antibody reactivity against purified hexameric NPC-derived BARF1 irrespective of the endogenous BARF1 sequence. Conclusion The sequence variation of BARF1 observed in Indonesian NPC patients and controls may reflect a natural selection of EBV strains unlikely to be predisposing to carcinogenesis. The conserved nature of BARF1 may reflect an important role in EBV (epithelial) persistence. PMID:20849661
Hutajulu, Susanna H; Hoebe, Eveline K; Verkuijlen, Sandra Awm; Fachiroh, Jajah; Hariwijanto, Bambang; Haryana, Sofia M; Stevens, Servi Jc; Greijer, Astrid E; Middeldorp, Jaap M
2010-09-19
BamHI-A rightward frame-1 (BARF1) is a carcinoma-specific Epstein-Barr virus (EBV) encoded oncogene. Here we describe the BARF1 sequence diversity in nasopharyngeal carcinoma (NPC), other EBV-related diseases and Indonesian healthy EBV carriers in relation to EBV genotype, viral load and serology markers. Nasopharyngeal brushings from 56 NPC cases, blood or tissue from 15 other EBV-related disorders, spontaneous B cell lines (LCL) from 5 Indonesian healthy individuals and several prototype EBV isolates were analysed by PCR-direct sequencing. Most NPC isolates revealed specific BARF1 nucleotide changes compared to prototype B95-8 virus. At the protein level these mutations resulted in 3 main substitutions (V29A, W72G, H130R), which are not considered to cause gross tertiary structure alterations in the hexameric BARF1 protein. At least one amino acid conversion was detected in 80.3% of NPC samples compared to 33.3% of non-NPC samples (p < 0.001) and 40.0% of healthy LCLs (p = 0.074). NPC isolates also showed more frequent codon mutation than non-NPC samples. EBV strain typing revealed most isolates as EBV type 1. The viral load of either NPC or non-NPC samples was high, but only in non- NPC group it related to a particular BARF1 variant. Serology on NPC sera using IgA/EBNA-1 ELISA, IgA/VCA-p18 ELISA and immunoblot score showed no relation with BARF1 sequence diversity (p = 0.802, 0.382 and 0.058, respectively). NPC patients had variable antibody reactivity against purified hexameric NPC-derived BARF1 irrespective of the endogenous BARF1 sequence. The sequence variation of BARF1 observed in Indonesian NPC patients and controls may reflect a natural selection of EBV strains unlikely to be predisposing to carcinogenesis. The conserved nature of BARF1 may reflect an important role in EBV (epithelial) persistence.
Deep Sequencing to Identify the Causes of Viral Encephalitis
Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.
2014-01-01
Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
Microbial Diversity of Groundwater from Deep Subsurface Environment
NASA Astrophysics Data System (ADS)
Lin, L.; Onstott, T. C.; Hall, J.
2002-12-01
The subsurface environment harbors one of the most abundant reservoirs of biomass on Earth. The distribution of microbial ecosystems and the diversity of microbial metabolisms there remained poorly understood due to lack of detailed sampling over three-dimensional space with extremely heterogeneous characteristics. South African Au mines, however, provide the best access in the world to various types of groundwater and rocks at depths up to 4 km below surface. In this study, we present our recent analyses of microbial community structure of groundwater (with residence time of several million years) collected from depths between 850 to 1500 mbsl of Beatrix Au mine, South Africa. Five groundwater samples were collected anaerobically from freshly drilling boreholes with flow rates of 1 to 38 L/min. Cells were concentrated through filtration and total DNA were extracted from filters and PCR-amplified with primers targeting 16S rDNA gene. The amplicons were cloned and digested with restriction enzymes to identify the unique clone type. Sequences were obtained through direct sequencing of representative clones and compared with the closest matching sequences deposited in the gene bank for the construction of phylogenetic tree. The archaeal signatures were only found in one sample and close to the lineage of methanosarcina. The most predominant ribotype was similar to the environmental clone found in the same mine under the species level while the rest of ribotypes were either close to those capable of methanogenesis from long-chain alkanes or found in rice field or were distant from other environmental clones reported in previous study (Takai et al., 2001). The bacterial community exhibited a wide range of diversity among samples. Most samples were dominated by sequences close to alpha proteobacteria with various proportions of beta, gamma proteobacteria and environmental clones. A significant proportion of sequences close to thermophilic delta proteobacteria and clostridia were observed from one of the deepest samples. Since the in-situ water or rock temperature at sampling location is below the temperature range for thermophilic bacteria, this might indicate that the microorganisms once colonizing in the deeper and hotter portion of crust were transported upward with hydrothermal fluid and preserved in the sealed water pocket for a time scale of millions of years.
Head direction cells in the postsubiculum do not show replay of prior waking sequences during sleep
Brandon, Mark P.; Bogaard, Andrew; Andrews, Chris M.; Hasselmo, Michael E.
2011-01-01
During slow-wave sleep and REM sleep, hippocampal place cells in the rat show replay of sequences previously observed during waking. We tested the hypothesis from computational modelling that the temporal structure of REM sleep replay could arise from an interplay of place cells with head direction cells in the postsubiculum. Physiological single-unit recording was performed simultaneously from five or more head direction or place by head direction cells in the postsubiculum during running on a circular track allowing sampling of a full range of head directions, and during sleep periods before and after running on the circular track. Data analysis compared the spiking activity during individual REM periods with waking as in previous analysis procedures for REM sleep. We also used a new procedure comparing groups of similar runs during waking with REM sleep periods. There was no consistent evidence for a statistically significant correlation of the temporal structure of spiking during REM sleep with spiking during waking running periods. Thus, the spiking activity of head direction cells during REM sleep does not show replay of head direction cell activity occurring during a previous waking period of running on the task. In addition, we compared the spiking of postsubiculum neurons during hippocampal sharp wave ripple events. We show that head direction cells are not activated during sharp wave ripples, while neurons responsive to place in the postsubiculum show reliable spiking at ripple events. PMID:21509854
NASA Astrophysics Data System (ADS)
Salman Shahid, Syed; Gaul, Robert T.; Kerskens, Christian; Flamini, Vittoria; Lally, Caitríona
2017-12-01
Diffusion magnetic resonance imaging (dMRI) can provide insights into the microstructure of intact arterial tissue. The current study employed high magnetic field MRI to obtain ultra-high resolution dMRI at an isotropic voxel resolution of 117 µm3 in less than 2 h of scan time. A parameter selective single shell (128 directions) diffusion-encoding scheme based on Stejskel-Tanner sequence with echo-planar imaging (EPI) readout was used. EPI segmentation was used to reduce the echo time (TE) and to minimise the susceptibility-induced artefacts. The study utilised the dMRI analysis with diffusion tensor imaging (DTI) framework to investigate structural heterogeneity in intact arterial tissue and to quantify variations in tissue composition when the tissue is cut open and flattened. For intact arterial samples, the region of interest base comparison showed significant differences in fractional anisotropy and mean diffusivity across the media layer (p < 0.05). For open cut flat samples, DTI based directionally invariant indices did not show significant differences across the media layer. For intact samples, fibre tractography based indices such as calculated helical angle and fibre dispersion showed near circumferential alignment and a high degree of fibre dispersion, respectively. This study demonstrates the feasibility of fast dMRI acquisition with ultra-high spatial and angular resolution at 7 T. Using the optimised sequence parameters, this study shows that DTI based markers are sensitive to local structural changes in intact arterial tissue samples and these markers may have clinical relevance in the diagnosis of atherosclerosis and aneurysm.
Apparatus for improved DNA sequencing
Douthart, R.J.; Crowell, S.L.
1996-05-07
This invention is a means for the rapid sequencing of DNA samples. More specifically, it consists of a new design direct blotting electrophoresis unit. The DNA sequence is deposited on a membrane attached to a rotating drum. Initial data compaction is facilitated by the use of a machined multi-channeled plate called a ribbon channel plate. Each channel is an isolated mini gel system much like a gel filled capillary. The system as a whole, however, is in a slab gel like format with the advantages of uniformity and easy reusability. The system can be used in different embodiments. The drum system is unique in that after deposition the drum rotates the deposited DNA into a large non-buffer open space where processing and detection can occur. The drum can also be removed in toto to special workstations for downstream processing, multiplexing and detection. 18 figs.
Apparatus for improved DNA sequencing
Douthart, Richard J.; Crowell, Shannon L.
1996-01-01
This invention is a means for the rapid sequencing of DNA samples. More specifically, it consists of a new design direct blotting electrophoresis unit. The DNA sequence is deposited on a membrane attached to a rotating drum. Initial data compaction is facilitated by the use of a machined multi-channeled plate called a ribbon channel plate. Each channel is an isolated mini gel system much like a gel filled capillary. The system as a whole, however, is in a slab gel like format with the advantages of uniformity and easy reusability. The system can be used in different embodiments. The drum system is unique in that after deposition the drum rotates the deposited DNA into a large non-buffer open space where processing and detection can occur. The drum can also be removed in toto to special workstations for downstream processing, multiplexing and detection.
Fritz, Megan L; Miller, James R; Bayoh, M Nabie; Vulule, John M; Landgraf, Jeffrey R; Walker, Edward D
2012-01-01
A DNA-DNA hybridization method, reverse dot blot analysis (RDBA), was used for identification of Anopheles gambiae s.s. and An. arabiensis hosts. Of 299 blood fed and half gravid An. gambiae s.l. collected from Kisian, Kenya, 244 individuals were identifiable to species; 69.5% were An. arabiensis, and 29.5% were An. gambiae s.s. Host identifications with RDBA were comparable to conventional PCR followed by direct sequencing of amplicons of the vertebrate mitochondrial cytochrome B gene. Of the 174 amplicon-producing samples used for comparison of these two methods, 147 were identifiable by direct sequencing, and 139 of these same by RDBA. An. arabiensis blood meals were mostly (>90%) bovine in origin, whereas An. gambiae s.s. fed upon humans > 90% of the time. RDBA detected that 2 of 112 An. arabiensis had blood from more than one host species, whereas PCR and direct sequencing did not. Recent insecticide-treated bednet (ITN) use in Kisian has likely caused the shift in the dominant vector species from An. gambiae s.s. to An. arabiensis. RDBA provides an opportunity to study changes in host-feeding by members of the An. gambiae complex as a response to the broadening distribution of vector control measures targeting host-selection behaviors. PMID:24188164
NASA Astrophysics Data System (ADS)
Meyer, M.; Morris, A.; Anderson, M.; MacLeod, C. J.
2014-12-01
The Oman ophiolite is an important natural laboratory for understanding the construction of oceanic crust at fast spreading axes and its subsequent tectonic evolution. Previous paleomagnetic research in lavas of the northern ophiolitic blocks (Perrin et al., 2000, Mar. Geophys. Res.) has demonstrated substantial clockwise intraoceanic tectonic rotations. Paleomagnetic data from lower crustal sequences in the southern blocks, however, have been more equivocal due to complications arising from remagnetization, and have been used to infer that clockwise rotations seen in the north are internal to the ophiolite rather than regionally significant (Weiler, 2000, Mar. Geophys. Res.). Here we demonstrate the importance and advantages of sampling crustal transects in the ophiolite in order to understand the nature and variability in magnetization directions. By systematically sampling the lower crustal sequence exposed in Wadi Abyad (Rustaq block) we resolve for the first time in a single section a pattern of remagnetized lowermost gabbros and retention of earlier magnetizations by uppermost gabbros and the overlying dyke-rooting zone. Results are supported by a positive fold test that shows that remagnetization of lower gabbros occurred prior to the Campanian structural disruption of the Moho. NW-directed remagnetized remanences in the lower units are consistent with those used by Weiler (2000, Mar. Geophys. Res.) to infer lack of significant rotation of the southern blocks and to argue, therefore, that rotation of the northern blocks was internal to the ophiolite. In contrast, E/ENE-directed remanences in the uppermost levels of Wadi Abyad imply a large, clockwise rotation of the Rustaq block, of a sense and magnitude consistent with intraoceanic rotations inferred from extrusive sections in the northern blocks. We conclude that without the control provided by systematic crustal sampling, the potential for different remanence directions being acquired at different times may lead to erroneous tectonic interpretation.
Narrow field electromagnetic sensor system and method
McEwan, Thomas E.
1996-01-01
A narrow field electromagnetic sensor system and method of sensing a characteristic of an object provide the capability to realize a characteristic of an object such as density, thickness, or presence, for any desired coordinate position on the object. One application is imaging. The sensor can also be used as an obstruction detector or an electronic trip wire with a narrow field without the disadvantages of impaired performance when exposed to dirt, snow, rain, or sunlight. The sensor employs a transmitter for transmitting a sequence of electromagnetic signals in response to a transmit timing signal, a receiver for sampling only the initial direct RF path of the electromagnetic signal while excluding all other electromagnetic signals in response to a receive timing signal, and a signal processor for processing the sampled direct RF path electromagnetic signal and providing an indication of the characteristic of an object. Usually, the electromagnetic signal is a short RF burst and the obstruction must provide a substantially complete eclipse of the direct RF path. By employing time-of-flight techniques, a timing circuit controls the receiver to sample only the initial direct RF path of the electromagnetic signal while not sampling indirect path electromagnetic signals. The sensor system also incorporates circuitry for ultra-wideband spread spectrum operation that reduces interference to and from other RF services while allowing co-location of multiple electronic sensors without the need for frequency assignments.
Narrow field electromagnetic sensor system and method
McEwan, T.E.
1996-11-19
A narrow field electromagnetic sensor system and method of sensing a characteristic of an object provide the capability to realize a characteristic of an object such as density, thickness, or presence, for any desired coordinate position on the object. One application is imaging. The sensor can also be used as an obstruction detector or an electronic trip wire with a narrow field without the disadvantages of impaired performance when exposed to dirt, snow, rain, or sunlight. The sensor employs a transmitter for transmitting a sequence of electromagnetic signals in response to a transmit timing signal, a receiver for sampling only the initial direct RF path of the electromagnetic signal while excluding all other electromagnetic signals in response to a receive timing signal, and a signal processor for processing the sampled direct RF path electromagnetic signal and providing an indication of the characteristic of an object. Usually, the electromagnetic signal is a short RF burst and the obstruction must provide a substantially complete eclipse of the direct RF path. By employing time-of-flight techniques, a timing circuit controls the receiver to sample only the initial direct RF path of the electromagnetic signal while not sampling indirect path electromagnetic signals. The sensor system also incorporates circuitry for ultra-wideband spread spectrum operation that reduces interference to and from other RF services while allowing co-location of multiple electronic sensors without the need for frequency assignments. 12 figs.
Lisi, Simonetta; Chirichella, Michele; Arisi, Ivan; Goracci, Martina; Cremisi, Federico; Cattaneo, Antonino
2017-01-01
Antibody libraries are important resources to derive antibodies to be used for a wide range of applications, from structural and functional studies to intracellular protein interference studies to developing new diagnostics and therapeutics. Whatever the goal, the key parameter for an antibody library is its complexity (also known as diversity), i.e. the number of distinct elements in the collection, which directly reflects the probability of finding in the library an antibody against a given antigen, of sufficiently high affinity. Quantitative evaluation of antibody library complexity and quality has been for a long time inadequately addressed, due to the high similarity and length of the sequences of the library. Complexity was usually inferred by the transformation efficiency and tested either by fingerprinting and/or sequencing of a few hundred random library elements. Inferring complexity from such a small sampling is, however, very rudimental and gives limited information about the real diversity, because complexity does not scale linearly with sample size. Next-generation sequencing (NGS) has opened new ways to tackle the antibody library complexity quality assessment. However, much remains to be done to fully exploit the potential of NGS for the quantitative analysis of antibody repertoires and to overcome current limitations. To obtain a more reliable antibody library complexity estimate here we show a new, PCR-free, NGS approach to sequence antibody libraries on Illumina platform, coupled to a new bioinformatic analysis and software (Diversity Estimator of Antibody Library, DEAL) that allows to reliably estimate the complexity, taking in consideration the sequencing error. PMID:28505201
Evaluation of the genetic diversity of Plum pox virus in a single plum tree.
Predajňa, Lukáš; Šubr, Zdeno; Candresse, Thierry; Glasa, Miroslav
2012-07-01
Genetic diversity of Plum pox virus (PPV) and its distribution within a single perennial woody host (plum, Prunus domestica) has been evaluated. A plum tree was triply infected by chip-budding with PPV-M, PPV-D and PPV-Rec isolates in 2003 and left to develop untreated under open field conditions. In September 2010 leaf and fruit samples were collected from different parts of the tree canopy. A 745-bp NIb-CP fragment of PPV genome, containing the hypervariable region encoding the CP N-terminal end was amplified by RT-PCR from each sample and directly sequenced to determine the dominant sequence. In parallel, the PCR products were cloned and a total of 105 individual clones were sequenced. Sequence analysis revealed that after 7 years of infection, only PPV-M was still detectable in the tree and that the two other isolates (PPV-Rec and PPV-D) had been displaced. Despite the fact that the analysis targeted a relatively short portion of the genome, a substantial amount of intra-isolate variability was observed for PPV-M. A total of 51 different haplotypes could be identified from the 105 individual sequences, two of which were largely dominant. However, no clear-cut structuration of the viral population by the tree architecture could be highlighted although the results obtained suggest the possibility of intra-leaf/fruit differentiation of the viral population. Comparison of the consensus sequence with the original source isolate showed no difference, suggesting within-plant stability of this original isolate under open field conditions. Copyright © 2012 Elsevier B.V. All rights reserved.
Fantini, Marco; Pandolfini, Luca; Lisi, Simonetta; Chirichella, Michele; Arisi, Ivan; Terrigno, Marco; Goracci, Martina; Cremisi, Federico; Cattaneo, Antonino
2017-01-01
Antibody libraries are important resources to derive antibodies to be used for a wide range of applications, from structural and functional studies to intracellular protein interference studies to developing new diagnostics and therapeutics. Whatever the goal, the key parameter for an antibody library is its complexity (also known as diversity), i.e. the number of distinct elements in the collection, which directly reflects the probability of finding in the library an antibody against a given antigen, of sufficiently high affinity. Quantitative evaluation of antibody library complexity and quality has been for a long time inadequately addressed, due to the high similarity and length of the sequences of the library. Complexity was usually inferred by the transformation efficiency and tested either by fingerprinting and/or sequencing of a few hundred random library elements. Inferring complexity from such a small sampling is, however, very rudimental and gives limited information about the real diversity, because complexity does not scale linearly with sample size. Next-generation sequencing (NGS) has opened new ways to tackle the antibody library complexity quality assessment. However, much remains to be done to fully exploit the potential of NGS for the quantitative analysis of antibody repertoires and to overcome current limitations. To obtain a more reliable antibody library complexity estimate here we show a new, PCR-free, NGS approach to sequence antibody libraries on Illumina platform, coupled to a new bioinformatic analysis and software (Diversity Estimator of Antibody Library, DEAL) that allows to reliably estimate the complexity, taking in consideration the sequencing error.
Evaluation of sequencing approaches for high-throughput ...
Whole-genome in vitro transcriptomics has shown the capability to identify mechanisms of action and estimates of potency for chemical-mediated effects in a toxicological framework, but with limited throughput and high cost. We present the evaluation of three toxicogenomics platforms for potential application to high-throughput screening: 1. TempO-Seq utilizing custom designed paired probes per gene; 2. Targeted sequencing (TSQ) utilizing Illumina’s TruSeq RNA Access Library Prep Kit containing tiled exon-specific probe sets; 3. Low coverage whole transcriptome sequencing (LSQ) using Illumina’s TruSeq Stranded mRNA Kit. Each platform was required to cover the ~20,000 genes of the full transcriptome, operate directly with cell lysates, and be automatable with 384-well plates. Technical reproducibility was assessed using MAQC control RNA samples A and B, while functional utility for chemical screening was evaluated using six treatments at a single concentration after 6 hr in MCF7 breast cancer cells: 10 µM chlorpromazine, 10 µM ciclopriox, 10 µM genistein, 100 nM sirolimus, 1 µM tanespimycin, and 1 µM trichostatin A. All RNA samples and chemical treatments were run with 5 technical replicates. The three platforms achieved different read depths, with the TempO-Seq having ~34M mapped reads per sample, while TSQ and LSQ averaged 20M and 11M aligned reads per sample, respectively. Inter-replicate correlation averaged ≥0.95 for raw log2 expression values i
Rapid identification of causative species in patients with Old World leishmaniasis.
Minodier, P; Piarroux, R; Gambarelli, F; Joblet, C; Dumon, H
1997-01-01
Conventional methods for the identification of species of Leishmania parasite causing infections have limitations. By using a DNA-based alternative, the present study tries to develop a new tool for this purpose. Thirty-three patients living in Marseilles (in the south of France) were suffering from visceral or cutaneous leishmaniasis. DNA of the parasite in clinical samples (bone marrow, peripheral blood, or skin) from these patients were amplified by PCR and were directly sequenced. The sequences observed were compared to these of 30 strains of the genus causing Old World leishmaniasis collected in Europe, Africa, or Asia. In the analysis of the sequences of the strains, two different sequence patterns for Leishmania infantum, one sequence for Leishmania donovani, one sequence for Leishmania major, two sequences for Leishmania tropica, and one sequence for Leishmania aethiopica were obtained. Four sequences were observed among the strains from the patients: one was similar to the sequence for the L. major strains, two were identical to the sequences for the L. infantum strains, and the last sequence was not observed within the strains but had a high degree of homology with the sequences of the L. infantum and L. donovani strains. The L. infantum strains from all immunocompetent patients had the same sequence. The L. infantum strains from immunodeficient patients suffering from visceral leishmaniasis had three different sequences. This fact might signify that some variants of L. infantum acquire pathogenicity exclusively in immunocompromised patients. To dispense with the sequencing step, a restriction assay with HaeIII was used. Some restriction patterns might support genetic exchanges in members of the genus Leishmania. PMID:9316906
Trémeaux, P; Caporossi, A; Ramière, C; Santoni, E; Tarbouriech, N; Thélu, M-A; Fusillier, K; Geneletti, L; François, O; Leroy, V; Burmeister, W P; André, P; Morand, P; Larrat, S
2016-05-01
Directly acting antiviral drugs have contributed considerable progress to hepatitis C virus (HCV) treatment, but they show variable activity depending on virus genotypes and subtypes. Therefore, accurate genotyping including recombinant form detection is still of major importance, as is the detection of resistance-associated mutations in case of therapeutic failure. To meet these goals, an approach to amplify the HCV near-complete genome with a single long-range PCR and sequence it with Roche GS Junior was developed. After optimization, the overall amplification success rate was 73% for usual genotypes (i.e. HCV 1a, 1b, 3a and 4a, 16/22) and 45% for recombinant forms RF_2k/1b (5/11). After pyrosequencing and subsequent de novo assembly, a near-full-length genomic consensus sequence was obtained for 19 of 21 samples. The genotype and subtype were confirmed by phylogenetic analysis for every sample, including the suspected recombinant forms. Resistance-associated mutations were detected in seven of 13 samples at baseline, in the NS3 (n = 3) or NS5A (n = 4) region. Of these samples, the treatment of one patient included daclatasvir, and that patient experienced a relapse. Virus sequences from pre- and posttreatment samples of four patients who experienced relapse after sofosbuvir-based therapy were compared: the selected variants seem too far from the NS5B catalytic site to be held responsible. Although tested on a limited set of samples and with technical improvements still necessary, this assay has proven to be successful for both genotyping and resistance-associated variant detection on several HCV types. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Ahmed, Md Atique; Fong, Mun Yik; Lau, Yee Ling; Yusof, Ruhani
2016-04-26
The zoonotic malaria parasite Plasmodium knowlesi has become an emerging threat to South East Asian countries particular in Malaysia. A recent study from Sarawak (Malaysian Borneo) discovered two distinct normocyte binding protein xa (Pknbpxa) types of P. knowlesi. In the present study, the Pknbpxa of clinical isolates from Peninsular Malaysia and Sabah (Malaysian Borneo) were investigated for the presence of Pknbpxa types and natural selection force acting on the gene. Blood samples were collected from 47 clinical samples from Peninsular Malaysia (n = 35) and Sabah (Malaysian Borneo, n = 12) were used in the study. The Pknbpxa gene was successfully amplified and directly sequenced from 38 of the samples (n = 31, Peninsular Malaysia and n = 7, Sabah, Malaysian Borneo). The Pknbpxa sequences of P. knowlesi isolates from Sarawak (Malaysian Borneo) were retrieved from GenBank and included in the analysis. Polymorphism, genetic diversity and natural selection of Pknbpxa sequences were analysed using DNAsp v 5.10, MEGA5. Phylogentics of Pknbpxa sequences was analysed using MrBayes v3.2 and Splits Tree v4.13.1. The pairwise F ST indices were used to determine the genetic differentiation between the Pknbpxa types and was calculated using Arlequin 3.5.1.3. Analyses of the sequences revealed Pknbpxa dimorphism throughout Malaysia indicating co-existence of the two types (Type-1 and Type-2) of Pknbpxa. More importantly, a third type (Type 3) closely related to Type 2 Pknbpxa was also detected. This third type was found only in the isolates originating from Peninsular Malaysia. Negative natural selection was observed, suggesting functional constrains within the Pknbpxa types. This study revealed the existence of three Pknbpxa types in Malaysia. Types 1 and 2 were found not only in Malaysian Borneo (Sarawak and Sabah) but also in Peninsular Malaysia. A third type which was specific only to samples originating from Peninsular Malaysia was discovered. Further genetic studies with a larger sample size will be necessary to determine whether natural selection is driving this genetic differentiation and geographical separation.
Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga
2015-01-01
Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer’s, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how analysis can be repeated from saved sequencing images using the Long Template Protocol to increase accuracy. PMID:25860802
USDA-ARS?s Scientific Manuscript database
Current technologies for next generation sequencing (NGS) have revolutionized metagenomics analysis of clinical samples. One advantage of the NGS platform is the possibility to sequence the genetic material in samples without any prior knowledge of the sequence contained within. Sequence-Independent...
Toward high-resolution NMR spectroscopy of microscopic liquid samples
DOE Office of Scientific and Technical Information (OSTI.GOV)
Butler, Mark C.; Mehta, Hardeep S.; Chen, Ying
A longstanding limitation of high-resolution NMR spectroscopy is the requirement for samples to have macroscopic dimensions. Commercial probes, for example, are designed for volumes of at least 5 mL, in spite of decades of work directed toward the goal of miniaturization. Progress in miniaturizing inductive detectors has been limited by a perceived need to meet two technical requirements: (1) minimal separation between the sample and the detector, which is essential for sensitivity, and (2) near-perfect magnetic-field homogeneity at the sample, which is typically needed for spectral resolution. The first of these requirements is real, but the second can be relaxed,more » as we demonstrate here. By using pulse sequences that yield high-resolution spectra in an inhomogeneous field, we eliminate the need for near-perfect field homogeneity and the accompanying requirement for susceptibility matching of microfabricated detector components. With this requirement removed, typical imperfections in microfabricated components can be tolerated, and detector dimensions can be matched to those of the sample, even for samples of volume << 5 uL. Pulse sequences that are robust to field inhomogeneity thus enable small-volume detection with optimal sensitivity. We illustrate the potential of this approach to miniaturization by presenting spectra acquired with a flat-wire detector that can easily be scaled to subnanoliter volumes. In particular, we report high-resolution NMR spectroscopy of an alanine sample of volume 500 pL.« less
Meghdadi, Hossein; Khosravi, Azar D.; Ghadiri, Ata A.; Sina, Amir H.; Alami, Ameneh
2015-01-01
Present study was aimed to examine the diagnostic utility of polymerase chain reaction (PCR) and nested PCR techniques for the detection of Mycobacterium tuberculosis (MTB) DNA in samples from patients with extra pulmonary tuberculosis (EPTB). In total 80 formalin-fixed, paraffin-embedded (FFPE) samples comprising 70 samples with definite diagnosis of EPTB and 10 samples from known non- EPTB on the basis of histopathology examination, were included in the study. PCR amplification targeting IS6110, rpoB gene and nested PCR targeting the rpoB gene were performed on the extracted DNAs from 80 FFPE samples. The strong positive samples were directly sequenced. For negative samples and those with weak band in nested-rpoB PCR, TA cloning was performed by cloning the products into the plasmid vector with subsequent sequencing. The 95% confidence intervals (CI) for the estimates of sensitivity and specificity were calculated for each method. Fourteen (20%), 34 (48.6%), and 60 (85.7%) of the 70 positive samples confirmed by histopathology, were positive by rpoB-PCR, IS6110-PCR, and nested-rpoB PCR, respectively. By performing TA cloning on samples that yielded weak (n = 8) or negative results (n = 10) in the PCR methods, we were able to improve their quality for later sequencing. All samples with weak band and 7 out of 10 negative samples, showed strong positive results after cloning. So nested-rpoB PCR cloning revealed positivity in 67 out of 70 confirmed samples (95.7%). The sensitivity of these combination methods was calculated as 95.7% in comparison with histopathology examination. The CI for sensitivity of the PCR methods were calculated as 11.39–31.27% for rpoB-PCR, 36.44–60.83% for IS6110- PCR, 75.29–92.93% for nested-rpoB PCR, and 87.98–99.11% for nested-rpoB PCR cloning. The 10 true EPTB negative samples by histopathology, were negative by all tested methods including cloning and were used to calculate the specificity of the applied methods. The CI for 100% specificity of each PCR method were calculated as 69.15–100%. Our results indicated that nested-rpoB PCR combined with TA cloning and sequencing is a preferred method for the detection of MTB DNA in EPTB samples with high sensitivity and specificity which confirm the histopathology results. PMID:26191059
Meghdadi, Hossein; Khosravi, Azar D; Ghadiri, Ata A; Sina, Amir H; Alami, Ameneh
2015-01-01
Present study was aimed to examine the diagnostic utility of polymerase chain reaction (PCR) and nested PCR techniques for the detection of Mycobacterium tuberculosis (MTB) DNA in samples from patients with extra pulmonary tuberculosis (EPTB). In total 80 formalin-fixed, paraffin-embedded (FFPE) samples comprising 70 samples with definite diagnosis of EPTB and 10 samples from known non- EPTB on the basis of histopathology examination, were included in the study. PCR amplification targeting IS6110, rpoB gene and nested PCR targeting the rpoB gene were performed on the extracted DNAs from 80 FFPE samples. The strong positive samples were directly sequenced. For negative samples and those with weak band in nested-rpoB PCR, TA cloning was performed by cloning the products into the plasmid vector with subsequent sequencing. The 95% confidence intervals (CI) for the estimates of sensitivity and specificity were calculated for each method. Fourteen (20%), 34 (48.6%), and 60 (85.7%) of the 70 positive samples confirmed by histopathology, were positive by rpoB-PCR, IS6110-PCR, and nested-rpoB PCR, respectively. By performing TA cloning on samples that yielded weak (n = 8) or negative results (n = 10) in the PCR methods, we were able to improve their quality for later sequencing. All samples with weak band and 7 out of 10 negative samples, showed strong positive results after cloning. So nested-rpoB PCR cloning revealed positivity in 67 out of 70 confirmed samples (95.7%). The sensitivity of these combination methods was calculated as 95.7% in comparison with histopathology examination. The CI for sensitivity of the PCR methods were calculated as 11.39-31.27% for rpoB-PCR, 36.44-60.83% for IS6110- PCR, 75.29-92.93% for nested-rpoB PCR, and 87.98-99.11% for nested-rpoB PCR cloning. The 10 true EPTB negative samples by histopathology, were negative by all tested methods including cloning and were used to calculate the specificity of the applied methods. The CI for 100% specificity of each PCR method were calculated as 69.15-100%. Our results indicated that nested-rpoB PCR combined with TA cloning and sequencing is a preferred method for the detection of MTB DNA in EPTB samples with high sensitivity and specificity which confirm the histopathology results.
NASA Astrophysics Data System (ADS)
Schnepp, Elisabeth; Pucher, Rudolf; Goedicke, Christian; Manzano, Ana; Müller, Uwe; Lanos, Philippe
2003-02-01
A record of about 450 years of geomagnetic secular variation is presented from a single archaeological site in Lübeck (Germany) where a sequence of 25 bread oven floors has been preserved in a bakery from medieval times until today. The age dating of the oven-floor sequence is based on historical documents, 14C-dating and thermoluminescence dating. It confines the time interval from about 1300 to 1800 A.D. Paleomagnetic directions have been determined from each oven floor by means of 198 oriented hand samples. After alternating field as well as thermal demagnetization experiments, the characteristic remanent magnetization direction was obtained using principal component analysis. The mean directions of 24 oven floors are characterized by high Fisherian precision parameters (>146) and small α95 confidence limits (1.2°-4.6°). For obtaining a smooth curve of geomagnetic secular variation for Lübeck, a spherical spline function was fitted to the data using a Bayesian approach, which considers not only the obtained ages, but also stratigraphic order. Correlation with historical magnetic records suggests that the age estimation for the upper 10 layers was too young and must date from the end of the sixteenth to the mid of the eighteenth century. For the lowermost 14 layers, dating is reliable and provides a secular variation curve for Germany. The inclination shows a minimum in the fourteenth century and then increases by more than 10°. Declination shows a local minimum around 1400 A.D. followed by a maximum in the seventeenth century. This is followed by the movement of declination about 30° to western directions.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ton, H.; Yeung, E.S.
1997-02-15
An integrated on-line prototype for coupling a microreactor to capillary electrophoresis for DNA sequencing has been demonstrated. A dye-labeled terminator cycle-sequencing reaction is performed in a fused-silica capillary. Subsequently, the sequencing ladder is directly injected into a size-exclusion chromatographic column operated at nearly 95{degree}C for purification. On-line injection to a capillary for electrophoresis is accomplished at a junction set at nearly 70{degree}C. High temperature at the purification column and injection junction prevents the renaturation of DNA fragments during on-line transfer without affecting the separation. The high solubility of DNA in and the relatively low ionic strength of 1 x TEmore » buffer permit both effective purification and electrokinetic injection of the DNA sample. The system is compatible with highly efficient separations by a replaceable poly(ethylene oxide) polymer solution in uncoated capillary tubes. Future automation and adaptation to a multiple-capillary array system should allow high-speed, high-throughput DNA sequencing from templates to called bases in one step. 32 refs., 5 figs.« less
Qiu, Ping; Pang, Ling; Arreaza, Gladys; Maguire, Maureen; Chang, Ken C. N.; Marton, Matthew J.; Levitan, Diane
2016-01-01
Immune checkpoint inhibitors, which unleash a patient’s own T cells to kill tumors, are revolutionizing cancer treatment. Several independent studies suggest that higher non-synonymous mutational burden assessed by whole exome sequencing (WES) in tumors is associated with improved objective response, durable clinical benefit, and progression-free survival in immune checkpoint inhibitors treatment. Next-generation sequencing (NGS) is a promising technology being used in the clinic to direct patient treatment. Cancer genome WES poses a unique challenge due to tumor heterogeneity and sequencing artifacts introduced by formalin-fixed, paraffin-embedded (FFPE) tissue. In order to evaluate the data interoperability of WES data from different sources to survey tumor mutational landscape, we compared WES data of several tumor/normal matched samples from five commercial vendors. A large data discrepancy was observed from vendors’ self-reported data. Independent data analysis from vendors’ raw NGS data shows that whole exome sequencing data from qualified vendors can be combined and analyzed uniformly to derive comparable quantitative estimates of tumor mutational burden. PMID:27136543
The use of museum specimens with high-throughput DNA sequencers
Burrell, Andrew S.; Disotell, Todd R.; Bergey, Christina M.
2015-01-01
Natural history collections have long been used by morphologists, anatomists, and taxonomists to probe the evolutionary process and describe biological diversity. These biological archives also offer great opportunities for genetic research in taxonomy, conservation, systematics, and population biology. They allow assays of past populations, including those of extinct species, giving context to present patterns of genetic variation and direct measures of evolutionary processes. Despite this potential, museum specimens are difficult to work with because natural postmortem processes and preservation methods fragment and damage DNA. These problems have restricted geneticists’ ability to use natural history collections primarily by limiting how much of the genome can be surveyed. Recent advances in DNA sequencing technology, however, have radically changed this, making truly genomic studies from museum specimens possible. We review the opportunities and drawbacks of the use of museum specimens, and suggest how to best execute projects when incorporating such samples. Several high-throughput (HT) sequencing methodologies, including whole genome shotgun sequencing, sequence capture, and restriction digests (demonstrated here), can be used with archived biomaterials. PMID:25532801
SDSS-IV MaNGA: Spatially Resolved Star Formation Main Sequence and LI(N)ER Sequence
NASA Astrophysics Data System (ADS)
Hsieh, B. C.; Lin, Lihwai; Lin, J. H.; Pan, H. A.; Hsu, C. H.; Sánchez, S. F.; Cano-Díaz, M.; Zhang, K.; Yan, R.; Barrera-Ballesteros, J. K.; Boquien, M.; Riffel, R.; Brownstein, J.; Cruz-González, I.; Hagen, A.; Ibarra, H.; Pan, K.; Bizyaev, D.; Oravetz, D.; Simmons, A.
2017-12-01
We present our study on the spatially resolved Hα and M * relation for 536 star-forming and 424 quiescent galaxies taken from the MaNGA survey. We show that the star formation rate surface density ({{{Σ }}}{SFR}), derived based on the Hα emissions, is strongly correlated with the M * surface density ({{{Σ }}}* ) on kiloparsec scales for star-forming galaxies and can be directly connected to the global star-forming sequence. This suggests that the global main sequence may be a consequence of a more fundamental relation on small scales. On the other hand, our result suggests that ∼20% of quiescent galaxies in our sample still have star formation activities in the outer region with lower specific star formation rate (SSFR) than typical star-forming galaxies. Meanwhile, we also find a tight correlation between {{{Σ }}}{{H}α } and {{{Σ }}}* for LI(N)ER regions, named the resolved “LI(N)ER” sequence, in quiescent galaxies, which is consistent with the scenario that LI(N)ER emissions are primarily powered by the hot, evolved stars as suggested in the literature.
SAMSA2: a standalone metatranscriptome analysis pipeline.
Westreich, Samuel T; Treiber, Michelle L; Mills, David A; Korf, Ian; Lemay, Danielle G
2018-05-21
Complex microbial communities are an area of growing interest in biology. Metatranscriptomics allows researchers to quantify microbial gene expression in an environmental sample via high-throughput sequencing. Metatranscriptomic experiments are computationally intensive because the experiments generate a large volume of sequence data and each sequence must be compared with reference sequences from thousands of organisms. SAMSA2 is an upgrade to the original Simple Annotation of Metatranscriptomes by Sequence Analysis (SAMSA) pipeline that has been redesigned for standalone use on a supercomputing cluster. SAMSA2 is faster due to the use of the DIAMOND aligner, and more flexible and reproducible because it uses local databases. SAMSA2 is available with detailed documentation, and example input and output files along with examples of master scripts for full pipeline execution. SAMSA2 is a rapid and efficient metatranscriptome pipeline for analyzing large RNA-seq datasets in a supercomputing cluster environment. SAMSA2 provides simplified output that can be examined directly or used for further analyses, and its reference databases may be upgraded, altered or customized to fit the needs of any experiment.
Jayakumar, Amal; Chang, Bonnie X; Widner, Brittany; Bernhardt, Peter; Mulholland, Margaret R; Ward, Bess B
2017-10-01
Biological nitrogen fixation (BNF) was investigated above and within the oxygen-depleted waters of the oxygen-minimum zone of the Eastern Tropical North Pacific Ocean. BNF rates were estimated using an isotope tracer method that overcame the uncertainty of the conventional bubble method by directly measuring the tracer enrichment during the incubations. Highest rates of BNF (~4 nM day -1 ) occurred in coastal surface waters and lowest detectable rates (~0.2 nM day -1 ) were found in the anoxic region of offshore stations. BNF was not detectable in most samples from oxygen-depleted waters. The composition of the N 2 -fixing assemblage was investigated by sequencing of nifH genes. The diazotrophic assemblage in surface waters contained mainly Proteobacterial sequences (Cluster I nifH), while both Proteobacterial sequences and sequences with high identities to those of anaerobic microbes characterized as Clusters III and IV type nifH sequences were found in the anoxic waters. Our results indicate modest input of N through BNF in oxygen-depleted zones mainly due to the activity of proteobacterial diazotrophs.
Baele, Guy; Lemey, Philippe; Vansteelandt, Stijn
2013-03-06
Accurate model comparison requires extensive computation times, especially for parameter-rich models of sequence evolution. In the Bayesian framework, model selection is typically performed through the evaluation of a Bayes factor, the ratio of two marginal likelihoods (one for each model). Recently introduced techniques to estimate (log) marginal likelihoods, such as path sampling and stepping-stone sampling, offer increased accuracy over the traditional harmonic mean estimator at an increased computational cost. Most often, each model's marginal likelihood will be estimated individually, which leads the resulting Bayes factor to suffer from errors associated with each of these independent estimation processes. We here assess the original 'model-switch' path sampling approach for direct Bayes factor estimation in phylogenetics, as well as an extension that uses more samples, to construct a direct path between two competing models, thereby eliminating the need to calculate each model's marginal likelihood independently. Further, we provide a competing Bayes factor estimator using an adaptation of the recently introduced stepping-stone sampling algorithm and set out to determine appropriate settings for accurately calculating such Bayes factors, with context-dependent evolutionary models as an example. While we show that modest efforts are required to roughly identify the increase in model fit, only drastically increased computation times ensure the accuracy needed to detect more subtle details of the evolutionary process. We show that our adaptation of stepping-stone sampling for direct Bayes factor calculation outperforms the original path sampling approach as well as an extension that exploits more samples. Our proposed approach for Bayes factor estimation also has preferable statistical properties over the use of individual marginal likelihood estimates for both models under comparison. Assuming a sigmoid function to determine the path between two competing models, we provide evidence that a single well-chosen sigmoid shape value requires less computational efforts in order to approximate the true value of the (log) Bayes factor compared to the original approach. We show that the (log) Bayes factors calculated using path sampling and stepping-stone sampling differ drastically from those estimated using either of the harmonic mean estimators, supporting earlier claims that the latter systematically overestimate the performance of high-dimensional models, which we show can lead to erroneous conclusions. Based on our results, we argue that highly accurate estimation of differences in model fit for high-dimensional models requires much more computational effort than suggested in recent studies on marginal likelihood estimation.
2013-01-01
Background Accurate model comparison requires extensive computation times, especially for parameter-rich models of sequence evolution. In the Bayesian framework, model selection is typically performed through the evaluation of a Bayes factor, the ratio of two marginal likelihoods (one for each model). Recently introduced techniques to estimate (log) marginal likelihoods, such as path sampling and stepping-stone sampling, offer increased accuracy over the traditional harmonic mean estimator at an increased computational cost. Most often, each model’s marginal likelihood will be estimated individually, which leads the resulting Bayes factor to suffer from errors associated with each of these independent estimation processes. Results We here assess the original ‘model-switch’ path sampling approach for direct Bayes factor estimation in phylogenetics, as well as an extension that uses more samples, to construct a direct path between two competing models, thereby eliminating the need to calculate each model’s marginal likelihood independently. Further, we provide a competing Bayes factor estimator using an adaptation of the recently introduced stepping-stone sampling algorithm and set out to determine appropriate settings for accurately calculating such Bayes factors, with context-dependent evolutionary models as an example. While we show that modest efforts are required to roughly identify the increase in model fit, only drastically increased computation times ensure the accuracy needed to detect more subtle details of the evolutionary process. Conclusions We show that our adaptation of stepping-stone sampling for direct Bayes factor calculation outperforms the original path sampling approach as well as an extension that exploits more samples. Our proposed approach for Bayes factor estimation also has preferable statistical properties over the use of individual marginal likelihood estimates for both models under comparison. Assuming a sigmoid function to determine the path between two competing models, we provide evidence that a single well-chosen sigmoid shape value requires less computational efforts in order to approximate the true value of the (log) Bayes factor compared to the original approach. We show that the (log) Bayes factors calculated using path sampling and stepping-stone sampling differ drastically from those estimated using either of the harmonic mean estimators, supporting earlier claims that the latter systematically overestimate the performance of high-dimensional models, which we show can lead to erroneous conclusions. Based on our results, we argue that highly accurate estimation of differences in model fit for high-dimensional models requires much more computational effort than suggested in recent studies on marginal likelihood estimation. PMID:23497171
The past, present and future of mitochondrial genomics: have we sequenced enough mtDNAs?
Smith, David Roy
2016-01-01
The year 2014 saw more than a thousand new mitochondrial genome sequences deposited in GenBank-an almost 15% increase from the previous year. Hundreds of peer-reviewed articles accompanied these genomes, making mitochondrial DNAs (mtDNAs) the most sequenced and reported type of eukaryotic chromosome. These mtDNA data have advanced a wide range of scientific fields, from forensics to anthropology to medicine to molecular evolution. But for many biological lineages, mtDNAs are so well sampled that newly published genomes are arguably no longer contributing significantly to the progression of science, and in some cases they are tying up valuable resources, particularly journal editors and referees. Is it time to acknowledge that as a research community we have published enough mitochondrial genome papers? Here, I address this question, exploring the history, milestones and impacts of mitochondrial genomics, the benefits and drawbacks of continuing to publish mtDNAs at a high rate and what the future may hold for such an important and popular genetic marker. I highlight groups for which mtDNAs are still poorly sampled, thus meriting further investigation, and recommend that more energy be spent characterizing aspects of mitochondrial genomes apart from the DNA sequence, such as their chromosomal and transcriptional architectures. Ultimately, one should be mindful before writing a mitochondrial genome paper. Consider perhaps sending the sequence directly to GenBank instead, and be sure to annotate it correctly before submission. © The Author 2015. Published by Oxford University Press.
Validating a Coarse-Grained Potential Energy Function through Protein Loop Modelling
MacDonald, James T.; Kelley, Lawrence A.; Freemont, Paul S.
2013-01-01
Coarse-grained (CG) methods for sampling protein conformational space have the potential to increase computational efficiency by reducing the degrees of freedom. The gain in computational efficiency of CG methods often comes at the expense of non-protein like local conformational features. This could cause problems when transitioning to full atom models in a hierarchical framework. Here, a CG potential energy function was validated by applying it to the problem of loop prediction. A novel method to sample the conformational space of backbone atoms was benchmarked using a standard test set consisting of 351 distinct loops. This method used a sequence-independent CG potential energy function representing the protein using -carbon positions only and sampling conformations with a Monte Carlo simulated annealing based protocol. Backbone atoms were added using a method previously described and then gradient minimised in the Rosetta force field. Despite the CG potential energy function being sequence-independent, the method performed similarly to methods that explicitly use either fragments of known protein backbones with similar sequences or residue-specific /-maps to restrict the search space. The method was also able to predict with sub-Angstrom accuracy two out of seven loops from recently solved crystal structures of proteins with low sequence and structure similarity to previously deposited structures in the PDB. The ability to sample realistic loop conformations directly from a potential energy function enables the incorporation of additional geometric restraints and the use of more advanced sampling methods in a way that is not possible to do easily with fragment replacement methods and also enable multi-scale simulations for protein design and protein structure prediction. These restraints could be derived from experimental data or could be design restraints in the case of computational protein design. C++ source code is available for download from http://www.sbg.bio.ic.ac.uk/phyre2/PD2/. PMID:23824634
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Improved PCR primers for the detection and identification of arbuscular mycorrhizal fungi.
Lee, Jaikoo; Lee, Sangsun; Young, J Peter W
2008-08-01
A set of PCR primers that should amplify all subgroups of arbuscular mycorrhizal fungi (AMF, Glomeromycota), but exclude sequences from other organisms, was designed to facilitate rapid detection and identification directly from field-grown plant roots. The small subunit rRNA gene was targeted for the new primers (AML1 and AML2) because phylogenetic relationships among the Glomeromycota are well understood for this gene. Sequence comparisons indicate that the new primers should amplify all published AMF sequences except those from Archaeospora trappei. The specificity of the new primers was tested using 23 different AMF spore morphotypes from trap cultures and Miscanthus sinensis, Glycine max and Panax ginseng roots sampled from the field. Non-AMF DNA of 14 plants, 14 Basidiomycota and 18 Ascomycota was also tested as negative controls. Sequences amplified from roots using the new primers were compared with those obtained using the established NS31 and AM1 primer combination. The new primers have much better specificity and coverage of all known AMF groups.
Bulgari, Daniela; Casati, Paola; Brusetti, Lorenzo; Quaglino, Fabio; Brasca, Milena; Daffonchio, Daniele; Bianco, Piero Attilio
2009-08-01
Diversity of bacterial endophytes associated with grapevine leaf tissues was analyzed by cultivation and cultivation-independent methods. In order to identify bacterial endophytes directly from metagenome, a protocol for bacteria enrichment and DNA extraction was optimized. Sequence analysis of 16S rRNA gene libraries underscored five diverse Operational Taxonomic Units (OTUs), showing best sequence matches with gamma-Proteobacteria, family Enterobacteriaceae, with a dominance of the genus Pantoea. Bacteria isolation through cultivation revealed the presence of six OTUs, showing best sequence matches with Actinobacteria, genus Curtobacterium, and with Firmicutes genera Bacillus and Enterococcus. Length Heterogeneity-PCR (LH-PCR) electrophoretic peaks from single bacterial clones were used to setup a database representing the bacterial endophytes identified in association with grapevine tissues. Analysis of healthy and phytoplasma-infected grapevine plants showed that LH-PCR could be a useful complementary tool for examining the diversity of bacterial endophytes especially for diversity survey on a large number of samples.
TP53, PIK3CA, FBXW7 and KRAS Mutations in Esophageal Cancer Identified by Targeted Sequencing.
Zheng, Huili; Wang, Yan; Tang, Chuanning; Jones, Lindsey; Ye, Hua; Zhang, Guangchun; Cao, Weihai; Li, Jingwen; Liu, Lifeng; Liu, Zhencong; Zhang, Chao; Lou, Feng; Liu, Zhiyuan; Li, Yangyang; Shi, Zhenfen; Zhang, Jingbo; Zhang, Dandan; Sun, Hong; Dong, Haichao; Dong, Zhishou; Guo, Baishuai; Yan, H E; Lu, Qingyu; Huang, Xue; Chen, Si-Yi
2016-01-01
Esophageal cancer (EC) is a common malignancy with significant morbidity and mortality. As individual cancers exhibit unique mutation patterns, identifying and characterizing gene mutations in EC that may serve as biomarkers might help predict patient outcome and guide treatment. Traditionally, personalized cancer DNA sequencing was impractical and expensive. Recent technological advancements have made targeted DNA sequencing more cost- and time-effective with reliable results. This technology may be useful for clinicians to direct patient treatment. The Ion PGM and AmpliSeq Cancer Panel was used to identify mutations at 737 hotspot loci of 45 cancer-related genes in 64 EC samples from Chinese patients. Frequent mutations were found in TP53 and less frequent mutations in PIK3CA, FBXW7 and KRAS. These results demonstrate that targeted sequencing can reliably identify mutations in individual tumors that make this technology a possibility for clinical use. Copyright© 2016, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.
Beyhan, Yunus E; Karakus, Mehmet; Karagoz, Alper; Mungan, Mesut; Ozkan, Aysegul T; Hokelek, Murat
2017-09-01
To characterize the cutaneous leishmaniasis (CL) isolates of Syrian and Central Anatolia patients at species levels. Methods: Skin scrapings of 3 patients (2 Syrian, 1 Turkish) were taken and examined by direct examination, culture in Novy-MacNeal-Nicole (NNN) medium, internal transcribed spacer polymerase chain reaction and sequence analysis (PCR). Results:According to microscopic examination, culture and PCR methods, 3 samples were detected positive. The sequencing results of all isolates in the study were identified as Leishmania tropica. The same genotypes were detected in the 3 isolates and nucleotide sequence submitted into GenBank with the accession number: KP689599. Conclusion: This finding could give information about the transmission of CL between Turkey and Syria. Because of the Syrian civil war, most of the Syrian citizens circulating in Turkey and different part of Europe, this can be increase the risk of spreading the disease. So, prevention measurements must be taken urgently.
Camunas-Soler, Joan; Kertesz, Michael; De Vlaminck, Iwijn; Koh, Winston; Pan, Wenying; Martin, Lance; Neff, Norma F.; Okamoto, Jennifer; Wong, Ronald J.; Kharbanda, Sandhya; El-Sayed, Yasser; Blumenfeld, Yair; Stevenson, David K.; Shaw, Gary M.; Wolfe, Nathan D.; Quake, Stephen R.
2017-01-01
Blood circulates throughout the human body and contains molecules drawn from virtually every tissue, including the microbes and viruses which colonize the body. Through massive shotgun sequencing of circulating cell-free DNA from the blood, we identified hundreds of new bacteria and viruses which represent previously unidentified members of the human microbiome. Analyzing cumulative sequence data from 1,351 blood samples collected from 188 patients enabled us to assemble 7,190 contiguous regions (contigs) larger than 1 kbp, of which 3,761 are novel with little or no sequence homology in any existing databases. The vast majority of these novel contigs possess coding sequences, and we have validated their existence both by finding their presence in independent experiments and by performing direct PCR amplification. When their nearest neighbors are located in the tree of life, many of the organisms represent entirely novel taxa, showing that microbial diversity within the human body is substantially broader than previously appreciated. PMID:28830999
Statistical analysis of archeomagnetic samples of Teotihuacan, Mexico
NASA Astrophysics Data System (ADS)
Soler-Arechalde, A. M.
2012-12-01
Teotihuacan was the one of the most important metropolis of Mesoamerica during the Classic Period (1 to 600 AC). The city had a continuous growth in different stages that usually concluded with a ritual. Fire was an important element natives would burn entire structures. An example of this is the Quetzalcoatl pyramid in La Ciudadela (350 AC), it was burned and a new structure was built over it, also the Big Fire at 570 AC, that marks its end. These events are suitable to archaeomagnetic dating. The inclusion of ash in the stucco enhances the magnetic signal of detrital type that also allows us to make dating. This increases the number of samples to be processed as well as the number of dates. The samples have been analyzed according to their type: floor, wall, talud and painting and whether or not exposed to fire. Sequences of directions obtained in excavations in strict stratigraphic control will be shown. A sequence of images was used to analyze the improving of Teotihuacan secular variation curve through more than a decade of continuous work at the area.
Fusion genes with ALK as recurrent partner in ependymoma-like gliomas: a new brain tumor entity?
Olsen, Thale Kristin; Panagopoulos, Ioannis; Meling, Torstein R.; Micci, Francesca; Gorunova, Ludmila; Thorsen, Jim; Due-Tønnessen, Bernt; Scheie, David; Lund-Iversen, Marius; Krossnes, Bård; Saxhaug, Cathrine; Heim, Sverre; Brandal, Petter
2015-01-01
Background We have previously characterized 19 ependymal tumors using Giemsa banding and high-resolution comparative genomic hybridization. The aim of this study was to analyze these tumors searching for fusion genes. Methods RNA sequencing was performed in 12 samples. Potential fusion transcripts were assessed by seed count and structural chromosomal aberrations. Transcripts of interest were validated using fluorescence in situ hybridization and PCR followed by direct sequencing. Results RNA sequencing identified rearrangements of the anaplastic lymphoma kinase gene (ALK) in 2 samples. Both tumors harbored structural aberrations involving the ALK locus 2p23. Tumor 1 had an unbalanced t(2;14)(p23;q22) translocation which led to the fusion gene KTN1-ALK. Tumor 2 had an interstitial del(2)(p16p23) deletion causing the fusion of CCDC88A and ALK. In both samples, the breakpoint of ALK was located between exons 19 and 20. Both patients were infants and both tumors were supratentorial. The tumors were well demarcated from surrounding tissue and had both ependymal and astrocytic features but were diagnosed and treated as ependymomas. Conclusions By combining karyotyping and RNA sequencing, we identified the 2 first ever reported ALK rearrangements in CNS tumors. Such rearrangements may represent the hallmark of a new entity of pediatric glioma characterized by both ependymal and astrocytic features. Our findings are of particular importance because crizotinib, a selective ALK inhibitor, has demonstrated effect in patients with lung cancer harboring ALK rearrangements. Thus, ALK emerges as an interesting therapeutic target in patients with ependymal tumors carrying ALK fusions. PMID:25795305
Catonella morbi and Granulicatella adiacens: new species in endodontic infections.
Siqueira, José F; Rôças, Isabela N
2006-08-01
This study intended to examine samples from primary endodontic infections for the presence of Catonella morbi and Granulicatella adiacens, 2 species that have been recently suggested to be involved with infections in other oral sites. Genomic DNA was isolated directly from samples taken from teeth with different forms of apical periodontitis, and a devised culture-independent 16S rRNA gene-based heminested PCR assay was used to determine the prevalence of these 2 target species. Species-specific primers were developed by aligning closely related 16S rRNA gene sequences. Species-specificity for each primer pair was confirmed by running PCR against a panel of oral bacteria and by sequencing of DNA from representative positive samples. C morbi and G adiacens were detected in 33% and 19%, respectively, of the root canals associated with chronic apical periodontitis; 30% and 10%, respectively, of the cases diagnosed as acute apical periodontitis, and 16% and 11%, respectively, of the pus samples taken from acute apical abscesses. Overall, C morbi occurred in 26% and G adiacens in 14% of the samples taken from primary endodontic infections. Our findings demonstrate that C morbi and G adiacens can take part in the microbiota associated with primary endodontic infections, and their specific role in the disease process warrants further elucidation.
Use of whole genome sequencing in surveillance of drug resistant tuberculosis.
McNerney, Ruth; Zignol, Matteo; Clark, Taane G
2018-05-01
The threat of resistance to anti-tuberculosis drugs is of global concern. Current efforts to monitor resistance rely on phenotypic testing where cultured bacteria are exposed to critical concentrations of the drugs. Capacity for such testing is low in TB endemic countries. Drug resistance is caused by mutations in the Mycobacterium tuberculosis genome and whole genome sequencing to detect these mutations offers an alternative means of assessing resistance. Areas covered: The challenges of assessing TB drug resistance are discussed. Progress in elucidating the M. tuberculosis resistome and evidence of the accuracy of next generation sequencing for detecting resistance is reviewed. Expert Commentary: There are considerable advantages to using next generation sequencing for TB drug resistance surveillance. Accuracy is high for detecting resistance to the major first-line drugs but is currently lower for the second-line drugs due to our incomplete knowledge regarding resistance causing mutations. With the advances in sequencing technology and the opportunity to replace phenotypic drug susceptibility testing with safer and more cost effective methods it would appear that the question is when to implement. Current bottlenecks are sample extraction to allow whole genome sequencing directly from sputum and the lack of bioinformatics expertise in some TB endemic countries.
Detection and characterization of Pasteuria 16S rRNA gene sequences from nematodes and soils.
Duan, Y P; Castro, H F; Hewlett, T E; White, J H; Ogram, A V
2003-01-01
Various bacterial species in the genus Pasteuria have great potential as biocontrol agents against plant-parasitic nematodes, although study of this important genus is hampered by the current inability to cultivate Pasteuria species outside their host. To aid in the study of this genus, an extensive 16S rRNA gene sequence phylogeny was constructed and this information was used to develop cultivation-independent methods for detection of Pasteuria in soils and nematodes. Thirty new clones of Pasteuria 16S rRNA genes were obtained directly from nematodes and soil samples. These were sequenced and used to construct an extensive phylogeny of this genus. These sequences were divided into two deeply branching clades within the low-G + C, Gram-positive division; some sequences appear to represent novel species within the genus Pasteuria. In addition, a surprising degree of 16S rRNA gene sequence diversity was observed within what had previously been designated a single strain of Pasteuria penetrans (P-20). PCR primers specific to Pasteuria 16S rRNA for detection of Pasteuria in soils were also designed and evaluated. Detection limits for soil DNA were 100-10,000 Pasteuria endospores (g soil)(-1).
A Sensitive Assay for Virus Discovery in Respiratory Clinical Samples
de Vries, Michel; Deijs, Martin; Canuti, Marta; van Schaik, Barbera D. C.; Faria, Nuno R.; van de Garde, Martijn D. B.; Jachimowski, Loes C. M.; Jebbink, Maarten F.; Jakobs, Marja; Luyf, Angela C. M.; Coenjaerts, Frank E. J.; Claas, Eric C. J.; Molenkamp, Richard; Koekkoek, Sylvie M.; Lammens, Christine; Leus, Frank; Goossens, Herman; Ieven, Margareta; Baas, Frank; van der Hoek, Lia
2011-01-01
In 5–40% of respiratory infections in children, the diagnostics remain negative, suggesting that the patients might be infected with a yet unknown pathogen. Virus discovery cDNA-AFLP (VIDISCA) is a virus discovery method based on recognition of restriction enzyme cleavage sites, ligation of adaptors and subsequent amplification by PCR. However, direct discovery of unknown pathogens in nasopharyngeal swabs is difficult due to the high concentration of ribosomal RNA (rRNA) that acts as competitor. In the current study we optimized VIDISCA by adjusting the reverse transcription enzymes and decreasing rRNA amplification in the reverse transcription, using hexamer oligonucleotides that do not anneal to rRNA. Residual cDNA synthesis on rRNA templates was further reduced with oligonucleotides that anneal to rRNA but can not be extended due to 3′-dideoxy-C6-modification. With these modifications >90% reduction of rRNA amplification was established. Further improvement of the VIDISCA sensitivity was obtained by high throughput sequencing (VIDISCA-454). Eighteen nasopharyngeal swabs were analysed, all containing known respiratory viruses. We could identify the proper virus in the majority of samples tested (11/18). The median load in the VIDISCA-454 positive samples was 7.2 E5 viral genome copies/ml (ranging from 1.4 E3–7.7 E6). Our results show that optimization of VIDISCA and subsequent high-throughput-sequencing enhances sensitivity drastically and provides the opportunity to perform virus discovery directly in patient material. PMID:21283679
Nanopore DNA Sequencing and Genome Assembly on the International Space Station.
Castro-Wallace, Sarah L; Chiu, Charles Y; John, Kristen K; Stahl, Sarah E; Rubins, Kathleen H; McIntyre, Alexa B R; Dworkin, Jason P; Lupisella, Mark L; Smith, David J; Botkin, Douglas J; Stephenson, Timothy A; Juul, Sissel; Turner, Daniel J; Izquierdo, Fernando; Federman, Scot; Stryke, Doug; Somasekar, Sneha; Alexander, Noah; Yu, Guixia; Mason, Christopher E; Burton, Aaron S
2017-12-21
We evaluated the performance of the MinION DNA sequencer in-flight on the International Space Station (ISS), and benchmarked its performance off-Earth against the MinION, Illumina MiSeq, and PacBio RS II sequencing platforms in terrestrial laboratories. Samples contained equimolar mixtures of genomic DNA from lambda bacteriophage, Escherichia coli (strain K12, MG1655) and Mus musculus (female BALB/c mouse). Nine sequencing runs were performed aboard the ISS over a 6-month period, yielding a total of 276,882 reads with no apparent decrease in performance over time. From sequence data collected aboard the ISS, we constructed directed assemblies of the ~4.6 Mb E. coli genome, ~48.5 kb lambda genome, and a representative M. musculus sequence (the ~16.3 kb mitochondrial genome), at 100%, 100%, and 96.7% consensus pairwise identity, respectively; de novo assembly of the E. coli genome from raw reads yielded a single contig comprising 99.9% of the genome at 98.6% consensus pairwise identity. Simulated real-time analyses of in-flight sequence data using an automated bioinformatic pipeline and laptop-based genomic assembly demonstrated the feasibility of sequencing analysis and microbial identification aboard the ISS. These findings illustrate the potential for sequencing applications including disease diagnosis, environmental monitoring, and elucidating the molecular basis for how organisms respond to spaceflight.
Zhang, Changsheng; Cai, Hongmin; Huang, Jingying; Song, Yan
2016-09-17
Variations in DNA copy number have an important contribution to the development of several diseases, including autism, schizophrenia and cancer. Single-cell sequencing technology allows the dissection of genomic heterogeneity at the single-cell level, thereby providing important evolutionary information about cancer cells. In contrast to traditional bulk sequencing, single-cell sequencing requires the amplification of the whole genome of a single cell to accumulate enough samples for sequencing. However, the amplification process inevitably introduces amplification bias, resulting in an over-dispersing portion of the sequencing data. Recent study has manifested that the over-dispersed portion of the single-cell sequencing data could be well modelled by negative binomial distributions. We developed a read-depth based method, nbCNV to detect the copy number variants (CNVs). The nbCNV method uses two constraints-sparsity and smoothness to fit the CNV patterns under the assumption that the read signals are negatively binomially distributed. The problem of CNV detection was formulated as a quadratic optimization problem, and was solved by an efficient numerical solution based on the classical alternating direction minimization method. Extensive experiments to compare nbCNV with existing benchmark models were conducted on both simulated data and empirical single-cell sequencing data. The results of those experiments demonstrate that nbCNV achieves superior performance and high robustness for the detection of CNVs in single-cell sequencing data.
The Viral Evolution Core within the AIDS and Cancer Virus Program will extract viral RNA/DNA from cell-free or cell-associated samples. Complementary (cDNA) will be generated as needed, and cDNA or DNA will be diluted to a single copy prior to nested
Italo Jr. Delalibera; Archana Vasanthakumar; Benjamin J. Burwitz; Patrick D. Schloss; Kier D. Klepzig; Jo Handelsman; Kenneth F. Raffa
2007-01-01
The gut bacterial community of a bark beetle, the pine engraver Ips pini (Say), was characterized using culture-dependent and culture-independent methods. Bacteria from individual guts of larvae, pupae and adults were cultured and DNA was extracted from samples of pooled larval guts. Analysis of 16S rRNA gene sequences amplified directly from the gut...
Erlanger, Bernard F.; Chen, Bi-Xing
1999-01-01
The subject invention provides an antibody which specifically binds to the product of a reaction between a labeling substance and a substrate. The subject invention also provides a method of making an immunogen used to produce the antibody of the subject invention. The invention further provides methods of using the subject antibody for detecting an antigen of interest in a sample, for example, detecting a protein comprising an amino acid sequence of interest and detecting a nucleic acid molecule comprising a nucleic acid sequence of interest, detecting a polypeptide such as those expressed by infectious agents, fungi or parasites.
Erlanger, B.F.; Chen, B.
1999-07-20
The subject invention provides an antibody which specifically binds to the product of a reaction between a labeling substance and a substrate. The subject invention also provides a method of making an immunogen used to produce the antibody of the subject invention. The invention further provides methods of using the subject antibody for detecting an antigen of interest in a sample, for example, detecting a protein comprising an amino acid sequence of interest and detecting a nucleic acid molecule comprising a nucleic acid sequence of interest, detecting a polypeptide such as those expressed by infectious agents, fungi or parasites. 25 figs.
Fraley, Stephanie I; Hardick, Justin; Masek, Billie J; Jo Masek, Billie; Athamanolap, Pornpat; Rothman, Richard E; Gaydos, Charlotte A; Carroll, Karen C; Wakefield, Teresa; Wang, Tza-Huei; Yang, Samuel
2013-10-01
Comprehensive profiling of nucleic acids in genetically heterogeneous samples is important for clinical and basic research applications. Universal digital high-resolution melt (U-dHRM) is a new approach to broad-based PCR diagnostics and profiling technologies that can overcome issues of poor sensitivity due to contaminating nucleic acids and poor specificity due to primer or probe hybridization inaccuracies for single nucleotide variations. The U-dHRM approach uses broad-based primers or ligated adapter sequences to universally amplify all nucleic acid molecules in a heterogeneous sample, which have been partitioned, as in digital PCR. Extensive assay optimization enables direct sequence identification by algorithm-based matching of melt curve shape and Tm to a database of known sequence-specific melt curves. We show that single-molecule detection and single nucleotide sensitivity is possible. The feasibility and utility of U-dHRM is demonstrated through detection of bacteria associated with polymicrobial blood infection and microRNAs (miRNAs) associated with host response to infection. U-dHRM using broad-based 16S rRNA gene primers demonstrates universal single cell detection of bacterial pathogens, even in the presence of larger amounts of contaminating bacteria; U-dHRM using universally adapted Lethal-7 miRNAs in a heterogeneous mixture showcases the single copy sensitivity and single nucleotide specificity of this approach.
Sequence analysis of DBL2β domain of vargene of Indonesian Plasmodium falciparum
NASA Astrophysics Data System (ADS)
Sulistyaningsih, E.; Romadhon, B. D.; Palupi, I.; Hidayah, F.; Dewi, R.; Prasetyo, A.
2018-03-01
Malaria is a major health problem in tropical countries including Indonesia. The most deadly agent is Plasmodium falciparum. In P. falciparum infection, PfEMP1 is supposed to play an important role in the pathogenesis of malaria. PfEMP1 is encoded by var gene family, it is a polymorphic protein where the extra-cellular portion contains of three distinct binding domains: Duffy binding-like (DBL), Cysteine-rich interdomain regions (CIDR) and C2. PfEMP1 varies in domain composition and binding specificity. The study explored the characteristic of Indonesian DBL2β-var genes and investigated its role to the malaria outcome. Twenty blood samples from clinically mild to severe malaria patients in Jember, East Java were collected for DNA extraction. Diagnosis was confirmed by Giemsa-stained thick blood smear. PCR was conducted using specific primer targeting on the full-length of DBL2ß and resulted approximately single band of 1,7 kb in a sample. This band was observed only from severe malaria sample. Sequence analysis directly from PCR product showed 74-99% similarities with previous sequences in Gene Bank. In conclusion, the DBL2β domain of vargene of Indonesian isolates was 1603 nucleotides in length and there was a possible association of the existence of DBL2β domain with the severity of malaria outcome.
Forster, Samuel C; Browne, Hilary P; Kumar, Nitin; Hunt, Martin; Denise, Hubert; Mitchell, Alex; Finn, Robert D; Lawley, Trevor D
2016-01-04
The Human Pan-Microbe Communities (HPMC) database (http://www.hpmcd.org/) provides a manually curated, searchable, metagenomic resource to facilitate investigation of human gastrointestinal microbiota. Over the past decade, the application of metagenome sequencing to elucidate the microbial composition and functional capacity present in the human microbiome has revolutionized many concepts in our basic biology. When sufficient high quality reference genomes are available, whole genome metagenomic sequencing can provide direct biological insights and high-resolution classification. The HPMC database provides species level, standardized phylogenetic classification of over 1800 human gastrointestinal metagenomic samples. This is achieved by combining a manually curated list of bacterial genomes from human faecal samples with over 21000 additional reference genomes representing bacteria, viruses, archaea and fungi with manually curated species classification and enhanced sample metadata annotation. A user-friendly, web-based interface provides the ability to search for (i) microbial groups associated with health or disease state, (ii) health or disease states and community structure associated with a microbial group, (iii) the enrichment of a microbial gene or sequence and (iv) enrichment of a functional annotation. The HPMC database enables detailed analysis of human microbial communities and supports research from basic microbiology and immunology to therapeutic development in human health and disease. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
RAS testing in metastatic colorectal cancer: advances in Europe.
Van Krieken, J Han J M; Rouleau, Etienne; Ligtenberg, Marjolijn J L; Normanno, Nicola; Patterson, Scott D; Jung, Andreas
2016-04-01
Personalized medicine shows promise for maximizing efficacy and minimizing toxicity of anti-cancer treatment. KRAS exon 2 mutations are predictive of resistance to epidermal growth factor receptor-directed monoclonal antibodies in patients with metastatic colorectal cancer. Recent studies have shown that broader RAS testing (KRAS and NRAS) is needed to select patients for treatment. While Sanger sequencing is still used, approaches based on various methodologies are available. Few CE-approved kits, however, detect the full spectrum of RAS mutations. More recently, "next-generation" sequencing has been developed for research use, including parallel semiconductor sequencing and reversible termination. These techniques have high technical sensitivities for detecting mutations, although the ideal threshold is currently unknown. Finally, liquid biopsy has the potential to become an additional tool to assess tumor-derived DNA. For accurate and timely RAS testing, appropriate sampling and prompt delivery of material is critical. Processes to ensure efficient turnaround from sample request to RAS evaluation must be implemented so that patients receive the most appropriate treatment. Given the variety of methodologies, external quality assurance programs are important to ensure a high standard of RAS testing. Here, we review technical and practical aspects of RAS testing for pathologists working with metastatic colorectal cancer tumor samples. The extension of markers from KRAS to RAS testing is the new paradigm for biomarker testing in colorectal cancer.
Considerations for standardizing predictive molecular pathology for cancer prognosis.
Fiorentino, Michelangelo; Scarpelli, Marina; Lopez-Beltran, Antonio; Cheng, Liang; Montironi, Rodolfo
2017-01-01
Molecular tests that were once ancillary to the core business of cyto-histopathology are becoming the most relevant workload in pathology departments after histopathology/cytopathology and before autopsies. This has resulted from innovations in molecular biology techniques, which have developed at an incredibly fast pace. Areas covered: Most of the current widely used techniques in molecular pathology such as FISH, direct sequencing, pyrosequencing, and allele-specific PCR will be replaced by massive parallel sequencing that will not be considered next generation, but rather, will be considered to be current generation sequencing. The pre-analytical steps of molecular techniques such as DNA extraction or sample preparation will be largely automated. Moreover, all the molecular pathology instruments will be part of an integrated workflow that traces the sample from extraction to the analytical steps until the results are reported; these steps will be guided by expert laboratory information systems. In situ hybridization and immunohistochemistry for quantification will be largely digitalized as much as histology will be mostly digitalized rather than viewed using microscopy. Expert commentary: This review summarizes the technical and regulatory issues concerning the standardization of molecular tests in pathology. A vision of the future perspectives of technological changes is also provided.
Stepanauskas, Ramunas; Fergusson, Elizabeth A; Brown, Joseph; Poulton, Nicole J; Tupper, Ben; Labonté, Jessica M; Becraft, Eric D; Brown, Julia M; Pachiadaki, Maria G; Povilaitis, Tadas; Thompson, Brian P; Mascena, Corianna J; Bellows, Wendy K; Lubys, Arvydas
2017-07-20
Microbial single-cell genomics can be used to provide insights into the metabolic potential, interactions, and evolution of uncultured microorganisms. Here we present WGA-X, a method based on multiple displacement amplification of DNA that utilizes a thermostable mutant of the phi29 polymerase. WGA-X enhances genome recovery from individual microbial cells and viral particles while maintaining ease of use and scalability. The greatest improvements are observed when amplifying high G+C content templates, such as those belonging to the predominant bacteria in agricultural soils. By integrating WGA-X with calibrated index-cell sorting and high-throughput genomic sequencing, we are able to analyze genomic sequences and cell sizes of hundreds of individual, uncultured bacteria, archaea, protists, and viral particles, obtained directly from marine and soil samples, in a single experiment. This approach may find diverse applications in microbiology and in biomedical and forensic studies of humans and other multicellular organisms.Single-cell genomics can be used to study uncultured microorganisms. Here, Stepanauskas et al. present a method combining improved multiple displacement amplification and FACS, to obtain genomic sequences and cell size information from uncultivated microbial cells and viral particles in environmental samples.
Wang, Yang; Zhang, Rongmin; Li, Jiyun; Wu, Zuowei; Yin, Wenjuan; Schwarz, Stefan; Tyrrell, Jonathan M; Zheng, Yongjun; Wang, Shaolin; Shen, Zhangqi; Liu, Zhihai; Liu, Jianye; Lei, Lei; Li, Mei; Zhang, Qidi; Wu, Congming; Zhang, Qijing; Wu, Yongning; Walsh, Timothy R; Shen, Jianzhong
2017-02-06
By 2030, the global population will be 8.5 billion, placing pressure on international poultry production, of which China is a key producer 1 . From April 2017, China will implement the withdrawal of colistin as a growth promoter, removing over 8,000 tonnes per year from the Chinese farming sector 2 . To understand the impact of banning colistin and the epidemiology of multi-drug-resistant (MDR) Escherichia coli (using bla NDM and mcr-1 as marker genes), we sampled poultry, dogs, sewage, wild birds and flies. Here, we show that mcr-1, but not bla NDM , is prevalent in hatcheries, but bla NDM quickly contaminates flocks through dogs, flies and wild birds. We also screened samples directly for resistance genes to understand the true breadth and depth of the environmental and animal resistome. Direct sample testing for bla NDM and mcr-1 in hatcheries, commercial farms, a slaughterhouse and supermarkets revealed considerably higher levels of positive samples than the bla NDM - and mcr-1-positive E. coli, indicating a substantial segment of unseen resistome-a phenomenon we have termed the 'phantom resistome'. Whole-genome sequencing identified common bla NDM -positive E. coli shared among farms, flies, dogs and farmers, providing direct evidence of carbapenem-resistant E. coli transmission and environmental contamination.
Eyre, David W.; Cule, Madeleine L.; Griffiths, David; Crook, Derrick W.; Peto, Tim E. A.
2013-01-01
Bacterial whole genome sequencing offers the prospect of rapid and high precision investigation of infectious disease outbreaks. Close genetic relationships between microorganisms isolated from different infected cases suggest transmission is a strong possibility, whereas transmission between cases with genetically distinct bacterial isolates can be excluded. However, undetected mixed infections—infection with ≥2 unrelated strains of the same species where only one is sequenced—potentially impairs exclusion of transmission with certainty, and may therefore limit the utility of this technique. We investigated the problem by developing a computationally efficient method for detecting mixed infection without the need for resource-intensive independent sequencing of multiple bacterial colonies. Given the relatively low density of single nucleotide polymorphisms within bacterial sequence data, direct reconstruction of mixed infection haplotypes from current short-read sequence data is not consistently possible. We therefore use a two-step maximum likelihood-based approach, assuming each sample contains up to two infecting strains. We jointly estimate the proportion of the infection arising from the dominant and minor strains, and the sequence divergence between these strains. In cases where mixed infection is confirmed, the dominant and minor haplotypes are then matched to a database of previously sequenced local isolates. We demonstrate the performance of our algorithm with in silico and in vitro mixed infection experiments, and apply it to transmission of an important healthcare-associated pathogen, Clostridium difficile. Using hospital ward movement data in a previously described stochastic transmission model, 15 pairs of cases enriched for likely transmission events associated with mixed infection were selected. Our method identified four previously undetected mixed infections, and a previously undetected transmission event, but no direct transmission between the pairs of cases under investigation. These results demonstrate that mixed infections can be detected without additional sequencing effort, and this will be important in assessing the extent of cryptic transmission in our hospitals. PMID:23658511
Song, Junfang; Duc, Céline; Storey, Kate G.; McLean, W. H. Irwin; Brown, Sara J.; Simpson, Gordon G.; Barton, Geoffrey J.
2014-01-01
The reference annotations made for a genome sequence provide the framework for all subsequent analyses of the genome. Correct and complete annotation in addition to the underlying genomic sequence is particularly important when interpreting the results of RNA-seq experiments where short sequence reads are mapped against the genome and assigned to genes according to the annotation. Inconsistencies in annotations between the reference and the experimental system can lead to incorrect interpretation of the effect on RNA expression of an experimental treatment or mutation in the system under study. Until recently, the genome-wide annotation of 3′ untranslated regions received less attention than coding regions and the delineation of intron/exon boundaries. In this paper, data produced for samples in Human, Chicken and A. thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing technology from Helicos Biosciences which locates 3′ polyadenylation sites to within +/− 2 nt, were combined with archival EST and RNA-Seq data. Nine examples are illustrated where this combination of data allowed: (1) gene and 3′ UTR re-annotation (including extension of one 3′ UTR by 5.9 kb); (2) disentangling of gene expression in complex regions; (3) clearer interpretation of small RNA expression and (4) identification of novel genes. While the specific examples displayed here may become obsolete as genome sequences and their annotations are refined, the principles laid out in this paper will be of general use both to those annotating genomes and those seeking to interpret existing publically available annotations in the context of their own experimental data. PMID:24722185
Tso, Kai-Yuen; Lee, Sau Dan; Lo, Kwok-Wai; Yip, Kevin Y
2014-12-23
Patient-derived tumor xenografts in mice are widely used in cancer research and have become important in developing personalized therapies. When these xenografts are subject to DNA sequencing, the samples could contain various amounts of mouse DNA. It has been unclear how the mouse reads would affect data analyses. We conducted comprehensive simulations to compare three alignment strategies at different mutation rates, read lengths, sequencing error rates, human-mouse mixing ratios and sequenced regions. We also sequenced a nasopharyngeal carcinoma xenograft and a cell line to test how the strategies work on real data. We found the "filtering" and "combined reference" strategies performed better than aligning reads directly to human reference in terms of alignment and variant calling accuracies. The combined reference strategy was particularly good at reducing false negative variants calls without significantly increasing the false positive rate. In some scenarios the performance gain of these two special handling strategies was too small for special handling to be cost-effective, but it was found crucial when false non-synonymous SNVs should be minimized, especially in exome sequencing. Our study systematically analyzes the effects of mouse contamination in the sequencing data of human-in-mouse xenografts. Our findings provide information for designing data analysis pipelines for these data.
Ni, Xiangyang; Westpheling, Janet
1997-01-01
The chi63 promoter directs glucose-sensitive, chitin-dependent transcription of a gene involved in the utilization of chitin as carbon source. Analysis of 5′ and 3′ deletions of the promoter region revealed that a 350-bp segment is sufficient for wild-type levels of expression and regulation. The analysis of single base changes throughout the promoter region, introduced by random and site-directed mutagenesis, identified several sequences to be important for activity and regulation. Single base changes at −10, −12, −32, −33, −35, and −37 upstream of the transcription start site resulted in loss of activity from the promoter, suggesting that bases in these positions are important for RNA polymerase interaction. The sequences centered around −10 (TATTCT) and −35 (TTGACC) in this promoter are, in fact, prototypical of eubacterial promoters. Overlapping the RNA polymerase binding site is a perfect 12-bp direct repeat sequence. Some base changes within this direct repeat resulted in constitutive expression, suggesting that this sequence is an operator for negative regulation. Other base changes resulted in loss of glucose repression while retaining the requirement for chitin induction, suggesting that this sequence is also involved in glucose repression. The fact that cis-acting mutations resulted in glucose resistance but not inducer independence rules out the possibility that glucose repression acts exclusively by inducer exclusion. The fact that mutations that affect glucose repression and chitin induction fall within the same direct repeat sequence module suggests that the direct repeat sequence facilitates both chitin induction and glucose repression. PMID:9371809
Simultaneous Multi-Slice fMRI using Spiral Trajectories
Zahneisen, Benjamin; Poser, Benedikt A.; Ernst, Thomas; Stenger, V. Andrew
2014-01-01
Parallel imaging methods using multi-coil receiver arrays have been shown to be effective for increasing MRI acquisition speed. However parallel imaging methods for fMRI with 2D sequences show only limited improvements in temporal resolution because of the long echo times needed for BOLD contrast. Recently, Simultaneous Multi-Slice (SMS) imaging techniques have been shown to increase fMRI temporal resolution by factors of four and higher. In SMS fMRI multiple slices can be acquired simultaneously using Echo Planar Imaging (EPI) and the overlapping slices are un-aliased using a parallel imaging reconstruction with multiple receivers. The slice separation can be further improved using the “blipped-CAIPI” EPI sequence that provides a more efficient sampling of the SMS 3D k-space. In this paper a blipped-spiral SMS sequence for ultra-fast fMRI is presented. The blipped-spiral sequence combines the sampling efficiency of spiral trajectories with the SMS encoding concept used in blipped-CAIPI EPI. We show that blipped spiral acquisition can achieve almost whole brain coverage at 3 mm isotropic resolution in 168 ms. It is also demonstrated that the high temporal resolution allows for dynamic BOLD lag time measurement using visual/motor and retinotopic mapping paradigms. The local BOLD lag time within the visual cortex following the retinotopic mapping stimulation of expanding flickering rings is directly measured and easily translated into an eccentricity map of the cortex. PMID:24518259
Discovering Deeply Divergent RNA Viruses in Existing Metatranscriptome Data with Machine Learning
NASA Astrophysics Data System (ADS)
Rivers, A. R.
2016-02-01
Most sampling of RNA viruses and phages has been directed toward a narrow range of hosts and environments. Several marine metagenomic studies have examined the RNA viral fraction in aquatic samples and found a number of picornaviruses and uncharacterized sequences. The lack of homology to known protein families has limited the discovery of new RNA viruses. We developed a computational method for identifying RNA viruses that relies on information in the codon transition probabilities of viral sequences to train a classifier. This approach does not rely on homology, but it has higher information content than other reference-free methods such as tetranucleotide frequency. Training and validation with RefSeq data gave true positive and true negative rates of 99.6% and 99.5% on the highly imbalanced validation sets (0.2% viruses) that, like the metatranscriptomes themselves, contain mostly non-viral sequences. To further test the method, a validation dataset of putative RNA virus genomes were identified in metatransciptomes by the presence of RNA dependent RNA polymerase, an essential gene for RNA viruses. The classifier successfully identified 99.4% of those contigs as viral. This approach is currently being extended to screen all metatranscriptome data sequenced at the DOE Joint Genome Institute, presently 4.5 Gb of assembled data from 504 public projects representing a wide range of marine, aquatic and terrestrial environments.
Nucleotide-Specific Contrast for DNA Sequencing by Electron Spectroscopy.
Mankos, Marian; Persson, Henrik H J; N'Diaye, Alpha T; Shadman, Khashayar; Schmid, Andreas K; Davis, Ronald W
2016-01-01
DNA sequencing by imaging in an electron microscope is an approach that holds promise to deliver long reads with low error rates and without the need for amplification. Earlier work using transmission electron microscopes, which use high electron energies on the order of 100 keV, has shown that low contrast and radiation damage necessitates the use of heavy atom labeling of individual nucleotides, which increases the read error rates. Other prior work using scattering electrons with much lower energy has shown to suppress beam damage on DNA. Here we explore possibilities to increase contrast by employing two methods, X-ray photoelectron and Auger electron spectroscopy. Using bulk DNA samples with monomers of each base, both methods are shown to provide contrast mechanisms that can distinguish individual nucleotides without labels. Both spectroscopic techniques can be readily implemented in a low energy electron microscope, which may enable label-free DNA sequencing by direct imaging.
Staphylococcus nepalensis in the guano of bats (Mammalia).
Vandžurová, A; Bačkor, P; Javorský, P; Pristaš, P
2013-05-31
Thirty randomly selected mesophilic isolates from the six years old guano sample from mixed Myotis myotis and M. blythii summer roosts colony were isolated and identified as Staphylococcus nepalensis using MALDI TOF analysis. 16S rRNA gene sequencing of selected five isolates and subsequent phylogenetic analysis confirmed that all sequences showed the highest similarity to S. nepalensis sequences. Several virulence factors were produced by tested isolates, mainly capsule formation and resistance to tetracycline, ampicillin, gentamycin, and chloramphenicol antibiotics. Our experiments show that the majority of cultivable mesophilic bacteria from the guano of bats belong to the S. nepalensis species. This is the first report on the occurrence of this species in the guano of bats and our results indicate that the guano accumulated near or directly in human dwellings and buildings may represent a significant risk for human health. Copyright © 2013 Elsevier B.V. All rights reserved.
Ramos, Enrique; Levinson, Benjamin T; Chasnoff, Sara; Hughes, Andrew; Young, Andrew L; Thornton, Katherine; Li, Allie; Vallania, Francesco L M; Province, Michael; Druley, Todd E
2012-12-06
Rare genetic variation in the human population is a major source of pathophysiological variability and has been implicated in a host of complex phenotypes and diseases. Finding disease-related genes harboring disparate functional rare variants requires sequencing of many individuals across many genomic regions and comparing against unaffected cohorts. However, despite persistent declines in sequencing costs, population-based rare variant detection across large genomic target regions remains cost prohibitive for most investigators. In addition, DNA samples are often precious and hybridization methods typically require large amounts of input DNA. Pooled sample DNA sequencing is a cost and time-efficient strategy for surveying populations of individuals for rare variants. We set out to 1) create a scalable, multiplexing method for custom capture with or without individual DNA indexing that was amenable to low amounts of input DNA and 2) expand the functionality of the SPLINTER algorithm for calling substitutions, insertions and deletions across either candidate genes or the entire exome by integrating the variant calling algorithm with the dynamic programming aligner, Novoalign. We report methodology for pooled hybridization capture with pre-enrichment, indexed multiplexing of up to 48 individuals or non-indexed pooled sequencing of up to 92 individuals with as little as 70 ng of DNA per person. Modified solid phase reversible immobilization bead purification strategies enable no sample transfers from sonication in 96-well plates through adapter ligation, resulting in 50% less library preparation reagent consumption. Custom Y-shaped adapters containing novel 7 base pair index sequences with a Hamming distance of ≥2 were directly ligated onto fragmented source DNA eliminating the need for PCR to incorporate indexes, and was followed by a custom blocking strategy using a single oligonucleotide regardless of index sequence. These results were obtained aligning raw reads against the entire genome using Novoalign followed by variant calling of non-indexed pools using SPLINTER or SAMtools for indexed samples. With these pipelines, we find sensitivity and specificity of 99.4% and 99.7% for pooled exome sequencing. Sensitivity, and to a lesser degree specificity, proved to be a function of coverage. For rare variants (≤2% minor allele frequency), we achieved sensitivity and specificity of ≥94.9% and ≥99.99% for custom capture of 2.5 Mb in multiplexed libraries of 22-48 individuals with only ≥5-fold coverage/chromosome, but these parameters improved to ≥98.7 and 100% with 20-fold coverage/chromosome. This highly scalable methodology enables accurate rare variant detection, with or without individual DNA sample indexing, while reducing the amount of required source DNA and total costs through less hybridization reagent consumption, multi-sample sonication in a standard PCR plate, multiplexed pre-enrichment pooling with a single hybridization and lesser sequencing coverage required to obtain high sensitivity.
Lee, Seung-Bum; Kaittanis, Charalambos; Jansen, Robert K; Hostetler, Jessica B; Tallon, Luke J; Town, Christopher D; Daniell, Henry
2006-01-01
Background Cotton (Gossypium hirsutum) is the most important fiber crop grown in 90 countries. In 2004–2005, US farmers planted 79% of the 5.7-million hectares of nuclear transgenic cotton. Unfortunately, genetically modified cotton has the potential to hybridize with other cultivated and wild relatives, resulting in geographical restrictions to cultivation. However, chloroplast genetic engineering offers the possibility of containment because of maternal inheritance of transgenes. The complete chloroplast genome of cotton provides essential information required for genetic engineering. In addition, the sequence data were used to assess phylogenetic relationships among the major clades of rosids using cotton and 25 other completely sequenced angiosperm chloroplast genomes. Results The complete cotton chloroplast genome is 160,301 bp in length, with 112 unique genes and 19 duplicated genes within the IR, containing a total of 131 genes. There are four ribosomal RNAs, 30 distinct tRNA genes and 17 intron-containing genes. The gene order in cotton is identical to that of tobacco but lacks rpl22 and infA. There are 30 direct and 24 inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Most of the direct repeats are within intergenic spacer regions, introns and a 72 bp-long direct repeat is within the psaA and psaB genes. Comparison of protein coding sequences with expressed sequence tags (ESTs) revealed nucleotide substitutions resulting in amino acid changes in ndhC, rpl23, rpl20, rps3 and clpP. Phylogenetic analysis of a data set including 61 protein-coding genes using both maximum likelihood and maximum parsimony were performed for 28 taxa, including cotton and five other angiosperm chloroplast genomes that were not included in any previous phylogenies. Conclusion Cotton chloroplast genome lacks rpl22 and infA and contains a number of dispersed direct and inverted repeats. RNA editing resulted in amino acid changes with significant impact on their hydropathy. Phylogenetic analysis provides strong support for the position of cotton in the Malvales in the eurosids II clade sister to Arabidopsis in the Brassicales. Furthermore, there is strong support for the placement of the Myrtales sister to the eurosid I clade, although expanded taxon sampling is needed to further test this relationship. PMID:16553962
NASA Astrophysics Data System (ADS)
Benkler, Erik; Telle, Harald R.
2007-06-01
An improved phase-locked loop (PLL) for versatile synchronization of a sampling pulse train to an optical data stream is presented. It enables optical sampling of the true waveform of repetitive high bit-rate optical time division multiplexed (OTDM) data words such as pseudorandom bit sequences. Visualization of the true waveform can reveal details, which cause systematic bit errors. Such errors cannot be inferred from eye diagrams and require word-synchronous sampling. The programmable direct-digital-synthesis circuit used in our novel PLL approach allows flexible adaption of virtually any problem-specific synchronization scenario, including those required for waveform sampling, for jitter measurements by slope detection, and for classical eye-diagrams. Phase comparison of the PLL is performed at 10-GHz OTDM base clock rate, leading to a residual synchronization jitter of less than 70 fs.
Bomfim, Maria Rosa Quaresma; Koury, Matilde Cota
2006-12-20
We evaluated the use of low-stringency single specific primer PCR (LSSP-PCR) for genetically typing Leptospira directly from urine samples of cattle with clinical suspicion of leptospirosis. Urine samples obtained from 40 cattle with clinical suspicion of leptospirosis were amplified by specific PCR using the following primers: Internal 1/Internal 2 and G1/G2. The internal primers were designed from the gene sequence of the outer membrane lipoprotein Lip32 from Leptospira kirschneri, strain RM52. The PCR products were amplified with these two pairs of primers, which had approximately 497 and 285bp, respectively, and were subsequently used as a template for LSSP-PCR analysis. The genetic signatures from the leptospires which were present in the urine samples allowed us to make a preliminary identification of the leptospires by comparing the LSSP-PCR profiles obtained directly from urine samples with those from reference leptospires. The LSSP-PCR profiles obtained with the Internal 1 primer or with the G1 primer allowed the grouping of the leptospires into serogroups. LSSP-PCR was found to be a useful and sensitive approach capable of identifying leptospires directly from biological samples without the need for prior bacterial isolation. In conclusion, the LSSP-PCR technique may still be helpful in discriminating serogroups of Leptospira from different animal reservoirs, since the early identification of carrier animals and information on the shedding state are crucial to prevent the spread of leptospiral infection to other animals and humans.
Direct-push geochemical profiling for assessment of inorganic chemical heterogeneity in aquifers
Schulmeister, M.K.; Healey, J.M.; Butler, J.J.; McCall, G.W.
2004-01-01
Discrete-depth sampling of inorganic groundwater chemistry is essential for a variety of site characterization activities. Although the mobility and rapid sampling capabilities of direct-push techniques have led to their widespread use for evaluating the distribution of organic contaminants, complementary methods for the characterization of spatial variations in geochemical conditions have not been developed. In this study, a direct-push-based approach for high-resolution inorganic chemical profiling was developed at a site where sharp chemical contrasts and iron-reducing conditions had previously been observed. Existing multilevel samplers (MLSs) that span a fining-upward alluvial sequence were used for comparison with the direct-push profiling. Chemical profiles obtained with a conventional direct-push exposed-screen sampler differed from those obtained with an adjacent MLS because of sampler reactivity and mixing with water from previous sampling levels. The sampler was modified by replacing steel sampling components with stainless-steel and heat-treated parts, and adding an adapter that prevents mixing. Profiles obtained with the modified approach were in excellent agreement with those obtained from an adjacent MLS for all constituents and parameters monitored (Cl, NO3, Fe, Mn, DO, ORP, specific conductance and pH). Interpretations of site redox conditions based on field-measured parameters were supported by laboratory analysis of dissolved Fe. The discrete-depth capability of this approach allows inorganic chemical variations to be described at a level of detail that has rarely been possible. When combined with the mobility afforded by direct-push rigs and on-site methods of chemical analysis, the new approach is well suited for a variety of interactive site-characterization endeavors. ?? 2003 Elsevier B.V. All rights reserved.
Orlando, Ludovic; Mauffrey, Jean-François; Cuisin, Jacques; Patton, James L; Hänni, Catherine; Catzeflis, François
2003-04-01
The spiny rat Mesomys hispidus is one of many South American rodents that lack adequate taxonomic definition. The few sampled populations of this broadly distributed trans-Amazonian arboreal rat have come from widely separated regions and are typically highly divergent. The holotype was described in 1817 by A.-G. Desmarest, after Napoleon's army brought it to Paris following the plunder of Lisbon in 1808; however, the locality of origin has remained unknown. Here we examine the taxonomic status of this species by direct comparison of 50 extant individuals with the holotype at the morphometric and genetic levels, the latter based on 331 bp of the mitochondrial cytochrome b gene retrieved from a small skin fragment of the holotype with ancient DNA technology. Extensive sequence divergence is present among samples of M. hispidus collected from throughout its range, from French Guiana across Amazonia to Bolivia and Peru, with at least seven mitochondrial clades recognized (average divergence of 7.7% Kimura 2-parameter distance). Sequence from the holotype is, however, only weakly divergent from those of recent samples from French Guiana. Moreover, the holotype clusters with greater that 99% posterior probability with samples from this part of Amazonia in a discriminant analysis based on 22 cranial and dental measurements. Thus, we suggest that the holotype was originally obtained in eastern Amazonia north of the Amazon River, most likely in the Brazilian state of Amapá. Despite the high level of sequence diversity and marked morphological differences in size across the range of M. hispidus, we continue to regard this assemblage as a single species until additional samples and analyses suggest otherwise. Copyright 2002 Elsevier Science (USA)
Laassri, Majid; Dragunsky, Eugenia; Enterline, Joan; Eremeeva, Tatiana; Ivanova, Olga; Lottenbach, Kathleen; Belshe, Robert; Chumakov, Konstantin
2005-01-01
Sabin strains of poliovirus used in the manufacture of oral poliovirus vaccine (OPV) are prone to genetic variations that occur during growth in cell cultures and the organisms of vaccine recipients. Such derivative viruses often have increased neurovirulence and transmissibility, and in some cases they can reestablish chains of transmission in human populations. Monitoring for vaccine-derived polioviruses is an important part of the worldwide campaign to eradicate poliomyelitis. Analysis of vaccine-derived polioviruses requires, as a first step, their isolation in cell cultures, which takes significant time and may yield viral stocks that are not fully representative of the strains present in the original sample. Here we demonstrate that full-length viral cDNA can be PCR amplified directly from stool samples and immediately subjected to genomic analysis by oligonucleotide microarray hybridization and nucleotide sequencing. Most fecal samples from healthy children who received OPV were found to contain variants of Sabin vaccine viruses. Sequence changes in the 5′ untranslated region were common, as were changes in the VP1-coding region, including changes in a major antigenic site. Analysis of stool samples taken from cases of acute flaccid paralysis revealed the presence of mixtures of recombinant polioviruses, in addition to the emergence of new sequence variants. Avoiding the need for cell culture isolation dramatically shortened the time needed for identification and analysis of vaccine-derived polioviruses and could be useful for preliminary screening of clinical samples. The amplified full-length viral cDNA can be archived and used to recover live virus for further virological studies. PMID:15956413
Seyer, Ayse; Karasartova, Djursun; Ruh, Emrah; Güreser, Ayse Semra; Imir, Turgut; Taylan-Ozkan, Aysegul
2016-12-01
PCR and DNA sequencing are currently the diagnostic methods of choice for detection of Blastocystis spp. and their suptypes. Fresh or frozen stool samples have disadvantages in terms of several aspects such as transportation, storage, and existence of PCR inhibitors. Filter paper technology may provide a solution to these issues. The aim of the present study was to detect Blastocystis spp. and their subtypes by employing two different preservation methods: conventional frozen stool (FS) and dried stool spots on filter paper (DSSFP). Concentration and purity of DNA, sensitivity of PCR, and DNA sequencing results obtained from the two methods were also compared. A total of 230 fecal samples were included and separated into two parts: one part of the fecal samples were directly frozen and stored at -20 °C. The remaining portion of the specimens were homogenized with saline and spread onto the filter papers as thin layer with a diameter of approximately 3 cm. After air-dried, the filter papers were stored at room temperature. DSSFP samples were collected by scraping from the filter papers. DNA were extracted by EURx Stool DNA Extraction Kit from both samples. Concentration and purity were measured with Nano-Drop, then PCR and sequencing were conducted for detection of Blastocystis spp. and its genotypes. Pure DNA was obtained with a A260/A280 ratio of 1.7-2.2 in both methods. DNA yield from FS was 25-405 ng/μl and average DNA concentration was 151 ng/μl, while these were 7-339 and 122 ng/μl for DSSFP, respectively. No PCR inhibition was observed in two methods. DNA from DSSFP were found to be stable and PCR were reproducible for at least 1 year. FS-PCR- and DSSFP-PCR-positive samples were 49 (21.3 %) and 58 (25.3 %), respectively (p = 0.078). The 43 specimens were concordantly positive by both FS-PCR and DSSFP-PCR. When the microscopy was taken as the gold standard, sensitivity of DSSFP-PCR and FS-PCR was 95.5 and 86.4 %, while specificity of both tests was 99.4 and 98.3 %, respectively. DNA sequencing results of 19 microscopically confirmed cases were strictly identical (concordance 100 %) in both methods, and ST2:6, ST3:8, ST4:3, and ST6:2 were the detected subtypes. Among the 230 fecal samples, the most predominant subtypes were ST3, ST2, ST4, and ST1 by both FS and DSSFP methods. Concordance of DNA sequencing results obtained from the two methods was noted to be 90.7 %. To our knowledge, this is the first study that demonstrates DNA extraction from DSSFP is more sensitive and effective than the FS method for diagnosis of Blastocystis spp. and their subtypes by PCR and DNA sequencing.
Frange, Pierre; Meyer, Laurence; Jung, Matthieu; Goujard, Cecile; Zucman, David; Abel, Sylvie; Hochedez, Patrick; Gousset, Marine; Gascuel, Olivier; Rouzioux, Christine; Chaix, Marie-Laure
2013-01-01
Objective Characterization of HIV-1 sequences in newly infected individuals is important for elucidating the mechanisms of viral sexual transmission. We report the identification of transmitted/founder viruses in eight pairs of HIV-1 sexually-infected patients enrolled at the time of primary infection (“recipients”) and their transmitting partners (“donors”). Methods Using a single genome-amplification approach, we compared quasispecies in donors and recipients on the basis of 316 and 376 C2V5 env sequences amplified from plasma viral RNA and PBMC-associated DNA, respectively. Results Both DNA and RNA sequences indicated very homogeneous viral populations in all recipients, suggesting transmission of a single variant, even in cases of recent sexually transmitted infections (STIs) in donors (n = 2) or recipients (n = 3). In all pairs, the transmitted/founder virus was derived from an infrequent variant population within the blood of the donor. The donor variant sequences most closely related to the recipient sequences were found in plasma samples in 3/8 cases and/or in PBMC samples in 6/8 cases. Although donors were exclusively (n = 4) or predominantly (n = 4) infected by CCR5-tropic (R5) strains, two recipients were infected with highly homogeneous CXCR4/dual-mixed-tropic (X4/DM) viral populations, identified in both DNA and RNA. The proportion of X4/DM quasispecies in donors was higher in cases of X4/DM than R5 HIV transmission (16.7–22.0% versus 0–2.6%), suggesting that X4/DM transmission may be associated with a threshold population of X4/DM circulating quasispecies in donors. Conclusions These suggest that a severe genetic bottleneck occurs during subtype B HIV-1 heterosexual and homosexual transmission. Sexually-transmitted/founder virus cannot be directly predicted by analysis of the donor’s quasispecies in plasma and/or PBMC. Additional studies are required to fully understand the traits that confer the capacity to transmit and establish infection, and determine the role of concomitant STIs in mitigating the genetic bottleneck in mucosal HIV transmission. PMID:23874894
A metagenomic framework for the study of airborne microbial communities.
Yooseph, Shibu; Andrews-Pfannkoch, Cynthia; Tenney, Aaron; McQuaid, Jeff; Williamson, Shannon; Thiagarajan, Mathangi; Brami, Daniel; Zeigler-Allen, Lisa; Hoffman, Jeff; Goll, Johannes B; Fadrosh, Douglas; Glass, John; Adams, Mark D; Friedman, Robert; Venter, J Craig
2013-01-01
Understanding the microbial content of the air has important scientific, health, and economic implications. While studies have primarily characterized the taxonomic content of air samples by sequencing the 16S or 18S ribosomal RNA gene, direct analysis of the genomic content of airborne microorganisms has not been possible due to the extremely low density of biological material in airborne environments. We developed sampling and amplification methods to enable adequate DNA recovery to allow metagenomic profiling of air samples collected from indoor and outdoor environments. Air samples were collected from a large urban building, a medical center, a house, and a pier. Analyses of metagenomic data generated from these samples reveal airborne communities with a high degree of diversity and different genera abundance profiles. The identities of many of the taxonomic groups and protein families also allows for the identification of the likely sources of the sampled airborne bacteria.
A Metagenomic Framework for the Study of Airborne Microbial Communities
Tenney, Aaron; McQuaid, Jeff; Williamson, Shannon; Thiagarajan, Mathangi; Brami, Daniel; Zeigler-Allen, Lisa; Hoffman, Jeff; Goll, Johannes B.; Fadrosh, Douglas; Glass, John; Adams, Mark D.; Friedman, Robert; Venter, J. Craig
2013-01-01
Understanding the microbial content of the air has important scientific, health, and economic implications. While studies have primarily characterized the taxonomic content of air samples by sequencing the 16S or 18S ribosomal RNA gene, direct analysis of the genomic content of airborne microorganisms has not been possible due to the extremely low density of biological material in airborne environments. We developed sampling and amplification methods to enable adequate DNA recovery to allow metagenomic profiling of air samples collected from indoor and outdoor environments. Air samples were collected from a large urban building, a medical center, a house, and a pier. Analyses of metagenomic data generated from these samples reveal airborne communities with a high degree of diversity and different genera abundance profiles. The identities of many of the taxonomic groups and protein families also allows for the identification of the likely sources of the sampled airborne bacteria. PMID:24349140
Concerted evolution at the population level: pupfish HindIII satellite DNA sequences.
Elder, J F; Turner, B J
1994-01-01
The canonical monomers (approximately 170 bp) of an abundant (1.9 x 10(6) copies per diploid genome) satellite DNA sequence family in the genome of Cyprinodon variegatus, a "pupfish" that ranges along the Atlantic coast from Cape Cod to central Mexico, are divergent in base sequence in 10 of 12 samples collected from natural populations. The divergence involves substitutions, deletions, and insertions, is marked in scope (mean pairwise sequence similarity = 61.6%; range = 35-95.9%), is largely confined to the 3' half of the monomer, and is not correlated with the distance among collecting sites. Repetitive cloning and direct genomic sequencing experiments failed to detect intrapopulation and intraindividual variation, suggesting high levels of sequence homogeneity within populations. The satellite sequence has therefore undergone "concerted evolution," at the level of the local population. Concerted evolution has previously almost always been discussed in terms of the divergence of species or higher taxa; its intraspecific occurrence apparently has not been reported previously. The generality of the observation is difficult to evaluate, for although satellite DNAs from a large number of organisms have been studied in detail, there appear to be little or no other data on their sequence variation in natural populations. The relationship (if any) between concerted, population level, satellite DNA divergence and the extent of gene flow/genetic isolation among conspecific natural populations remains to be established. Images PMID:8302879
Tanner, Hannah; Evans, Jason T; Gossain, Savita; Hussain, Abid
2017-01-18
Patient mortality is significantly reduced by rapid identification of bacteria from sterile sites. MALDI-TOF can identify bacteria directly from positive blood cultures and multiple sample preparation methods are available. We evaluated three sample preparation methods and two MALDI-TOF score cut-off values. Positive blood culture bottles with organisms present in Gram stains were prospectively analysed by MALDI-TOF. Three lysis reagents (Saponin, SDS, and SepsiTyper lysis bufer) were applied to each positive culture followed by centrifugation, washing and protein extraction steps. Methods were compared using the McNemar test and 16S rDNA sequencing was used to assess discordant results. In 144 monomicrobial cultures, using ≥2.000 as the cut-off value, species level identifications were obtained from 69/144 (48%) samples using Saponin, 86/144 (60%) using SDS, and 91/144 (63%) using SepsiTyper. The difference between SDS and SepsiTyper was not statistically significant (P = 0.228). Differences between Saponin and the other two reagents were significant (P < 0.01). Using ≥1.700 plus top three results matching as the cut-off value, species level identifications were obtained from 100/144 (69%) samples using Saponin, 103/144 (72%) using SDS, and 106/144 (74%) using SepsiTyper and there was no statistical difference between the methods. No true discordances between culture and direct MALDI-TOF identification were observed in monomicrobial cultures. In 32 polymicrobial cultures, MALDI-TOF identified one organism in 34-75% of samples depending on the method. This study demonstrates two inexpensive in-house detergent lysis methods are non-inferior to a commercial kit for analysis of positive blood cultures by direct MALDI-TOF in a clinical diagnostic microbiology laboratory.
Microbiome in the Apical Root Canal System of Teeth with Post-Treatment Apical Periodontitis
Siqueira, José F.; Antunes, Henrique S.; Rôças, Isabela N.; Rachid, Caio T. C. C.
2016-01-01
Introduction Bacteria present in the apical root canal system are directly involved with the pathogenesis of post-treatment apical periodontitis. This study used a next-generation sequencing approach to identify the bacterial taxa occurring in cryopulverized apical root samples from root canal-treated teeth with post-treatment disease. Methods Apical root specimens obtained during periradicular surgery of ten adequately treated teeth with persistent apical periodontitis were cryogenically ground. DNA was extracted from the powder and the microbiome was characterized on the basis of the V4 hypervariable region of the 16S rRNA gene by using paired-end sequencing on Illumina MiSeq device. Results All samples were positive for the presence of bacterial DNA. Bacterial taxa were mapped to 11 phyla and 103 genera composed by 538 distinct operational taxonomic units (OTUs) at 3% of dissimilarity. Over 85% of the sequences belonged to 4 phyla: Proteobacteria, Firmicutes, Fusobacteria and Actinobacteria. In general, these 4 phyla accounted for approximately 80% of the distinct OTUs found in the apical root samples. Proteobacteria was the most abundant phylum in 6/10 samples. Fourteen genera had representatives identified in all cases. Overall, the genera Fusobacterium and Pseudomonas were the most dominant. Enterococcus was found in 4 cases, always in relatively low abundance. Conclusions This study showed a highly complex bacterial community in the apical root canal system of adequately treated teeth with persistent apical periodontitis. This suggests that this disease is characterized by multispecies bacterial communities and has a heterogeneous etiology, because the community composition largely varied from case to case. PMID:27689802
Microbiome in the Apical Root Canal System of Teeth with Post-Treatment Apical Periodontitis.
Siqueira, José F; Antunes, Henrique S; Rôças, Isabela N; Rachid, Caio T C C; Alves, Flávio R F
Bacteria present in the apical root canal system are directly involved with the pathogenesis of post-treatment apical periodontitis. This study used a next-generation sequencing approach to identify the bacterial taxa occurring in cryopulverized apical root samples from root canal-treated teeth with post-treatment disease. Apical root specimens obtained during periradicular surgery of ten adequately treated teeth with persistent apical periodontitis were cryogenically ground. DNA was extracted from the powder and the microbiome was characterized on the basis of the V4 hypervariable region of the 16S rRNA gene by using paired-end sequencing on Illumina MiSeq device. All samples were positive for the presence of bacterial DNA. Bacterial taxa were mapped to 11 phyla and 103 genera composed by 538 distinct operational taxonomic units (OTUs) at 3% of dissimilarity. Over 85% of the sequences belonged to 4 phyla: Proteobacteria, Firmicutes, Fusobacteria and Actinobacteria. In general, these 4 phyla accounted for approximately 80% of the distinct OTUs found in the apical root samples. Proteobacteria was the most abundant phylum in 6/10 samples. Fourteen genera had representatives identified in all cases. Overall, the genera Fusobacterium and Pseudomonas were the most dominant. Enterococcus was found in 4 cases, always in relatively low abundance. This study showed a highly complex bacterial community in the apical root canal system of adequately treated teeth with persistent apical periodontitis. This suggests that this disease is characterized by multispecies bacterial communities and has a heterogeneous etiology, because the community composition largely varied from case to case.
Miller, Mark P.; Bellinger, R.M.; Forsman, E.D.; Haig, Susan M.
2006-01-01
Phylogeographical analyses conducted in the Pacific Northwestern United States have often revealed concordant patterns of genetic diversity among taxa. These studies demonstrate distinct North/South genetic discontinuities that have been attributed to Pleistocene glaciation. We examined phylogeographical patterns of red tree voles (Phenacomys longicaudus) in western Oregon by analysing mitochondrial control region sequences for 169 individuals from 18 areas across the species' range. Cytochrome b sequences were also analysed from a subset of our samples to confirm the presence of major haplotype groups. Phylogenetic network analyses suggested the presence of two haplotype groups corresponding to northern and southern regions of P. longicaudus' range. Spatial genetic analyses (samova and Genetic Landscape Shapes) of control region sequences demonstrated a primary genetic discontinuity separating northern and southern sampling areas, while a secondary discontinuity separated northern sampling areas into eastern and western groups divided by the Willamette Valley. The North/South discontinuity likely corresponds to a region of secondary contact between lineages rather than an overt barrier. Although the Cordilleran ice sheet (maximum a??12 000 years ago) did not move southward to directly affect the region occupied by P. longicaudus, climate change during glaciation fragmented the forest landscape that it inhabits. Signatures of historical fragmentation were reflected by positive associations between latitude and variables such as Tajima's D and patterns associated with location-specific alleles. Genetic distances between southern sampling areas were smaller, suggesting that forest fragmentation was reduced in southern vs. northern regions.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tew, Karen
I spent the last ten weeks working in the Systems Biology department at Sandia National Laboratories in Livermore, CA. Under the direction of Zachary Bent, I helped do preliminary testing/optimization of a vacuum-driven, capture-based system for pathogen RNA transcript enrichment. I also worked on a project to create mutant Yersinia enterocolitica strains in order to test which genes are involved in intracellular pathogen virulence, as well as sequencing several Klebsiella pneumoniae samples for use by a bioinformaticist.
Fritz, M L; Miller, J R; Bayoh, M N; Vulule, J M; Landgraf, J R; Walker, E D
2013-12-01
A DNA-DNA hybridization method, reverse dot blot analysis (RDBA), was used to identify Anopheles gambiae s.s. and Anopheles arabiensis (Diptera: Culicidae) hosts. Of 299 blood-fed and semi-gravid An. gambiae s.l. collected from Kisian, Kenya, 244 individuals were identifiable to species; of these, 69.5% were An. arabiensis and 29.5% were An. gambiae s.s. Host identifications with RDBA were comparable with those of conventional polymerase chain reaction (PCR) followed by direct sequencing of amplicons of the vertebrate mitochondrial cytochrome b gene. Of the 174 amplicon-producing samples used to compare these two methods, 147 were identifiable by direct sequencing and 139 of these were identifiable by RDBA. Anopheles arabiensis bloodmeals were mostly (94.6%) bovine in origin, whereas An. gambiae s.s. fed upon humans more than 91.8% of the time. Tests by RDBA detected that two of 112 An. arabiensis contained blood from more than one host species, whereas PCR and direct sequencing did not. Recent use of insecticide-treated bednets in Kisian is likely to have caused the shift in the dominant vector species from An. gambiae s.s. to An. arabiensis. Reverse dot blot analysis provides an opportunity to study changes in host-feeding by members of the An. gambiae complex in response to the broadening distribution of vector control measures targeting host-selection behaviours. © 2013 The Royal Entomological Society.
Fine-tuning gene networks using simple sequence repeats
Egbert, Robert G.; Klavins, Eric
2012-01-01
The parameters in a complex synthetic gene network must be extensively tuned before the network functions as designed. Here, we introduce a simple and general approach to rapidly tune gene networks in Escherichia coli using hypermutable simple sequence repeats embedded in the spacer region of the ribosome binding site. By varying repeat length, we generated expression libraries that incrementally and predictably sample gene expression levels over a 1,000-fold range. We demonstrate the utility of the approach by creating a bistable switch library that programmatically samples the expression space to balance the two states of the switch, and we illustrate the need for tuning by showing that the switch’s behavior is sensitive to host context. Further, we show that mutation rates of the repeats are controllable in vivo for stability or for targeted mutagenesis—suggesting a new approach to optimizing gene networks via directed evolution. This tuning methodology should accelerate the process of engineering functionally complex gene networks. PMID:22927382
Anand, Prachi; Grigoryan, Alexandre; Bhuiyan, Mohammed H; Ueberheide, Beatrix; Russell, Victoria; Quinoñez, Jose; Moy, Patrick; Chait, Brian T; Poget, Sébastien F; Holford, Mandë
2014-01-01
Disulfide-rich peptide toxins found in the secretions of venomous organisms such as snakes, spiders, scorpions, leeches, and marine snails are highly efficient and effective tools for novel therapeutic drug development. Venom peptide toxins have been used extensively to characterize ion channels in the nervous system and platelet aggregation in haemostatic systems. A significant hurdle in characterizing disulfide-rich peptide toxins from venomous animals is obtaining significant quantities needed for sequence and structural analyses. Presented here is a strategy for the structural characterization of venom peptide toxins from sample limited (4 ng) specimens via direct mass spectrometry sequencing, chemical synthesis and NMR structure elucidation. Using this integrated approach, venom peptide Tv1 from Terebra variegata was discovered. Tv1 displays a unique fold not witnessed in prior snail neuropeptides. The novel structural features found for Tv1 suggest that the terebrid pool of peptide toxins may target different neuronal agents with varying specificities compared to previously characterized snail neuropeptides.
Ndunguru, Joseph; Taylor, Nigel J; Yadav, Jitender; Aly, Haytham; Legg, James P; Aveling, Terry; Thompson, Graham; Fauquet, Claude M
2005-01-01
Background Plant viral diseases present major constraints to crop production. Effective sampling of the viruses infecting plants is required to facilitate their molecular study and is essential for the development of crop protection and improvement programs. Retaining integrity of viral pathogens within sampled plant tissues is often a limiting factor in this process, most especially when sample sizes are large and when operating in developing counties and regions remote from laboratory facilities. FTA is a paper-based system designed to fix and store nucleic acids directly from fresh tissues pressed into the treated paper. We report here the use of FTA as an effective technology for sampling and retrieval of DNA and RNA viruses from plant tissues and their subsequent molecular analysis. Results DNA and RNA viruses were successfully recovered from leaf tissues of maize, cassava, tomato and tobacco pressed into FTA® Classic Cards. Viral nucleic acids eluted from FTA cards were found to be suitable for diagnostic molecular analysis by PCR-based techniques and restriction analysis, and for cloning and nucleotide sequencing in a manner equivalent to that offered by tradition isolation methods. Efficacy of the technology was demonstrated both from sampled greenhouse-grown plants and from leaf presses taken from crop plants growing in farmer's fields in East Africa. In addition, FTA technology was shown to be suitable for recovery of viral-derived transgene sequences integrated into the plant genome. Conclusion Results demonstrate that FTA is a practical, economical and sensitive method for sampling, storage and retrieval of viral pathogens and plant genomic sequences, when working under controlled conditions and in the field. Application of this technology has the potential to significantly increase ability to bring modern analytical techniques to bear on the viral pathogens infecting crop plants. PMID:15904535
A comparative molecular analysis of water-filled limestone sinkholes in north-eastern Mexico.
Sahl, Jason W; Gary, Marcus O; Harris, J Kirk; Spear, John R
2011-01-01
Sistema Zacatón in north-eastern Mexico is host to several deep, water-filled, anoxic, karstic sinkholes (cenotes). These cenotes were explored, mapped, and geochemically and microbiologically sampled by the autonomous underwater vehicle deep phreatic thermal explorer (DEPTHX). The community structure of the filterable fraction of the water column and extensive microbial mats that coat the cenote walls was investigated by comparative analysis of small-subunit (SSU) 16S rRNA gene sequences. Full-length Sanger gene sequence analysis revealed novel microbial diversity that included three putative bacterial candidate phyla and three additional groups that showed high intra-clade distance with poorly characterized bacterial candidate phyla. Limited functional gene sequence analysis in these anoxic environments identified genes associated with methanogenesis, sulfate reduction and anaerobic ammonium oxidation. A directed, barcoded amplicon, multiplex pyrosequencing approach was employed to compare ∼100,000 bacterial SSU gene sequences from water column and wall microbial mat samples from five cenotes in Sistema Zacatón. A new, high-resolution sequence distribution profile (SDP) method identified changes in specific phylogenetic types (phylotypes) in microbial mats at varied depths; Mantel tests showed a correlation of the genetic distances between mat communities in two cenotes and the geographic location of each cenote. Community structure profiles from the water column of three neighbouring cenotes showed distinct variation; statistically significant differences in the concentration of geochemical constituents suggest that the variation observed in microbial communities between neighbouring cenotes are due to geochemical variation. © 2010 Society for Applied Microbiology and Blackwell Publishing Ltd.
Poole, Anthony M; Willerslev, Eske
2007-10-01
A central question in astrobiology is whether life exists elsewhere in the universe. If so, is it related to Earth life? Technologies exist that enable identification of DNA- or RNA-based microbial life directly from environmental samples here on Earth. Such technologies could, in principle, be applied to the search for life elsewhere; indeed, efforts are underway to initiate such a search. However, surveying for nucleic acid-based life on other planets, if attempted, must be carried out with caution, owing to the risk of contamination by Earth-based life. Here we argue that the null hypothesis must be that any DNA discovered and sequenced from samples taken elsewhere in the universe are Earth-based contaminants. Experience from studies of low-biomass ancient DNA demonstrates that some results, by their very nature, will not enable complete rejection of the null hypothesis. In terms of eliminating contamination as an explanation of the data, there may be value in identification of sequences that lie outside the known diversity of the three domains of life. We therefore have examined whether a fourth domain could be readily identified from environmental DNA sequence data alone. We concluded that, even on Earth, this would be far from trivial, and we illustrate this point by way of examples drawn from the literature. Overall, our conclusions do not bode well for planned PCR-based surveys for life on Mars, and we argue that other independent biosignatures will be essential in corroborating any claims for the presence of life based on nucleic acid sequences.
Nahku, Ranno; Peebo, Karl; Valgepea, Kaspar; Barrick, Jeffrey E; Adamberg, Kaarel; Vilu, Raivo
2011-09-01
Nutrient-limited continuous cultures in chemostats have been used to study microbial cell physiology for over 60 years. Genome instability and genetic heterogeneity are possible uncontrolled factors in continuous cultivation experiments. We investigated these issues by using high-throughput (HT) DNA sequencing to characterize samples from different phases of a glucose-limited accelerostat (A-stat) experiment with Escherichia coli K-12 MG1655 and a duration regularly used in cell physiology studies (20 generations of continuous cultivation). Seven consensus mutations from the reference sequence and five subpopulations characterized by different mutations were detected in the HT-sequenced samples. This genetic heterogeneity was confirmed to result from the stock culture by Sanger sequencing. All the subpopulations in which allele frequencies increased (betA, cspG/cspH, glyA) during the experiment were also present at the end of replicate A-stats, indicating that no new subpopulations emerged during our experiments. The fact that ~31 % of the cells in our initial cultures obtained directly from a culture stock centre were mutants raises concerns that even if cultivations are started from single colonies, there is a significant chance of picking a mutant clone with an altered phenotype. Our results show that current HT DNA sequencing technology allows accurate subpopulation analysis and demonstrates that a glucose-limited E. coli K-12 MG1655 A-stat experiment with a duration of tens of generations is suitable for studying cell physiology and collecting quantitative data for metabolic modelling without interference from new mutations.
Nahku, Ranno; Peebo, Karl; Valgepea, Kaspar; Barrick, Jeffrey E.; Adamberg, Kaarel
2011-01-01
Nutrient-limited continuous cultures in chemostats have been used to study microbial cell physiology for over 60 years. Genome instability and genetic heterogeneity are possible uncontrolled factors in continuous cultivation experiments. We investigated these issues by using high-throughput (HT) DNA sequencing to characterize samples from different phases of a glucose-limited accelerostat (A-stat) experiment with Escherichia coli K-12 MG1655 and a duration regularly used in cell physiology studies (20 generations of continuous cultivation). Seven consensus mutations from the reference sequence and five subpopulations characterized by different mutations were detected in the HT-sequenced samples. This genetic heterogeneity was confirmed to result from the stock culture by Sanger sequencing. All the subpopulations in which allele frequencies increased (betA, cspG/cspH, glyA) during the experiment were also present at the end of replicate A-stats, indicating that no new subpopulations emerged during our experiments. The fact that ~31 % of the cells in our initial cultures obtained directly from a culture stock centre were mutants raises concerns that even if cultivations are started from single colonies, there is a significant chance of picking a mutant clone with an altered phenotype. Our results show that current HT DNA sequencing technology allows accurate subpopulation analysis and demonstrates that a glucose-limited E. coli K-12 MG1655 A-stat experiment with a duration of tens of generations is suitable for studying cell physiology and collecting quantitative data for metabolic modelling without interference from new mutations. PMID:21700661
Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures.
Vallat, Brinda; Madrid-Aliste, Carlos; Fiser, Andras
2015-08-01
Predicting the three-dimensional structure of proteins from their amino acid sequences remains a challenging problem in molecular biology. While the current structural coverage of proteins is almost exclusively provided by template-based techniques, the modeling of the rest of the protein sequences increasingly require template-free methods. However, template-free modeling methods are much less reliable and are usually applicable for smaller proteins, leaving much space for improvement. We present here a novel computational method that uses a library of supersecondary structure fragments, known as Smotifs, to model protein structures. The library of Smotifs has saturated over time, providing a theoretical foundation for efficient modeling. The method relies on weak sequence signals from remotely related protein structures to create a library of Smotif fragments specific to the target protein sequence. This Smotif library is exploited in a fragment assembly protocol to sample decoys, which are assessed by a composite scoring function. Since the Smotif fragments are larger in size compared to the ones used in other fragment-based methods, the proposed modeling algorithm, SmotifTF, can employ an exhaustive sampling during decoy assembly. SmotifTF successfully predicts the overall fold of the target proteins in about 50% of the test cases and performs competitively when compared to other state of the art prediction methods, especially when sequence signal to remote homologs is diminishing. Smotif-based modeling is complementary to current prediction methods and provides a promising direction in addressing the structure prediction problem, especially when targeting larger proteins for modeling.
Improved multiple displacement amplification (iMDA) and ultraclean reagents.
Motley, S Timothy; Picuri, John M; Crowder, Chris D; Minich, Jeremiah J; Hofstadler, Steven A; Eshoo, Mark W
2014-06-06
Next-generation sequencing sample preparation requires nanogram to microgram quantities of DNA; however, many relevant samples are comprised of only a few cells. Genomic analysis of these samples requires a whole genome amplification method that is unbiased and free of exogenous DNA contamination. To address these challenges we have developed protocols for the production of DNA-free consumables including reagents and have improved upon multiple displacement amplification (iMDA). A specialized ethylene oxide treatment was developed that renders free DNA and DNA present within Gram positive bacterial cells undetectable by qPCR. To reduce DNA contamination in amplification reagents, a combination of ion exchange chromatography, filtration, and lot testing protocols were developed. Our multiple displacement amplification protocol employs a second strand-displacing DNA polymerase, improved buffers, improved reaction conditions and DNA free reagents. The iMDA protocol, when used in combination with DNA-free laboratory consumables and reagents, significantly improved efficiency and accuracy of amplification and sequencing of specimens with moderate to low levels of DNA. The sensitivity and specificity of sequencing of amplified DNA prepared using iMDA was compared to that of DNA obtained with two commercial whole genome amplification kits using 10 fg (~1-2 bacterial cells worth) of bacterial genomic DNA as a template. Analysis showed >99% of the iMDA reads mapped to the template organism whereas only 0.02% of the reads from the commercial kits mapped to the template. To assess the ability of iMDA to achieve balanced genomic coverage, a non-stochastic amount of bacterial genomic DNA (1 pg) was amplified and sequenced, and data obtained were compared to sequencing data obtained directly from genomic DNA. The iMDA DNA and genomic DNA sequencing had comparable coverage 99.98% of the reference genome at ≥1X coverage and 99.9% at ≥5X coverage while maintaining both balance and representation of the genome. The iMDA protocol in combination with DNA-free laboratory consumables, significantly improved the ability to sequence specimens with low levels of DNA. iMDA has broad utility in metagenomics, diagnostics, ancient DNA analysis, pre-implantation embryo screening, single-cell genomics, whole genome sequencing of unculturable organisms, and forensic applications for both human and microbial targets.
Yokoi, H; Kitahashi, T; Tanaka, T; Utagawa, E
2001-04-01
Through a year from April, 1999 to March, 2000, 20 samples, which consisted of raw sewage (2), chlorine-treated sewage (2), seawater (10) and naturally grown oysters (6), were collected monthly both from the sewage works at Mihama-ku, Chiba City and at a yacht harbor in Chiba City Bay, Japan. Astrovirus RNA were detected by reverse transcription-polymerase chain reaction (RT-PCR) and was typed by direct sequencing. Astrovirus positive products were detected from 9 samples (raw sewage; 1/2, chlorine-treated sewage; 2/2, seawater; 5/10 and oysters; 1/6) collected in April, 1999. In May, positive products were detected from 4 samples (raw sewage; 2/2 and seawater; 2/10). In June, only 1 positive product was detected from raw sewage. The number of positive samples showed a tendency to decrease and no positive products were detected from samples collected in July, 1999 to January, 2000. After that period, positive products were again detected from 3 samples (raw sewage; 1/2, chlorine-treated sewage; 2/2) collected in February, 2000. In March, the number of positive samples showed the peak and positive products were detected from 12 samples (raw sewage; 2/2, chlorine-treated sewage; 2/2, seawater; 7/10 and oysters: 1/6). Astrovirus positive products detected in April, May, June, July, 1999 and February, 2000 were classified into type 1 or 2 by sequencing, whereas in March, 2000 were type 1, 2, 3, 6 and 7.
Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon
2014-11-01
The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.
LLNL Genomic Assessment: Viral and Bacterial Sequencing Needs for TMTI, Task 1.4.2 Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Slezak, T; Borucki, M; Lam, M
Good progress has been made on both bacterial and viral sequencing by the TMTI centers. While access to appropriate samples is a limiting factor to throughput, excellent progress has been made with respect to getting agreements in place with key sources of relevant materials. Sharing of sequenced genomes funded by TMTI has been extremely limited to date. The April 2010 exercise should force a resolution to this, but additional managerial pressures may be needed to ensure that rapid sharing of TMTI-funded sequencing occurs, regardless of collaborator constraints concerning ultimate publication(s). Policies to permit TMTI-internal rapid sharing of sequenced genomes shouldmore » be written into all TMTI agreements with collaborators now being negotiated. TMTI needs to establish a Web-based system for tracking samples destined for sequencing. This includes metadata on sample origins and contributor, information on sample shipment/receipt, prioritization by TMTI, assignment to one or more sequencing centers (including possible TMTI-sponsored sequencing at a contributor site), and status history of the sample sequencing effort. While this system could be a component of the AFRL system, it is not part of any current development effort. Policy and standardized procedures are needed to ensure appropriate verification of all TMTI samples prior to the investment in sequencing. PCR, arrays, and classical biochemical tests are examples of potential verification methods. Verification is needed to detect miss-labeled, degraded, mixed or contaminated samples. Regular QC exercises are needed to ensure that the TMTI-funded centers are meeting all standards for producing quality genomic sequence data.« less
Nováková, Markéta; Heneberg, Petr; Heylen, Dieter J A; Medvecký, Matej; Muñoz-Leal, Sebastián; Šmajs, David; Literák, Ivan
2018-03-01
In the last two decades, the advent of molecular methods has revealed a remarkable diversity of rickettsiae (Rickettsiales: Rickettsiaceae) in invertebrates. Several species of these obligate intracellular bacteria are known to cause human infections, hence more attention has been directed towards human-biting ectoparasites. A spotted fever group Rickettsia sp. was previously detected in Ixodes lividus ticks (Ixodidae) associated with sand martins (Hirundinidae: Riparia riparia). In order to identify whether this rickettsia varies among isolated tick populations, a total of 1758 I. lividus ticks and five Ixodes ricinus ticks (Ixodidae) were collected in the Czech Republic and 148 I. lividus ticks were collected in Belgium, from nests of sand martins, European bee-eaters (Meropidae: Merops apiaster), Eurasian tree sparrows (Passeridae: Passer montanus), and from captured sand martins. We screened 165 and 78 I. lividus ticks (from the Czech Republic and Belgium, respectively) and all five I. ricinus ticks for the presence of rickettsial DNA. Only I. lividus samples were positive for Rickettsia vini, a spotted fever group rickettsia that commonly infects the tree-hole tick Ixodes arboricola (Ixodidae). Maximum likelihood analysis of the rickettsial sequences showed that the most closely related organism to R. vini corresponds to an uncharacterized rickettsia detected in Argas lagenoplastis (Argasidae), a nidicolous soft tick of the fairy martin (Hirundinidae: Petrochelidon ariel) in Australia. The observed variability of R. vini sequences from isolated tick populations was low; all 85 sequenced samples were identical to each other in five out of six partial rickettsial genes, except for the sca4 sequence (99.9% identity, 808/809 nt) that differed in I. lividus ticks from two sampling sites in the Czech Republic. Copyright © 2018 Elsevier GmbH. All rights reserved.
Serša, Igor; Kranjc, Matej; Miklavčič, Damijan
2015-01-01
Electroporation is gaining its importance in everyday clinical practice of cancer treatment. For its success it is extremely important that coverage of the target tissue, i.e. treated tumor, with electric field is within the specified range. Therefore, an efficient tool for the electric field monitoring in the tumor during delivery of electroporation pulses is needed. The electric field can be reconstructed by the magnetic resonance electric impedance tomography method from current density distribution data. In this study, the use of current density imaging with MRI for monitoring current density distribution during delivery of irreversible electroporation pulses was demonstrated. Using a modified single-shot RARE sequence, where four 3000 V and 100 μs long pulses were included at the start, current distribution between a pair of electrodes inserted in a liver tissue sample was imaged. Two repetitions of the sequence with phases of refocusing radiofrequency pulses 90° apart were needed to acquire one current density image. For each sample in total 45 current density images were acquired to follow a standard protocol for irreversible electroporation where 90 electric pulses are delivered at 1 Hz. Acquired current density images showed that the current density in the middle of the sample increased from first to last electric pulses by 60%, i.e. from 8 kA/m2 to 13 kA/m2 and that direction of the current path did not change with repeated electric pulses significantly. The presented single-shot RARE-based current density imaging sequence was used successfully to image current distribution during delivery of short high-voltage electric pulses. The method has a potential to enable monitoring of tumor coverage by electric field during irreversible electroporation tissue ablation.
info-gibbs: a motif discovery algorithm that directly optimizes information content during sampling.
Defrance, Matthieu; van Helden, Jacques
2009-10-15
Discovering cis-regulatory elements in genome sequence remains a challenging issue. Several methods rely on the optimization of some target scoring function. The information content (IC) or relative entropy of the motif has proven to be a good estimator of transcription factor DNA binding affinity. However, these information-based metrics are usually used as a posteriori statistics rather than during the motif search process itself. We introduce here info-gibbs, a Gibbs sampling algorithm that efficiently optimizes the IC or the log-likelihood ratio (LLR) of the motif while keeping computation time low. The method compares well with existing methods like MEME, BioProspector, Gibbs or GAME on both synthetic and biological datasets. Our study shows that motif discovery techniques can be enhanced by directly focusing the search on the motif IC or the motif LLR. http://rsat.ulb.ac.be/rsat/info-gibbs
Necrotizing enterocolitis and preterm infant gut bacteria
Warner, Barbara B.; Tarr, Phillip I.
2016-01-01
Summary Necrotizing enterocolitis remains an intractable consequence of preterm birth. Gut microbial communities, especially bacterial communities, have long been suspected to play a role in the development of necrotizing enterocolitis. Direct-from-stool nucleic acid sequencing technology now offers insights into the make-up of these communities. Data are now converging on the roles of Gram-negative bacteria as causative agents, despite the dynamic nature of bacterial populations, the varying technologies and sampling strategies, and the overall small sample sizes in these case–control studies. Bacteria that confer protection from necrotizing enterocolitis have not been identified across studies. The beneficial effect of probiotics is not apparent in infants with birth weights <1000 g (these infants are at highest risk of, and have the highest case fatality rate from, necrotizing enterocolitis). Further work should be directed to the modulating gut microbes, or the products they produce, to prevent this devastating complication of preterm birth. PMID:27343151
Barb, Jennifer J; Oler, Andrew J; Kim, Hyung-Suk; Chalmers, Natalia; Wallen, Gwenyth R; Cashion, Ann; Munson, Peter J; Ames, Nancy J
2016-01-01
There is much speculation on which hypervariable region provides the highest bacterial specificity in 16S rRNA sequencing. The optimum solution to prevent bias and to obtain a comprehensive view of complex bacterial communities would be to sequence the entire 16S rRNA gene; however, this is not possible with second generation standard library design and short-read next-generation sequencing technology. This paper examines a new process using seven hypervariable or V regions of the 16S rRNA (six amplicons: V2, V3, V4, V6-7, V8, and V9) processed simultaneously on the Ion Torrent Personal Genome Machine (Life Technologies, Grand Island, NY). Four mock samples were amplified using the 16S Ion Metagenomics Kit™ (Life Technologies) and their sequencing data is subjected to a novel analytical pipeline. Results are presented at family and genus level. The Kullback-Leibler divergence (DKL), a measure of the departure of the computed from the nominal bacterial distribution in the mock samples, was used to infer which region performed best at the family and genus levels. Three different hypervariable regions, V2, V4, and V6-7, produced the lowest divergence compared to the known mock sample. The V9 region gave the highest (worst) average DKL while the V4 gave the lowest (best) average DKL. In addition to having a high DKL, the V9 region in both the forward and reverse directions performed the worst finding only 17% and 53% of the known family level and 12% and 47% of the genus level bacteria, while results from the forward and reverse V4 region identified all 17 family level bacteria. The results of our analysis have shown that our sequencing methods using 6 hypervariable regions of the 16S rRNA and subsequent analysis is valid. This method also allowed for the assessment of how well each of the variable regions might perform simultaneously. Our findings will provide the basis for future work intended to assess microbial abundance at different time points throughout a clinical protocol.
Direct and long-term detection of gene doping in conventional blood samples.
Beiter, T; Zimmermann, M; Fragasso, A; Hudemann, J; Niess, A M; Bitzer, M; Lauer, U M; Simon, P
2011-03-01
The misuse of somatic gene therapy for the purpose of enhancing athletic performance is perceived as a coming threat to the world of sports and categorized as 'gene doping'. This article describes a direct detection approach for gene doping that gives a clear yes-or-no answer based on the presence or absence of transgenic DNA in peripheral blood samples. By exploiting a priming strategy to specifically amplify intronless DNA sequences, we developed PCR protocols allowing the detection of very small amounts of transgenic DNA in genomic DNA samples to screen for six prime candidate genes. Our detection strategy was verified in a mouse model, giving positive signals from minute amounts (20 μl) of blood samples for up to 56 days following intramuscular adeno-associated virus-mediated gene transfer, one of the most likely candidate vector systems to be misused for gene doping. To make our detection strategy amenable for routine testing, we implemented a robust sample preparation and processing protocol that allows cost-efficient analysis of small human blood volumes (200 μl) with high specificity and reproducibility. The practicability and reliability of our detection strategy was validated by a screening approach including 327 blood samples taken from professional and recreational athletes under field conditions.
Weiss, Eric R.; Alter, Galit; Ogembo, Javier Gordon; Henderson, Jennifer L.; Tabak, Barbara; Bakiş, Yasin; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa
2016-01-01
ABSTRACT The Epstein-Barr virus (EBV) gp350 glycoprotein interacts with the cellular receptor to mediate viral entry and is thought to be the major target for neutralizing antibodies. To better understand the role of EBV-specific antibodies in the control of viral replication and the evolution of sequence diversity, we measured EBV gp350-specific antibody responses and sequenced the gp350 gene in samples obtained from individuals experiencing primary EBV infection (acute infectious mononucleosis [AIM]) and again 6 months later (during convalescence [CONV]). EBV gp350-specific IgG was detected in the sera of 17 (71%) of 24 individuals at the time of AIM and all 24 (100%) individuals during CONV; binding antibody titers increased from AIM through CONV, reaching levels equivalent to those in age-matched, chronically infected individuals. Antibody-dependent cell-mediated phagocytosis (ADCP) was rarely detected during AIM (4 of 24 individuals; 17%) but was commonly detected during CONV (19 of 24 individuals; 79%). The majority (83%) of samples taken during AIM neutralized infection of primary B cells; all samples obtained at 6 months postdiagnosis neutralized EBV infection of cultured and primary target cells. Deep sequencing revealed interpatient gp350 sequence variation but conservation of the CR2-binding site. The levels of gp350-specific neutralizing activity directly correlated with higher peripheral blood EBV DNA levels during AIM and a greater evolution of diversity in gp350 nucleotide sequences from AIM to CONV. In summary, we conclude that the viral load and EBV gp350 diversity during early infection are associated with the development of neutralizing antibody responses following AIM. IMPORTANCE Antibodies against viral surface proteins can blunt the spread of viral infection by coating viral particles, mediating uptake by immune cells, or blocking interaction with host cell receptors, making them a desirable component of a sterilizing vaccine. The EBV surface protein gp350 is a major target for antibodies. We report the detection of EBV gp350-specific antibodies capable of neutralizing EBV infection in vitro. The majority of gp350-directed vaccines focus on glycoproteins from lab-adapted strains, which may poorly reflect primary viral envelope diversity. We report some of the first primary gp350 sequences, noting that the gp350 host receptor binding site is remarkably stable across patients and time. However, changes in overall gene diversity were detectable during infection. Patients with higher peripheral blood viral loads in primary infection and greater changes in viral diversity generated more efficient antibodies. Our findings provide insight into the generation of functional antibodies, necessary for vaccine development. PMID:27733645
Weiss, Eric R; Alter, Galit; Ogembo, Javier Gordon; Henderson, Jennifer L; Tabak, Barbara; Bakiş, Yasin; Somasundaran, Mohan; Garber, Manuel; Selin, Liisa; Luzuriaga, Katherine
2017-01-01
The Epstein-Barr virus (EBV) gp350 glycoprotein interacts with the cellular receptor to mediate viral entry and is thought to be the major target for neutralizing antibodies. To better understand the role of EBV-specific antibodies in the control of viral replication and the evolution of sequence diversity, we measured EBV gp350-specific antibody responses and sequenced the gp350 gene in samples obtained from individuals experiencing primary EBV infection (acute infectious mononucleosis [AIM]) and again 6 months later (during convalescence [CONV]). EBV gp350-specific IgG was detected in the sera of 17 (71%) of 24 individuals at the time of AIM and all 24 (100%) individuals during CONV; binding antibody titers increased from AIM through CONV, reaching levels equivalent to those in age-matched, chronically infected individuals. Antibody-dependent cell-mediated phagocytosis (ADCP) was rarely detected during AIM (4 of 24 individuals; 17%) but was commonly detected during CONV (19 of 24 individuals; 79%). The majority (83%) of samples taken during AIM neutralized infection of primary B cells; all samples obtained at 6 months postdiagnosis neutralized EBV infection of cultured and primary target cells. Deep sequencing revealed interpatient gp350 sequence variation but conservation of the CR2-binding site. The levels of gp350-specific neutralizing activity directly correlated with higher peripheral blood EBV DNA levels during AIM and a greater evolution of diversity in gp350 nucleotide sequences from AIM to CONV. In summary, we conclude that the viral load and EBV gp350 diversity during early infection are associated with the development of neutralizing antibody responses following AIM. Antibodies against viral surface proteins can blunt the spread of viral infection by coating viral particles, mediating uptake by immune cells, or blocking interaction with host cell receptors, making them a desirable component of a sterilizing vaccine. The EBV surface protein gp350 is a major target for antibodies. We report the detection of EBV gp350-specific antibodies capable of neutralizing EBV infection in vitro The majority of gp350-directed vaccines focus on glycoproteins from lab-adapted strains, which may poorly reflect primary viral envelope diversity. We report some of the first primary gp350 sequences, noting that the gp350 host receptor binding site is remarkably stable across patients and time. However, changes in overall gene diversity were detectable during infection. Patients with higher peripheral blood viral loads in primary infection and greater changes in viral diversity generated more efficient antibodies. Our findings provide insight into the generation of functional antibodies, necessary for vaccine development. Copyright © 2016 American Society for Microbiology.
kWIP: The k-mer weighted inner product, a de novo estimator of genetic similarity.
Murray, Kevin D; Webers, Christfried; Ong, Cheng Soon; Borevitz, Justin; Warthmann, Norman
2017-09-01
Modern genomics techniques generate overwhelming quantities of data. Extracting population genetic variation demands computationally efficient methods to determine genetic relatedness between individuals (or "samples") in an unbiased manner, preferably de novo. Rapid estimation of genetic relatedness directly from sequencing data has the potential to overcome reference genome bias, and to verify that individuals belong to the correct genetic lineage before conclusions are drawn using mislabelled, or misidentified samples. We present the k-mer Weighted Inner Product (kWIP), an assembly-, and alignment-free estimator of genetic similarity. kWIP combines a probabilistic data structure with a novel metric, the weighted inner product (WIP), to efficiently calculate pairwise similarity between sequencing runs from their k-mer counts. It produces a distance matrix, which can then be further analysed and visualised. Our method does not require prior knowledge of the underlying genomes and applications include establishing sample identity and detecting mix-up, non-obvious genomic variation, and population structure. We show that kWIP can reconstruct the true relatedness between samples from simulated populations. By re-analysing several published datasets we show that our results are consistent with marker-based analyses. kWIP is written in C++, licensed under the GNU GPL, and is available from https://github.com/kdmurray91/kwip.
Ndombo, P K; Ndze, V N; Mbarga, F D; Anderson, R; Acho, A; Ebua Chia, J; Njamnshi, A K; Rota, P A; Waku-Kouomou, D
2018-02-01
Measles is a highly infectious human viral disease caused by measles virus (MeV). An estimated 114 900 measles deaths occurred worldwide in 2014. There are currently eight clades (A-H) comprised 24 MeV genotypes. We sought to characterise MeVs among Central African Republic (CAR) refugees during the 2014 measles epidemic in Cameroon. Samples were collected from children <15 years with suspected measles infections in two refugee camps in the east region of Cameroon. Viral RNA was extracted directly from urine samples. RNA detection of MeV RNA was performed with real-time reverse transcription polymerase chain reaction (PCR) to amplify a 634 bp nucleotide fragment of the N gene. The sequence of the PCR product was obtained to determine the genotype. MeV RNA was detected in 25 out of 30 samples from suspected cases, and among the 25 positive samples, MeV sequences were obtained from 20. The MeV strains characterised were all genotype B3. The MeV strains from genotype B3 found in this outbreak were more similar to those circulating in Northern Cameroon in 2010-2011 than to MeV strains circulating in the CAR in 2011. Surveillance system should be improved to focus on refugees for early detection of and response to outbreaks.
2013-01-01
Following recent trends in environmental microbiology, food microbiology has benefited from the advances in molecular biology and adopted novel strategies to detect, identify, and monitor microbes in food. An in-depth study of the microbial diversity in food can now be achieved by using high-throughput sequencing (HTS) approaches after direct nucleic acid extraction from the sample to be studied. In this review, the workflow of applying culture-independent HTS to food matrices is described. The current scenario and future perspectives of HTS uses to study food microbiota are presented, and the decision-making process leading to the best choice of working conditions to fulfill the specific needs of food research is described. PMID:23475615
Ji, Feng; Zhao, Jing-Zhuang; Liu, Miao; Lu, Tong-Yan; Liu, Hong-Bai; Yin, Jiasheng; Xu, Li-Ming
2017-04-01
Infectious pancreatic necrosis (IPN) is a significant disease of farmed salmonids resulting in direct economic losses due to high mortality in China. However, no gene sequence of any Chinese infectious pancreatic necrosis virus (IPNV) isolates was available. In the study, moribund rainbow trout fry samples were collected during an outbreak of IPN in Yunnan province of southwest China in 2013. An IPNV was isolated and tentatively named ChRtm213. We determined the full genome sequence of the IPNV ChRtm213 and compared it with previously identified IPNV sequences worldwide. The sequences of different structural and non-structural protein genes were compared to those of other aquatic birnaviruses sequenced to date. The results indicated that the complete genome sequence of ChRtm213 strain contains a segment A (3099 nucleotides) coding a polyprotein VP2-VP4-VP3, and a segment B (2789 nucleotides) coding a RNA-dependent RNA polymerase VP1. The phylogenetic analyses showed that ChRtm213 strain fell within genogroup 1, serotype A9 (Jasper), having similarities of 96.3% (segment A) and 97.3% (segment B) with the IPNV strain AM98 from Japan. The results suggest that the Chinese IPNV isolate has relative closer relationship with Japanese IPNV strains. The sequence of ChRtm213 was the first gene sequence of IPNV isolates in China. This study provided a robust reference for diagnosis and/or control of IPNV prevalent in China.
Dröge, J.; Gregor, I.; McHardy, A. C.
2015-01-01
Motivation: Metagenomics characterizes microbial communities by random shotgun sequencing of DNA isolated directly from an environment of interest. An essential step in computational metagenome analysis is taxonomic sequence assignment, which allows identifying the sequenced community members and reconstructing taxonomic bins with sequence data for the individual taxa. For the massive datasets generated by next-generation sequencing technologies, this cannot be performed with de-novo phylogenetic inference methods. We describe an algorithm and the accompanying software, taxator-tk, which performs taxonomic sequence assignment by fast approximate determination of evolutionary neighbors from sequence similarities. Results: Taxator-tk was precise in its taxonomic assignment across all ranks and taxa for a range of evolutionary distances and for short as well as for long sequences. In addition to the taxonomic binning of metagenomes, it is well suited for profiling microbial communities from metagenome samples because it identifies bacterial, archaeal and eukaryotic community members without being affected by varying primer binding strengths, as in marker gene amplification, or copy number variations of marker genes across different taxa. Taxator-tk has an efficient, parallelized implementation that allows the assignment of 6 Gb of sequence data per day on a standard multiprocessor system with 10 CPU cores and microbial RefSeq as the genomic reference data. Availability and implementation: Taxator-tk source and binary program files are publicly available at http://algbio.cs.uni-duesseldorf.de/software/. Contact: Alice.McHardy@uni-duesseldorf.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25388150
Gregor, Ivan; Dröge, Johannes; Schirmer, Melanie; Quince, Christopher; McHardy, Alice C
2016-01-01
Background. Metagenomics is an approach for characterizing environmental microbial communities in situ, it allows their functional and taxonomic characterization and to recover sequences from uncultured taxa. This is often achieved by a combination of sequence assembly and binning, where sequences are grouped into 'bins' representing taxa of the underlying microbial community. Assignment to low-ranking taxonomic bins is an important challenge for binning methods as is scalability to Gb-sized datasets generated with deep sequencing techniques. One of the best available methods for species bins recovery from deep-branching phyla is the expert-trained PhyloPythiaS package, where a human expert decides on the taxa to incorporate in the model and identifies 'training' sequences based on marker genes directly from the sample. Due to the manual effort involved, this approach does not scale to multiple metagenome samples and requires substantial expertise, which researchers who are new to the area do not have. Results. We have developed PhyloPythiaS+, a successor to our PhyloPythia(S) software. The new (+) component performs the work previously done by the human expert. PhyloPythiaS+ also includes a new k-mer counting algorithm, which accelerated the simultaneous counting of 4-6-mers used for taxonomic binning 100-fold and reduced the overall execution time of the software by a factor of three. Our software allows to analyze Gb-sized metagenomes with inexpensive hardware, and to recover species or genera-level bins with low error rates in a fully automated fashion. PhyloPythiaS+ was compared to MEGAN, taxator-tk, Kraken and the generic PhyloPythiaS model. The results showed that PhyloPythiaS+ performs especially well for samples originating from novel environments in comparison to the other methods. Availability. PhyloPythiaS+ in a virtual machine is available for installation under Windows, Unix systems or OS X on: https://github.com/algbioi/ppsp/wiki.
Effects of 16S rDNA sampling on estimates of the number of endosymbiont lineages in sucking lice
Burleigh, J. Gordon; Light, Jessica E.; Reed, David L.
2016-01-01
Phylogenetic trees can reveal the origins of endosymbiotic lineages of bacteria and detect patterns of co-evolution with their hosts. Although taxon sampling can greatly affect phylogenetic and co-evolutionary inference, most hypotheses of endosymbiont relationships are based on few available bacterial sequences. Here we examined how different sampling strategies of Gammaproteobacteria sequences affect estimates of the number of endosymbiont lineages in parasitic sucking lice (Insecta: Phthirapatera: Anoplura). We estimated the number of louse endosymbiont lineages using both newly obtained and previously sequenced 16S rDNA bacterial sequences and more than 42,000 16S rDNA sequences from other Gammaproteobacteria. We also performed parametric and nonparametric bootstrapping experiments to examine the effects of phylogenetic error and uncertainty on these estimates. Sampling of 16S rDNA sequences affects the estimates of endosymbiont diversity in sucking lice until we reach a threshold of genetic diversity, the size of which depends on the sampling strategy. Sampling by maximizing the diversity of 16S rDNA sequences is more efficient than randomly sampling available 16S rDNA sequences. Although simulation results validate estimates of multiple endosymbiont lineages in sucking lice, the bootstrap results suggest that the precise number of endosymbiont origins is still uncertain. PMID:27547523
Biondi, Enrico; Zamorano, Alan; Vega, Ernesto; Ardizzi, Stefano; Sitta, Davide; De Salvador, Flavio Roberto; Campos-Vargas, Reinaldo; Meneses, Claudio; Perez, Set; Bertaccini, Assunta; Fiore, Nicola
2018-05-01
Kiwifruit bleeding sap samples, collected in Italian and Chilean orchards from symptomatic and asymptomatic plants, were evaluated for the presence of Pseudomonas syringae pv. actinidiae, the causal agent of bacterial canker. The saps were sampled during the spring in both hemispheres, before the bud sprouting, during the optimal time window for the collection of an adequate volume of sample for the early detection of the pathogen, preliminarily by molecular assays, and then through its direct isolation and identification. The results of molecular analyses showed more effectiveness in the P. syringae pv. actinidiae detection when compared with those of microbiological analyses through the pathogen isolation on the nutritive and semiselective media selected. The bleeding sap analyses allowed the isolation and identification of two hypersensitive response (HR) negative and hypovirulent P. syringae pv. actinidiae strains from different regions in Italy. Moreover, multilocus sequence analysis (MLSA) and whole genome sequence (WGS) were carried out on selected Italian and Chilean P. syringae pv. actinidiae virulent strains to verify the presence of genetic variability compared with the HR negative strains and to compare the variability of selected gene clusters between strains isolated in both countries. All the strains showed the lack of argK and coronatine gene clusters as reported for the biovar 3 P. syringae pv. actinidiae strains. Despite the biologic differences obtained in the tobacco bioassays and in pathogenicity assays, the MLSA and WGS analyses did not show significant differences between the WGS of the HR negative and HR positive strains; the difference, on the other hand, between PAC_ICE sequences of Italian and Chilean P. syringae pv. actinidiae strains was confirmed. The inability of the hypovirulent strains IPV-BO 8893 and IPV-BO 9286 to provoke HR in tobacco and the low virulence shown in this host could not be associated with mutations or recombinations in T3SS island.
Bergfors, Assar; Leenheer, Daniël; Bergqvist, Anders; Ameur, Adam; Lennerstrand, Johan
2016-02-01
Development of Hepatitis C virus (HCV) resistance against direct-acting antivirals (DAAs), including NS5A inhibitors, is an obstacle to successful treatment of HCV when DAAs are used in sub-optimal combinations. Furthermore, it has been shown that baseline (pre-existing) resistance against DAAs is present in treatment naïve-patients and this will potentially complicate future treatment strategies in different HCV genotypes (GTs). Thus the aim was to detect low levels of NS5A resistant associated variants (RAVs) in a limited sample set of treatment-naïve patients of HCV GT1a and 3a, since such polymorphisms can display in vitro resistance as high as 60000 fold. Ultra-deep single molecule real time (SMRT) sequencing with the Pacific Biosciences (PacBio) RSII instrument was used to detect these RAVs. The SMRT sequencing was conducted on ten samples; three of them positive with Sanger sequencing (GT1a Q30H and Y93N, and GT3a Y93H), five GT1a samples, and two GT3a non-positive samples. The same methods were applied to the HCV GT1a H77-plasmid in a dilution series, in order to determine the error rates of replication, which in turn was used to determine the limit of detection (LOD), as defined by mean + 3SD, of minority variants down to 0.24%. We found important baseline NS5A RAVs at levels between 0.24 and 0.5%, which could potentially have clinical relevance. This new method with low level detection of baseline RAVs could be useful in predicting the most cost-efficient combination of DAA treatment, and reduce the treatment duration for an HCV infected individual. Copyright © 2015 Elsevier B.V. All rights reserved.
Genomics and museum specimens.
Nachman, Michael W
2013-12-01
Nearly 25 years ago, Allan Wilson and colleagues isolated DNA sequences from museum specimens of kangaroo rats (Dipodomys panamintinus) and compared these sequences with those from freshly collected animals (Thomas et al. 1990). The museum specimens had been collected up to 78 years earlier, so the two samples provided a direct temporal comparison of patterns of genetic variation. This was not the first time DNA sequences had been isolated from preserved material, but it was the first time it had been carried out with a population sample. Population geneticists often try to make inferences about the influence of historical processes such as selection, drift, mutation and migration on patterns of genetic variation in the present. The work of Wilson and colleagues was important in part because it suggested a way in which population geneticists could actually study genetic change in natural populations through time, much the same way that experimentalists can do with artificial populations in the laboratory. Indeed, the work of Thomas et al. (1990) spawned dozens of studies in which museum specimens were used to compare historical and present-day genetic diversity (reviewed in Wandeler et al. 2007). All of these studies, however, were limited by the same fundamental problem: old DNA is degraded into short fragments. As a consequence, these studies mostly involved PCR amplification of short templates, usually short stretches of mitochondrial DNA or microsatellites. In this issue, Bi et al. (2013) report a breakthrough that should open the door to studies of genomic variation in museum specimens. They used target enrichment (exon capture) and next-generation (Illumina) sequencing to compare patterns of genetic variation in historic and present-day population samples of alpine chipmunks (Tamias alpinus) (Fig. 1). The historic samples came from specimens collected in 1915, so the temporal span of this comparison is nearly 100 years. © 2013 John Wiley & Sons Ltd.
High-resolution melting analysis for detection of MYH9 mutations.
Provaznikova, Dana; Kumstyrova, Tereza; Kotlin, Roman; Salaj, Peter; Matoska, Vaclav; Hrachovinova, Ingrid; Rittich, Simon
2008-09-01
May-Hegglin anomaly (MHA), Sebastian (SBS), Fechtner (FTNS) and Epstein (EPS) syndromes are rare autosomal dominant disorders with giant platelets and thrombocytopenia. Other manifestations of these disorders are combinations of the presence of granulocyte inclusions and deafness, cataracts and renal failure. Currently, MHA, SBS, FTNS and EPS are considered to be distinct clinical manifestation of a single illness caused by mutations of the MYH9 gene encoding the heavy chain of non-muscle myosin IIA (NMMHC-IIA). As the MYH9 gene has a high number of exons, it takes much time and material to use this method for the detection of MYH9 mutations. Recently, a new method has been introduced for scanning DNA mutations without the need for direct sequencing: high-resolution melting analysis (HRMA). Mutation detection with HRMA relies on the intercalation of the specific dye (LC Green plus) in double-strand DNA and fluorescence monitoring of PCR product melting profiles. In our study, we optimized the conditions and used HRMA for rapid screening of mutations in all MYH9 exons in seven affected individuals from four unrelated families with suspected MYH9 disorders. Samples identified by HRMA as positive for the mutation were analysed by direct sequencing. HRMA saved us over 85% of redundant sequencing.
Finnerty, J R; Block, B A
1992-06-01
We were able to differentiate between species of billfish (Istiophoridae family) and to detect considerable intraspecific variation in the blue marlin (Makaira nigricans) by directly sequencing a polymerase chain reaction (PCR)-amplified, 612-bp fragment of the mitochondrial cytochrome b gene. Thirteen variable nucleotide sites separated blue marlin (n = 26) into 7 genotypes. On average, these genotypes differed by 5.7 base substitutions. A smaller sample of swordfish from an equally broad geographic distribution displayed relatively little intraspecific variation, with an average of 1.3 substitutions separating different genotypes. A cladistic analysis of blue marlin cytochrome b variants indicates two major divergent evolutionary lines within the species. The frequencies of these two major evolutionary lines differ significantly between Atlantic and Pacific ocean basins. This finding is important given that the Atlantic stocks of blue marlin are considered endangered. Migration from the Pacific can help replenish the numbers of blue marlin in the Atlantic, but the loss of certain mitochondrial DNA haplotypes in the Atlantic due to overfishing probably could not be remedied by an influx of Pacific fish because of their absence in the Pacific population. Fishery management strategies should attempt to preserve the genetic diversity within the species. The detection of DNA sequence polymorphism indicates the utility of PCR technology in pelagic fishery genetics.
Using pre-screening methods for an effective and reliable site characterization at megasites.
Algreen, Mette; Kalisz, Mariusz; Stalder, Marcel; Martac, Eugeniu; Krupanek, Janusz; Trapp, Stefan; Bartke, Stephan
2015-10-01
This paper illustrates the usefulness of pre-screening methods for an effective characterization of polluted sites. We applied a sequence of site characterization methods to a former Soviet military airbase with likely fuel and benzene, toluene, ethylbenzene, and xylene (BTEX) contamination in shallow groundwater and subsoil. The methods were (i) phytoscreening with tree cores; (ii) soil gas measurements for CH4, O2, and photoionization detector (PID); (iii) direct-push with membrane interface probe (MIP) and laser-induced fluorescence (LIF) sensors; (iv) direct-push sampling; and (v) sampling from soil and from groundwater monitoring wells. Phytoscreening and soil gas measurements are rapid and inexpensive pre-screening methods. Both indicated subsurface pollution and hot spots successfully. The direct-push sensors yielded 3D information about the extension and the volume of the subsurface plume. This study also expanded the applicability of tree coring to BTEX compounds and tested the use of high-resolution direct-push sensors for light hydrocarbons. Comparison of screening results to results from conventional soil and groundwater sampling yielded in most cases high rank correlation and confirmed the findings. The large-scale application of non- or low-invasive pre-screening can be of help in directing and focusing the subsequent, more expensive investigation methods. The rapid pre-screening methods also yielded useful information about potential remediation methods. Overall, we see several benefits of a stepwise screening and site characterization scheme, which we propose in conclusion.
Single molecule targeted sequencing for cancer gene mutation detection.
Gao, Yan; Deng, Liwei; Yan, Qin; Gao, Yongqian; Wu, Zengding; Cai, Jinsen; Ji, Daorui; Li, Gailing; Wu, Ping; Jin, Huan; Zhao, Luyang; Liu, Song; Ge, Liangjin; Deem, Michael W; He, Jiankui
2016-05-19
With the rapid decline in cost of sequencing, it is now affordable to examine multiple genes in a single disease-targeted clinical test using next generation sequencing. Current targeted sequencing methods require a separate step of targeted capture enrichment during sample preparation before sequencing. Although there are fast sample preparation methods available in market, the library preparation process is still relatively complicated for physicians to use routinely. Here, we introduced an amplification-free Single Molecule Targeted Sequencing (SMTS) technology, which combined targeted capture and sequencing in one step. We demonstrated that this technology can detect low-frequency mutations using artificially synthesized DNA sample. SMTS has several potential advantages, including simple sample preparation thus no biases and errors are introduced by PCR reaction. SMTS has the potential to be an easy and quick sequencing technology for clinical diagnosis such as cancer gene mutation detection, infectious disease detection, inherited condition screening and noninvasive prenatal diagnosis.
Meta sequence analysis of human blood peptides and their parent proteins.
Bowden, Peter; Pendrak, Voitek; Zhu, Peihong; Marshall, John G
2010-04-18
Sequence analysis of the blood peptides and their qualities will be key to understanding the mechanisms that contribute to error in LC-ESI-MS/MS. Analysis of peptides and their proteins at the level of sequences is much more direct and informative than the comparison of disparate accession numbers. A portable database of all blood peptide and protein sequences with descriptor fields and gene ontology terms might be useful for designing immunological or MRM assays from human blood. The results of twelve studies of human blood peptides and/or proteins identified by LC-MS/MS and correlated against a disparate array of genetic libraries were parsed and matched to proteins from the human ENSEMBL, SwissProt and RefSeq databases by SQL. The reported peptide and protein sequences were organized into an SQL database with full protein sequences and up to five unique peptides in order of prevalence along with the peptide count for each protein. Structured query language or BLAST was used to acquire descriptive information in current databases. Sampling error at the level of peptides is the largest source of disparity between groups. Chi Square analysis of peptide to protein distributions confirmed the significant agreement between groups on identified proteins. Copyright 2010. Published by Elsevier B.V.
Kolacsek, Orsolya; Pergel, Enikő; Varga, Nóra; Apáti, Ágota; Orbán, Tamás I
2017-01-20
There are numerous applications of quantitative PCR for both diagnostic and basic research. As in many other techniques the basis of quantification is that comparisons are made between different (unknown and known or reference) specimens of the same entity. When the aim is to compare real quantities of different species in samples, one cannot escape their separate precise absolute quantification. We have established a simple and reliable method for this purpose (Ct shift method) which combines the absolute and the relative approach. It requires a plasmid standard containing both sequences of amplicons to be compared (e.g. the target of interest and the endogenous control). It can serve as a reference sample with equal copies of templates for both targets. Using the ΔΔCt formula we can quantify the exact ratio of the two templates in each unknown sample. The Ct shift method has been successfully applied for transposon gene copy measurements, as well as for comparison of different mRNAs in cDNA samples. This study provides the proof of concept and introduces some potential applications of the method; the absolute nature of results even without the need for real reference samples can contribute to the universality of the method and comparability of different studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Knierim, Dennis; Tsai, Wen-Shi; Kenyon, Lawrence
2013-06-01
Polerovirus infection was detected by reverse transcription polymerase chain reaction (RT-PCR) in 29 pepper plants (Capsicum spp.) and one black nightshade plant (Solanum nigrum) sample collected from fields in India, Indonesia, Mali, Philippines, Thailand and Taiwan. At least two representative samples for each country were selected to generate a general polerovirus RT-PCR product of 1.4 kb length for sequencing. Sequence analysis of the partial genome sequences revealed the presence of pepper vein yellows virus (PeVYV) in all 13 samples. A 1990 Australian herbarium sample of pepper described by serological means as infected with capsicum yellows virus (CYV) was identified by sequence analysis of a partial CP sequence as probably infected with a potato leaf roll virus (PLRV) isolate.
Differentially Private Frequent Sequence Mining via Sampling-based Candidate Pruning
Xu, Shengzhi; Cheng, Xiang; Li, Zhengyi; Xiong, Li
2016-01-01
In this paper, we study the problem of mining frequent sequences under the rigorous differential privacy model. We explore the possibility of designing a differentially private frequent sequence mining (FSM) algorithm which can achieve both high data utility and a high degree of privacy. We found, in differentially private FSM, the amount of required noise is proportionate to the number of candidate sequences. If we could effectively reduce the number of unpromising candidate sequences, the utility and privacy tradeoff can be significantly improved. To this end, by leveraging a sampling-based candidate pruning technique, we propose a novel differentially private FSM algorithm, which is referred to as PFS2. The core of our algorithm is to utilize sample databases to further prune the candidate sequences generated based on the downward closure property. In particular, we use the noisy local support of candidate sequences in the sample databases to estimate which sequences are potentially frequent. To improve the accuracy of such private estimations, a sequence shrinking method is proposed to enforce the length constraint on the sample databases. Moreover, to decrease the probability of misestimating frequent sequences as infrequent, a threshold relaxation method is proposed to relax the user-specified threshold for the sample databases. Through formal privacy analysis, we show that our PFS2 algorithm is ε-differentially private. Extensive experiments on real datasets illustrate that our PFS2 algorithm can privately find frequent sequences with high accuracy. PMID:26973430
Sobel Leonard, Ashley; McClain, Micah T; Smith, Gavin J D; Wentworth, David E; Halpin, Rebecca A; Lin, Xudong; Ransier, Amy; Stockwell, Timothy B; Das, Suman R; Gilbert, Anthony S; Lambkin-Williams, Robert; Ginsburg, Geoffrey S; Woods, Christopher W; Koelle, Katia
2016-12-15
Knowledge of influenza virus evolution at the point of transmission and at the intrahost level remains limited, particularly for human hosts. Here, we analyze a unique viral data set of next-generation sequencing (NGS) samples generated from a human influenza challenge study wherein 17 healthy subjects were inoculated with cell- and egg-passaged virus. Nasal wash samples collected from 7 of these subjects were successfully deep sequenced. From these, we characterized changes in the subjects' viral populations during infection and identified differences between the virus in these samples and the viral stock used to inoculate the subjects. We first calculated pairwise genetic distances between the subjects' nasal wash samples, the viral stock, and the influenza virus A/Wisconsin/67/2005 (H3N2) reference strain used to generate the stock virus. These distances revealed that considerable viral evolution occurred at various points in the human challenge study. Further quantitative analyses indicated that (i) the viral stock contained genetic variants that originated and likely were selected for during the passaging process, (ii) direct intranasal inoculation with the viral stock resulted in a selective bottleneck that reduced nonsynonymous genetic diversity in the viral hemagglutinin and nucleoprotein, and (iii) intrahost viral evolution continued over the course of infection. These intrahost evolutionary dynamics were dominated by purifying selection. Our findings indicate that rapid viral evolution can occur during acute influenza infection in otherwise healthy human hosts when the founding population size of the virus is large, as is the case with direct intranasal inoculation. Influenza viruses circulating among humans are known to rapidly evolve over time. However, little is known about how influenza virus evolves across single transmission events and over the course of a single infection. To address these issues, we analyze influenza virus sequences from a human challenge experiment that initiated infection with a cell- and egg-passaged viral stock, which appeared to have adapted during its preparation. We find that the subjects' viral populations differ genetically from the viral stock, with subjects' viral populations having lower representation of the amino-acid-changing variants that arose during viral preparation. We also find that most of the viral evolution occurring over single infections is characterized by further decreases in the frequencies of these amino-acid-changing variants and that only limited intrahost genetic diversification through new mutations is apparent. Our findings indicate that influenza virus populations can undergo rapid genetic changes during acute human infections. Copyright © 2016 Sobel Leonard et al.
Yao, Qiu-Mei; Zhou, Jiao; Gale, Robert Peter; Li, Jin-Lan; Li, Ling-Di; Li, Ning; Chen, Shan-Shan; Ruan, Guo-Rui
2015-10-01
Calreticulin (CALR) mutations were recently identified in a substantial proportion of persons with essential thrombocythemia (ET) and with primary myelofibrosis (PMF) without JAK2(V617F). Consequently rapid, sensitive, and specific methods to detect and quantify these mutations are needed. We studied samples from 1088 persons with myeloproliferative neoplasms (MPNs) including 421 JAK2(V617F) negative subjects with ET, PMF, polycythemia vera (PV), chronic myeloid leukemia (CML) and hyper-eosinophilic syndrome (HES). Detection of CALR exon 9 mutations was done by PCR amplification followed by fragment length analysis and direct sequencing. Dilution assays were used to determine CALR mutant allele burden. We detected CALR mutations in blood and bone marrow samples from 152 subjects with ET and with PMF but not in samples from normal or persons with PV, CML, or HES. CALR mutant peaks were distinct from wild-type peaks and dilution experiments indicated a sensitivity level of 0.5-5% for a CALR mutant allele in a wild-type background. Diverse types of mutations were detected including deletions, insertions, and complex indels. All mutations were confirmed by direct sequencing. We also used dilution experiments to quantify mutant allele burden. We were able to reproducibly detect mutant allele levels as low 5% (0.5-5%) in a wild-type background. PCR amplification followed by fragment length analysis is a rapid, sensitive, and specific method for screening persons with MPNs for CALR mutations, especially those with ET and PMF and for estimating mutant allele burden.
A novel, privacy-preserving cryptographic approach for sharing sequencing data
Cassa, Christopher A; Miller, Rachel A; Mandl, Kenneth D
2013-01-01
Objective DNA samples are often processed and sequenced in facilities external to the point of collection. These samples are routinely labeled with patient identifiers or pseudonyms, allowing for potential linkage to identity and private clinical information if intercepted during transmission. We present a cryptographic scheme to securely transmit externally generated sequence data which does not require any patient identifiers, public key infrastructure, or the transmission of passwords. Materials and methods This novel encryption scheme cryptographically protects participant sequence data using a shared secret key that is derived from a unique subset of an individual’s genetic sequence. This scheme requires access to a subset of an individual’s genetic sequence to acquire full access to the transmitted sequence data, which helps to prevent sample mismatch. Results We validate that the proposed encryption scheme is robust to sequencing errors, population uniqueness, and sibling disambiguation, and provides sufficient cryptographic key space. Discussion Access to a set of an individual’s genotypes and a mutually agreed cryptographic seed is needed to unlock the full sequence, which provides additional sample authentication and authorization security. We present modest fixed and marginal costs to implement this transmission architecture. Conclusions It is possible for genomics researchers who sequence participant samples externally to protect the transmission of sequence data using unique features of an individual’s genetic sequence. PMID:23125421
Lewandowska, Dagmara W; Zagordi, Osvaldo; Geissberger, Fabienne-Desirée; Kufner, Verena; Schmutz, Stefan; Böni, Jürg; Metzner, Karin J; Trkola, Alexandra; Huber, Michael
2017-08-08
Sequence-specific PCR is the most common approach for virus identification in diagnostic laboratories. However, as specific PCR only detects pre-defined targets, novel virus strains or viruses not included in routine test panels will be missed. Recently, advances in high-throughput sequencing allow for virus-sequence-independent identification of entire virus populations in clinical samples, yet standardized protocols are needed to allow broad application in clinical diagnostics. Here, we describe a comprehensive sample preparation protocol for high-throughput metagenomic virus sequencing using random amplification of total nucleic acids from clinical samples. In order to optimize metagenomic sequencing for application in virus diagnostics, we tested different enrichment and amplification procedures on plasma samples spiked with RNA and DNA viruses. A protocol including filtration, nuclease digestion, and random amplification of RNA and DNA in separate reactions provided the best results, allowing reliable recovery of viral genomes and a good correlation of the relative number of sequencing reads with the virus input. We further validated our method by sequencing a multiplexed viral pathogen reagent containing a range of human viruses from different virus families. Our method proved successful in detecting the majority of the included viruses with high read numbers and compared well to other protocols in the field validated against the same reference reagent. Our sequencing protocol does work not only with plasma but also with other clinical samples such as urine and throat swabs. The workflow for virus metagenomic sequencing that we established proved successful in detecting a variety of viruses in different clinical samples. Our protocol supplements existing virus-specific detection strategies providing opportunities to identify atypical and novel viruses commonly not accounted for in routine diagnostic panels.
Characterization of genetic variability of Venezuelan equine encephalitis viruses
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...
2016-04-07
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
NASA Astrophysics Data System (ADS)
Wu, Jiangling; Huang, Yu; Bian, Xintong; Li, DanDan; Cheng, Quan; Ding, Shijia
2016-10-01
In this work, a custom-made intensity-interrogation surface plasmon resonance imaging (SPRi) system has been developed to directly detect a specific sequence of BCR/ABL fusion gene in chronic myelogenous leukemia (CML). The variation in the reflected light intensity detected from the sensor chip composed of gold islands array is proportional to the change of refractive index due to the selective hybridization of surface-bound DNA probes with target ssDNA. SPRi measurements were performed with different concentrations of synthetic target DNA sequence. The calibration curve of synthetic target sequence shows a good relationship between the concentration of synthetic target and the change of reflected light intensity. The detection limit of this SPRi measurement could approach 10.29 nM. By comparing SPRi images, the target ssDNA and non-complementary DNA sequence are able to be distinguished. This SPRi system has been applied for assay of BCR/ABL fusion gene extracted from real samples. This nucleic acid-based SPRi biosensor therefore offers an alternative high-effective, high-throughput label-free tool for DNA detection in biomedical research and molecular diagnosis.
Human Y chromosome copy number variation in the next generation sequencing era and beyond.
Massaia, Andrea; Xue, Yali
2017-05-01
The human Y chromosome provides a fertile ground for structural rearrangements owing to its haploidy and high content of repeated sequences. The methodologies used for copy number variation (CNV) studies have developed over the years. Low-throughput techniques based on direct observation of rearrangements were developed early on, and are still used, often to complement array-based or sequencing approaches which have limited power in regions with high repeat content and specifically in the presence of long, identical repeats, such as those found in human sex chromosomes. Some specific rearrangements have been investigated for decades; because of their effects on fertility, or their outstanding evolutionary features, the interest in these has not diminished. However, following the flourishing of large-scale genomics, several studies have investigated CNVs across the whole chromosome. These studies sometimes employ data generated within large genomic projects such as the DDD study or the 1000 Genomes Project, and often survey large samples of healthy individuals without any prior selection. Novel technologies based on sequencing long molecules and combinations of technologies, promise to stimulate the study of Y-CNVs in the immediate future.
Geochemistry of volcanic rocks from the Wawa greenstone belt
NASA Technical Reports Server (NTRS)
Schulz, K. J.; Sylvester, P. J.; Attoh, K.
1983-01-01
The Wawa greenstone belt is located in the District of Algoma and extends east-northeast from Lake Superior to the western part of the Sudbury District in Ontario, Canada. Recent mapping by Attoh has shown that an unconformity at the base of the Dore' Formation and equivalent sedimentary rocks marks a significant stratigraphic break which can be traced throughout the volcanic belt. This break has been used to subdivide the volcanic-sedimentary into pre- and post-Dore' sequences. The pre-Dore' sequence includes at least two cycles of mafic-to-felsic volcanism, each capped by an iron-formation unit. The post-Dore' sequence includes an older mafic-to-felsic unit, which directly overlies sedimentary rocks correlated with the Dore' Formation, and a younger felsic breccia unit interpreted to have formed as debris flows from a felsic volcanic center. In the present study, samples of both the pre-and post-Dore' volcanic sequences were analyzed for major and trace elements, incuding rare earths (REE). This preliminary study is part of an ongoing program to assess the petrogenesis of the volcanic rocks of the Wawa greenstone belt.
Tuininga, Amy R; Miller, Jessica L; Morath, Shannon U; Daniels, Thomas J; Falco, Richard C; Marchese, Michael; Sahabi, Sadia; Rosa, Dieshia; Stafford, Kirby C
2009-05-01
Entomopathogenic fungi are commonly found in forested soils that provide tick habitat, and many species are pathogenic to Ixodes scapularis Say, the blacklegged tick. As a first step to developing effective biocontrol strategies, the objective of this study was to determine the best methods to isolate entomopathogenic fungal species from field-collected samples of soils and ticks from an Eastern deciduous forest where I. scapularis is common. Several methods were assessed: (1) soils, leaf litter, and ticks were plated on two types of media; (2) soils were assayed for entomopathogenic fungi using the Galleria bait method; (3) DNA from internal transcribed spacer (ITS) regions of the nuclear ribosomal repeat was extracted from pure cultures obtained from soils, Galleria, and ticks and was amplified and sequenced; and (4) DNA was extracted directly from ticks, amplified, and sequenced. We conclude that (1) ticks encounter potentially entomopathogenic fungi more often in soil than in leaf litter, (2) many species of potentially entomopathogenic fungi found in the soil can readily be cultured, (3) the Galleria bait method is a sufficiently efficient method for isolation of these fungi from soils, and (4) although DNA extraction from ticks was not possible in this study because of small sample size, DNA extraction from fungi isolated from soils and from ticks was successful and provided clean sequences in 100 and 73% of samples, respectively. A combination of the above methods is clearly necessary for optimal characterization of entomopathogenic fungi associated with ticks in the environment.
Wood, David L. A.; Nones, Katia; Steptoe, Anita; Christ, Angelika; Harliwong, Ivon; Newell, Felicity; Bruxner, Timothy J. C.; Miller, David; Cloonan, Nicole; Grimmond, Sean M.
2015-01-01
Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci. PMID:25965996
O'Flaherty, Brigid M; Li, Yan; Tao, Ying; Paden, Clinton R; Queen, Krista; Zhang, Jing; Dinwiddie, Darrell L; Gross, Stephen M; Schroth, Gary P; Tong, Suxiang
2018-06-01
Next generation sequencing (NGS) technologies have revolutionized the genomics field and are becoming more commonplace for identification of human infectious diseases. However, due to the low abundance of viral nucleic acids (NAs) in relation to host, viral identification using direct NGS technologies often lacks sufficient sensitivity. Here, we describe an approach based on two complementary enrichment strategies that significantly improves the sensitivity of NGS-based virus identification. To start, we developed two sets of DNA probes to enrich virus NAs associated with respiratory diseases. The first set of probes spans the genomes, allowing for identification of known viruses and full genome sequencing, while the second set targets regions conserved among viral families or genera, providing the ability to detect both known and potentially novel members of those virus groups. Efficiency of enrichment was assessed by NGS testing reference virus and clinical samples with known infection. We show significant improvement in viral identification using enriched NGS compared to unenriched NGS. Without enrichment, we observed an average of 0.3% targeted viral reads per sample. However, after enrichment, 50%-99% of the reads per sample were the targeted viral reads for both the reference isolates and clinical specimens using both probe sets. Importantly, dramatic improvements on genome coverage were also observed following virus-specific probe enrichment. The methods described here provide improved sensitivity for virus identification by NGS, allowing for a more comprehensive analysis of disease etiology. © 2018 O'Flaherty et al.; Published by Cold Spring Harbor Laboratory Press.
Cai, Sheng; Tian, Xueke; Sun, Lianli; Hu, Haihong; Zheng, Shirui; Jiang, Huidi; Yu, Lushan; Zeng, Su
2015-10-20
Wide use of platinum-based chemotherapeutic regimens for the treatment for carcinoma calls for a simple and selective detection of platinum compound in biological samples. On the basis of the platinum(II)-base pair coordination, a novel type of aptameric platform for platinum detection has been introduced. This chemiluminescence (CL) aptasensor consists of a designed streptavidin (SA) aptamer sequence in which several base pairs were replaced by G-G mismatches. Only in the presence of platinum, coordination occurs between the platinum and G-G base pairs as opposed to the hydrogen-bonded G-C base pairs, which leads to SA aptamer sequence activation, resulting in their binding to SA coated magnetic beads. These Pt-DNA coordination events were monitored by a simple and direct luminol-peroxide CL reaction through horseradish peroxidase (HRP) catalysis with a strong chemiluminescence emission. The validated ranges of quantification were 0.12-240 μM with a limit of detection of 60 nM and selectivity over other metal ions. This assay was also successfully used in urine sample determination. It will be a promising candidate for the detection of platinum in biomedical and environmental samples.
NASA Astrophysics Data System (ADS)
Chassefiere, E.; Jambon, A.; Berthelier, J.-J.; Sarda, Ph.; Agrinier, P.
2003-04-01
The technique of GCMS analysis, which has been used with a great success on several past planetary missions, is not adapted for precise measurements of the isotopic composition of planetary atmospheres (noble gases, stable isotopes), and volatile outgassed products from solid sample pyrolysis. Static mass spectrometry, coupled with gas separation by cryo-separation, and chemical trapping, is commonly used in the laboratory to study volatiles extracted from terrestrial and meteoritic samples. This technique allows to reach a precision on isotopic ratios of the order of a few 0.1 ppm for a typical amount of gas of a few micromoles. We are presently studying an instrument based on the same principle for space exploration applications. The PALOMA instrument (PAyload for Local Observation of Mars Atmosphere) will be proposed in response to the AO for the instrumentation of the NASA Mars Smart Lander mission, planned to be launched in 2009. It might be part as well of the EXOMARS mission presently studied at ESA in the frame of the Aurora program. The miniaturization of major key elements, like the cryogenic device, the mass spectrometer, the line and its ensemble of valves, is presently led in our laboratories under CNES funding. The instrument consists of : (i) a gas purification and separation line, using techniques of cryogenic and chemical trapping, and possibly membrane permeation for molecular hydrogen analysis, (ii) a mass spectrometer working in static mode, without carrier gas (both time-of-flight and magnetic solutions are studied), (iii) a turbo-molecular pump that provides the required level of vacuum in the separation line and in the spectrometer. In the specific case of Mars, it is designed to work during typically 2 years (about 1000 measurement cycles), in order to perform accurate measurements of molecular, elemental and isotopic composition and of their diurnal/seasonal variations. The gas is sampled directly from the ambient atmosphere, without need for an external sample distribution system. The general characteristics of the instrument are as following . The mass is 6 kg, for a size of 30 x 30 x 20 cm. The required power, averaged over a complete measurement cycle, is 20 W (peak value : 30 W). The total energy required for one sequence is 100 Wh. This number must be considered as an upper limit, and corresponds to the most complex sequence (noble gas isotope analysis). Sequences used for stable isotopes measurement, and atmospheric molecular composition (trace gases of geological and/or astrobiological interest), are expected to be simpler, and less power-consuming. The anticipated volume of data produced by one observation sequence is estimated to be in the 3-6 kb range. The gas is sampled directly from the ambient atmosphere.
Wong, Danny Ka-Ho; Tsoi, Ottilia; Huang, Fung-Yu; Seto, Wai-Kay; Fung, James; Lai, Ching-Lung
2014-01-01
Nucleoside/nucleotide analogue for the treatment of chronic hepatitis B virus (HBV) infection is hampered by the emergence of drug resistance mutations. Conventional PCR sequencing cannot detect minor variants of <20%. We developed a modified co-amplification at lower denaturation temperature-PCR (COLD-PCR) method for the detection of HBV minority drug resistance mutations. The critical denaturation temperature for COLD-PCR was determined to be 78°C. Sensitivity of COLD-PCR sequencing was determined using serially diluted plasmids containing mixed proportions of HBV reverse transcriptase (rt) wild-type and mutant sequences. Conventional PCR sequencing detected mutations only if they existed in ≥25%, whereas COLD-PCR sequencing detected mutations when they existed in 5 to 10% of the viral population. The performance of COLD-PCR was compared to conventional PCR sequencing and a line probe assay (LiPA) using 215 samples obtained from 136 lamivudine- or telbivudine-treated patients with virological breakthrough. Among these 215 samples, drug resistance mutations were detected in 155 (72%), 148 (69%), and 113 samples (53%) by LiPA, COLD-PCR, and conventional PCR sequencing, respectively. Nineteen (9%) samples had mutations detectable by COLD-PCR but not LiPA, while 26 (12%) samples had mutations detectable by LiPA but not COLD-PCR, indicating both methods were comparable (P = 0.371). COLD-PCR was more sensitive than conventional PCR sequencing. Thirty-five (16%) samples had mutations detectable by COLD-PCR but not conventional PCR sequencing, while none had mutations detected by conventional PCR sequencing but not COLD-PCR (P < 0.0001). COLD-PCR sequencing is a simple method which is comparable to LiPA and superior to conventional PCR sequencing in detecting minor lamivudine/telbivudine resistance mutations. PMID:24951803
Kilpatrick, David R.; Nakamura, Tomofumi; Burns, Cara C.; Bukbuk, David; Oderinde, Soji B.; Oberste, M. Steven; Kew, Olen M.; Pallansch, Mark A.; Shimizu, Hiroyuki
2014-01-01
Laboratory diagnosis has played a critical role in the Global Polio Eradication Initiative since 1988, by isolating and identifying poliovirus (PV) from stool specimens by using cell culture as a highly sensitive system to detect PV. In the present study, we aimed to develop a molecular method to detect PV directly from stool extracts, with a high efficiency comparable to that of cell culture. We developed a method to efficiently amplify the entire capsid coding region of human enteroviruses (EVs) including PV. cDNAs of the entire capsid coding region (3.9 kb) were obtained from as few as 50 copies of PV genomes. PV was detected from the cDNAs with an improved PV-specific real-time reverse transcription-PCR system and nucleotide sequence analysis of the VP1 coding region. For assay validation, we analyzed 84 stool extracts that were positive for PV in cell culture and detected PV genomes from 100% of the extracts (84/84 samples) with this method in combination with a PV-specific extraction method. PV could be detected in 2/4 stool extract samples that were negative for PV in cell culture. In PV-positive samples, EV species C viruses were also detected with high frequency (27% [23/86 samples]). This method would be useful for direct detection of PV from stool extracts without using cell culture. PMID:25339406
Mallory, Melanie A; Lucic, Danijela; Ebbert, Mark T W; Cloherty, Gavin A; Toolsie, Dan; Hillyard, David R
2017-05-01
HCV genotyping remains a critical tool for guiding initiation of therapy and selecting the most appropriate treatment regimen. Current commercial genotyping assays may have difficulty identifying 1a, 1b and genotype 6. To evaluate the concordance for identifying 1a, 1b, and genotype 6 between two methods: the PLUS assay and core/NS5B sequencing. This study included 236 plasma and serum samples previously genotyped by core/NS5B sequencing. Of these, 25 samples were also previously tested by the Abbott RealTime HCV GT II Research Use Only (RUO) assay and yielded ambiguous results. The remaining 211 samples were routine genotype 1 (n=169) and genotype 6 (n=42). Genotypes obtained from sequence data were determined using a laboratory-developed HCV sequence analysis tool and the NCBI non-redundant database. Agreement between the PLUS assay and core/NS5B sequencing for genotype 1 samples was 95.8% (162/169), with 96% (127/132) and 95% (35/37) agreement for 1a and 1b samples respectively. PLUS results agreed with core/NS5B sequencing for 83% (35/42) of unselected genotype 6 samples, with the remaining seven "not detected" by the PLUS assay. Among the 25 samples with ambiguous GT II results, 15 were concordant by PLUS and core/NS5B sequencing, nine were not detected by PLUS, and one sample had an internal control failure. The PLUS assay is an automated method that identifies 1a, 1b and genotype 6 with good agreement with gold-standard core/NS5B sequencing and can aid in the resolution of certain genotype samples with ambiguous GT II results. Copyright © 2017 Elsevier B.V. All rights reserved.
Jun, Goo; Flickinger, Matthew; Hetrick, Kurt N.; Romm, Jane M.; Doheny, Kimberly F.; Abecasis, Gonçalo R.; Boehnke, Michael; Kang, Hyun Min
2012-01-01
DNA sample contamination is a serious problem in DNA sequencing studies and may result in systematic genotype misclassification and false positive associations. Although methods exist to detect and filter out cross-species contamination, few methods to detect within-species sample contamination are available. In this paper, we describe methods to identify within-species DNA sample contamination based on (1) a combination of sequencing reads and array-based genotype data, (2) sequence reads alone, and (3) array-based genotype data alone. Analysis of sequencing reads allows contamination detection after sequence data is generated but prior to variant calling; analysis of array-based genotype data allows contamination detection prior to generation of costly sequence data. Through a combination of analysis of in silico and experimentally contaminated samples, we show that our methods can reliably detect and estimate levels of contamination as low as 1%. We evaluate the impact of DNA contamination on genotype accuracy and propose effective strategies to screen for and prevent DNA contamination in sequencing studies. PMID:23103226
Fast converging minimum probability of error neural network receivers for DS-CDMA communications.
Matyjas, John D; Psaromiligkos, Ioannis N; Batalama, Stella N; Medley, Michael J
2004-03-01
We consider a multilayer perceptron neural network (NN) receiver architecture for the recovery of the information bits of a direct-sequence code-division-multiple-access (DS-CDMA) user. We develop a fast converging adaptive training algorithm that minimizes the bit-error rate (BER) at the output of the receiver. The adaptive algorithm has three key features: i) it incorporates the BER, i.e., the ultimate performance evaluation measure, directly into the learning process, ii) it utilizes constraints that are derived from the properties of the optimum single-user decision boundary for additive white Gaussian noise (AWGN) multiple-access channels, and iii) it embeds importance sampling (IS) principles directly into the receiver optimization process. Simulation studies illustrate the BER performance of the proposed scheme.
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome
Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.
2001-01-01
Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.
Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M
2001-10-09
Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.
Sokolova, O I; Dem'ianov, A V; Bovers, L S; Did'e, E S; Sokolova, Iu Ia
2011-01-01
The FTA technology was applied for sampling, archiving, and molecular analysis of the DNA isolated from stool samples to diagnose and identify microsporidia, the intracellular opportunistic parasites which induce malabsortion syndrome in immunosuppressed humans, particularly in patients with AIDS. Microsporidia DNA was successfully amplified in 6 of 50 stool samples of HIV-positive patients of the S. P. Botkin Memorial Infectious Disease Hospital (St. Petersburg) applied to FTA cards (FTA-Cars, Whatman Inc. Florham Park, NJ, USA). Amplicons (the fragments of rDNA) were directly sequenced, and microsporidia species--Encephalitozoon intestinalis, E. cuniculi, E. hellem, and Enterocytozoon bieneusi--were identified in Genbank by NCBI BLAST program. The FTA method of DNA immobilization is especially promising for epidemiological and field population studies which involve genotyping of microsporidia species and isolates.
A micro-rheological method for determination of blood type.
Makulska, Sylwia; Jakiela, Slawomir; Garstecki, Piotr
2013-07-21
The measurement of time and distance can be used for determining agglutination in small (nL) samples of liquid. We demonstrate the use of this new scheme of detection in typing and subtyping blood in a simple microfluidic system that monitors the speed of flow of microdroplets. The system (i) accepts small samples of liquids deposited directly onto the chip, (ii) forms droplets on demand from these samples, (iii) merges the droplets, and (iv) measures their speed in a microchannel. A sequence of measurements on different combinations of blood and antibodies can thus be used to determine blood type with the estimated probability of mistyping being less than 1 in a million tests. In addition, in the agglutinated samples, red blood cells concentrate at the rear of the droplets yielding an additional vista for detection and suggesting a possible mechanism for separations.
Vollmer, Tanja; Störmer, Melanie; Kleesiek, Knut; Dreier, Jens
2008-01-01
In the present study, a novel broad-range real-time PCR was developed for the rapid detection of human pathogenic fungi. The assay targets a part of the 28S large-subunit ribosomal RNA (rDNA) gene. We investigated its application for the most important human pathogenic fungal genera, including Aspergillus, Candida, Cryptococcus, Mucor, Penicillium, Pichia, Microsporum, Trichophyton, and Scopulariopsis. Species were identified in PCR-positive reactions by direct DNA sequencing. A noncompetitive internal control was applied to prevent false-negative results due to PCR inhibition. The minimum detection limit for the PCR was determined to be one 28S rDNA copy per PCR, and the 95% detection limit was calculated to 15 copies per PCR. To assess the clinical applicability of the PCR method, intensive-care patients with artificial respiration and patients with infective endocarditis were investigated. For this purpose, 76 tracheal secretion samples and 70 heart valve tissues were analyzed in parallel by real-time PCR and cultivation. No discrepancies in results were observed between PCR analysis and cultivation methods. Furthermore, the application of the PCR method was investigated for other clinical specimens, including cervical swabs, nail and horny skin scrapings, and serum, blood, and urine samples. The combination of a broad-range real-time PCR and direct sequencing facilitates rapid screening for fungal infection in various clinical specimens. PMID:18385440
Vollmer, Tanja; Störmer, Melanie; Kleesiek, Knut; Dreier, Jens
2008-06-01
In the present study, a novel broad-range real-time PCR was developed for the rapid detection of human pathogenic fungi. The assay targets a part of the 28S large-subunit ribosomal RNA (rDNA) gene. We investigated its application for the most important human pathogenic fungal genera, including Aspergillus, Candida, Cryptococcus, Mucor, Penicillium, Pichia, Microsporum, Trichophyton, and Scopulariopsis. Species were identified in PCR-positive reactions by direct DNA sequencing. A noncompetitive internal control was applied to prevent false-negative results due to PCR inhibition. The minimum detection limit for the PCR was determined to be one 28S rDNA copy per PCR, and the 95% detection limit was calculated to 15 copies per PCR. To assess the clinical applicability of the PCR method, intensive-care patients with artificial respiration and patients with infective endocarditis were investigated. For this purpose, 76 tracheal secretion samples and 70 heart valve tissues were analyzed in parallel by real-time PCR and cultivation. No discrepancies in results were observed between PCR analysis and cultivation methods. Furthermore, the application of the PCR method was investigated for other clinical specimens, including cervical swabs, nail and horny skin scrapings, and serum, blood, and urine samples. The combination of a broad-range real-time PCR and direct sequencing facilitates rapid screening for fungal infection in various clinical specimens.
NASA Technical Reports Server (NTRS)
Birmele, Michele N.
2011-01-01
The Regenerative, Environmental Control and Life Support System (ECLSS) on the International Space Station (ISS) includes the the Water Recovery System (WRS) and the Oxygen Generation System (OGS). The WRS consists of a Urine Processor Assembly (UPA) and Water Processor Assembly (WPA). This report describes microbial characterization of wastewater and surface samples collected from the WRS and OGS subsystems, returned to KSC, JSC, and MSFC on consecutive shuttle flights (STS-129 and STS-130) in 2009-10. STS-129 returned two filters that contained fluid samples from the WPA Waste Tank Orbital Recovery Unit (ORU), one from the waste tank and the other from the ISS humidity condensate. Direct count by microscopic enumeration revealed 8.38 x 104 cells per mL in the humidity condensate sample, but none of those cells were recoverable on solid agar media. In contrast, 3.32 x lOs cells per mL were measured from a surface swab of the WRS waste tank, including viable bacteria and fungi recovered after S12 days of incubation on solid agar media. Based on rDNA sequencing and phenotypic characterization, a fungus recovered from the filter was determined to be Lecythophora mutabilis. The bacterial isolate was identified by rDNA sequence data to be Methylobacterium radiotolerans. Additional UPA subsystem samples were returned on STS-130 for analysis. Both liquid and solid samples were collected from the Russian urine container (EDV), Distillation Assembly (DA) and Recycle Filter Tank Assembly (RFTA) for post-flight analysis. The bacterium Pseudomonas aeruginosa and fungus Chaetomium brasiliense were isolated from the EDV samples. No viable bacteria or fungi were recovered from RFTA brine samples (N= 6), but multiple samples (N = 11) from the DA and RFTA were found to contain fungal and bacterial cells. Many recovered cells have been identified to genus by rDNA sequencing and carbon source utilization profiling (BiOLOG Gen III). The presence of viable bacteria and fungi from WRS and OGS subsystems demonstrates the need for continued monitoring of ECLSS during future ISS operations and investigation of advanced antimicrobial controls.
Drummond, A; Rodrigo, A G
2000-12-01
Reconstruction of evolutionary relationships from noncontemporaneous molecular samples provides a new challenge for phylogenetic reconstruction methods. With recent biotechnological advances there has been an increase in molecular sequencing throughput, and the potential to obtain serial samples of sequences from populations, including rapidly evolving pathogens, is fast being realized. A new method called the serial-sample unweighted pair grouping method with arithmetic means (sUPGMA) is presented that reconstructs a genealogy or phylogeny of sequences sampled serially in time using a matrix of pairwise distances. The resulting tree depicts the terminal lineages of each sample ending at a different level consistent with the sample's temporal order. Since sUPGMA is a variant of UPGMA, it will perform best when sequences have evolved at a constant rate (i.e., according to a molecular clock). On simulated data, this new method performs better than standard cluster analysis under a variety of longitudinal sampling strategies. Serial-sample UPGMA is particularly useful for analysis of longitudinal samples of viruses and bacteria, as well as ancient DNA samples, with the minimal requirement that samples of sequences be ordered in time.
Cuddy, L L; Thompson, W F
1992-01-01
In a probe-tone experiment, two groups of listeners--one trained, the other untrained, in traditional music theory--rated the goodness of fit of each of the 12 notes of the chromatic scale to four-voice harmonic sequences. Sequences were 12 simplified excerpts from Bach chorales, 4 nonmodulating, and 8 modulating. Modulations occurred either one or two steps in either the clockwise or the counterclockwise direction on the cycle of fifths. A consistent pattern of probe-tone ratings was obtained for each sequence, with no significant differences between listener groups. Two methods of analysis (Fourier analysis and regression analysis) revealed a directional asymmetry in the perceived key movement conveyed by modulating sequences. For a given modulation distance, modulations in the counterclockwise direction effected a clearer shift in tonal organization toward the final key than did clockwise modulations. The nature of the directional asymmetry was consistent with results reported for identification and rating of key change in the sequences (Thompson & Cuddy, 1989a). Further, according to the multiple-regression analysis, probe-tone ratings did not merely reflect the distribution of tones in the sequence. Rather, ratings were sensitive to the temporal structure of the tonal organization in the sequence.
Magnetostratigraphic correlations of Permian-Triassic marine-to-terrestrial sections from China
Glen, J.M.G.; Nomade, S.; Lyons, J.J.; Metcalfe, I.; Mundil, R.; Renne, P.R.
2009-01-01
We have studied three Permian–Triassic (PT) localities from China as part of a combined magnetostratigraphic, 40Ar/39Ar and U–Pb radioisotopic, and biostratigraphic study aimed at resolving the temporal relations between terrestrial and marine records across the Permo-Triassic boundary, as well as the rate of the biotic recovery in the Early Triassic. The studied sections from Shangsi (Sichuan Province), Langdai (Guihzou Province), and the Junggar basin (Xinjiang Province), span marine, paralic, and terrestrial PT environments, respectively. Each of these sections was logged in detail in order to place geochronologic, paleomagnetic, geochemical, conodont and palynologic samples within a common stratigraphic context. Here we present rock-magnetic, paleomagnetic and magnetostratigraphic results from the three localities.At Shangsi, northern Sichuan Province, we sampled three sections spanning Permo-Triassic marine carbonates. Magnetostratigraphic results from the three sections indicate that the composite section contains at least eight polarity chrons and that the PT boundary occurs within a normal polarity chron a short distance above the mass extinction level and a reversed-to-normal (R-N) polarity reversal. Furthermore, the onset of the Illawarra mixed interval lies below the sampled section indicating that the uppermost Permian Changhsingian and at least part of the Wuchiapingian stages postdate the end of the Kiaman Permo-Carboniferous Reversed Superchron.At Langdai, Guizhou Province, we studied magnetostratigraphy of PT paralic mudstone and carbonate sediments in two sections. The composite section spans an R-N polarity sequence. Section-mean directions pass a fold test at the 95% confidence level, and the section-mean poles are close to the mean PT pole for the South China block. Based on biostratigraphic constraints, the R-N transition recorded at Langdai is consistent with that at Shangsi and demonstrates that the PT boundary occurred within a normal polarity chron a short distance above the mass extinction level.In the southern Junggar basin, Xinjiang Province, in northwest China, we determined the magnetostratigraphy of three sections of a terrestrial sequence. Normal and reversed polarity directions are roughly antipodal, and magnetostratigraphies from the three sections are highly consistent. Combined bio- and magneto-stratigraphy used to correlate this sequence to other PT sequences suggests that the previously-proposed biostratigraphic PT boundary in the Junggar sections was most likely misplaced by earlier workers suggesting that further work is necessary to confidently place the PT boundary there.
Targeted Capture and High-Throughput Sequencing Using Molecular Inversion Probes (MIPs).
Cantsilieris, Stuart; Stessman, Holly A; Shendure, Jay; Eichler, Evan E
2017-01-01
Molecular inversion probes (MIPs) in combination with massively parallel DNA sequencing represent a versatile, yet economical tool for targeted sequencing of genomic DNA. Several thousand genomic targets can be selectively captured using long oligonucleotides containing unique targeting arms and universal linkers. The ability to append sequencing adaptors and sample-specific barcodes allows large-scale pooling and subsequent high-throughput sequencing at relatively low cost per sample. Here, we describe a "wet bench" protocol detailing the capture and subsequent sequencing of >2000 genomic targets from 192 samples, representative of a single lane on the Illumina HiSeq 2000 platform.
Low-Latency Telerobotic Sample Return and Biomolecular Sequencing for Deep Space Gateway
NASA Astrophysics Data System (ADS)
Lupisella, M.; Bleacher, J.; Lewis, R.; Dworkin, J.; Wright, M.; Burton, A.; Rubins, K.; Wallace, S.; Stahl, S.; John, K.; Archer, D.; Niles, P.; Regberg, A.; Smith, D.; Race, M.; Chiu, C.; Russell, J.; Rampe, E.; Bywaters, K.
2018-02-01
Low-latency telerobotics, crew-assisted sample return, and biomolecular sequencing can be used to acquire and analyze lunar farside and/or Apollo landing site samples. Sequencing can also be used to monitor and study Deep Space Gateway environment and crew health.
Infant rolling abilities--the same or different 20 years after the back to sleep campaign?
Darrah, Johanna; Bartlett, Doreen J
2013-05-01
To compare the order and age of emergence of rolling prone to supine and supine to prone before the introduction of back to sleep guidelines and 20 years after their introduction. The original normative data for the Alberta Infant Motor Scale (AIMS) were collected just prior to the introduction of back to sleep guidelines in 1992. Currently these norms are being re-evaluated. Data of rolling patterns of infants 36 weeks of age or younger from the original sample (n=1114) and the contemporary sample (n=351) were evaluated to compare the sequence of appearance of prone to supine and supine to prone rolls (proportion of infants passing each roll) and the ages of emergence (estimated age when 50% of infants passed each roll). The sequence of emergence and estimated age of appearance of both rolling directions were similar between the two time periods. The introduction of the supine sleep position to reduce the prevalence of Sudden Infant Death Syndrome (SIDS) has not altered the timing or sequence of infant rolling abilities. This information is valuable to health care providers involved in the surveillance of infants' development. Original normative age estimates for these two motor abilities are still appropriate. Copyright © 2012 Elsevier Ltd. All rights reserved.
Mass and age of red giant branch stars observed with LAMOST and Kepler
NASA Astrophysics Data System (ADS)
Wu, Yaqian; Xiang, Maosheng; Bi, Shaolan; Liu, Xiaowei; Yu, Jie; Hon, Marc; Sharma, Sanjib; Li, Tanda; Huang, Yang; Liu, Kang; Zhang, Xianfei; Li, Yaguang; Ge, Zhishuai; Tian, Zhijia; Zhang, Jinghua; Zhang, Jianwei
2018-04-01
Obtaining accurate and precise masses and ages for large numbers of giant stars is of great importance for unraveling the assemblage history of the Galaxy. In this paper, we estimate masses and ages of 6940 red giant branch (RGB) stars with asteroseismic parameters deduced from Kepler photometry and stellar atmospheric parameters derived from LAMOST spectra. The typical uncertainties of mass is a few per cent, and that of age is ˜20 per cent. The sample stars reveal two separate sequences in the age-[α/Fe] relation - a high-α sequence with stars older than ˜8 Gyr and a low-α sequence composed of stars with ages ranging from younger than 1 Gyr to older than 11 Gyr. We further investigate the feasibility of deducing ages and masses directly from LAMOST spectra with a machine learning method based on kernel based principal component analysis, taking a sub-sample of these RGB stars as a training data set. We demonstrate that ages thus derived achieve an accuracy of ˜24 per cent. We also explored the feasibility of estimating ages and masses based on the spectroscopically measured carbon and nitrogen abundances. The results are quite satisfactory and significantly improved compared to the previous studies.
Newsham, Kevin K; Bridge, Paul D
2010-06-01
The leafy liverwort Lophozia excisa, which is colonised by basidiomycete fungi in other biomes and which evidence suggests may be colonised by mycorrhizal fungi in Antarctica, was sampled from Léonie Island in the southern maritime Antarctic (67 degrees 36' S, 68 degrees 21' W). Microscopic examination of plants indicated that fungal hyphae colonised 78% of the rhizoids of the liverwort, apparently by entering the tips of rhizoids prior to growing into their bases, where they formed hyphal coils. Extensive colonisation of stem medullary cells by hyphae was also observed. DNA was extracted from surface-sterilised liverwort tissues and sequenced following nested PCR, using the primer set ITS1F/TW14, followed by a second round of amplification using the ITSSeb3/TW13 primer set. Neighbour-joining analyses showed that the sequences obtained nested in Sebacinales clade B as a 100% supported sister group to Sebacinales sequences from the leafy liverworts Lophozia sudetica, L. incisa and Calypogeia muelleriana sampled from Europe. Direct PCR using the fungal specific primer set ITS1F/ITS4 similarly identified fungi belonging to Sebacinales clade B as the principal colonists of L. excisa tissues. These observations indicate the presence of a second mycothallus in Antarctica and support the previous suggestion that the Sebacinales has a wide geographical distribution.
Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions
Sükösd, Zsuzsanna; Swenson, M. Shel; Kjems, Jørgen; Heitsch, Christine E.
2013-01-01
Recent advances in RNA structure determination include using data from high-throughput probing experiments to improve thermodynamic prediction accuracy. We evaluate the extent and nature of improvements in data-directed predictions for a diverse set of 16S/18S ribosomal sequences using a stochastic model of experimental SHAPE data. The average accuracy for 1000 data-directed predictions always improves over the original minimum free energy (MFE) structure. However, the amount of improvement varies with the sequence, exhibiting a correlation with MFE accuracy. Further analysis of this correlation shows that accurate MFE base pairs are typically preserved in a data-directed prediction, whereas inaccurate ones are not. Thus, the positive predictive value of common base pairs is consistently higher than the directed prediction accuracy. Finally, we confirm sequence dependencies in the directability of thermodynamic predictions and investigate the potential for greater accuracy improvements in the worst performing test sequence. PMID:23325843
Bender, Kelly S; Rice, Melissa R; Fugate, William H; Coates, John D; Achenbach, Laurie A
2004-09-01
Natural attenuation of the environmental contaminant perchlorate is a cost-effective alternative to current removal methods. The success of natural perchlorate remediation is dependent on the presence and activity of dissimilatory (per)chlorate-reducing bacteria (DPRB) within a target site. To detect DPRB in the environment, two degenerate primer sets targeting the chlorite dismutase (cld) gene were developed and optimized. A nested PCR approach was used in conjunction with these primer sets to increase the sensitivity of the molecular detection method. Screening of environmental samples indicated that all products amplified by this method were cld gene sequences. These sequences were obtained from pristine sites as well as contaminated sites from which DPRB were isolated. More than one cld phylotype was also identified from some samples, indicating the presence of more than one DPRB strain at those sites. The use of these primer sets represents a direct and sensitive molecular method for the qualitative detection of (per)chlorate-reducing bacteria in the environment, thus offering another tool for monitoring natural attenuation. Sequences of cld genes isolated in the course of this project were also generated from various DPRB and provided the first opportunity for a phylogenetic treatment of this metabolic gene. Comparisons of the cld and 16S ribosomal DNA (rDNA) gene trees indicated that the cld gene does not track 16S rDNA phylogeny, further implicating the possible role of horizontal transfer in the evolution of (per)chlorate respiration.
Ivy, Morgan I; Thoendel, Matthew J; Jeraldo, Patricio R; Greenwood-Quaintance, Kerryl E; Hanssen, Arlen D; Abdel, Matthew P; Chia, Nicholas; Yao, Janet Z; Tande, Aaron J; Mandrekar, Jayawant N; Patel, Robin
2018-05-30
Background: Metagenomic shotgun sequencing has the potential to transform how serious infections are diagnosed by offering universal, culture-free pathogen detection. This may be especially advantageous for microbial diagnosis of prosthetic joint infection (PJI) by synovial fluid analysis, since synovial fluid cultures are not universally positive, and synovial fluid is easily obtained pre-operatively. We applied a metagenomics-based approach to synovial fluid in an attempt to detect microorganisms in 168 failed total knee arthroplasties. Results: Genus- and species-level analysis of metagenomic sequencing yielded the known pathogen in 74 (90%) and 68 (83%) of the 82 culture-positive PJIs analyzed, respectively, with testing of two (2%) and three (4%) samples, respectively, yielding additional pathogens not detected by culture. For the 25 culture-negative PJIs tested, genus- and species-level analysis yielded 19 (76%) and 21 (84%) samples with insignificant findings, respectively, and 6 (24%) and 4 (16%) with potential pathogens detected, respectively. Genus- and species-level analysis of the 60 culture-negative aseptic failure cases yielded 53 (88.3%) and 56 (93.3%) cases with insignificant findings, and 7 (11.7%) and 4 (6.7%) with potential clinically-significant organisms detected, respectively. There was one case of aseptic failure with synovial fluid culture growth; metagenomic analysis showed insignificant findings, suggesting possible synovial fluid culture contamination. Conclusion: Metagenomic shotgun sequencing can detect pathogens involved in PJI when applied to synovial fluid and may be particularly useful for culture-negative cases. Copyright © 2018 American Society for Microbiology.
Jurka, Jerzy W.
1997-01-01
Enhanced homologous recombination is obtained by employing a consensus sequence which has been found to be associated with integration of repeat sequences, such as Alu and ID. The consensus sequence or sequence having a single transition mutation determines one site of a double break which allows for high efficiency of integration at the site. By introducing single or double stranded DNA having the consensus sequence flanking region joined to a sequence of interest, one can reproducibly direct integration of the sequence of interest at one or a limited number of sites. In this way, specific sites can be identified and homologous recombination achieved at the site by employing a second flanking sequence associated with a sequence proximal to the 3'-nick.
Porwollik, Steffen; Mottaz-Brewer, Heather; Petritis, Brianne O.; Jaitly, Navdeep; Adkins, Joshua N.; McClelland, Michael; Heffron, Fred; Smith, Richard D.
2009-01-01
Using sample-matched transcriptomics and proteomics measurements it is now possible to begin to understand the impact of post-transcriptional regulatory programs in Enterobacteria. In bacteria post-transcriptional regulation is mediated by relatively few identified RNA-binding protein factors including CsrA, Hfq and SmpB. A mutation in any one of these three genes, csrA, hfq, and smpB, in Salmonella is attenuated for mouse virulence and unable to survive in macrophages. CsrA has a clearly defined specificity based on binding to a specific mRNA sequence to inhibit translation. However, the proteins regulated by Hfq and SmpB are not as clearly defined. Previous work identified proteins regulated by hfq using purification of the RNA-protein complex with direct sequencing of the bound RNAs and found binding to a surprisingly large number of transcripts. In this report we have used global proteomics to directly identify proteins regulated by Hfq or SmpB by comparing protein abundance in the parent and isogenic hfq or smpB mutant. From these same samples we also prepared RNA for microarray analysis to determine if alteration of protein expression was mediated post-transcriptionally. Samples were analyzed from bacteria grown under four different conditions; two laboratory conditions and two that are thought to mimic the intracellular environment. We show that mutants of hfq and smpB directly or indirectly modulate at least 20% and 4% of all possible Salmonella proteins, respectively, with limited correlation between transcription and protein expression. These proteins represent a broad spectrum of Salmonella proteins required for many biological processes including host cell invasion, motility, central metabolism, LPS biosynthesis, two-component regulatory systems, and fatty acid metabolism. Our results represent one of the first global analyses of post-transcriptional regulons in any organism and suggest that regulation at the translational level is widespread and plays an important role in virulence regulation and environmental adaptation for Salmonella. PMID:19277208
NASA Astrophysics Data System (ADS)
Dick, G. J.; Andersson, A.; Banfield, J. F.
2007-12-01
Our understanding of environmental microbiology has been greatly enhanced by community genome sequencing of DNA recovered directly the environment. Community genomics provides insights into the diversity, community structure, metabolic function, and evolution of natural populations of uncultivated microbes, thereby revealing dynamics of how microorganisms interact with each other and their environment. Recent studies have demonstrated the potential for reconstructing near-complete genomes from natural environments while highlighting the challenges of analyzing community genomic sequence, especially from diverse environments. A major challenge of shotgun community genome sequencing is identification of DNA fragments from minor community members for which only low coverage of genomic sequence is present. We analyzed community genome sequence retrieved from biofilms in an acid mine drainage (AMD) system in the Richmond Mine at Iron Mountain, CA, with an emphasis on identification and assembly of DNA fragments from low-abundance community members. The Richmond mine hosts an extensive, relatively low diversity subterranean chemolithoautotrophic community that is sustained entirely by oxidative dissolution of pyrite. The activity of these microorganisms greatly accelerates the generation of AMD. Previous and ongoing work in our laboratory has focused on reconstrucing genomes of dominant community members, including several bacteria and archaea. We binned contigs from several samples (including one new sample and two that had been previously analyzed) by tetranucleotide frequency with clustering by Self-Organizing Maps (SOM). The binning, evaluated by comparison with information from the manually curated assembly of the dominant organisms, was found to be very effective: fragments were correctly assigned with 95% accuracy. Improperly assigned fragments often contained sequences that are either evolutionarily constrained (e.g. 16S rRNA genes) or mobile elements that are not expected to reflect the tetranucleotide frequency signature of the host genome. Four unknown tetranucleotide frequency clusters with significant sequence (6 Mb total) were noted and analyzed further. Based on phylogenetic markers and BLAST results, these clusters represent low abundance bacteria including Acintobacteria, Firmicutes, and Proteobacteria. Functional analysis of these clusters revealved that the low- abundance bacteria harbor genes that could potentially encode important ecosystem functions such as sulfur utilization (e.g. polysulfide reductase) and polymer degradation (e.g. chitinase and glycoside hydrolase). We conclude that ESOM clustering of tetranucleotide frequency patterns is an effective method for rapidly binning shotgun community genomic sequences and a valuable tool for analyzing minor community members, which despite their low abundance may play crucial ecological roles.
Adams, Cynthia R.; Kohn, Joshua R.; Fisher, Robert N.; Brehme, Cheryl S.
2016-01-01
Understanding the diet of an endangered species illuminates the animal’s ecology, habitat requirements, and conservation needs. However, direct observation of diet can be difficult, particularly for small, nocturnal animals such as the Pacific pocket mouse (Heteromyidae: Perognathus longimembris pacificus). Very little is known of the dietary habits of this federally endangered rodent, hindering management and restoration efforts. We used a metabarcoding approach to identify source plants in fecal samples (N = 52) from the three remaining populations known. The internal transcribed spacers (ITS) of the nuclear ribosomal loci were sequenced following the Illumina MiSeq amplicon strategy and processed reads were mapped to reference databases. We evaluated a range of threshold mapping criteria and found the best-performing setting generally recovered two distinct mock communities in proportions similar to expectation. We tested our method on captive animals fed a known diet and recovered almost all plant sources, but found substantial heterogeneity among fecal pellets collected from the same individual at the same time. Observed richness did not increase with pooling of pellets from the same individual. In field-collected samples, we identified 4–14 plant genera in individual samples and 74 genera overall, but over 50 percent of reads mapped to just six species in five genera. We simulated the effects of sequencing error, variable read length, and chimera formation to infer taxon-specific rates of misassignment for the local flora, which were generally low with some exceptions. Richness at the species and genus levels did not reach a clear asymptote, suggesting that diet breadth remained underestimated in the current pool of samples. Large numbers of scat samples are therefore needed to make inferences about diet and resource selection in future studies of the Pacific pocket mouse. We conclude that our minimally invasive method is promising for determining herbivore diets given a library of sequences from local plants. PMID:27851756
Iwanowicz, Deborah D; Vandergast, Amy G; Cornman, Robert S; Adams, Cynthia R; Kohn, Joshua R; Fisher, Robert N; Brehme, Cheryl S
2016-01-01
Understanding the diet of an endangered species illuminates the animal's ecology, habitat requirements, and conservation needs. However, direct observation of diet can be difficult, particularly for small, nocturnal animals such as the Pacific pocket mouse (Heteromyidae: Perognathus longimembris pacificus). Very little is known of the dietary habits of this federally endangered rodent, hindering management and restoration efforts. We used a metabarcoding approach to identify source plants in fecal samples (N = 52) from the three remaining populations known. The internal transcribed spacers (ITS) of the nuclear ribosomal loci were sequenced following the Illumina MiSeq amplicon strategy and processed reads were mapped to reference databases. We evaluated a range of threshold mapping criteria and found the best-performing setting generally recovered two distinct mock communities in proportions similar to expectation. We tested our method on captive animals fed a known diet and recovered almost all plant sources, but found substantial heterogeneity among fecal pellets collected from the same individual at the same time. Observed richness did not increase with pooling of pellets from the same individual. In field-collected samples, we identified 4-14 plant genera in individual samples and 74 genera overall, but over 50 percent of reads mapped to just six species in five genera. We simulated the effects of sequencing error, variable read length, and chimera formation to infer taxon-specific rates of misassignment for the local flora, which were generally low with some exceptions. Richness at the species and genus levels did not reach a clear asymptote, suggesting that diet breadth remained underestimated in the current pool of samples. Large numbers of scat samples are therefore needed to make inferences about diet and resource selection in future studies of the Pacific pocket mouse. We conclude that our minimally invasive method is promising for determining herbivore diets given a library of sequences from local plants.
Iwanowicz, Deborah; Vandergast, Amy; Cornman, Robert S.; Adams, Cynthia; Kohn, Joshua R.; Fisher, Robert N.; Brehme, Cheryl S.
2016-01-01
Understanding the diet of an endangered species illuminates the animal’s ecology, habitat requirements, and conservation needs. However, direct observation of diet can be difficult, particularly for small, nocturnal animals such as the Pacific pocket mouse (Heteromyidae: Perognathus longimembris pacificus). Very little is known of the dietary habits of this federally endangered rodent, hindering management and restoration efforts. We used a metabarcoding approach to identify source plants in fecal samples (N = 52) from the three remaining populations known. The internal transcribed spacers (ITS) of the nuclear ribosomal loci were sequenced following the Illumina MiSeq amplicon strategy and processed reads were mapped to reference databases. We evaluated a range of threshold mapping criteria and found the best-performing setting generally recovered two distinct mock communities in proportions similar to expectation. We tested our method on captive animals fed a known diet and recovered almost all plant sources, but found substantial heterogeneity among fecal pellets collected from the same individual at the same time. Observed richness did not increase with pooling of pellets from the same individual. In field-collected samples, we identified 4–14 plant genera in individual samples and 74 genera overall, but over 50 percent of reads mapped to just six species in five genera. We simulated the effects of sequencing error, variable read length, and chimera formation to infer taxon-specific rates of misassignment for the local flora, which were generally low with some exceptions. Richness at the species and genus levels did not reach a clear asymptote, suggesting that diet breadth remained underestimated in the current pool of samples. Large numbers of scat samples are therefore needed to make inferences about diet and resource selection in future studies of the Pacific pocket mouse. We conclude that our minimally invasive method is promising for determining herbivore diets given a library of sequences from local plants.
A waterborne norovirus gastroenteritis outbreak in a school, eastern China.
Zhou, N; Zhang, H; Lin, X; Hou, P; Wang, S; Tao, Z; Bi, Z; Xu, A
2016-04-01
In late 2014, a gastroenteritis outbreak occurred in a school in Shandong Province, eastern China. Hundreds of individuals developed the symptoms of diarrhoea and vomiting. Epidemiological investigation showed that food consumption was not linked to this outbreak, and unboiled direct drinking water was identified as the independent risk factor with a relative risk of 1·37 (95% confidence interval 1·03-1·83). Furthermore, examination of common bacterial and viral gastroenteritis pathogens was conducted on different specimens. Norovirus GI.1, GI.2, GI.6, GII.4, GII.6 and GII.13 were detected in clinical specimens and a water sample. GII.4 sequences between clinical specimens and the water sample displayed a close relationship and belonged to GII.4 variant Sydney 2012. These results indicate that direct drinking water contaminated by norovirus was responsible for this gastroenteritis outbreak. This study enriches our knowledge of waterborne norovirus outbreaks in China, and presents valuable prevention and control practices for policy-makers. In future, strengthened surveillance and supervision of direct drinking-water systems is needed.
Rabies in the arctic fox population, Svalbard, Norway.
Mørk, Torill; Bohlin, Jon; Fuglei, Eva; Åsbakk, Kjetil; Tryland, Morten
2011-10-01
Arctic foxes, 620 that were trapped and 22 found dead on Svalbard, Norway (1996-2004), as well as 10 foxes trapped in Nenets, North-West Russia (1999), were tested for rabies virus antigen in brain tissue by standard direct fluorescent antibody test. Rabies antigen was found in two foxes from Svalbard and in three from Russia. Blood samples from 515 of the fox carcasses were screened for rabies antibodies with negative result. Our results, together with a previous screening (1980-1989, n=817) indicate that the prevalence of rabies in Svalbard has remained low or that the virus has not been enzootic in the arctic fox population since the first reported outbreak in 1980. Brain tissues from four arctic foxes (one from Svalbard, three from Russia) in which rabies virus antigen was detected were further analyzed by reverse-transcriptase polymerase chain reaction direct amplicon sequencing and phylogenetic analysis. Sequences were compared to corresponding sequences from rabies virus isolates from other arctic regions. The Svalbard isolate and two of the Russian isolates were identical (310 nucleotides), whereas the third Russian isolate differed in six nucleotide positions. However, when translated into amino acid sequences, none of these substitutions produced changes in the amino acid sequence. These findings suggest that the spread of rabies virus to Svalbard was likely due to migration of arctic foxes over sea ice from Russia to Svalbard. Furthermore, when compared to other Arctic rabies virus isolates, a high degree of homology was found, suggesting a high contact rate between arctic fox populations from different arctic regions. The high degree of homology also indicates that other, and more variable, regions of the genome than this part of the nucleoprotein gene should be used to distinguish Arctic rabies virus isolates for epidemiologic purposes.
Mitsuya, Yumi; Varghese, Vici; Wang, Chunlin; Liu, Tommy F.; Holmes, Susan P.; Jayakumar, Prerana; Gharizadeh, Baback; Ronaghi, Mostafa; Klein, Daniel; Fessel, W. Jeffrey; Shafer, Robert W.
2008-01-01
T215 revertant mutations such as T215C/D/E/S that evolve from the nucleoside reverse transcriptase (RT) inhibitor mutations T215Y/F have been found in about 3% of human immunodeficiency virus type 1 (HIV-1) isolates from newly diagnosed HIV-1-infected persons. We used a newly developed sequencing method—ultradeep pyrosequencing (UDPS; 454 Life Sciences)—to determine the frequency with which T215Y/F or other RT inhibitor resistance mutations could be detected as minority variants in samples from untreated persons that contain T215 revertants (“revertant” samples) compared with samples from untreated persons that lack such revertants (“control” samples). Among the 22 revertant and 29 control samples, UDPS detected a mean of 3.8 and 4.8 additional RT amino acid mutations, respectively. In 6 of 22 (27%) revertant samples and in 4 of 29 control samples (14%; P = 0.4), UDPS detected one or more RT inhibitor resistance mutations. T215Y or T215F was not detected in any of the revertant or control samples; however, 4 of 22 revertant samples had one or more T215 revertants that were detected by UDPS but not by direct PCR sequencing. The failure to detect viruses with T215Y/F in the 22 revertant samples in this study may result from the overwhelming replacement of transmitted T215Y variants by the more fit T215 revertants or from the primary transmission of a T215 revertant in a subset of persons with T215 revertants. PMID:18715933
Madi, Nada; Al-Nakib, Widad; Mustafa, Abu Salim; Habibi, Nazima
2018-03-01
A metagenomic approach based on target independent next-generation sequencing has become a known method for the detection of both known and novel viruses in clinical samples. This study aimed to use the metagenomic sequencing approach to characterize the viral diversity in respiratory samples from patients with respiratory tract infections. We have investigated 86 respiratory samples received from various hospitals in Kuwait between 2015 and 2016 for the diagnosis of respiratory tract infections. A metagenomic approach using the next-generation sequencer to characterize viruses was used. According to the metagenomic analysis, an average of 145, 019 reads were identified, and 2% of these reads were of viral origin. Also, metagenomic analysis of the viral sequences revealed many known respiratory viruses, which were detected in 30.2% of the clinical samples. Also, sequences of non-respiratory viruses were detected in 14% of the clinical samples, while sequences of non-human viruses were detected in 55.8% of the clinical samples. The average genome coverage of the viruses was 12% with the highest genome coverage of 99.2% for respiratory syncytial virus, and the lowest was 1% for torque teno midi virus 2. Our results showed 47.7% agreement between multiplex Real-Time PCR and metagenomics sequencing in the detection of respiratory viruses in the clinical samples. Though there are some difficulties in using this method to clinical samples such as specimen quality, these observations are indicative of the promising utility of the metagenomic sequencing approach for the identification of respiratory viruses in patients with respiratory tract infections. © 2017 Wiley Periodicals, Inc.
Vacca, Davide; Cancila, Valeria; Gulino, Alessandro; Lo Bosco, Giosuè; Belmonte, Beatrice; Di Napoli, Arianna; Florena, Ada Maria; Tripodo, Claudio; Arancio, Walter
2018-02-01
The MinION is a miniaturized high-throughput next generation sequencing platform of novel conception. The use of nucleic acids derived from formalin-fixed paraffin-embedded samples is highly desirable, but their adoption for molecular assays is hurdled by the high degree of fragmentation and by the chemical-induced mutations stemming from the fixation protocols. In order to investigate the suitability of MinION sequencing on formalin-fixed paraffin-embedded samples, the presence and frequency of BRAF c.1799T > A mutation was investigated in two archival tissue specimens of Hairy cell leukemia and Hairy cell leukemia Variant. Despite the poor quality of the starting DNA, BRAF mutation was successfully detected in the Hairy cell leukemia sample with around 50% of the reads obtained within 2 h of the sequencing start. Notably, the mutational burden of the Hairy cell leukemia sample as derived from nanopore sequencing proved to be comparable to a sensitive method for the detection of point mutations, namely the Digital PCR, using a validated assay. Nanopore sequencing can be adopted for targeted sequencing of genetic lesions on critical DNA samples such as those extracted from archival routine formalin-fixed paraffin-embedded samples. This result let speculating about the possibility that the nanopore sequencing could be trustably adopted for the real-time targeted sequencing of genetic lesions. Our report opens the window for the adoption of nanopore sequencing in molecular pathology for research and diagnostics.
García-Garcerà, Marc; Gigli, Elena; Sanchez-Quinto, Federico; Ramirez, Oscar; Calafell, Francesc; Civit, Sergi; Lalueza-Fox, Carles
2011-01-01
Despite the successful retrieval of genomes from past remains, the prospects for human palaeogenomics remain unclear because of the difficulty of distinguishing contaminant from endogenous DNA sequences. Previous sequence data generated on high-throughput sequencing platforms indicate that fragmentation of ancient DNA sequences is a characteristic trait primarily arising due to depurination processes that create abasic sites leading to DNA breaks. METHODOLOGY/PRINCIPALS FINDINGS: To investigate whether this pattern is present in ancient remains from a temperate environment, we have 454-FLX pyrosequenced different samples dated between 5,500 and 49,000 years ago: a bone from an extinct goat (Myotragus balearicus) that was treated with a depurinating agent (bleach), an Iberian lynx bone not subjected to any treatment, a human Neolithic sample from Barcelona (Spain), and a Neandertal sample from the El Sidrón site (Asturias, Spain). The efficiency of retrieval of endogenous sequences is below 1% in all cases. We have used the non-human samples to identify human sequences (0.35 and 1.4%, respectively), that we positively know are contaminants. We observed that bleach treatment appears to create a depurination-associated fragmentation pattern in resulting contaminant sequences that is indistinguishable from previously described endogenous sequences. Furthermore, the nucleotide composition pattern observed in 5' and 3' ends of contaminant sequences is much more complex than the flat pattern previously described in some Neandertal contaminants. Although much research on samples with known contaminant histories is needed, our results suggest that endogenous and contaminant sequences cannot be distinguished by the fragmentation pattern alone.
Mankinen, Edward A.; Prevot, M.; Gromme, C. Sherman; Coe, Robert S.
1985-01-01
The thick sequence of Miocene lava flows exposed on Steens Mountain in southeastern Oregon is well known for containing a detailed record of a reversed‐to‐normal geomagnetic polarity transition. Paleomagnetic samples were obtained from the sequence for a combined study of the directional and intensity variations recorded; the paleointensity study is reported in a companion paper. This effort has resulted in the first detailed history of total geomagnetic field behavior during a reversal of polarity. A comparison of the directional variation history of the reversed and normal polarity intervals on either side of the transition with the Holocene record has allowed an estimate of the duration of these periods to be made. These time estimates were then used to calculate accumulation rates for the volcanic sequence and thereby provide a means for estimating time periods within the transition itself. The polarity transition was found to consist of two phases, each with quite different characteristics. At the onset of the first phase, a one‐third decrease in magnetic field intensity may have preceded the first intermediate field directions by about 600 years. Changes in field direction were confined near the local north‐south vertical plane when the actual reversal in direction occurred and normal polarity directions may have been attained within 550±150 years. The end of the first phase of the transition was marked by a brief (possibly 100–300 years) period with normal polarity and a pretransitional intensity which suggests a quasi‐normal dipole field structure existed during this interval. The second phase of the transition was characterized by a return to very low field intensities with the changes in direction describing a long counterclockwise loop in contrast to the earlier narrowly constrained changes. This second phase lasted 2900±300 years, and both normal directions and intensities were recovered at the same time. Both directional and intensity data document very erratic geomagnetic field behavior during the polarity transition. Changes in magnetic field direction were variable and occurred either (1) in a regular, progressive manner, (2) with sudden, extremely rapid angular changes (58°±21°/year), or (3) with little or no movement for periods of the order of 600±200 years. Changes in magnetic intensity occurred in a like manner and were sometimes correlated with changes in direction, but during other periods both directional and intensity changes occurred independently. Directional changes following the polarity transition occurred in a seemingly normal manner, although intensity fluctuations attest to some instability of the newly reestablished dipole.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.
Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.
2015-01-01
Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Gallium plasmonic nanoparticles for label-free DNA and single nucleotide polymorphism sensing
NASA Astrophysics Data System (ADS)
Marín, Antonio García; García-Mendiola, Tania; Bernabeu, Cristina Navio; Hernández, María Jesús; Piqueras, Juan; Pau, Jose Luis; Pariente, Félix; Lorenzo, Encarnación
2016-05-01
A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R2 = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells.A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R2 = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells. Electronic supplementary information (ESI) available. See DOI: 10.1039/c6nr00926c
Direct fluorescence anisotropy assay for cocaine using tetramethylrhodamine-labeled aptamer.
Liu, Yingxiong; Zhao, Qiang
2017-06-01
Development of simple, sensitive, and rapid method for cocaine detection is important in medicine and drug abuse monitoring. Taking advantage of fluorescence anisotropy and aptamer, this study reports a direct fluorescence anisotropy (FA) assay for cocaine by employing an aptamer probe with tetramethylrhodamine (TMR) labeled on a specific position. The binding of cocaine and the aptamer causes a structure change of the TMR-labeled aptamer, leading to changes of the interaction between labeled TMR and adjacent G bases in aptamer sequence, so FA of TMR varies with increasing of cocaine. After screening different labeling positions of the aptamer, including thymine (T) bases and terminals of the aptamer, we obtained a favorable aptamer probe with TMR labeled on the 25th base T in the sequence, which exhibited sensitive and significant FA-decreasing responses upon cocaine. Under optimized assay conditions, this TMR-labeled aptamer allowed for direct FA detection of cocaine as low as 5 μM. The maximum FA change reached about 0.086. This FA method also enabled the detection of cocaine spiked in diluted serum and urine samples, showing potential for applications. Graphical Abstract The binding of cocaine to the TMR-labeled aptamer causes conformation change and alteration of the intramolecular interaction between TMR and bases of aptamer, leading to variance of fluorescence anisotropy (FA) of TMR, so direct FA analyis of cocaine is achieved.
Phylogenetic Network for European mtDNA
Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari
2001-01-01
The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229
Friis-Nielsen, Jens; Vinner, Lasse; Hansen, Thomas Arn; Richter, Stine Raith; Fridholm, Helena; Herrera, Jose Alejandro Romero; Lund, Ole; Brunak, Søren; Izarzugaza, Jose M. G.; Mourier, Tobias; Nielsen, Lars Peter
2016-01-01
Propionibacterium acnes is the most abundant bacterium on human skin, particularly in sebaceous areas. P. acnes is suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence of P. acnes DNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions of P. acnes DNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples. P. acnes reads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show that P. acnes can be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully when P. acnes is detected in clinical samples. We advocate that detection of P. acnes always be accompanied by experiments validating the association between this bacterium and any clinical condition. PMID:26818667
Pyrin gene and mutants thereof, which cause familial Mediterranean fever
Kastner, Daniel L [Bethesda, MD; Aksentijevichh, Ivona [Bethesda, MD; Centola, Michael [Tacoma Park, MD; Deng, Zuoming [Gaithersburg, MD; Sood, Ramen [Rockville, MD; Collins, Francis S [Rockville, MD; Blake, Trevor [Laytonsville, MD; Liu, P Paul [Ellicott City, MD; Fischel-Ghodsian, Nathan [Los Angeles, CA; Gumucio, Deborah L [Ann Arbor, MI; Richards, Robert I [North Adelaide, AU; Ricke, Darrell O [San Diego, CA; Doggett, Norman A [Santa Cruz, NM; Pras, Mordechai [Tel-Hashomer, IL
2003-09-30
The invention provides the nucleic acid sequence encoding the protein associated with familial Mediterranean fever (FMF). The cDNA sequence is designated as MEFV. The invention is also directed towards fragments of the DNA sequence, as well as the corresponding sequence for the RNA transcript and fragments thereof. Another aspect of the invention provides the amino acid sequence for a protein (pyrin) associated with FMF. The invention is directed towards both the full length amino acid sequence, fusion proteins containing the amino acid sequence and fragments thereof. The invention is also directed towards mutants of the nucleic acid and amino acid sequences associated with FMF. In particular, the invention discloses three missense mutations, clustered in within about 40 to 50 amino acids, in the highly conserved rfp (B30.2) domain at the C-terminal of the protein. These mutants include M6801, M694V, K695R, and V726A. Additionally, the invention includes methods for diagnosing a patient at risk for having FMF and kits therefor.
Struniawski, R; Szpechcinski, A; Poplawska, B; Skronski, M; Chorostowska-Wynimko, J
2013-01-01
The dried blood spot (DBS) specimens have been successfully employed for the large-scale diagnostics of α1-antitrypsin (AAT) deficiency as an easy to collect and transport alternative to plasma/serum. In the present study we propose a fast, efficient, and cost effective protocol of DNA extraction from dried blood spot (DBS) samples that provides sufficient quantity and quality of DNA and effectively eliminates any natural PCR inhibitors, allowing for successful AAT genotyping by real-time PCR and direct sequencing. DNA extracted from 84 DBS samples from chronic obstructive pulmonary disease patients was genotyped for AAT deficiency variants by real-time PCR. The results of DBS AAT genotyping were validated by serum IEF phenotyping and AAT concentration measurement. The proposed protocol allowed successful DNA extraction from all analyzed DBS samples. Both quantity and quality of DNA were sufficient for further real-time PCR and, if necessary, for genetic sequence analysis. A 100% concordance between AAT DBS genotypes and serum phenotypes in positive detection of two major deficiency S- and Z- alleles was achieved. Both assays, DBS AAT genotyping by real-time PCR and serum AAT phenotyping by IEF, positively identified PI*S and PI*Z allele in 8 out of the 84 (9.5%) and 16 out of 84 (19.0%) patients, respectively. In conclusion, the proposed protocol noticeably reduces the costs and the hand-on-time of DBS samples preparation providing genomic DNA of sufficient quantity and quality for further real-time PCR or genetic sequence analysis. Consequently, it is ideally suited for large-scale AAT deficiency screening programs and should be method of choice.
Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen).
Rambaut, Andrew; Lam, Tommy T; Max Carvalho, Luiz; Pybus, Oliver G
2016-01-01
Gene sequences sampled at different points in time can be used to infer molecular phylogenies on a natural timescale of months or years, provided that the sequences in question undergo measurable amounts of evolutionary change between sampling times. Data sets with this property are termed heterochronous and have become increasingly common in several fields of biology, most notably the molecular epidemiology of rapidly evolving viruses. Here we introduce the cross-platform software tool, TempEst (formerly known as Path-O-Gen), for the visualization and analysis of temporally sampled sequence data. Given a molecular phylogeny and the dates of sampling for each sequence, TempEst uses an interactive regression approach to explore the association between genetic divergence through time and sampling dates. TempEst can be used to (1) assess whether there is sufficient temporal signal in the data to proceed with phylogenetic molecular clock analysis, and (2) identify sequences whose genetic divergence and sampling date are incongruent. Examination of the latter can help identify data quality problems, including errors in data annotation, sample contamination, sequence recombination, or alignment error. We recommend that all users of the molecular clock models implemented in BEAST first check their data using TempEst prior to analysis.
Mucosal and Cutaneous Human Papillomaviruses Detected in Raw Sewages
La Rosa, Giuseppina; Fratini, Marta; Accardi, Luisa; D'Oro, Graziana; Della Libera, Simonetta; Muscillo, Michele; Di Bonito, Paola
2013-01-01
Epitheliotropic viruses can find their way into sewage. The aim of the present study was to investigate the occurrence, distribution, and genetic diversity of Human Papillomaviruses (HPVs) in urban wastewaters. Sewage samples were collected from treatment plants distributed throughout Italy. The DNA extracted from these samples was analyzed by PCR using five PV-specific sets of primers targeting the L1 (GP5/GP6, MY09/MY11, FAP59/64, SKF/SKR) and E1 regions (PM-A/PM-B), according to the protocols previously validated for the detection of mucosal and cutaneous HPV genotypes. PCR products underwent sequencing analysis and the sequences were aligned to reference genomes from the Papillomavirus Episteme database. Phylogenetic analysis was then performed to assess the genetic relationships among the different sequences and between the sequences of the samples and those of the prototype strains. A broad spectrum of sequences related to mucosal and cutaneous HPV types was detected in 81% of the sewage samples analyzed. Surprisingly, sequences related to the anogenital HPV6 and 11 were detected in 19% of the samples, and sequences related to the “high risk” oncogenic HPV16 were identified in two samples. Sequences related to HPV9, HPV20, HPV25, HPV76, HPV80, HPV104, HPV110, HPV111, HPV120 and HPV145 beta Papillomaviruses were detected in 76% of the samples. In addition, similarity searches and phylogenetic analysis of some sequences suggest that they could belong to putative new genotypes of the beta genus. In this study, for the first time, the presence of HPV viruses strongly related to human cancer is reported in sewage samples. Our data increases the knowledge of HPV genomic diversity and suggests that virological analysis of urban sewage can provide key information useful in supporting epidemiological studies. PMID:23341898
Ruhlman, Tracey; Lee, Seung-Bum; Jansen, Robert K; Hostetler, Jessica B; Tallon, Luke J; Town, Christopher D; Daniell, Henry
2006-08-31
Carrot (Daucus carota) is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats > or = 30 bp with a sequence identity > or = 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP) and maximum likelihood (ML) were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap) for the sister relationship of Daucus with Panax in the euasterid II clade. These results provide the best taxon sampling of complete chloroplast genomes and the strongest support yet for the sister relationship of Caryophyllales to the asterids. The availability of the complete plastid genome sequence should facilitate improved transformation efficiency and foreign gene expression in carrot through utilization of endogenous flanking sequences and regulatory elements.
Comprehensive comparative analysis of 5'-end RNA-sequencing methods.
Adiconis, Xian; Haber, Adam L; Simmons, Sean K; Levy Moonshine, Ami; Ji, Zhe; Busby, Michele A; Shi, Xi; Jacques, Justin; Lancaster, Madeline A; Pan, Jen Q; Regev, Aviv; Levin, Joshua Z
2018-06-04
Specialized RNA-seq methods are required to identify the 5' ends of transcripts, which are critical for studies of gene regulation, but these methods have not been systematically benchmarked. We directly compared six such methods, including the performance of five methods on a single human cellular RNA sample and a new spike-in RNA assay that helps circumvent challenges resulting from uncertainties in annotation and RNA processing. We found that the 'cap analysis of gene expression' (CAGE) method performed best for mRNA and that most of its unannotated peaks were supported by evidence from other genomic methods. We applied CAGE to eight brain-related samples and determined sample-specific transcription start site (TSS) usage, as well as a transcriptome-wide shift in TSS usage between fetal and adult brain.
Rapid detection of human fecal Eubacterium species and related genera by nested PCR method.
Kageyama, A; Benno, Y
2001-01-01
PCR procedures based on 16S rDNA gene sequence specific for seven Eubacterium spp. and Eggerthella lenta that predominate in the human intestinal tract were developed, and used for direct detection of these species in seven human feces samples. Three species of Eggerthella lenta, Eubacterium rectale, and Eubacterium eligens were detected from seven fecal samples. Eubacterium biforme was detected from six samples. It was reported that E. rectale, E. eligens, and E. biforme were difficult to detect by traditional culture method, but the nested PCR method is available for the detection of these species. This result shows that the nested PCR method utilizing a universal primer pair, followed by amplification with species-specific primers, would allow rapid detection of Eubacterium species in human feces.
Extremely low Plasmodium prevalence in wild plovers and coursers from Cape Verde and Madagascar.
Martínez-de la Puente, Josué; Eberhart-Phillips, Luke J; Cristina Carmona-Isunza, M; Zefania, Sama; Navarro, María José; Kruger, Oliver; Hoffman, Joseph Ivan; Székely, Tamás; Figuerola, Jordi
2017-06-08
Relatively little is known about the prevalence of blood parasites in shorebirds, especially those breeding in the tropics. The prevalence of blood parasites of the genera Plasmodium, Haemoproteus and Leucocytozoon was assessed in blood samples from Kentish plovers and cream-coloured coursers in Cape Verde, and samples of Kittlitz's plovers, Madagascar plovers and white-fronted plovers in Madagascar. Only two of these samples were positive for Plasmodium: a Kittlitz's plover was infected by a generalist lineage of Plasmodium that has already been reported in Europe and Africa, while in a white-fronted plover direct sequencing revealed a previously un-described Plasmodium lineage. Potential explanations for the low prevalence of blood parasites include the scarcity of vectors in habitats used by these bird species and their resistance to parasitic infections.
Jeong, Ji Hun; Park, Soon Ho; Park, Mi Jung; Kim, Moon Jin; Kim, Kyung Hee; Park, Pil Whan; Seo, Yiel Hea; Lee, Jae Hoon; Park, Jinny; Hong, Junshik
2013-01-01
Background N-ras mutations are one of the most commonly detected abnormalities of myeloid origin. N-ras mutations result in a constitutively active N-ras protein that induces uncontrolled cell proliferation and inhibits apoptosis. We analyzed N-ras mutations in adult patients with AML at a particular institution and compared pyrosequencing analysis with a direct sequencing method for the detection of N-ras mutations. Methods We analyzed 90 bone marrow samples from 83 AML patients. We detected N-ras mutations in codons 12, 13, and 61 using the pyrosequencing method and subsequently confirmed all data by direct sequencing. Using these methods, we screened the N-ras mutation quantitatively and determined the incidence and characteristic of N-ras mutation. Results The incidence of N-ras mutation was 7.2% in adult AML patients. The patients with N-ras mutations showed significant higher hemoglobin levels (P=0.022) and an increased incidence of FLT3 mutations (P=0.003). We observed 3 cases with N-ras mutations in codon 12 (3.6%), 2 cases in codon 13 (2.4%), and 1 case in codon 61 (1.2%). All the mutations disappeared during chemotherapy. Conclusions There is a low incidence (7.2%) of N-ras mutations in AML patients compared with other populations. Similar data is obtained by both pyrosequencing and direct sequencing. This study showed the correlation between the N-ras mutation and the therapeutic response. However, pyrosequencing provides quantitative data and is useful for monitoring therapeutic responses. PMID:23667841
Hajibabaei, Mehrdad; Shokralla, Shadi; Zhou, Xin; Singer, Gregory A. C.; Baird, Donald J.
2011-01-01
Timely and accurate biodiversity analysis poses an ongoing challenge for the success of biomonitoring programs. Morphology-based identification of bioindicator taxa is time consuming, and rarely supports species-level resolution especially for immature life stages. Much work has been done in the past decade to develop alternative approaches for biodiversity analysis using DNA sequence-based approaches such as molecular phylogenetics and DNA barcoding. On-going assembly of DNA barcode reference libraries will provide the basis for a DNA-based identification system. The use of recently introduced next-generation sequencing (NGS) approaches in biodiversity science has the potential to further extend the application of DNA information for routine biomonitoring applications to an unprecedented scale. Here we demonstrate the feasibility of using 454 massively parallel pyrosequencing for species-level analysis of freshwater benthic macroinvertebrate taxa commonly used for biomonitoring. We designed our experiments in order to directly compare morphology-based, Sanger sequencing DNA barcoding, and next-generation environmental barcoding approaches. Our results show the ability of 454 pyrosequencing of mini-barcodes to accurately identify all species with more than 1% abundance in the pooled mixture. Although the approach failed to identify 6 rare species in the mixture, the presence of sequences from 9 species that were not represented by individuals in the mixture provides evidence that DNA based analysis may yet provide a valuable approach in finding rare species in bulk environmental samples. We further demonstrate the application of the environmental barcoding approach by comparing benthic macroinvertebrates from an urban region to those obtained from a conservation area. Although considerable effort will be required to robustly optimize NGS tools to identify species from bulk environmental samples, our results indicate the potential of an environmental barcoding approach for biomonitoring programs. PMID:21533287
Mitochondrial DNA variant at HVI region as a candidate of genetic markers of type 2 diabetes
NASA Astrophysics Data System (ADS)
Gumilar, Gun Gun; Purnamasari, Yunita; Setiadi, Rahmat
2016-02-01
Mitochondrial DNA (mtDNA) is maternally inherited. mtDNA mutations which can contribute to the excess of maternal inheritance of type 2 diabetes. Due to the high mutation rate, one of the areas in the mtDNA that is often associated with the disease is the hypervariable region I (HVI). Therefore, this study was conducted to determine the genetic variants of human mtDNA HVI that related to the type 2 diabetes in four samples that were taken from four generations in one lineage. Steps being taken include the lyses of hair follicles, amplification of mtDNA HVI fragment using Polymerase Chain Reaction (PCR), detection of PCR products through agarose gel electrophoresis technique, the measurement of the concentration of mtDNA using UV-Vis spectrophotometer, determination of the nucleotide sequence via direct sequencing method and analysis of the sequencing results using SeqMan DNASTAR program. Based on the comparison between nucleotide sequence of samples and revised Cambridge Reference Sequence (rCRS) obtained six same mutations that these are C16147T, T16189C, C16193del, T16127C, A16235G, and A16293C. After comparing the data obtained to the secondary data from Mitomap and NCBI, it were found that two mutations, T16189C and T16217C, become candidates as genetic markers of type 2 diabetes even the mutations were found also in the generations of undiagnosed type 2 diabetes. The results of this study are expected to give contribution to the collection of human mtDNA database of genetic variants that associated to metabolic diseases, so that in the future it can be utilized in various fields, especially in medicine.
Molecular taxonomy of phytopathogenic fungi: a case study in Peronospora.
Göker, Markus; García-Blázquez, Gema; Voglmayr, Hermann; Tellería, M Teresa; Martín, María P
2009-07-29
Inappropriate taxon definitions may have severe consequences in many areas. For instance, biologically sensible species delimitation of plant pathogens is crucial for measures such as plant protection or biological control and for comparative studies involving model organisms. However, delimiting species is challenging in the case of organisms for which often only molecular data are available, such as prokaryotes, fungi, and many unicellular eukaryotes. Even in the case of organisms with well-established morphological characteristics, molecular taxonomy is often necessary to emend current taxonomic concepts and to analyze DNA sequences directly sampled from the environment. Typically, for this purpose clustering approaches to delineate molecular operational taxonomic units have been applied using arbitrary choices regarding the distance threshold values, and the clustering algorithms. Here, we report on a clustering optimization method to establish a molecular taxonomy of Peronospora based on ITS nrDNA sequences. Peronospora is the largest genus within the downy mildews, which are obligate parasites of higher plants, and includes various economically important pathogens. The method determines the distance function and clustering setting that result in an optimal agreement with selected reference data. Optimization was based on both taxonomy-based and host-based reference information, yielding the same outcome. Resampling and permutation methods indicate that the method is robust regarding taxon sampling and errors in the reference data. Tests with newly obtained ITS sequences demonstrate the use of the re-classified dataset in molecular identification of downy mildews. A corrected taxonomy is provided for all Peronospora ITS sequences contained in public databases. Clustering optimization appears to be broadly applicable in automated, sequence-based taxonomy. The method connects traditional and modern taxonomic disciplines by specifically addressing the issue of how to optimally account for both traditional species concepts and genetic divergence.
Molecular Taxonomy of Phytopathogenic Fungi: A Case Study in Peronospora
Göker, Markus; García-Blázquez, Gema; Voglmayr, Hermann; Tellería, M. Teresa; Martín, María P.
2009-01-01
Background Inappropriate taxon definitions may have severe consequences in many areas. For instance, biologically sensible species delimitation of plant pathogens is crucial for measures such as plant protection or biological control and for comparative studies involving model organisms. However, delimiting species is challenging in the case of organisms for which often only molecular data are available, such as prokaryotes, fungi, and many unicellular eukaryotes. Even in the case of organisms with well-established morphological characteristics, molecular taxonomy is often necessary to emend current taxonomic concepts and to analyze DNA sequences directly sampled from the environment. Typically, for this purpose clustering approaches to delineate molecular operational taxonomic units have been applied using arbitrary choices regarding the distance threshold values, and the clustering algorithms. Methodology Here, we report on a clustering optimization method to establish a molecular taxonomy of Peronospora based on ITS nrDNA sequences. Peronospora is the largest genus within the downy mildews, which are obligate parasites of higher plants, and includes various economically important pathogens. The method determines the distance function and clustering setting that result in an optimal agreement with selected reference data. Optimization was based on both taxonomy-based and host-based reference information, yielding the same outcome. Resampling and permutation methods indicate that the method is robust regarding taxon sampling and errors in the reference data. Tests with newly obtained ITS sequences demonstrate the use of the re-classified dataset in molecular identification of downy mildews. Conclusions A corrected taxonomy is provided for all Peronospora ITS sequences contained in public databases. Clustering optimization appears to be broadly applicable in automated, sequence-based taxonomy. The method connects traditional and modern taxonomic disciplines by specifically addressing the issue of how to optimally account for both traditional species concepts and genetic divergence. PMID:19641601
Lee, Andie S; White, Elizabeth; Monahan, Leigh G; Jensen, Slade O; Chan, Raymond; van Hal, Sebastiaan J
2018-06-01
OBJECTIVETo describe the transmission dynamics of the emergence and persistence of vanA vancomycin-resistant enterococcus (VRE) in an intensive care unit (ICU) using whole-genome sequencing of patient and environmental isolates.DESIGNRetrospective cohort study.SETTINGICU in a tertiary referral center.PARTICIPANTSPatients admitted to the ICU over an 11-month period.METHODS VanA VRE isolated from patients (n=31) were sequenced using the Illumina MiSeq platform. Environmental samples from bed spaces, equipment, and waste rooms were collected. All vanA VRE-positive environmental samples (n=14) were also sequenced. Data were collected regarding patient ward and bed movements.RESULTSThe 31 patient vanA VRE isolates were from screening (n=19), urine (n=4), bloodstream (n=3), skin/wound (n=3), and intra-abdominal (n=2) sources. The phylogeny from sequencing data confirmed several VRE clusters, with 1 group accounting for 38 of 45 isolates (84%). Within this cluster, cross-transmission was extensive and complex across the ICU. Directionality indicated that colonized patients contaminated environmental sites. Similarly, environmental sources not only led to patient colonization but also to infection. Notably, shared equipment acted as a conduit for transmission between different ICU areas. Infected patients, however, were not linked to further VRE transmission.CONCLUSIONSGenomic sequencing confirmed a predominantly clonal outbreak of VRE with complex transmission dynamics. The environmental reservoir, particularly from shared equipment, played a key role in ongoing VRE spread. This study provides evidence to support the use of multifaceted strategies, with an emphasis on measures to reduce bacterial burden in the environment, for successful VRE control.Infect Control Hosp Epidemiol 2018;39:668-675.
Zepeda-Mendoza, Marie Lisandra; Bohmann, Kristine; Carmona Baez, Aldo; Gilbert, M Thomas P
2016-05-03
DNA metabarcoding is an approach for identifying multiple taxa in an environmental sample using specific genetic loci and taxa-specific primers. When combined with high-throughput sequencing it enables the taxonomic characterization of large numbers of samples in a relatively time- and cost-efficient manner. One recent laboratory development is the addition of 5'-nucleotide tags to both primers producing double-tagged amplicons and the use of multiple PCR replicates to filter erroneous sequences. However, there is currently no available toolkit for the straightforward analysis of datasets produced in this way. We present DAMe, a toolkit for the processing of datasets generated by double-tagged amplicons from multiple PCR replicates derived from an unlimited number of samples. Specifically, DAMe can be used to (i) sort amplicons by tag combination, (ii) evaluate PCR replicates dissimilarity, and (iii) filter sequences derived from sequencing/PCR errors, chimeras, and contamination. This is attained by calculating the following parameters: (i) sequence content similarity between the PCR replicates from each sample, (ii) reproducibility of each unique sequence across the PCR replicates, and (iii) copy number of the unique sequences in each PCR replicate. We showcase the insights that can be obtained using DAMe prior to taxonomic assignment, by applying it to two real datasets that vary in their complexity regarding number of samples, sequencing libraries, PCR replicates, and used tag combinations. Finally, we use a third mock dataset to demonstrate the impact and importance of filtering the sequences with DAMe. DAMe allows the user-friendly manipulation of amplicons derived from multiple samples with PCR replicates built in a single or multiple sequencing libraries. It allows the user to: (i) collapse amplicons into unique sequences and sort them by tag combination while retaining the sample identifier and copy number information, (ii) identify sequences carrying unused tag combinations, (iii) evaluate the comparability of PCR replicates of the same sample, and (iv) filter tagged amplicons from a number of PCR replicates using parameters of minimum length, copy number, and reproducibility across the PCR replicates. This enables an efficient analysis of complex datasets, and ultimately increases the ease of handling datasets from large-scale studies.
Oono, Ryoko
2017-01-01
High-throughput sequencing technology has helped microbial community ecologists explore ecological and evolutionary patterns at unprecedented scales. The benefits of a large sample size still typically outweigh that of greater sequencing depths per sample for accurate estimations of ecological inferences. However, excluding or not sequencing rare taxa may mislead the answers to the questions 'how and why are communities different?' This study evaluates the confidence intervals of ecological inferences from high-throughput sequencing data of foliar fungal endophytes as case studies through a range of sampling efforts, sequencing depths, and taxonomic resolutions to understand how technical and analytical practices may affect our interpretations. Increasing sampling size reliably decreased confidence intervals across multiple community comparisons. However, the effects of sequencing depths on confidence intervals depended on how rare taxa influenced the dissimilarity estimates among communities and did not significantly decrease confidence intervals for all community comparisons. A comparison of simulated communities under random drift suggests that sequencing depths are important in estimating dissimilarities between microbial communities under neutral selective processes. Confidence interval analyses reveal important biases as well as biological trends in microbial community studies that otherwise may be ignored when communities are only compared for statistically significant differences.
2017-01-01
High-throughput sequencing technology has helped microbial community ecologists explore ecological and evolutionary patterns at unprecedented scales. The benefits of a large sample size still typically outweigh that of greater sequencing depths per sample for accurate estimations of ecological inferences. However, excluding or not sequencing rare taxa may mislead the answers to the questions ‘how and why are communities different?’ This study evaluates the confidence intervals of ecological inferences from high-throughput sequencing data of foliar fungal endophytes as case studies through a range of sampling efforts, sequencing depths, and taxonomic resolutions to understand how technical and analytical practices may affect our interpretations. Increasing sampling size reliably decreased confidence intervals across multiple community comparisons. However, the effects of sequencing depths on confidence intervals depended on how rare taxa influenced the dissimilarity estimates among communities and did not significantly decrease confidence intervals for all community comparisons. A comparison of simulated communities under random drift suggests that sequencing depths are important in estimating dissimilarities between microbial communities under neutral selective processes. Confidence interval analyses reveal important biases as well as biological trends in microbial community studies that otherwise may be ignored when communities are only compared for statistically significant differences. PMID:29253889
Kellogg, Christina A.; Goldsmith, Dawn B.; Gray, Michael A.
2017-01-01
Over the last decade, publications on deep-sea corals have tripled. Most attention has been paid to Lophelia pertusa, a globally distributed scleractinian coral that creates critical three-dimensional habitat in the deep ocean. The bacterial community associated with L. pertusa has been previously described by a number of studies at sites in the Mediterranean Sea, Norwegian fjords, off Great Britain, and in the Gulf of Mexico (GOM). However, use of different methodologies prevents direct comparisons in most cases. Our objectives were to address intra-regional variation and to identify any conserved bacterial core community. We collected samples from three distinct colonies of L. pertusa at each of four locations within the western Atlantic: three sites within the GOM and one off the east coast of the United States. Amplicon libraries of 16S rRNA genes were generated using primers targeting the V4–V5 hypervariable region and 454 pyrosequencing. The dominant phylum was Proteobacteria (75–96%). At the family level, 80–95% of each sample was comprised of five groups: Pirellulaceae, Pseudonocardiaceae, Rhodobacteraceae, Sphingomonadaceae, and unclassified Oceanospirillales. Principal coordinate analysis based on weighted UniFrac distances showed a clear distinction between the GOM and Atlantic samples. Interestingly, the replicate samples from each location did not always cluster together, indicating there is not a strong site-specific influence. The core bacterial community, conserved in 100% of the samples, was dominated by the operational taxonomic units of genera Novosphingobium and Pseudonocardia, both known degraders of aromatic hydrocarbons. The sequence of another core member, Propionibacterium, was also found in prior studies of L. pertusa from Norway and Great Britain, suggesting a role as a conserved symbiont. By examining more than 40,000 sequences per sample, we found that GOM samples were dominated by the identified conserved core sequences, whereas open Atlantic samples had a much higher proportion of locally consistent bacteria. Further, predictive functional profiling highlights the potential for the L. pertusa microbiome to contribute to chemoautotrophy, nutrient cycling, and antibiotic production. PMID:28522997
Kellogg, Christina A; Goldsmith, Dawn B; Gray, Michael A
2017-01-01
Over the last decade, publications on deep-sea corals have tripled. Most attention has been paid to Lophelia pertusa , a globally distributed scleractinian coral that creates critical three-dimensional habitat in the deep ocean. The bacterial community associated with L. pertusa has been previously described by a number of studies at sites in the Mediterranean Sea, Norwegian fjords, off Great Britain, and in the Gulf of Mexico (GOM). However, use of different methodologies prevents direct comparisons in most cases. Our objectives were to address intra-regional variation and to identify any conserved bacterial core community. We collected samples from three distinct colonies of L. pertusa at each of four locations within the western Atlantic: three sites within the GOM and one off the east coast of the United States. Amplicon libraries of 16S rRNA genes were generated using primers targeting the V4-V5 hypervariable region and 454 pyrosequencing. The dominant phylum was Proteobacteria (75-96%). At the family level, 80-95% of each sample was comprised of five groups: Pirellulaceae, Pseudonocardiaceae, Rhodobacteraceae, Sphingomonadaceae, and unclassified Oceanospirillales. Principal coordinate analysis based on weighted UniFrac distances showed a clear distinction between the GOM and Atlantic samples. Interestingly, the replicate samples from each location did not always cluster together, indicating there is not a strong site-specific influence. The core bacterial community, conserved in 100% of the samples, was dominated by the operational taxonomic units of genera Novosphingobium and Pseudonocardia , both known degraders of aromatic hydrocarbons. The sequence of another core member, Propionibacterium , was also found in prior studies of L. pertusa from Norway and Great Britain, suggesting a role as a conserved symbiont. By examining more than 40,000 sequences per sample, we found that GOM samples were dominated by the identified conserved core sequences, whereas open Atlantic samples had a much higher proportion of locally consistent bacteria. Further, predictive functional profiling highlights the potential for the L. pertusa microbiome to contribute to chemoautotrophy, nutrient cycling, and antibiotic production.
Kellogg, Christina A.; Goldsmith, Dawn; Gray, Michael A.
2017-01-01
Over the last decade, publications on deep-sea corals have tripled. Most attention has been paid to Lophelia pertusa, a globally distributed scleractinian coral that creates critical three-dimensional habitat in the deep ocean. The bacterial community associated with L. pertusa has been previously described by a number of studies at sites in the Mediterranean Sea, Norwegian fjords, off Great Britain, and in the Gulf of Mexico (GOM). However, use of different methodologies prevents direct comparisons in most cases. Our objectives were to address intra-regional variation and to identify any conserved bacterial core community. We collected samples from three distinct colonies of L. pertusa at each of four locations within the western Atlantic: three sites within the GOM and one off the east coast of the United States. Amplicon libraries of 16S rRNA genes were generated using primers targeting the V4–V5 hypervariable region and 454 pyrosequencing. The dominant phylum was Proteobacteria (75–96%). At the family level, 80–95% of each sample was comprised of five groups: Pirellulaceae, Pseudonocardiaceae, Rhodobacteraceae, Sphingomonadaceae, and unclassified Oceanospirillales. Principal coordinate analysis based on weighted UniFrac distances showed a clear distinction between the GOM and Atlantic samples. Interestingly, the replicate samples from each location did not always cluster together, indicating there is not a strong site-specific influence. The core bacterial community, conserved in 100% of the samples, was dominated by the operational taxonomic units of genera Novosphingobium and Pseudonocardia, both known degraders of aromatic hydrocarbons. The sequence of another core member, Propionibacterium, was also found in prior studies of L. pertusa from Norway and Great Britain, suggesting a role as a conserved symbiont. By examining more than 40,000 sequences per sample, we found that GOM samples were dominated by the identified conserved core sequences, whereas open Atlantic samples had a much higher proportion of locally consistent bacteria. Further, predictive functional profiling highlights the potential for the L. pertusa microbiome to contribute to chemoautotrophy, nutrient cycling, and antibiotic production.
Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease
DOE Office of Scientific and Technical Information (OSTI.GOV)
Su, Y.; Zhang, H.; Madrid, R.
1994-09-01
Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less
High-throughput automated microfluidic sample preparation for accurate microbial genomics
Kim, Soohong; De Jonghe, Joachim; Kulesa, Anthony B.; Feldman, David; Vatanen, Tommi; Bhattacharyya, Roby P.; Berdy, Brittany; Gomez, James; Nolan, Jill; Epstein, Slava; Blainey, Paul C.
2017-01-01
Low-cost shotgun DNA sequencing is transforming the microbial sciences. Sequencing instruments are so effective that sample preparation is now the key limiting factor. Here, we introduce a microfluidic sample preparation platform that integrates the key steps in cells to sequence library sample preparation for up to 96 samples and reduces DNA input requirements 100-fold while maintaining or improving data quality. The general-purpose microarchitecture we demonstrate supports workflows with arbitrary numbers of reaction and clean-up or capture steps. By reducing the sample quantity requirements, we enabled low-input (∼10,000 cells) whole-genome shotgun (WGS) sequencing of Mycobacterium tuberculosis and soil micro-colonies with superior results. We also leveraged the enhanced throughput to sequence ∼400 clinical Pseudomonas aeruginosa libraries and demonstrate excellent single-nucleotide polymorphism detection performance that explained phenotypically observed antibiotic resistance. Fully-integrated lab-on-chip sample preparation overcomes technical barriers to enable broader deployment of genomics across many basic research and translational applications. PMID:28128213
1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life
Mukherjee, Supratim; Seshadri, Rekha; Varghese, Neha J.; ...
2017-06-12
We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster withmore » potential new chemical structure and antimicrobial activity. This Resource is the largest single release of reference genomes to date. Bacterial and archaeal isolate sequence space is still far from saturated, and future endeavors in this direction will continue to be a valuable resource for scientific discovery.« less
Nucleotide-Specific Contrast for DNA Sequencing by Electron Spectroscopy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mankos, Marian; Persson, Henrik H. J.; N’Diaye, Alpha T.
DNA sequencing by imaging in an electron microscope is an approach that holds promise to deliver long reads with low error rates and without the need for amplification. Earlier work using transmission electron microscopes, which use high electron energies on the order of 100 keV, has shown that low contrast and radiation damage necessitates the use of heavy atom labeling of individual nucleotides, which increases the read error rates. Other prior work using scattering electrons with much lower energy has shown to suppress beam damage on DNA. Here we explore possibilities to increase contrast by employing two methods, X-ray photoelectronmore » and Auger electron spectroscopy. Using bulk DNA samples with monomers of each base, both methods are shown to provide contrast mechanisms that can distinguish individual nucleotides without labels. In conclusion, both spectroscopic techniques can be readily implemented in a low energy electron microscope, which may enable label-free DNA sequencing by direct imaging.« less
1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mukherjee, Supratim; Seshadri, Rekha; Varghese, Neha J.
We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster withmore » potential new chemical structure and antimicrobial activity. This Resource is the largest single release of reference genomes to date. Bacterial and archaeal isolate sequence space is still far from saturated, and future endeavors in this direction will continue to be a valuable resource for scientific discovery.« less