Sample records for targets at-rich sequences

  1. Special AT-rich sequence binding protein 1 promotes tumor growth and metastasis of esophageal squamous cell carcinoma.

    PubMed

    Ma, Jun; Wu, Kaiming; Zhao, Zhenxian; Miao, Rong; Xu, Zhe

    2017-03-01

    Esophageal squamous cell carcinoma is one of the most aggressive malignancies worldwide. Special AT-rich sequence binding protein 1 is a nuclear matrix attachment region binding protein which participates in higher order chromatin organization and tissue-specific gene expression. However, the role of special AT-rich sequence binding protein 1 in esophageal squamous cell carcinoma remains unknown. In this study, western blot and quantitative real-time polymerase chain reaction analysis were performed to identify differentially expressed special AT-rich sequence binding protein 1 in a series of esophageal squamous cell carcinoma tissue samples. The effects of special AT-rich sequence binding protein 1 silencing by two short-hairpin RNAs on cell proliferation, migration, and invasion were assessed by the CCK-8 assay and transwell assays in esophageal squamous cell carcinoma in vitro. Special AT-rich sequence binding protein 1 was significantly upregulated in esophageal squamous cell carcinoma tissue samples and cell lines. Silencing of special AT-rich sequence binding protein 1 inhibited the proliferation of KYSE450 and EC9706 cells which have a relatively high level of special AT-rich sequence binding protein 1, and the ability of migration and invasion of KYSE450 and EC9706 cells was distinctly suppressed. Special AT-rich sequence binding protein 1 could be a potential target for the treatment of esophageal squamous cell carcinoma and inhibition of special AT-rich sequence binding protein 1 may provide a new strategy for the prevention of esophageal squamous cell carcinoma invasion and metastasis.

  2. Xenopus origin recognition complex (ORC) initiates DNA replication preferentially at sequences targeted by Schizosaccharomyces pombe ORC

    PubMed Central

    Kong, Daochun; Coleman, Thomas R.; DePamphilis, Melvin L.

    2003-01-01

    Budding yeast (Saccharomyces cerevisiae) origin recognition complex (ORC) requires ATP to bind specific DNA sequences, whereas fission yeast (Schizosaccharomyces pombe) ORC binds to specific, asymmetric A:T-rich sites within replication origins, independently of ATP, and frog (Xenopus laevis) ORC seems to bind DNA non-specifically. Here we show that despite these differences, ORCs are functionally conserved. Firstly, SpOrc1, SpOrc4 and SpOrc5, like those from other eukaryotes, bound ATP and exhibited ATPase activity, suggesting that ATP is required for pre-replication complex (pre-RC) assembly rather than origin specificity. Secondly, SpOrc4, which is solely responsible for binding SpORC to DNA, inhibited up to 70% of XlORC-dependent DNA replication in Xenopus egg extract by preventing XlORC from binding to chromatin and assembling pre-RCs. Chromatin-bound SpOrc4 was located at AT-rich sequences. XlORC in egg extract bound preferentially to asymmetric A:T-sequences in either bare DNA or in sperm chromatin, and it recruited XlCdc6 and XlMcm proteins to these sequences. These results reveal that XlORC initiates DNA replication preferentially at the same or similar sites to those targeted in S.pombe. PMID:12840006

  3. The nonamer UUAUUUAUU is the key AU-rich sequence motif that mediates mRNA degradation.

    PubMed Central

    Zubiaga, A M; Belasco, J G; Greenberg, M E

    1995-01-01

    Labile mRNAs that encode cytokine and immediate-early gene products often contain AU-rich sequences within their 3' untranslated region (UTR). These AU-rich sequences appear to be key determinants of the short half-lives of these mRNAs, although the sequence features of these elements and the mechanism by which they target mRNAs for rapid decay have not been fully defined. We have examined the features of AU-rich elements (AREs) that are crucial for their function as determinants of mRNA instability in mammalian cells by testing the ability of various mutant c-fos AREs and synthetic AREs to direct rapid mRNA deadenylation and decay when inserted within the 3' UTR of the normally stable beta-globin mRNA. Evidence is presented that the pentamer AUUUA, which previously was suggested to be the minimal determinant of instability present in mammalian AREs, cannot direct rapid mRNA deadenylation and decay. Instead, the nonomer UUAUUUAUU is the elemental AU-rich sequence motif that destabilizes mRNA. Removal of one uridine residue from either end of the nonamer (UUAUUUAU or UAUUUAUU) results in a decrease of potency of the element, while removal of a uridine residue from both ends of the nonamer (UAUUUAU) eliminates detectable destabilizing activity. The inclusion of an additional uridine residue at both ends of the nonamer (UUUAUUUAUUU) does not further increase the efficacy of the element. Taken together, these findings suggest that the nonamer UUAUUUAUU is the minimal AU-rich motif that effectively destabilizes mRNA. Additional ARE potency is achieved by combining multiple copies of this nonamer in a single mRNA 3' UTR. Furthermore, analysis of poly(A) shortening rates for ARE-containing mRNAs reveals that the UUAUUUAUU sequence also accelerates mRNA deadenylation and suggests that the UUAUUUAUU motif targets mRNA for rapid deadenylation as an early step in the mRNA decay process. PMID:7891716

  4. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-02-15

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).

  5. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    PubMed Central

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-01-01

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U). PMID:8604302

  6. The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain.

    PubMed

    Cruz-Gallardo, Isabel; Aroca, Ángeles; Gunzburg, Menachem J; Sivakumaran, Andrew; Yoon, Je-Hyun; Angulo, Jesús; Persson, Cecilia; Gorospe, Myriam; Karlsson, B Göran; Wilce, Jacqueline A; Díaz-Moreno, Irene

    2014-01-01

    T-cell intracellular antigen-1 (TIA-1) is a key DNA/RNA binding protein that regulates translation by sequestering target mRNAs in stress granules (SG) in response to stress conditions. TIA-1 possesses three RNA recognition motifs (RRM) along with a glutamine-rich domain, with the central domains (RRM2 and RRM3) acting as RNA binding platforms. While the RRM2 domain, which displays high affinity for U-rich RNA sequences, is primarily responsible for interaction with RNA, the contribution of RRM3 to bind RNA as well as the target RNA sequences that it binds preferentially are still unknown. Here we combined nuclear magnetic resonance (NMR) and surface plasmon resonance (SPR) techniques to elucidate the sequence specificity of TIA-1 RRM3. With a novel approach using saturation transfer difference NMR (STD-NMR) to quantify protein-nucleic acids interactions, we demonstrate that isolated RRM3 binds to both C- and U-rich stretches with micromolar affinity. In combination with RRM2 and in the context of full-length TIA-1, RRM3 significantly enhanced the binding to RNA, particularly to cytosine-rich RNA oligos, as assessed by biotinylated RNA pull-down analysis. Our findings provide new insight into the role of RRM3 in regulating TIA-1 binding to C-rich stretches, that are abundant at the 5' TOPs (5' terminal oligopyrimidine tracts) of mRNAs whose translation is repressed under stress situations.

  7. The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain

    PubMed Central

    Cruz-Gallardo, Isabel; Aroca, Ángeles; Gunzburg, Menachem J; Sivakumaran, Andrew; Yoon, Je-Hyun; Angulo, Jesús; Persson, Cecilia; Gorospe, Myriam; Karlsson, B Göran; Wilce, Jacqueline A; Díaz-Moreno, Irene

    2014-01-01

    T-cell intracellular antigen-1 (TIA-1) is a key DNA/RNA binding protein that regulates translation by sequestering target mRNAs in stress granules (SG) in response to stress conditions. TIA-1 possesses three RNA recognition motifs (RRM) along with a glutamine-rich domain, with the central domains (RRM2 and RRM3) acting as RNA binding platforms. While the RRM2 domain, which displays high affinity for U-rich RNA sequences, is primarily responsible for interaction with RNA, the contribution of RRM3 to bind RNA as well as the target RNA sequences that it binds preferentially are still unknown. Here we combined nuclear magnetic resonance (NMR) and surface plasmon resonance (SPR) techniques to elucidate the sequence specificity of TIA-1 RRM3. With a novel approach using saturation transfer difference NMR (STD-NMR) to quantify protein–nucleic acids interactions, we demonstrate that isolated RRM3 binds to both C- and U-rich stretches with micromolar affinity. In combination with RRM2 and in the context of full-length TIA-1, RRM3 significantly enhanced the binding to RNA, particularly to cytosine-rich RNA oligos, as assessed by biotinylated RNA pull-down analysis. Our findings provide new insight into the role of RRM3 in regulating TIA-1 binding to C-rich stretches, that are abundant at the 5′ TOPs (5′ terminal oligopyrimidine tracts) of mRNAs whose translation is repressed under stress situations. PMID:24824036

  8. Molecular determinants of nucleosome retention at CpG-rich sequences in mouse spermatozoa.

    PubMed

    Erkek, Serap; Hisano, Mizue; Liang, Ching-Yeu; Gill, Mark; Murr, Rabih; Dieker, Jürgen; Schübeler, Dirk; van der Vlag, Johan; Stadler, Michael B; Peters, Antoine H F M

    2013-07-01

    In mammalian spermatozoa, most but not all of the genome is densely packaged by protamines. Here we reveal the molecular logic underlying the retention of nucleosomes in mouse spermatozoa, which contain only 1% residual histones. We observe high enrichment throughout the genome of nucleosomes at CpG-rich sequences that lack DNA methylation. Residual nucleosomes are largely composed of the histone H3.3 variant and are trimethylated at Lys4 of histone H3 (H3K4me3). Canonical H3.1 and H3.2 histones are also enriched at CpG-rich promoters marked by Polycomb-mediated H3K27me3, a modification predictive of gene repression in preimplantation embryos. Histone variant-specific nucleosome retention in sperm is strongly associated with nucleosome turnover in round spermatids. Our data show evolutionary conservation of the basic principles of nucleosome retention in mouse and human sperm, supporting a model of epigenetic inheritance by nucleosomes between generations.

  9. Isolation and characterization of target sequences of the chicken CdxA homeobox gene.

    PubMed Central

    Margalit, Y; Yarus, S; Shapira, E; Gruenbaum, Y; Fainsod, A

    1993-01-01

    The DNA binding specificity of the chicken homeodomain protein CDXA was studied. Using a CDXA-glutathione-S-transferase fusion protein, DNA fragments containing the binding site for this protein were isolated. The sources of DNA were oligonucleotides with random sequence and chicken genomic DNA. The DNA fragments isolated were sequenced and tested in DNA binding assays. Sequencing revealed that most DNA fragments are AT rich which is a common feature of homeodomain binding sites. By electrophoretic mobility shift assays it was shown that the different target sequences isolated bind to the CDXA protein with different affinities. The specific sequences bound by the CDXA protein in the genomic fragments isolated, were determined by DNase I footprinting. From the footprinted sequences, the CDXA consensus binding site was determined. The CDXA protein binds the consensus sequence A, A/T, T, A/T, A, T, A/G. The CAUDAL binding site in the ftz promoter is also included in this consensus sequence. When tested, some of the genomic target sequences were capable of enhancing the transcriptional activity of reporter plasmids when introduced into CDXA expressing cells. This study determined the DNA sequence specificity of the CDXA protein and it also shows that this protein can further activate transcription in cells in culture. Images PMID:7909943

  10. Unlocking hidden genomic sequence

    PubMed Central

    Keith, Jonathan M.; Cochran, Duncan A. E.; Lala, Gita H.; Adams, Peter; Bryant, Darryn; Mitchelson, Keith R.

    2004-01-01

    Despite the success of conventional Sanger sequencing, significant regions of many genomes still present major obstacles to sequencing. Here we propose a novel approach with the potential to alleviate a wide range of sequencing difficulties. The technique involves extracting target DNA sequence from variants generated by introduction of random mutations. The introduction of mutations does not destroy original sequence information, but distributes it amongst multiple variants. Some of these variants lack problematic features of the target and are more amenable to conventional sequencing. The technique has been successfully demonstrated with mutation levels up to an average 18% base substitution and has been used to read previously intractable poly(A), AT-rich and GC-rich motifs. PMID:14973330

  11. Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers

    PubMed Central

    Pabinger, Stephan; Ernst, Karina; Pulverer, Walter; Kallmeyer, Rainer; Valdes, Ana M.; Metrustry, Sarah; Katic, Denis; Nuzzo, Angelo; Kriegner, Albert; Vierlinger, Klemens; Weinhaeusel, Andreas

    2016-01-01

    Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM). Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage. TABSAT is freely

  12. Microfluidic droplet enrichment for targeted sequencing

    PubMed Central

    Eastburn, Dennis J.; Huang, Yong; Pellegrino, Maurizio; Sciambi, Adam; Ptáček, Louis J.; Abate, Adam R.

    2015-01-01

    Targeted sequence enrichment enables better identification of genetic variation by providing increased sequencing coverage for genomic regions of interest. Here, we report the development of a new target enrichment technology that is highly differentiated from other approaches currently in use. Our method, MESA (Microfluidic droplet Enrichment for Sequence Analysis), isolates genomic DNA fragments in microfluidic droplets and performs TaqMan PCR reactions to identify droplets containing a desired target sequence. The TaqMan positive droplets are subsequently recovered via dielectrophoretic sorting, and the TaqMan amplicons are removed enzymatically prior to sequencing. We demonstrated the utility of this approach by generating an average 31.6-fold sequence enrichment across 250 kb of targeted genomic DNA from five unique genomic loci. Significantly, this enrichment enabled a more comprehensive identification of genetic polymorphisms within the targeted loci. MESA requires low amounts of input DNA, minimal prior locus sequence information and enriches the target region without PCR bias or artifacts. These features make it well suited for the study of genetic variation in a number of research and diagnostic applications. PMID:25873629

  13. A programmable method for massively parallel targeted sequencing

    PubMed Central

    Hopmans, Erik S.; Natsoulis, Georges; Bell, John M.; Grimes, Susan M.; Sieh, Weiva; Ji, Hanlee P.

    2014-01-01

    We have developed a targeted resequencing approach referred to as Oligonucleotide-Selective Sequencing. In this study, we report a series of significant improvements and novel applications of this method whereby the surface of a sequencing flow cell is modified in situ to capture specific genomic regions of interest from a sample and then sequenced. These improvements include a fully automated targeted sequencing platform through the use of a standard Illumina cBot fluidics station. Targeting optimization increased the yield of total on-target sequencing data 2-fold compared to the previous iteration, while simultaneously increasing the percentage of reads that could be mapped to the human genome. The described assays cover up to 1421 genes with a total coverage of 5.5 Megabases (Mb). We demonstrate a 10-fold abundance uniformity of greater than 90% in 1 log distance from the median and a targeting rate of up to 95%. We also sequenced continuous genomic loci up to 1.5 Mb while simultaneously genotyping SNPs and genes. Variants with low minor allele fraction were sensitively detected at levels of 5%. Finally, we determined the exact breakpoint sequence of cancer rearrangements. Overall, this approach has high performance for selective sequencing of genome targets, configuration flexibility and variant calling accuracy. PMID:24782526

  14. Targeted therapy according to next generation sequencing-based panel sequencing.

    PubMed

    Saito, Motonobu; Momma, Tomoyuki; Kono, Koji

    2018-04-17

    Targeted therapy against actionable gene mutations shows a significantly higher response rate as well as longer survival compared to conventional chemotherapy, and has become a standard therapy for many cancers. Recent progress in next-generation sequencing (NGS) has enabled to identify huge number of genetic aberrations. Based on sequencing results, patients recommend to undergo targeted therapy or immunotherapy. In cases where there are no available approved drugs for the genetic mutations detected in the patients, it is recommended to be facilitate the registration for the clinical trials. For that purpose, a NGS-based sequencing panel that can simultaneously target multiple genes in a single investigation has been used in daily clinical practice. To date, various types of sequencing panels have been developed to investigate genetic aberrations with tumor somatic genome variants (gain-of-function or loss-of-function mutations, high-level copy number alterations, and gene fusions) through comprehensive bioinformatics. Because sequencing panels are efficient and cost-effective, they are quickly being adopted outside the lab, in hospitals and clinics, in order to identify personal targeted therapy for individual cancer patients.

  15. G-quadruplex and G-rich sequence stimulate Pif1p-catalyzed downstream duplex DNA unwinding through reducing waiting time at ss/dsDNA junction

    PubMed Central

    Zhang, Bo; Wu, Wen-Qiang; Liu, Na-Nv; Duan, Xiao-Lei; Li, Ming; Dou, Shuo-Xing; Hou, Xi-Miao; Xi, Xu-Guang

    2016-01-01

    Alternative DNA structures that deviate from B-form double-stranded DNA such as G-quadruplex (G4) DNA can be formed by G-rich sequences that are widely distributed throughout the human genome. We have previously shown that Pif1p not only unfolds G4, but also unwinds the downstream duplex DNA in a G4-stimulated manner. In the present study, we further characterized the G4-stimulated duplex DNA unwinding phenomenon by means of single-molecule fluorescence resonance energy transfer. It was found that Pif1p did not unwind the partial duplex DNA immediately after unfolding the upstream G4 structure, but rather, it would dwell at the ss/dsDNA junction with a ‘waiting time’. Further studies revealed that the waiting time was in fact related to a protein dimerization process that was sensitive to ssDNA sequence and would become rapid if the sequence is G-rich. Furthermore, we identified that the G-rich sequence, as the G4 structure, equally stimulates duplex DNA unwinding. The present work sheds new light on the molecular mechanism by which G4-unwinding helicase Pif1p resolves physiological G4/duplex DNA structures in cells. PMID:27471032

  16. Targeted Capture and High-Throughput Sequencing Using Molecular Inversion Probes (MIPs).

    PubMed

    Cantsilieris, Stuart; Stessman, Holly A; Shendure, Jay; Eichler, Evan E

    2017-01-01

    Molecular inversion probes (MIPs) in combination with massively parallel DNA sequencing represent a versatile, yet economical tool for targeted sequencing of genomic DNA. Several thousand genomic targets can be selectively captured using long oligonucleotides containing unique targeting arms and universal linkers. The ability to append sequencing adaptors and sample-specific barcodes allows large-scale pooling and subsequent high-throughput sequencing at relatively low cost per sample. Here, we describe a "wet bench" protocol detailing the capture and subsequent sequencing of >2000 genomic targets from 192 samples, representative of a single lane on the Illumina HiSeq 2000 platform.

  17. Molecular equilibria and condensation sequences in carbon rich gases

    NASA Technical Reports Server (NTRS)

    Sharp, C. M.; Wasserburg, G. J.

    1993-01-01

    Chemical equilibria in stellar atmospheres have been investigated by many authors. Lattimer, Schramm, and Grossman presented calculations in both O rich and C rich environments and predicted possible presolar condensates. A recent paper by Cherchneff and Barker considered a C rich composition with PAH's included in the calculations. However, the condensation sequences of C bearing species have not been investigated in detail. In a carbon rich gas surrounding an AGB star, it is often assumed that graphite (or diamond) condenses out before TiC and SiC. However, Lattimer et al. found some conditions under which TiC condenses before graphite. We have performed molecular equilibrium calculations to establish the stability fields of C(s), TiC(s), and SiC(s) and other high temperature phases under conditions of different pressures and C/O. The preserved presolar interstellar dust grains so far discovered in meteorites are graphite, diamond, SiC, TiC, and possibly Al2O3.

  18. Human CST Facilitates Genome-wide RAD51 Recruitment to GC-Rich Repetitive Sequences in Response to Replication Stress.

    PubMed

    Chastain, Megan; Zhou, Qing; Shiva, Olga; Fadri-Moskwik, Maria; Whitmore, Leanne; Jia, Pingping; Dai, Xueyu; Huang, Chenhui; Ye, Ping; Chai, Weihang

    2016-08-02

    The telomeric CTC1/STN1/TEN1 (CST) complex has been implicated in promoting replication recovery under replication stress at genomic regions, yet its precise role is unclear. Here, we report that STN1 is enriched at GC-rich repetitive sequences genome-wide in response to hydroxyurea (HU)-induced replication stress. STN1 deficiency exacerbates the fragility of these sequences under replication stress, resulting in chromosome fragmentation. We find that upon fork stalling, CST proteins form distinct nuclear foci that colocalize with RAD51. Furthermore, replication stress induces physical association of CST with RAD51 in an ATR-dependent manner. Strikingly, CST deficiency diminishes HU-induced RAD51 foci formation and reduces RAD51 recruitment to telomeres and non-telomeric GC-rich fragile sequences. Collectively, our findings establish that CST promotes RAD51 recruitment to GC-rich repetitive sequences in response to replication stress to facilitate replication restart, thereby providing insights into the mechanism underlying genome stability maintenance. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.

  19. A tale of two sequences: microRNA-target chimeric reads.

    PubMed

    Broughton, James P; Pasquinelli, Amy E

    2016-04-04

    In animals, a functional interaction between a microRNA (miRNA) and its target RNA requires only partial base pairing. The limited number of base pair interactions required for miRNA targeting provides miRNAs with broad regulatory potential and also makes target prediction challenging. Computational approaches to target prediction have focused on identifying miRNA target sites based on known sequence features that are important for canonical targeting and may miss non-canonical targets. Current state-of-the-art experimental approaches, such as CLIP-seq (cross-linking immunoprecipitation with sequencing), PAR-CLIP (photoactivatable-ribonucleoside-enhanced CLIP), and iCLIP (individual-nucleotide resolution CLIP), require inference of which miRNA is bound at each site. Recently, the development of methods to ligate miRNAs to their target RNAs during the preparation of sequencing libraries has provided a new tool for the identification of miRNA target sites. The chimeric, or hybrid, miRNA-target reads that are produced by these methods unambiguously identify the miRNA bound at a specific target site. The information provided by these chimeric reads has revealed extensive non-canonical interactions between miRNAs and their target mRNAs, and identified many novel interactions between miRNAs and noncoding RNAs.

  20. Accurate and exact CNV identification from targeted high-throughput sequence data.

    PubMed

    Nord, Alex S; Lee, Ming; King, Mary-Claire; Walsh, Tom

    2011-04-12

    Massively parallel sequencing of barcoded DNA samples significantly increases screening efficiency for clinically important genes. Short read aligners are well suited to single nucleotide and indel detection. However, methods for CNV detection from targeted enrichment are lacking. We present a method combining coverage with map information for the identification of deletions and duplications in targeted sequence data. Sequencing data is first scanned for gains and losses using a comparison of normalized coverage data between samples. CNV calls are confirmed by testing for a signature of sequences that span the CNV breakpoint. With our method, CNVs can be identified regardless of whether breakpoints are within regions targeted for sequencing. For CNVs where at least one breakpoint is within targeted sequence, exact CNV breakpoints can be identified. In a test data set of 96 subjects sequenced across ~1 Mb genomic sequence using multiplexing technology, our method detected mutations as small as 31 bp, predicted quantitative copy count, and had a low false-positive rate. Application of this method allows for identification of gains and losses in targeted sequence data, providing comprehensive mutation screening when combined with a short read aligner.

  1. Highly multiplexed targeted DNA sequencing from single nuclei.

    PubMed

    Leung, Marco L; Wang, Yong; Kim, Charissa; Gao, Ruli; Jiang, Jerry; Sei, Emi; Navin, Nicholas E

    2016-02-01

    Single-cell DNA sequencing methods are challenged by poor physical coverage, high technical error rates and low throughput. To address these issues, we developed a single-cell DNA sequencing protocol that combines flow-sorting of single nuclei, time-limited multiple-displacement amplification (MDA), low-input library preparation, DNA barcoding, targeted capture and next-generation sequencing (NGS). This approach represents a major improvement over our previous single nucleus sequencing (SNS) Nature Protocols paper in terms of generating higher-coverage data (>90%), thereby enabling the detection of genome-wide variants in single mammalian cells at base-pair resolution. Furthermore, by pooling 48-96 single-cell libraries together for targeted capture, this approach can be used to sequence many single-cell libraries in parallel in a single reaction. This protocol greatly reduces the cost of single-cell DNA sequencing, and it can be completed in 5-6 d by advanced users. This single-cell DNA sequencing protocol has broad applications for studying rare cells and complex populations in diverse fields of biological research and medicine.

  2. Targeted RNA-Sequencing with Competitive Multiplex-PCR Amplicon Libraries

    PubMed Central

    Blomquist, Thomas M.; Crawford, Erin L.; Lovett, Jennie L.; Yeo, Jiyoun; Stanoszek, Lauren M.; Levin, Albert; Li, Jia; Lu, Mei; Shi, Leming; Muldrew, Kenneth; Willey, James C.

    2013-01-01

    Whole transcriptome RNA-sequencing is a powerful tool, but is costly and yields complex data sets that limit its utility in molecular diagnostic testing. A targeted quantitative RNA-sequencing method that is reproducible and reduces the number of sequencing reads required to measure transcripts over the full range of expression would be better suited to diagnostic testing. Toward this goal, we developed a competitive multiplex PCR-based amplicon sequencing library preparation method that a) targets only the sequences of interest and b) controls for inter-target variation in PCR amplification during library preparation by measuring each transcript native template relative to a known number of synthetic competitive template internal standard copies. To determine the utility of this method, we intentionally selected PCR conditions that would cause transcript amplification products (amplicons) to converge toward equimolar concentrations (normalization) during library preparation. We then tested whether this approach would enable accurate and reproducible quantification of each transcript across multiple library preparations, and at the same time reduce (through normalization) total sequencing reads required for quantification of transcript targets across a large range of expression. We demonstrate excellent reproducibility (R2 = 0.997) with 97% accuracy to detect 2-fold change using External RNA Controls Consortium (ERCC) reference materials; high inter-day, inter-site and inter-library concordance (R2 = 0.97–0.99) using FDA Sequencing Quality Control (SEQC) reference materials; and cross-platform concordance with both TaqMan qPCR (R2 = 0.96) and whole transcriptome RNA-sequencing following “traditional” library preparation using Illumina NGS kits (R2 = 0.94). Using this method, sequencing reads required to accurately quantify more than 100 targeted transcripts expressed over a 107-fold range was reduced more than 10,000-fold, from 2.3×109 to 1

  3. Single molecule targeted sequencing for cancer gene mutation detection.

    PubMed

    Gao, Yan; Deng, Liwei; Yan, Qin; Gao, Yongqian; Wu, Zengding; Cai, Jinsen; Ji, Daorui; Li, Gailing; Wu, Ping; Jin, Huan; Zhao, Luyang; Liu, Song; Ge, Liangjin; Deem, Michael W; He, Jiankui

    2016-05-19

    With the rapid decline in cost of sequencing, it is now affordable to examine multiple genes in a single disease-targeted clinical test using next generation sequencing. Current targeted sequencing methods require a separate step of targeted capture enrichment during sample preparation before sequencing. Although there are fast sample preparation methods available in market, the library preparation process is still relatively complicated for physicians to use routinely. Here, we introduced an amplification-free Single Molecule Targeted Sequencing (SMTS) technology, which combined targeted capture and sequencing in one step. We demonstrated that this technology can detect low-frequency mutations using artificially synthesized DNA sample. SMTS has several potential advantages, including simple sample preparation thus no biases and errors are introduced by PCR reaction. SMTS has the potential to be an easy and quick sequencing technology for clinical diagnosis such as cancer gene mutation detection, infectious disease detection, inherited condition screening and noninvasive prenatal diagnosis.

  4. A long-term target detection approach in infrared image sequence

    NASA Astrophysics Data System (ADS)

    Li, Hang; Zhang, Qi; Wang, Xin; Hu, Chao

    2016-10-01

    An automatic target detection method used in long term infrared (IR) image sequence from a moving platform is proposed. Firstly, based on POME(the principle of maximum entropy), target candidates are iteratively segmented. Then the real target is captured via two different selection approaches. At the beginning of image sequence, the genuine target with litter texture is discriminated from other candidates by using contrast-based confidence measure. On the other hand, when the target becomes larger, we apply online EM method to estimate and update the distributions of target's size and position based on the prior detection results, and then recognize the genuine one which satisfies both the constraints of size and position. Experimental results demonstrate that the presented method is accurate, robust and efficient.

  5. A novel class of plant-specific zinc-dependent DNA-binding protein that binds to A/T-rich DNA sequences

    PubMed Central

    Nagano, Yukio; Furuhashi, Hirofumi; Inaba, Takehito; Sasaki, Yukiko

    2001-01-01

    Complementary DNA encoding a DNA-binding protein, designated PLATZ1 (plant AT-rich sequence- and zinc-binding protein 1), was isolated from peas. The amino acid sequence of the protein is similar to those of other uncharacterized proteins predicted from the genome sequences of higher plants. However, no paralogous sequences have been found outside the plant kingdom. Multiple alignments among these paralogous proteins show that several cysteine and histidine residues are invariant, suggesting that these proteins are a novel class of zinc-dependent DNA-binding proteins with two distantly located regions, C-x2-H-x11-C-x2-C-x(4–5)-C-x2-C-x(3–7)-H-x2-H and C-x2-C-x(10–11)-C-x3-C. In an electrophoretic mobility shift assay, the zinc chelator 1,10-o-phenanthroline inhibited DNA binding, and two distant zinc-binding regions were required for DNA binding. A protein blot with 65ZnCl2 showed that both regions are required for zinc-binding activity. The PLATZ1 protein non-specifically binds to A/T-rich sequences, including the upstream region of the pea GTPase pra2 and plastocyanin petE genes. Expression of the PLATZ1 repressed those of the reporter constructs containing the coding sequence of luciferase gene driven by the cauliflower mosaic virus (CaMV) 35S90 promoter fused to the tandem repeat of the A/T-rich sequences. These results indicate that PLATZ1 is a novel class of plant-specific zinc-dependent DNA-binding protein responsible for A/T-rich sequence-mediated transcriptional repression. PMID:11600698

  6. HybPiper: Extracting coding sequence and introns for phylogenetics from high-throughput sequencing reads using target enrichment1

    PubMed Central

    Johnson, Matthew G.; Gardner, Elliot M.; Liu, Yang; Medina, Rafael; Goffinet, Bernard; Shaw, A. Jonathan; Zerega, Nyree J. C.; Wickett, Norman J.

    2016-01-01

    Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper. PMID:27437175

  7. Physics with Heavy Neutron Rich Ribs at the Hribf

    NASA Astrophysics Data System (ADS)

    Radford, David

    2002-10-01

    The Holifield Radioactive Ion Beam Facility at the Oak Ridge National Laboratory has recently produced the world's first post-accelerated beams of heavy neutron-rich nuclei. B(E2;0^+ arrow 2^+) values for neutron-rich ^126,128Sn and ^132,134,136Te isotopes have been measured by Coulomb excitation of radioactive ion beams in inverse kinematics. The results for ^132Te and ^134Te (N=80,82) show excellent agreement with systematics of lighter Te isotopes, but the B(E2) value for ^136Te (N=84) is unexpectedly small. Single-neutron transfer reactions leading to ^135Te were identified using a ^134Te beam on ^natBe and ^13C targets at energies just above the Coulomb barrier. The use of the Be target provided an unambiguous signature for neutron transfer through the detection of two correlated α particles, arising from the breakup of unstable ^8Be. The results of these experiments will be discussed, togther with plans for future experiments with these heavy n-rich RIBs.

  8. A long-term target detection approach in infrared image sequence

    NASA Astrophysics Data System (ADS)

    Li, Hang; Zhang, Qi; Li, Yuanyuan; Wang, Liqiang

    2015-12-01

    An automatic target detection method used in long term infrared (IR) image sequence from a moving platform is proposed. Firstly, based on non-linear histogram equalization, target candidates are coarse-to-fine segmented by using two self-adapt thresholds generated in the intensity space. Then the real target is captured via two different selection approaches. At the beginning of image sequence, the genuine target with litter texture is discriminated from other candidates by using contrast-based confidence measure. On the other hand, when the target becomes larger, we apply online EM method to iteratively estimate and update the distributions of target's size and position based on the prior detection results, and then recognize the genuine one which satisfies both the constraints of size and position. Experimental results demonstrate that the presented method is accurate, robust and efficient.

  9. Increased targeting of adenine-rich sequences by (2-amino-2-methyl-3-butanone oxime)dichloroplatinum(II) and investigations into its low cytotoxicity.

    PubMed

    Hambley, T W; Ling, E C; O'Mara, S; McKeage, M J; Russell, P J

    2000-12-01

    Using assays based on the inhibition of restriction enzyme cleavage of plasmid and synthetic DNA, the complex (2-amino-2-methyl-3-butanone oxime)dichloroplatinum(II), [PtCl2(ambo)], has been shown to have an increased tendency for binding to adenine-rich sequences when compared to cis[PtCl2(NH3)2] (cisplatin). [PtCl2(ambo)] was found to form substantially fewer interstrand adducts than does cisplatin. The in vitro cytotoxicity of [PtCl2(ambo)] against a human bladder cancer cell line was determined and found to be more than two orders of magnitude lower than that of cisplatin, yet it was also found to be equally effective at passing into cells and binding to isolated DNA.

  10. [Target gene sequence capture and next generation sequencing technology to diagnose four children with Alagille syndrome].

    PubMed

    Gao, M L; Zhong, X M; Ma, X; Ning, H J; Zhu, D; Zou, J Z

    2016-06-02

    To make genetic diagnosis of Alagille syndrome (ALGS) patients using target gene sequence capture and next generation sequencing technology. Target gene sequence capture and next generation sequencing were used to detect ALGS gene of 4 patients. They were hospitalized at the Affiliated Hospital, Capital Institute of Pediatrics between January 2014 and December 2015, referred to clinical diagnosis of ALGS typical and atypical respectively in 2 cases. Blood samples were collected from patients and their parents and genomic DNA was extracted from lymphocytes. Target gene sequence capture and next generation sequencing was detected. Sanger sequencing was used to confirm the results of the patients and their parents. Cholestasis, heart defects, inverted triangular face and butterfly vertebrae were presented as main clinical features in 4 male patients. The first hospital visiting ages ranged from 3 months and 14 days to 3 years and 1 month. The age of onset ranged from 3 days to 42 days (median 23 days). According to the clinical diagnostic criteria of ALGS, patient 1 and patient 2 were considered as typical ALGS. The other 2 patients were considered as atypical ALGS. Four Jagged 1(JAG1) pathogenic mutations were detected. Three different missense mutations were detected in patient 1 to patient 3 with ALGS(c.839C>T(p.W280X), c. 703G>A(p.R235X), c. 1720C>T(p.V574M)). The JAG1 mutation of patient 3 was first reported. Patient 4 had one novel insertion mutation (c.1779_1780insA(p.Ile594AsnfsTer23)). Parental analysis verified that the JAG1 missense mutation of 3 patients were de novo. The results of sanger sequencing was consistent with the results of the next generation sequencing. Target gene sequence capture combined with next generation sequencing can detect two pathogenic genes in ALGS and test genes of other related diseases in infantile cholestatic diseases simultaneously and presents a high throughput, high efficiency and low cost. It may provide molecular

  11. Intravenous phage display identifies peptide sequences that target the burn-injured intestine.

    PubMed

    Costantini, Todd W; Eliceiri, Brian P; Putnam, James G; Bansal, Vishal; Baird, Andrew; Coimbra, Raul

    2012-11-01

    The injured intestine is responsible for significant morbidity and mortality after severe trauma and burn; however, targeting the intestine with therapeutics aimed at decreasing injury has proven difficult. We hypothesized that we could use intravenous phage display technology to identify peptide sequences that target the injured intestinal mucosa in a murine model, and then confirm the cross-reactivity of this peptide sequence with ex vivo human gut. Four hours following 30% TBSA burn we performed an in vivo, intravenous systemic administration of phage library containing 10(12) phage in balb/c mice to biopan for gut-targeting peptides. In vivo assessment of the candidate peptide sequences identified after 4 rounds of internalization was performed by injecting 1×10(12) copies of each selected phage clone into sham or burned animals. Internalization into the gut was assessed using quantitative polymerase chain reaction. We then incubated this gut-targeting peptide sequence with human intestine and visualized fluorescence using confocal microscopy. We identified 3 gut-targeting peptide sequences which caused collapse of the phage library (4-1: SGHQLLLNKMP, 4-5: ILANDLTAPGPR, 4-11: SFKPSGLPAQSL). Sequence 4-5 was internalized into the intestinal mucosa of burned animals 9.3-fold higher than sham animals injected with the same sequence (2.9×10(5)vs. 3.1×10(4) particles per mg tissue). Sequences 4-1 and 4-11 were both internalized into the gut, but did not demonstrate specificity for the injured mucosa. Phage sequence 4-11 demonstrated cross-reactivity with human intestine. In the future, this gut-targeting peptide sequence could serve as a platform for the delivery of biotherapeutics. Copyright © 2012 Elsevier Inc. All rights reserved.

  12. Targeted exome sequencing of suspected mitochondrial disorders

    PubMed Central

    Lieber, Daniel S.; Calvo, Sarah E.; Shanahan, Kristy; Slate, Nancy G.; Liu, Shangtao; Hershman, Steven G.; Gold, Nina B.; Chapman, Brad A.; Thorburn, David R.; Berry, Gerard T.; Schmahmann, Jeremy D.; Borowsky, Mark L.; Mueller, David M.; Sims, Katherine B.

    2013-01-01

    Objective: To evaluate the utility of targeted exome sequencing for the molecular diagnosis of mitochondrial disorders, which exhibit marked phenotypic and genetic heterogeneity. Methods: We considered a diverse set of 102 patients with suspected mitochondrial disorders based on clinical, biochemical, and/or molecular findings, and whose disease ranged from mild to severe, with varying age at onset. We sequenced the mitochondrial genome (mtDNA) and the exons of 1,598 nuclear-encoded genes implicated in mitochondrial biology, mitochondrial disease, or monogenic disorders with phenotypic overlap. We prioritized variants likely to underlie disease and established molecular diagnoses in accordance with current clinical genetic guidelines. Results: Targeted exome sequencing yielded molecular diagnoses in established disease loci in 22% of cases, including 17 of 18 (94%) with prior molecular diagnoses and 5 of 84 (6%) without. The 5 new diagnoses implicated 2 genes associated with canonical mitochondrial disorders (NDUFV1, POLG2), and 3 genes known to underlie other neurologic disorders (DPYD, KARS, WFS1), underscoring the phenotypic and biochemical overlap with other inborn errors. We prioritized variants in an additional 26 patients, including recessive, X-linked, and mtDNA variants that were enriched 2-fold over background and await further support of pathogenicity. In one case, we modeled patient mutations in yeast to provide evidence that recessive mutations in ATP5A1 can underlie combined respiratory chain deficiency. Conclusion: The results demonstrate that targeted exome sequencing is an effective alternative to the sequential testing of mtDNA and individual nuclear genes as part of the investigation of mitochondrial disease. Our study underscores the ongoing challenge of variant interpretation in the clinical setting. PMID:23596069

  13. Genetic mutations in human rectal cancers detected by targeted sequencing.

    PubMed

    Bai, Jun; Gao, Jinglong; Mao, Zhijun; Wang, Jianhua; Li, Jianhui; Li, Wensheng; Lei, Yu; Li, Shuaishuai; Wu, Zhuo; Tang, Chuanning; Jones, Lindsey; Ye, Hua; Lou, Feng; Liu, Zhiyuan; Dong, Zhishou; Guo, Baishuai; Huang, Xue F; Chen, Si-Yi; Zhang, Enke

    2015-10-01

    Colorectal cancer (CRC) is widespread with significant mortality. Both inherited and sporadic mutations in various signaling pathways influence the development and progression of the cancer. Identifying genetic mutations in CRC is important for optimal patient treatment and many approaches currently exist to uncover these mutations, including next-generation sequencing (NGS) and commercially available kits. In the present study, we used a semiconductor-based targeted DNA-sequencing approach to sequence and identify genetic mutations in 91 human rectal cancer samples. Analysis revealed frequent mutations in KRAS (58.2%), TP53 (28.6%), APC (16.5%), FBXW7 (9.9%) and PIK3CA (9.9%), and additional mutations in BRAF, CTNNB1, ERBB2 and SMAD4 were also detected at lesser frequencies. Thirty-eight samples (41.8%) also contained two or more mutations, with common combination mutations occurring between KRAS and TP53 (42.1%), and KRAS and APC (31.6%). DNA sequencing for individual cancers is of clinical importance for targeted drug therapy and the advantages of such targeted gene sequencing over other NGS platforms or commercially available kits in sensitivity, cost and time effectiveness may aid clinicians in treating CRC patients in the near future.

  14. The environmental impacts on the star formation main sequence: An Hα study of the newly discovered rich cluster at z = 1.52

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Koyama, Yusei; Kodama, Tadayuki; Tadaki, Ken-ichi

    2014-07-01

    We report the discovery of a strong over-density of galaxies in the field of a radio galaxy at z = 1.52 (4C 65.22) based on our broadband and narrow-band (Hα) photometry with the Subaru Telescope. We find that Hα emitters are located in the outskirts of the density peak (cluster core) dominated by passive red-sequence galaxies. This resembles the situation in lower-redshift clusters, suggesting that the newly discovered structure is a well-evolved rich galaxy cluster at z = 1.5. Our data suggest that the color-density and stellar mass-density relations are already in place at z ∼ 1.5, mostly driven bymore » the passive red massive galaxies residing within r{sub c} ≲ 200 kpc from the cluster core. These environmental trends almost disappear when we consider only star-forming (SF) galaxies. We do not find SFR-density or SSFR-density relations amongst SF galaxies, and the location of the SF main sequence does not significantly change with environment. Nevertheless, we find a tentative hint that star-bursting galaxies (up-scattered objects from the main sequence) are preferentially located in a small group at ∼1 Mpc away from the main body of the cluster. We also argue that the scatter of the SF main sequence could be dependent on the distance to the nearest neighboring galaxy.« less

  15. Neutron-rich isotope production using the uranium carbide multi-foil SPES target prototype

    NASA Astrophysics Data System (ADS)

    Scarpa, D.; Biasetto, L.; Corradetti, S.; Manzolaro, M.; Andrighetto, A.; Carturan, S.; Prete, G.; Zanonato, P.; Stracener, D. W.

    2011-03-01

    In the framework of the R&D program for the SPES (Selective Production of Exotic Species) project of the Istituto Nazionale di Fisica Nucleare (INFN), production yields of neutron-rich isotopes have been measured at the Holifield Radioactive Ion Beam Facility (HRIBF, Oak Ridge National Laboratory, USA). This experiment makes use of the multi-foil SPES target prototype composed of 7 uranium carbide discs, with excess of graphite (ratio C/ U = 4 . 77 isotopes of medium mass (between 72 and 141amu), produced via proton-induced fission of uranium using a 40MeV proton beam, have been collected and analyzed for the target heated at 2000 ° C target temperature.

  16. Structure and DNA-Binding Sites of the SWI1 AT-rich Interaction Domain (ARID) Suggest Determinants for Sequence-Specific DNA Recognition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Suhkmann; Zhang, Ziming; Upchurch, Sean

    2004-04-16

    2 ARID is a homologous family of DNA-binding domains that occur in DNA binding proteins from a wide variety of species, ranging from yeast to nematodes, insects, mammals and plants. SWI1, a member of the SWI/SNF protein complex that is involved in chromatin remodeling during transcription, contains the ARID motif. The ARID domain of human SWI1 (also known as p270) does not select for a specific DNA sequence from a random sequence pool. The lack of sequence specificity shown by the SWI1 ARID domain stands in contrast to the other characterized ARID domains, which recognize specific AT-rich sequences. We havemore » solved the three-dimensional structure of human SWI1 ARID using solution NMR methods. In addition, we have characterized non-specific DNA-binding by the SWI1 ARID domain. Results from this study indicate that a flexible long internal loop in ARID motif is likely to be important for sequence specific DNA-recognition. The structure of human SWI1 ARID domain also represents a distinct structural subfamily. Studies of ARID indicate that boundary of the DNA binding structural and functional domains can extend beyond the sequence homologous region in a homologous family of proteins. Structural studies of homologous domains such as ARID family of DNA-binding domains should provide information to better predict the boundary of structural and functional domains in structural genomic studies. Key Words: ARID, SWI1, NMR, structural genomics, protein-DNA interaction.« less

  17. The siRNA Non-seed Region and Its Target Sequences Are Auxiliary Determinants of Off-Target Effects.

    PubMed

    Kamola, Piotr J; Nakano, Yuko; Takahashi, Tomoko; Wilson, Paul A; Ui-Tei, Kumiko

    2015-12-01

    RNA interference (RNAi) is a powerful tool for post-transcriptional gene silencing. However, the siRNA guide strand may bind unintended off-target transcripts via partial sequence complementarity by a mechanism closely mirroring micro RNA (miRNA) silencing. To better understand these off-target effects, we investigated the correlation between sequence features within various subsections of siRNA guide strands, and its corresponding target sequences, with off-target activities. Our results confirm previous reports that strength of base-pairing in the siRNA seed region is the primary factor determining the efficiency of off-target silencing. However, the degree of downregulation of off-target transcripts with shared seed sequence is not necessarily similar, suggesting that there are additional auxiliary factors that influence the silencing potential. Here, we demonstrate that both the melting temperature (Tm) in a subsection of siRNA non-seed region, and the GC contents of its corresponding target sequences, are negatively correlated with the efficiency of off-target effect. Analysis of experimentally validated miRNA targets demonstrated a similar trend, indicating a putative conserved mechanistic feature of seed region-dependent targeting mechanism. These observations may prove useful as parameters for off-target prediction algorithms and improve siRNA 'specificity' design rules.

  18. Structural polymorphism of a cytosine-rich DNA sequence forming i-motif structure: Exploring pH based biosensors.

    PubMed

    Ahmed, Saami; Kaushik, Mahima; Chaudhary, Swati; Kukreti, Shrikant

    2018-05-01

    Sequence recognition and conformational polymorphism enable DNA to emerge out as a substantial tool in fabricating the devices within nano-dimensions. These DNA associated nano devices work on the principle of conformational switches, which can be facilitated by many factors like sequence of DNA/RNA strand, change in pH or temperature, enzyme or ligand interactions etc. Thus, controlling these DNA conformational changes to acquire the desired function is significant for evolving DNA hybridization biosensor, used in genetic screening and molecular diagnosis. For exploring this conformational switching ability of cytosine-rich DNA oligonucleotides as a function of pH for their potential usage as biosensors, this study has been designed. A C-rich stretch of DNA sequence (5'-TCCCCCAATTAATTCCCCCA-3'; SG20c) has been investigated using UV-Thermal denaturation, poly-acrylamide gel electrophoresis and CD spectroscopy. The SG20c sequence is shown to adopt various topologies of i-motif structure at low pH. This pH dependent transition of SG20c from unstructured single strand to unimolecular and bimolecular i-motif structures can further be exploited for its utilization as switching on/off pH-based biosensors. Copyright © 2018. Published by Elsevier B.V.

  19. Optimizing Illumina next-generation sequencing library preparation for extremely AT-biased genomes.

    PubMed

    Oyola, Samuel O; Otto, Thomas D; Gu, Yong; Maslen, Gareth; Manske, Magnus; Campino, Susana; Turner, Daniel J; Macinnis, Bronwyn; Kwiatkowski, Dominic P; Swerdlow, Harold P; Quail, Michael A

    2012-01-03

    Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences. We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates. We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of

  20. Identification of miRNAs and their targets in wild tomato at moderately and acutely elevated temperatures by high-throughput sequencing and degradome analysis

    PubMed Central

    Zhou, Rong; Wang, Qian; Jiang, Fangling; Cao, Xue; Sun, Mintao; Liu, Min; Wu, Zhen

    2016-01-01

    MicroRNAs (miRNAs) are 19–24 nucleotide (nt) noncoding RNAs that play important roles in abiotic stress responses in plants. High temperatures have been the subject of considerable attention due to their negative effects on plant growth and development. Heat-responsive miRNAs have been identified in some plants. However, there have been no reports on the global identification of miRNAs and their targets in tomato at high temperatures, especially at different elevated temperatures. Here, three small-RNA libraries and three degradome libraries were constructed from the leaves of the heat-tolerant tomato at normal, moderately and acutely elevated temperatures (26/18 °C, 33/33 °C and 40/40 °C, respectively). Following high-throughput sequencing, 662 conserved and 97 novel miRNAs were identified in total with 469 conserved and 91 novel miRNAs shared in the three small-RNA libraries. Of these miRNAs, 96 and 150 miRNAs were responsive to the moderately and acutely elevated temperature, respectively. Following degradome sequencing, 349 sequences were identified as targets of 138 conserved miRNAs, and 13 sequences were identified as targets of eight novel miRNAs. The expression levels of seven miRNAs and six target genes obtained by quantitative real-time PCR (qRT-PCR) were largely consistent with the sequencing results. This study enriches the number of heat-responsive miRNAs and lays a foundation for the elucidation of the miRNA-mediated regulatory mechanism in tomatoes at elevated temperatures. PMID:27653374

  1. Targeted next-generation sequencing at copy-number breakpoints for personalized analysis of rearranged ends in solid tumors.

    PubMed

    Kim, Hyun-Kyoung; Park, Won Cheol; Lee, Kwang Man; Hwang, Hai-Li; Park, Seong-Yeol; Sorn, Sungbin; Chandra, Vishal; Kim, Kwang Gi; Yoon, Woong-Bae; Bae, Joon Seol; Shin, Hyoung Doo; Shin, Jong-Yeon; Seoh, Ju-Young; Kim, Jong-Il; Hong, Kyeong-Man

    2014-01-01

    The concept of the utilization of rearranged ends for development of personalized biomarkers has attracted much attention owing to its clinical applicability. Although targeted next-generation sequencing (NGS) for recurrent rearrangements has been successful in hematologic malignancies, its application to solid tumors is problematic due to the paucity of recurrent translocations. However, copy-number breakpoints (CNBs), which are abundant in solid tumors, can be utilized for identification of rearranged ends. As a proof of concept, we performed targeted next-generation sequencing at copy-number breakpoints (TNGS-CNB) in nine colon cancer cases including seven primary cancers and two cell lines, COLO205 and SW620. For deduction of CNBs, we developed a novel competitive single-nucleotide polymorphism (cSNP) microarray method entailing CNB-region refinement by competitor DNA. Using TNGS-CNB, 19 specific rearrangements out of 91 CNBs (20.9%) were identified, and two polymerase chain reaction (PCR)-amplifiable rearrangements were obtained in six cases (66.7%). And significantly, TNGS-CNB, with its high positive identification rate (82.6%) of PCR-amplifiable rearrangements at candidate sites (19/23), just from filtering of aligned sequences, requires little effort for validation. Our results indicate that TNGS-CNB, with its utility for identification of rearrangements in solid tumors, can be successfully applied in the clinical laboratory for cancer-relapse and therapy-response monitoring.

  2. Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.

    PubMed

    Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P

    2016-05-27

    Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.

  3. Electron beam plasma ionizing target for the production of neutron-rich nuclides

    NASA Astrophysics Data System (ADS)

    Panteleev, V. N.; Barzakh, A. E.; Essabaa, S.; Fedorov, D. V.; Ionan, A. M.; Ivanov, V. S.; Lau, C.; Leroy, R.; Lhersonneau, G.; Mezilev, K. A.; Molkanov, P. L.; Moroz, F. V.; Orlov, S. Yu.; Stroe, L.; Tecchio, L. B.; Villari, A. C. C.; Volkov, Yu. M.

    2008-10-01

    The production of neutron-rich Ag, In and Sn isotopes from a uranium carbide target of a high density has been investigated at the IRIS facility in the PLOG (PNPI-Legnaro-GANIL-Orsay) collaboration. The UC target material with a density of 12 g/cm3 was prepared by the method of powder metallurgy in a form of pellets of 2 mm thickness, 11 mm in diameter and grain dimensions of about 20 μm. The uranium target mass of 31 g was exposed at a 1 GeV proton beam of intensity 0.05-0.07 μA. For the ionization of the produced species the electron beam-plasma ionization inside the target container (ionizing target) has been used. It was the first experiment when the new high density UC target material was exploited with the electron-plasma ionization. Yields of Sn isotopes have been measured in the target temperature range of (1900-2100) °C. The yields of some Pd, In and Cd isotopes were measured as well to compare to previously measured ones from a high density uranium carbide target having a ceramic-like structure. For the first time a nickel isotope was obtained from a high density UC target.

  4. The eukaryotic signal sequence, YGRL, targets the chlamydial inclusion

    PubMed Central

    Kabeiseman, Emily J.; Cichos, Kyle H.; Moore, Elizabeth R.

    2014-01-01

    Understanding how host proteins are targeted to pathogen-specified organelles, like the chlamydial inclusion, is fundamentally important to understanding the biogenesis of these unique subcellular compartments and how they maintain autonomy within the cell. Syntaxin 6, which localizes to the chlamydial inclusion, contains an YGRL signal sequence. The YGRL functions to return syntaxin 6 to the trans-Golgi from the plasma membrane, and deletion of the YGRL signal sequence from syntaxin 6 also prevents the protein from localizing to the chlamydial inclusion. YGRL is one of three YXXL (YGRL, YQRL, and YKGL) signal sequences which target proteins to the trans-Golgi. We designed various constructs of eukaryotic proteins to test the specificity and propensity of YXXL sequences to target the inclusion. The YGRL signal sequence redirects proteins (e.g., Tgn38, furin, syntaxin 4) that normally do not localize to the chlamydial inclusion. Further, the requirement of the YGRL signal sequence for syntaxin 6 localization to inclusions formed by different species of Chlamydia is conserved. These data indicate that there is an inherent property of the chlamydial inclusion, which allows it to recognize the YGRL signal sequence. To examine whether this “inherent property” was protein or lipid in nature, we asked if deletion of the YGRL signal sequence from syntaxin 6 altered the ability of the protein to interact with proteins or lipids. Deletion or alteration of the YGRL from syntaxin 6 does not appreciably impact syntaxin 6-protein interactions, but does decrease syntaxin 6-lipid interactions. Intriguingly, data also demonstrate that YKGL or YQRL can successfully substitute for YGRL in localization of syntaxin 6 to the chlamydial inclusion. Importantly and for the first time, we are establishing that a eukaryotic signal sequence targets the chlamydial inclusion. PMID:25309881

  5. Targeted Analysis of Whole Genome Sequence Data to Diagnose Genetic Cardiomyopathy

    DOE PAGES

    Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa; ...

    2014-09-01

    Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less

  6. Improved bioactivity of G-rich triplex-forming oligonucleotides containing modified guanine bases

    PubMed Central

    Rogers, Faye A; Lloyd, Janice A; Tiwari, Meetu Kaushik

    2014-01-01

    Triplex structures generated by sequence-specific triplex-forming oligonucleotides (TFOs) have proven to be promising tools for gene targeting strategies. In addition, triplex technology has been highly utilized to study the molecular mechanisms of DNA repair, recombination and mutagenesis. However, triplex formation utilizing guanine-rich oligonucleotides as third strands can be inhibited by potassium-induced self-association resulting in G-quadruplex formation. We report here that guanine-rich TFOs partially substituted with 8-aza-7-deaza-guanine (PPG) have improved target site binding in potassium compared with TFOs containing the natural guanine base. We designed PPG-substituted TFOs to bind to a polypurine sequence in the supFG1 reporter gene. The binding efficiency of PPG-substituted TFOs to the target sequence was analyzed using electrophoresis mobility gel shift assays. We have determined that in the presence of potassium, the non-substituted TFO, AG30 did not bind to its target sequence, however binding was observed with the PPG-substituted AG30 under conditions with up to 140 mM KCl. The PPG-TFOs were able to maintain their ability to induce genomic modifications as measured by an assay for gene-targeted mutagenesis. In addition, these compounds were capable of triplex-induced DNA double strand breaks, which resulted in activation of apoptosis. PMID:25483840

  7. Genome-wide evidence for local DNA methylation spreading from small RNA-targeted sequences in Arabidopsis.

    PubMed

    Ahmed, Ikhlak; Sarazin, Alexis; Bowler, Chris; Colot, Vincent; Quesneville, Hadi

    2011-09-01

    Transposable elements (TEs) and their relics play major roles in genome evolution. However, mobilization of TEs is usually deleterious and strongly repressed. In plants and mammals, this repression is typically associated with DNA methylation, but the relationship between this epigenetic mark and TE sequences has not been investigated systematically. Here, we present an improved annotation of TE sequences and use it to analyze genome-wide DNA methylation maps obtained at single-nucleotide resolution in Arabidopsis. We show that although the majority of TE sequences are methylated, ∼26% are not. Moreover, a significant fraction of TE sequences densely methylated at CG, CHG and CHH sites (where H = A, T or C) have no or few matching small interfering RNA (siRNAs) and are therefore unlikely to be targeted by the RNA-directed DNA methylation (RdDM) machinery. We provide evidence that these TE sequences acquire DNA methylation through spreading from adjacent siRNA-targeted regions. Further, we show that although both methylated and unmethylated TE sequences located in euchromatin tend to be more abundant closer to genes, this trend is least pronounced for methylated, siRNA-targeted TE sequences located 5' to genes. Based on these and other findings, we propose that spreading of DNA methylation through promoter regions explains at least in part the negative impact of siRNA-targeted TE sequences on neighboring gene expression.

  8. Conserved Sequences at the Origin of Adenovirus DNA Replication

    PubMed Central

    Stillman, Bruce W.; Topp, William C.; Engler, Jeffrey A.

    1982-01-01

    The origin of adenovirus DNA replication lies within an inverted sequence repetition at either end of the linear, double-stranded viral DNA. Initiation of DNA replication is primed by a deoxynucleoside that is covalently linked to a protein, which remains bound to the newly synthesized DNA. We demonstrate that virion-derived DNA-protein complexes from five human adenovirus serological subgroups (A to E) can act as a template for both the initiation and the elongation of DNA replication in vitro, using nuclear extracts from adenovirus type 2 (Ad2)-infected HeLa cells. The heterologous template DNA-protein complexes were not as active as the homologous Ad2 DNA, most probably due to inefficient initiation by Ad2 replication factors. In an attempt to identify common features which may permit this replication, we have also sequenced the inverted terminal repeated DNA from human adenovirus serotypes Ad4 (group E), Ad9 and Ad10 (group D), and Ad31 (group A), and we have compared these to previously determined sequences from Ad2 and Ad5 (group C), Ad7 (group B), and Ad12 and Ad18 (group A) DNA. In all cases, the sequence around the origin of DNA replication can be divided into two structural domains: a proximal A · T-rich region which is partially conserved among these serotypes, and a distal G · C-rich region which is less well conserved. The G · C-rich region contains sequences similar to sequences present in papovavirus replication origins. The two domains may reflect a dual mechanism for initiation of DNA replication: adenovirus-specific protein priming of replication, and subsequent utilization of this primer by host replication factors for completion of DNA synthesis. Images PMID:7143575

  9. Deciphering the genomic targets of alkylating polyamide conjugates using high-throughput sequencing

    PubMed Central

    Chandran, Anandhakumar; Syed, Junetha; Taylor, Rhys D.; Kashiwazaki, Gengo; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

    2016-01-01

    Chemically engineered small molecules targeting specific genomic sequences play an important role in drug development research. Pyrrole-imidazole polyamides (PIPs) are a group of molecules that can bind to the DNA minor-groove and can be engineered to target specific sequences. Their biological effects rely primarily on their selective DNA binding. However, the binding mechanism of PIPs at the chromatinized genome level is poorly understood. Herein, we report a method using high-throughput sequencing to identify the DNA-alkylating sites of PIP-indole-seco-CBI conjugates. High-throughput sequencing analysis of conjugate 2 showed highly similar DNA-alkylating sites on synthetic oligos (histone-free DNA) and on human genomes (chromatinized DNA context). To our knowledge, this is the first report identifying alkylation sites across genomic DNA by alkylating PIP conjugates using high-throughput sequencing. PMID:27098039

  10. TIA-1 RRM23 binding and recognition of target oligonucleotides

    PubMed Central

    Waris, Saboora; García-Mauriño, Sofía M.; Sivakumaran, Andrew; Beckham, Simone A.; Loughlin, Fionna E.; Gorospe, Myriam; Díaz-Moreno, Irene; Wilce, Matthew C.J.

    2017-01-01

    Abstract TIA-1 (T-cell restricted intracellular antigen-1) is an RNA-binding protein involved in splicing and translational repression. It mainly interacts with RNA via its second and third RNA recognition motifs (RRMs), with specificity for U-rich sequences directed by RRM2. It has recently been shown that RRM3 also contributes to binding, with preferential binding for C-rich sequences. Here we designed UC-rich and CU-rich 10-nt sequences for engagement of both RRM2 and RRM3 and demonstrated that the TIA-1 RRM23 construct preferentially binds the UC-rich RNA ligand (5΄-UUUUUACUCC-3΄). Interestingly, this binding depends on the presence of Lys274 that is C-terminal to RRM3 and binding to equivalent DNA sequences occurs with similar affinity. Small-angle X-ray scattering was used to demonstrate that, upon complex formation with target RNA or DNA, TIA-1 RRM23 adopts a compact structure, showing that both RRMs engage with the target 10-nt sequences to form the complex. We also report the crystal structure of TIA-1 RRM2 in complex with DNA to 2.3 Å resolution providing the first atomic resolution structure of any TIA protein RRM in complex with oligonucleotide. Together our data support a specific mode of TIA-1 RRM23 interaction with target oligonucleotides consistent with the role of TIA-1 in binding RNA to regulate gene expression. PMID:28184449

  11. RISC RNA sequencing for context-specific identification of in vivo miR targets

    PubMed Central

    Matkovich, Scot J; Van Booven, Derek J; Eschenbacher, William H; Dorn, Gerald W

    2010-01-01

    (RISC)-associated RNAs (the RISCome), called RISC sequencing. We developed methods that did not require cross-linking of RNAs to RISCs or amplification of mRNA prior to sequencing, making it possible to rapidly perform RISC sequencing from intact tissue while avoiding amplification bias. Comparison of RISCome with transcriptome expression defined the degree of RISC enrichment for each mRNA. The majority of the mRNAs enriched in wild-type cardiac RISComes compared to transcriptomes were bioinformatically predicted to be targets of at least 1 of 139 cardiac-expressed miRs. Programming cardiomyocyte RISCs via transgenic overexpression in adult hearts of miR-133a or miR-499, two miRs that contain entirely different ‘seed’ sequences, elicited differing profiles of RISC-targeted mRNAs. Thus, RISC sequencing represents a highly sensitive method for general RISC profiling and individual miR target identification in biological context. PMID:21030712

  12. RNase H-assisted RNA-primed rolling circle amplification for targeted RNA sequence detection.

    PubMed

    Takahashi, Hirokazu; Ohkawachi, Masahiko; Horio, Kyohei; Kobori, Toshiro; Aki, Tsunehiro; Matsumura, Yukihiko; Nakashimada, Yutaka; Okamura, Yoshiko

    2018-05-17

    RNA-primed rolling circle amplification (RPRCA) is a useful laboratory method for RNA detection; however, the detection of RNA is limited by the lack of information on 3'-terminal sequences. We uncovered that conventional RPRCA using pre-circularized probes could potentially detect the internal sequence of target RNA molecules in combination with RNase H. However, the specificity for mRNA detection was low, presumably due to non-specific hybridization of non-target RNA with the circular probe. To overcome this technical problem, we developed a method for detecting a sequence of interest in target RNA molecules via RNase H-assisted RPRCA using padlocked probes. When padlock probes are hybridized to the target RNA molecule, they are converted to the circular form by SplintR ligase. Subsequently, RNase H creates nick sites only in the hybridized RNA sequence, and single-stranded DNA is finally synthesized from the nick site by phi29 DNA polymerase. This method could specifically detect at least 10 fmol of the target RNA molecule without reverse transcription. Moreover, this method detected GFP mRNA present in 10 ng of total RNA isolated from Escherichia coli without background DNA amplification. Therefore, this method can potentially detect almost all types of RNA molecules without reverse transcription and reveal full-length sequence information.

  13. A RICH detector for hadron identification at Jlab

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mammoliti, Francesco; Cisbani, Evaristo; Cusanno, Francesco

    2011-08-01

    The “standard” Hall A apparatus at Jefferson Lab (TOF and aerogel threshold Cherenkov detectors) does not provide complete identification for proton, kaon and pion. To this aim, a proximity focusing C6F14/CsI RICH (Ring Image Cherenkov) detector has been designed, built, tested and operated to separate kaons from pions with a pion contamination of a few percent up to 2.4 GeV/c. Two quite different experimental investigations have benefitted of the RICH identification: on one side, the high-resolution hypernuclear spectroscopy series of experiments on carbon, beryllium and oxygen, devoted to the study of the lambda-nucleon potential. On the other side, the measurementsmore » of the single spin asymmetries of pion and kaon on a transversely polarized 3He target are of utmost interest in understanding QCD dynamics in the nucleon. We present the technical features of such a RICH detector and comment on the presently achieved performance in hadron identification.« less

  14. Identification of Optimal Epitopes for Plasmodium falciparum Rapid Diagnostic Tests That Target Histidine-Rich Proteins 2 and 3

    PubMed Central

    Lee, Nelson; Gatton, Michelle L.; Pelecanos, Anita; Bubb, Martin; Gonzalez, Iveth; Bell, David; Cheng, Qin

    2012-01-01

    Rapid diagnostic tests (RDTs) represent important tools to diagnose malaria infection. To improve understanding of the variable performance of RDTs that detect the major target in Plasmodium falciparum, namely, histidine-rich protein 2 (HRP2), and to inform the design of better tests, we undertook detailed mapping of the epitopes recognized by eight HRP-specific monoclonal antibodies (MAbs). To investigate the geographic skewing of this polymorphic protein, we analyzed the distribution of these epitopes in parasites from geographically diverse areas. To identify an ideal amino acid motif for a MAb to target in HRP2 and in the related protein HRP3, we used a purpose-designed script to perform bioinformatic analysis of 448 distinct gene sequences from pfhrp2 and from 99 sequences from the closely related gene pfhrp3. The frequency and distribution of these motifs were also compared to the MAb epitopes. Heat stability testing of MAbs immobilized on nitrocellulose membranes was also performed. Results of these experiments enabled the identification of MAbs with the most desirable characteristics for inclusion in RDTs, including copy number and coverage of target epitopes, geographic skewing, heat stability, and match with the most abundant amino acid motifs identified. This study therefore informs the selection of MAbs to include in malaria RDTs as well as in the generation of improved MAbs that should improve the performance of HRP-detecting malaria RDTs. PMID:22259210

  15. Target sites for the transposition of rat long interspersed repeated DNA elements (LINEs) are not random.

    PubMed Central

    Furano, A V; Somerville, C C; Tsichlis, P N; D'Ambrosio, E

    1986-01-01

    The long interspersed repeated DNA family of rats (LINE or L1Rn family) contains about 40,000 6.7-kilobase (kb) long members (1). LINE members may be currently mobile since their presence or absence causes allelic variation at three single copy loci (2, 3): insulin 1, Moloney leukemia virus integration 2 (Mlvi-2) (4), and immunoglobulin heavy chain (Igh). To characterize target sites for LINE insertion, we compared the DNA sequences of the unoccupied Mlvi-2 target site, its LINE-containing allele, and several other LINE-containing sites. Although not homologous overall, the target sites share three characteristics: First, depending on the site, they are from 68% to 86% (A+T) compared to 58% (A+T) for total rat DNA (5). Depending on the site, a 7- to 15-bp target site sequence becomes duplicated and flanks the inserted LINE member. The second is a version (0 or 1 mismatch) of the hexanucleotide, TACTCA, which is also present in the LINE member, in a highly conserved region located just before the A-rich right end of the LINE member. The third is a stretch of alternating purine/pyrimidine (PQ). The A-rich right ends of different LINE members vary in length and composition, and the sequence of a particularly long one suggests that it contains the A-rich target site from a previous transposition. PMID:3012480

  16. Physics with heavy neutron-rich RIBs at the HRIBF

    NASA Astrophysics Data System (ADS)

    Radford, D. C.; Baktash, C.; Galindo-Uribarri, A.; Gross, C. J.; Lewis, T. A.; Mueller, P. E.; Hausladen, P. A.; Shapira, D.; Stracener, D. W.; Yu, C.-H.; Fuentes, B.; Padilla, E.; Hartley, D. J.; Barton, C. J.; Caprio, M.; Zamfir, N. V.

    The Holifield Radioactive Ion Beam Facility at Oak Ridge National Laboratory has recently produced the world's first post-accelerated beams of heavy neutron-rich nuclei. The first experiments with these beam are described, and the results discussed. B(E2;0+ --> 2+) values for neutron-rich 126,128Sn and 132,134,136Te isotopes have been measured by Coulomb excitation in inverse kinematics. The results for 132Te and 134Te (N = 80, 82) show excellent agreement with systematics of lighter Te isotopes, but the B(E2) value for 136Te (N = 84) is unexpectedly small. Single-neutron transfer reactions with a 134Te beam on natBe and 13C targets at energies just above the Coulomb barrier have also been studied.

  17. Profiling of potential driver mutations in sarcomas by targeted next generation sequencing.

    PubMed

    Andersson, Carola; Fagman, Henrik; Hansson, Magnus; Enlund, Fredrik

    2016-04-01

    Comprehensive genetic profiling by massively parallel sequencing, commonly known as next generation sequencing (NGS), is becoming the foundation of personalized oncology. For sarcomas very few targeted treatments are currently in routine use. In clinical practice the preoperative diagnostic workup of soft tissue tumours largely relies on core needle biopsies. Although mostly sufficient for histopathological diagnosis, only very limited amounts of formalin fixated paraffin embedded tissue are often available for predictive mutation analysis. Targeted NGS may thus open up new possibilities for comprehensive characterization of scarce biopsies. We therefore set out to search for driver mutations by NGS in a cohort of 55 clinically and morphologically well characterized sarcomas using low input of DNA from formalin fixated paraffin embedded tissues. The aim was to investigate if there are any recurrent or targetable aberrations in cancer driver genes in addition to known chromosome translocations in different types of sarcomas. We employed a panel covering 207 mutation hotspots in 50 cancer-associated genes to analyse DNA from nine gastrointestinal stromal tumours, 14 synovial sarcomas, seven myxoid liposarcomas, 22 Ewing sarcomas and three Ewing-like small round cell tumours at a large sequencing depth to detect also mutations that are subclonal or occur at low allele frequencies. We found nine mutations in eight different potential driver genes, some of which are potentially actionable by currently existing targeted therapies. Even though no recurrent mutations in driver genes were found in the different sarcoma groups, we show that targeted NGS-based sequencing is clearly feasible in a diagnostic setting with very limited amounts of paraffin embedded tissue and may provide novel insights into mesenchymal cell signalling and potentially druggable targets. Interestingly, we also identify five non-synonymous sequence variants in 4 established cancer driver genes in DNA

  18. Strong transcription blockage mediated by R-loop formation within a G-rich homopurine–homopyrimidine sequence localized in the vicinity of the promoter

    PubMed Central

    Soo Shin, Jane Hae

    2017-01-01

    Abstract Guanine-rich (G-rich) homopurine–homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA–DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA–DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription ‘bursting’) and may also have practical implications for the design of expression vectors. PMID:28498974

  19. Enrichment of target sequences for next-generation sequencing applications in research and diagnostics.

    PubMed

    Altmüller, Janine; Budde, Birgit S; Nürnberg, Peter

    2014-02-01

    Abstract Targeted re-sequencing such as gene panel sequencing (GPS) has become very popular in medical genetics, both for research projects and in diagnostic settings. The technical principles of the different enrichment methods have been reviewed several times before; however, new enrichment products are constantly entering the market, and researchers are often puzzled about the requirement to take decisions about long-term commitments, both for the enrichment product and the sequencing technology. This review summarizes important considerations for the experimental design and provides helpful recommendations in choosing the best sequencing strategy for various research projects and diagnostic applications.

  20. TIA-1 RRM23 binding and recognition of target oligonucleotides.

    PubMed

    Waris, Saboora; García-Mauriño, Sofía M; Sivakumaran, Andrew; Beckham, Simone A; Loughlin, Fionna E; Gorospe, Myriam; Díaz-Moreno, Irene; Wilce, Matthew C J; Wilce, Jacqueline A

    2017-05-05

    TIA-1 (T-cell restricted intracellular antigen-1) is an RNA-binding protein involved in splicing and translational repression. It mainly interacts with RNA via its second and third RNA recognition motifs (RRMs), with specificity for U-rich sequences directed by RRM2. It has recently been shown that RRM3 also contributes to binding, with preferential binding for C-rich sequences. Here we designed UC-rich and CU-rich 10-nt sequences for engagement of both RRM2 and RRM3 and demonstrated that the TIA-1 RRM23 construct preferentially binds the UC-rich RNA ligand (5΄-UUUUUACUCC-3΄). Interestingly, this binding depends on the presence of Lys274 that is C-terminal to RRM3 and binding to equivalent DNA sequences occurs with similar affinity. Small-angle X-ray scattering was used to demonstrate that, upon complex formation with target RNA or DNA, TIA-1 RRM23 adopts a compact structure, showing that both RRMs engage with the target 10-nt sequences to form the complex. We also report the crystal structure of TIA-1 RRM2 in complex with DNA to 2.3 Å resolution providing the first atomic resolution structure of any TIA protein RRM in complex with oligonucleotide. Together our data support a specific mode of TIA-1 RRM23 interaction with target oligonucleotides consistent with the role of TIA-1 in binding RNA to regulate gene expression. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence

    PubMed Central

    2018-01-01

    FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722

  2. Design of the hairpin ribozyme for targeting specific RNA sequences.

    PubMed

    Hampel, A; DeYoung, M B; Galasinski, S; Siwkowski, A

    1997-01-01

    The following steps should be taken when designing the hairpin ribozyme to cleave a specific target sequence: 1. Select a target sequence containing BN*GUC where B is C, G, or U. 2. Select the target sequence in areas least likely to have extensive interfering structure. 3. Design the conventional hairpin ribozyme as shown in Fig. 1, such that it can form a 4 bp helix 2 and helix 1 lengths up to 10 bp. 4. Synthesize this ribozyme from single-stranded DNA templates with a double-stranded T7 promoter. 5. Prepare a series of short substrates capable of forming a range of helix 1 lengths of 5-10 bp. 6. Identify these by direct RNA sequencing. 7. Assay the extent of cleavage of each substrate to identify the optimal length of helix 1. 8. Prepare the hairpin tetraloop ribozyme to determine if catalytic efficiency can be improved.

  3. Chromosomal translocations and palindromic AT-rich repeats

    PubMed Central

    Kato, Takema; Kurahashi, Hiroki; Emanuel1, Beverly S.

    2012-01-01

    Repetitive DNA sequences constitute 30% of the human genome, and are often sites of genomic rearrangement. Recently, it has been found that several constitutional translocations, especially those that involve chromosome 22, take place utilizing palindromic sequences on 22q11 and on the partner chromosome. Analysis of translocation junction fragments shows that the breakpoints of such palindrome-mediated translocations are localized at the center of palindromic AT-rich repeats (PATRRs). The presence of PATRRs at the breakpoints, indicates a palindrome-mediated mechanism involved in the generation of these constitutional translocations. Identification of these PATRR-mediated translocations suggests a universal pathway for gross chromosomal rearrangement in the human genome. De novo occurrences of PATRR-mediated translocations can be detected by PCR in normal sperm samples but not somatic cells. Polymorphisms of various PATRRs influence their propensity for adopting a secondary structure, which in turn affects de novo translocation frequency. We propose that the PATRRs form an unstable secondary structure, which leads to double-strand breaks at the center of the PATRR. The double-strand breaks appear to be followed by a non-homologous end-joining repair pathway, ultimately leading to the translocations. This review considers recent findings concerning the mechanism of meiosis-specific, PATRR-mediated translocations. PMID:22402448

  4. PCR Assays for Identification of Coccidioides posadasii Based on the Nucleotide Sequence of the Antigen 2/Proline-Rich Antigen

    PubMed Central

    Bialek, Ralf; Kern, Jan; Herrmann, Tanja; Tijerina, Rolando; Ceceñas, Luis; Reischl, Udo; González, Gloria M.

    2004-01-01

    A conventional nested PCR and a real-time LightCycler PCR assay for detection of Coccidioides posadasii DNA were designed and tested in 120 clinical strains. These had been isolated from 114 patients within 10 years in Monterrey, Nuevo Leon, Mexico, known to be endemic for coccidioidomycosis. The gene encoding the specific antigen 2/proline-rich antigen (Ag2/PRA) was used as a target. All strains were correctly identified, whereas DNA from related members of the family Onygenaceae remained negative. Melting curve analysis by LightCycler and sequencing of the 526-bp product of the first PCR demonstrated either 100% identity to the GenBank sequence of the Silveira strain, now known to be C. posadasii (accession number AF013256), or a single silent mutation at position 1228. Length determination of two microsatellite-containing loci (GAC and 621) identified all 120 isolates as C. posadasii. Specific DNA was amplified by conventional nested PCR from three microscopically spherule-positive paraffin-embedded tissue samples, whereas 20 human tissue samples positive for other dimorphic fungi remained negative. Additionally, the safety of each step of a modified commercially available DNA extraction procedure was evaluated by using 10 strains. At least three steps of the protocol were demonstrated to sufficiently kill arthroconidia. This safe procedure is applicable to cultures and to clinical specimens. PMID:14766853

  5. Identification of a novel LMF1 nonsense mutation responsible for severe hypertriglyceridemia by targeted next-generation sequencing.

    PubMed

    Cefalù, Angelo B; Spina, Rossella; Noto, Davide; Ingrassia, Valeria; Valenti, Vincenza; Giammanco, Antonina; Fayer, Francesca; Misiano, Gabriella; Cocorullo, Gianfranco; Scrimali, Chiara; Palesano, Ornella; Altieri, Grazia I; Ganci, Antonina; Barbagallo, Carlo M; Averna, Maurizio R

    Severe hypertriglyceridemia (HTG) may result from mutations in genes affecting the intravascular lipolysis of triglyceride (TG)-rich lipoproteins. The aim of this study was to develop a targeted next-generation sequencing panel for the molecular diagnosis of disorders characterized by severe HTG. We developed a targeted customized panel for next-generation sequencing Ion Torrent Personal Genome Machine to capture the coding exons and intron/exon boundaries of 18 genes affecting the main pathways of TG synthesis and metabolism. We sequenced 11 samples of patients with severe HTG (TG>885 mg/dL-10 mmol/L): 4 positive controls in whom pathogenic mutations had previously been identified by Sanger sequencing and 7 patients in whom the molecular defect was still unknown. The customized panel was accurate, and it allowed to confirm genetic variants previously identified in all positive controls with primary severe HTG. Only 1 patient of 7 with HTG was found to be carrier of a homozygous pathogenic mutation of the third novel mutation of LMF1 gene (c.1380C>G-p.Y460X). The clinical and molecular familial cascade screening allowed the identification of 2 additional affected siblings and 7 heterozygous carriers of the mutation. We showed that our targeted resequencing approach for genetic diagnosis of severe HTG appears to be accurate, less time consuming, and more economical compared with traditional Sanger resequencing. The identification of pathogenic mutations in candidate genes remains challenging and clinical resequencing should mainly intended for patients with strong clinical criteria for monogenic severe HTG. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.

  6. Strong transcription blockage mediated by R-loop formation within a G-rich homopurine-homopyrimidine sequence localized in the vicinity of the promoter.

    PubMed

    Belotserkovskii, Boris P; Soo Shin, Jane Hae; Hanawalt, Philip C

    2017-06-20

    Guanine-rich (G-rich) homopurine-homopyrimidine nucleotide sequences can block transcription with an efficiency that depends upon their orientation, composition and length, as well as the presence of negative supercoiling or breaks in the non-template DNA strand. We report that a G-rich sequence in the non-template strand reduces the yield of T7 RNA polymerase transcription by more than an order of magnitude when positioned close (9 bp) to the promoter, in comparison to that for a distal (∼250 bp) location of the same sequence. This transcription blockage is much less pronounced for a C-rich sequence, and is not significant for an A-rich sequence. Remarkably, the blockage is not pronounced if transcription is performed in the presence of RNase H, which specifically digests the RNA strands within RNA-DNA hybrids. The blockage also becomes less pronounced upon reduced RNA polymerase concentration. Based upon these observations and those from control experiments, we conclude that the blockage is primarily due to the formation of stable RNA-DNA hybrids (R-loops), which inhibit successive rounds of transcription. Our results could be relevant to transcription dynamics in vivo (e.g. transcription 'bursting') and may also have practical implications for the design of expression vectors. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Recognition of AT-Rich DNA Binding Sites by the MogR Repressor

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shen, Aimee; Higgins, Darren E.; Panne, Daniel

    2009-07-22

    The MogR transcriptional repressor of the intracellular pathogen Listeria monocytogenes recognizes AT-rich binding sites in promoters of flagellar genes to downregulate flagellar gene expression during infection. We describe here the 1.8 A resolution crystal structure of MogR bound to the recognition sequence 5' ATTTTTTAAAAAAAT 3' present within the flaA promoter region. Our structure shows that MogR binds as a dimer. Each half-site is recognized in the major groove by a helix-turn-helix motif and in the minor groove by a loop from the symmetry-related molecule, resulting in a 'crossover' binding mode. This oversampling through minor groove interactions is important for specificity.more » The MogR binding site has structural features of A-tract DNA and is bent by approximately 52 degrees away from the dimer. The structure explains how MogR achieves binding specificity in the AT-rich genome of L. monocytogenes and explains the evolutionary conservation of A-tract sequence elements within promoter regions of MogR-regulated flagellar genes.« less

  8. Possible Ni-Rich Mafic-Ultramafic Magmatic Sequence in the Columbia Hills: Evidence from the Spirit Rover

    NASA Technical Reports Server (NTRS)

    Mittlefehldt, David W.; Gellert, R.; McCoy, T.; McSween, H. Y., Jr.; Li, R.

    2006-01-01

    The Spirit rover landed on geologic units of Hesperian age in Gusev Crater. The Columbia Hills rise above the surrounding plains materials, but orbital images show that the Columbia Hills are older [1, 2]. Spirit has recently descended the southeast slope of the Columbia Hills doing detailed measurements of a series of outcrops. The mineralogical and compositional data on these rocks are consistent with an interpretation as a magmatic sequence becoming increasingly olivine-rich down slope. The outcrop sequence is Larry s Bench, Seminole, Algonquin and Comanche. The "teeth" on the Rock Abrasion Tool (RAT) wore away prior to arrival at Larry s Bench; the data discussed are for RAT brushed surfaces.

  9. How proteins bind to DNA: target discrimination and dynamic sequence search by the telomeric protein TRF1

    PubMed Central

    2017-01-01

    Abstract Target search as performed by DNA-binding proteins is a complex process, in which multiple factors contribute to both thermodynamic discrimination of the target sequence from overwhelmingly abundant off-target sites and kinetic acceleration of dynamic sequence interrogation. TRF1, the protein that binds to telomeric tandem repeats, faces an intriguing variant of the search problem where target sites are clustered within short fragments of chromosomal DNA. In this study, we use extensive (>0.5 ms in total) MD simulations to study the dynamical aspects of sequence-specific binding of TRF1 at both telomeric and non-cognate DNA. For the first time, we describe the spontaneous formation of a sequence-specific native protein–DNA complex in atomistic detail, and study the mechanism by which proteins avoid off-target binding while retaining high affinity for target sites. Our calculated free energy landscapes reproduce the thermodynamics of sequence-specific binding, while statistical approaches allow for a comprehensive description of intermediate stages of complex formation. PMID:28633355

  10. Genome and transcriptome sequencing identifies breeding targets in the orphan crop tef (Eragrostis tef).

    PubMed

    Cannarozzi, Gina; Plaza-Wüthrich, Sonia; Esfeld, Korinna; Larti, Stéphanie; Wilson, Yi Song; Girma, Dejene; de Castro, Edouard; Chanyalew, Solomon; Blösch, Regula; Farinelli, Laurent; Lyons, Eric; Schneider, Michel; Falquet, Laurent; Kuhlemeier, Cris; Assefa, Kebebew; Tadele, Zerihun

    2014-07-09

    Tef (Eragrostis tef), an indigenous cereal critical to food security in the Horn of Africa, is rich in minerals and protein, resistant to many biotic and abiotic stresses and safe for diabetics as well as sufferers of immune reactions to wheat gluten. We present the genome of tef, the first species in the grass subfamily Chloridoideae and the first allotetraploid assembled de novo. We sequenced the tef genome for marker-assisted breeding, to shed light on the molecular mechanisms conferring tef's desirable nutritional and agronomic properties, and to make its genome publicly available as a community resource. The draft genome contains 672 Mbp representing 87% of the genome size estimated from flow cytometry. We also sequenced two transcriptomes, one from a normalized RNA library and another from unnormalized RNASeq data. The normalized RNA library revealed around 38000 transcripts that were then annotated by the SwissProt group. The CoGe comparative genomics platform was used to compare the tef genome to other genomes, notably sorghum. Scaffolds comprising approximately half of the genome size were ordered by syntenic alignment to sorghum producing tef pseudo-chromosomes, which were sorted into A and B genomes as well as compared to the genetic map of tef. The draft genome was used to identify novel SSR markers, investigate target genes for abiotic stress resistance studies, and understand the evolution of the prolamin family of proteins that are responsible for the immune response to gluten. It is highly plausible that breeding targets previously identified in other cereal crops will also be valuable breeding targets in tef. The draft genome and transcriptome will be of great use for identifying these targets for genetic improvement of this orphan crop that is vital for feeding 50 million people in the Horn of Africa.

  11. AT-rich sequence elements promote nascent transcript cleavage leading to RNA polymerase II termination

    PubMed Central

    White, Eleanor; Kamieniarz-Gdula, Kinga; Dye, Michael J.; Proudfoot, Nick J.

    2013-01-01

    RNA Polymerase II (Pol II) termination is dependent on RNA processing signals as well as specific terminator elements located downstream of the poly(A) site. One of the two major terminator classes described so far is the Co-Transcriptional Cleavage (CoTC) element. We show that homopolymer A/T tracts within the human β-globin CoTC-mediated terminator element play a critical role in Pol II termination. These short A/T tracts, dispersed within seemingly random sequences, are strong terminator elements, and bioinformatics analysis confirms the presence of such sequences in 70% of the putative terminator regions (PTRs) genome-wide. PMID:23258704

  12. Inhibiting nucleation of amyloid structure in a huntingtin fragment by targeting α-helix rich oligomeric intermediates

    PubMed Central

    Mishra, Rakesh; Jayaraman, Murali; Roland, Bartholomew P.; Landrum, Elizabeth; Fullam, Timothy; Kodali, Ravindra; Thakur, Ashwani K.; Arduini, Irene; Wetzel, Ronald

    2011-01-01

    Although oligomeric intermediates are transiently formed in almost all known amyloid assembly reactions, their mechanistic roles are poorly understood. Recently we demonstrated a critical role for the 17 amino acid N-terminal segment (httNT) of huntingtin (htt) in oligomer-mediated amyloid assembly of htt N-terminal fragments. In this mechanism, the httNT segment forms the α-helix rich core of the oligomers, leaving most or all of each polyglutamine (polyQ) segment disordered and solvent-exposed. Nucleation of amyloid structure occurs within this local high concentration of disordered polyQ. Here we demonstrate the kinetic importance of httNT self-assembly by describing inhibitory httNT-containing peptides that appear to work by targeting nucleation within the oligomer fraction. These molecules inhibit amyloid nucleation by forming mixed oligomers with the httNT domains of polyQ-containing htt N-terminal fragments. In one class of inhibitor, nucleation is passively suppressed due to the reduced local concentration of polyQ within the mixed oligomer. In the other class, nucleation is actively suppressed by a proline-rich polyQ segment covalently attached to httNT. Studies with D-amino acid and scrambled sequence versions of httNT suggest that inhibition activity is strongly linked to the propensity of inhibitory peptides to make amphipathic α-helices. HttNT derivatives with C-terminal cell penetrating peptide segments, also exhibit excellent inhibitory activity. The httNT-based peptides described here, especially those with protease-resistant D-amino acids and/or with cell penetrating sequences, may prove useful as lead therapeutics for inhibiting nucleation of amyloid formation in Huntington’s disease. PMID:22178478

  13. BreaKmer: detection of structural variation in targeted massively parallel sequencing data using kmers.

    PubMed

    Abo, Ryan P; Ducar, Matthew; Garcia, Elizabeth P; Thorner, Aaron R; Rojas-Rudilla, Vanesa; Lin, Ling; Sholl, Lynette M; Hahn, William C; Meyerson, Matthew; Lindeman, Neal I; Van Hummelen, Paul; MacConaill, Laura E

    2015-02-18

    Genomic structural variation (SV), a common hallmark of cancer, has important predictive and therapeutic implications. However, accurately detecting SV using high-throughput sequencing data remains challenging, especially for 'targeted' resequencing efforts. This is critically important in the clinical setting where targeted resequencing is frequently being applied to rapidly assess clinically actionable mutations in tumor biopsies in a cost-effective manner. We present BreaKmer, a novel approach that uses a 'kmer' strategy to assemble misaligned sequence reads for predicting insertions, deletions, inversions, tandem duplications and translocations at base-pair resolution in targeted resequencing data. Variants are predicted by realigning an assembled consensus sequence created from sequence reads that were abnormally aligned to the reference genome. Using targeted resequencing data from tumor specimens with orthogonally validated SV, non-tumor samples and whole-genome sequencing data, BreaKmer had a 97.4% overall sensitivity for known events and predicted 17 positively validated, novel variants. Relative to four publically available algorithms, BreaKmer detected SV with increased sensitivity and limited calls in non-tumor samples, key features for variant analysis of tumor specimens in both the clinical and research settings. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Structural and sequencing analysis of local target DNA recognition by MLV integrase.

    PubMed

    Aiyer, Sriram; Rossi, Paolo; Malani, Nirav; Schneider, William M; Chandar, Ashwin; Bushman, Frederic D; Montelione, Gaetano T; Roth, Monica J

    2015-06-23

    Target-site selection by retroviral integrase (IN) proteins profoundly affects viral pathogenesis. We describe the solution nuclear magnetic resonance structure of the Moloney murine leukemia virus IN (M-MLV) C-terminal domain (CTD) and a structural homology model of the catalytic core domain (CCD). In solution, the isolated MLV IN CTD adopts an SH3 domain fold flanked by a C-terminal unstructured tail. We generated a concordant MLV IN CCD structural model using SWISS-MODEL, MMM-tree and I-TASSER. Using the X-ray crystal structure of the prototype foamy virus IN target capture complex together with our MLV domain structures, residues within the CCD α2 helical region and the CTD β1-β2 loop were predicted to bind target DNA. The role of these residues was analyzed in vivo through point mutants and motif interchanges. Viable viruses with substitutions at the IN CCD α2 helical region and the CTD β1-β2 loop were tested for effects on integration target site selection. Next-generation sequencing and analysis of integration target sequences indicate that the CCD α2 helical region, in particular P187, interacts with the sequences distal to the scissile bonds whereas the CTD β1-β2 loop binds to residues proximal to it. These findings validate our structural model and disclose IN-DNA interactions relevant to target site selection. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Target-Rich Environment

    ERIC Educational Resources Information Center

    Perna, Mark C.

    2005-01-01

    Target marketing is defining school enrollment goals and then developing a strategic plan to accomplish those goals through the use of specific communication vehicles and community focus. It is critical to reach the right audience, with the right message, at the right time, for the right cost. In this brief article, the author describes several…

  16. Dubinett - Targeted Sequencing 2012 — EDRN Public Portal

    Cancer.gov

    we propose to use targeted massively parallel DNA sequencing to identify somatic alterations within mutational hotspots in matched sets of primary lung tumors, premalignant lesions, and adjacent,histologically normal lung tissue.

  17. Eukaryotic gene regulation by targeted chromatin re-modeling at dispersed, middle-repetitive sequence elements.

    PubMed

    Hodgetts, Ross

    2004-12-01

    RNA interference might have evolved to minimize the deleterious impact of transposable elements and viruses on eukaryotic genomes, because mutations in genes within the RNAi pathway cause mobilization of transposons in nematodes and flies. Although the first examples of RNAi involved post-transcriptional gene silencing, recently the pathway has been shown to act at the transcriptional level. It does so by establishing a chromatin configuration on the target DNA that has many of the hallmarks of heterochromatin, thus preventing its transcription. Members of dispersed, repeated sequence families appear to have been utilized by the RNAi machinery to regulate nearby genes in yeast. The unusual genomic distribution of three repeated element families in the chicken, fruit-fly and nematode genomes prompts speculation that some of these repeats have been co-opted to control gene expression, either locally or over extended chromosomal domains.

  18. A Bioinformatic Pipeline for Monitoring of the Mutational Stability of Viral Drug Targets with Deep-Sequencing Technology.

    PubMed

    Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai

    2017-11-23

    The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.

  19. Nucleosome exclusion from the interspecies-conserved central AT-rich region of the Ars insulator.

    PubMed

    Takagi, Haruna; Inai, Yuta; Watanabe, Shun-ichiro; Tatemoto, Sayuri; Yajima, Mamiko; Akasaka, Koji; Yamamoto, Takashi; Sakamoto, Naoaki

    2012-01-01

    The Ars insulator is a boundary element identified in the upstream region of the arylsulfatase (HpArs) gene in the sea urchin, Hemicentrotus pulcherrimus, and possesses the ability to both block enhancer-promoter communications and protect transgenes from silent chromatin. To understand the molecular mechanism of the Ars insulator, we investigated the correlation between chromatin structure, DNA structure and insulator activity. Nuclease digestion of nuclei isolated from sea urchin embryos revealed the presence of a nuclease-hypersensitive site within the Ars insulator. Analysis of micrococcal nuclease-sensitive sites in the Ars insulator, reconstituted with nucleosomes, showed the exclusion of nucleosomes from the central AT-rich region. Furthermore, the central AT-rich region in naked DNA was sensitive to nucleotide base modification by diethylpyrocarbonate (DEPC). These observations suggest that non-B-DNA structures in the central AT-rich region may inhibit nucleosomal formation, which leads to nuclease hypersensitivity. Furthermore, comparison of nucleotide sequences between the HpArs gene and its ortholog in Strongylocentrotus purpuratus revealed that the central AT-rich region of the Ars insulator is conserved, and this conserved region showed significant enhancer blocking activity. These results suggest that the central AT-rich nucleosome-free region plays an important role in the function of the Ars insulator.

  20. GWASeq: targeted re-sequencing follow up to GWAS.

    PubMed

    Salomon, Matthew P; Li, Wai Lok Sibon; Edlund, Christopher K; Morrison, John; Fortini, Barbara K; Win, Aung Ko; Conti, David V; Thomas, Duncan C; Duggan, David; Buchanan, Daniel D; Jenkins, Mark A; Hopper, John L; Gallinger, Steven; Le Marchand, Loïc; Newcomb, Polly A; Casey, Graham; Marjoram, Paul

    2016-03-03

    For the last decade the conceptual framework of the Genome-Wide Association Study (GWAS) has dominated the investigation of human disease and other complex traits. While GWAS have been successful in identifying a large number of variants associated with various phenotypes, the overall amount of heritability explained by these variants remains small. This raises the question of how best to follow up on a GWAS, localize causal variants accounting for GWAS hits, and as a consequence explain more of the so-called "missing" heritability. Advances in high throughput sequencing technologies now allow for the efficient and cost-effective collection of vast amounts of fine-scale genomic data to complement GWAS. We investigate these issues using a colon cancer dataset. After QC, our data consisted of 1993 cases, 899 controls. Using marginal tests of associations, we identify 10 variants distributed among six targeted regions that are significantly associated with colorectal cancer, with eight of the variants being novel to this study. Additionally, we perform so-called 'SNP-set' tests of association and identify two sets of variants that implicate both common and rare variants in the etiology of colorectal cancer. Here we present a large-scale targeted re-sequencing resource focusing on genomic regions implicated in colorectal cancer susceptibility previously identified in several GWAS, which aims to 1) provide fine-scale targeted sequencing data for fine-mapping and 2) provide data resources to address methodological questions regarding the design of sequencing-based follow-up studies to GWAS. Additionally, we show that this strategy successfully identifies novel variants associated with colorectal cancer susceptibility and can implicate both common and rare variants.

  1. A screen of cell-surface molecules identifies leucine-rich repeat proteins as key mediators of synaptic target selection in the Drosophila neuromuscular system

    PubMed Central

    Kurusu, Mitsuhiko; Cording, Amy; Taniguchi, Misako; Menon, Kaushiki; Suzuki, Emiko; Zinn, Kai

    2008-01-01

    Summary In Drosophila embryos and larvae, a small number of identified motor neurons innervate body wall muscles in a highly stereotyped pattern. Although genetic screens have identified many proteins that are required for axon guidance and synaptogenesis in this system, little is known about the mechanisms by which muscle fibers are defined as targets for specific motor axons. To identify potential target labels, we screened 410 genes encoding cell-surface and secreted proteins, searching for those whose overexpression on all muscle fibers causes motor axons to make targeting errors. Thirty such genes were identified, and a number of these were members of a large gene family encoding proteins whose extracellular domains contain leucine-rich repeat (LRR) sequences, which are protein interaction modules. By manipulating gene expression in muscle 12, we showed that four LRR proteins participate in the selection of this muscle as the appropriate synaptic target for the RP5 motor neuron. PMID:18817735

  2. Phosphoenolpyruvate carboxykinase of Trypanosoma brucei is targeted to the glycosomes by a C-terminal sequence.

    PubMed

    Sommer, J M; Nguyen, T T; Wang, C C

    1994-08-15

    Import of proteins into the glycosomes of T. brucei resembles the peroxisomal protein import in that C-terminal SKL-like tripeptide sequences can function as targeting signals. Many of the glycosomal proteins do not, however, possess such C-terminal tripeptide signals. Among these, phosphoenolpyruvate carboxykinase (PEPCK (ATP)) was thought to be targeted to the glycosomes by an N-terminal or an internal targeting signal. A limited similarity to the N-terminal targeting signal of rat peroxisomal thiolase exists at the N-terminus of T. brucei PEPCK. However, we found that this peroxisomal targeting signal does not function for glycosomal protein import in T. brucei. Further studies of the PEPCK gene revealed that the C-terminus of the predicted protein does not correspond to the previously deduced protein sequence of 472 amino acids due to a -1 frame shift error in the original DNA sequence. Readjusting the reading frame of the sequence results in a predicted protein of 525 amino acids in length ending in a tripeptide serine-arginine-leucine (SRL), which is a potential targeting signal for import into the glycosomes. A fusion protein of firefly luciferase, without its own C-terminal SKL targeting signal, and T. brucei PEPCK is efficiently imported into the glycosomes when expressed in procyclic trypanosomes. Deletion of the C-terminal SRL tripeptide or the last 29 amino acids of PEPCK reduced the import only by about 50%, while a deletion of the last 47 amino acids completely abolished the import. These results suggest that T. brucei PEPCK may contain a second, internal glycosomal targeting signal upstream of the C-terminal SRL sequence.

  3. Protospacer Adjacent Motif (PAM)-Distal Sequences Engage CRISPR Cas9 DNA Target Cleavage

    PubMed Central

    Ethier, Sylvain; Schmeing, T. Martin; Dostie, Josée; Pelletier, Jerry

    2014-01-01

    The clustered regularly interspaced short palindromic repeat (CRISPR)-associated enzyme Cas9 is an RNA-guided nuclease that has been widely adapted for genome editing in eukaryotic cells. However, the in vivo target specificity of Cas9 is poorly understood and most studies rely on in silico predictions to define the potential off-target editing spectrum. Using chromatin immunoprecipitation followed by sequencing (ChIP-seq), we delineate the genome-wide binding panorama of catalytically inactive Cas9 directed by two different single guide (sg) RNAs targeting the Trp53 locus. Cas9:sgRNA complexes are able to load onto multiple sites with short seed regions adjacent to 5′NGG3′ protospacer adjacent motifs (PAM). Yet among 43 ChIP-seq sites harboring seed regions analyzed for mutational status, we find editing only at the intended on-target locus and one off-target site. In vitro analysis of target site recognition revealed that interactions between the 5′ end of the guide and PAM-distal target sequences are necessary to efficiently engage Cas9 nucleolytic activity, providing an explanation for why off-target editing is significantly lower than expected from ChIP-seq data. PMID:25275497

  4. Transcription blockage by homopurine DNA sequences: role of sequence composition and single-strand breaks

    PubMed Central

    Belotserkovskii, Boris P.; Neil, Alexander J.; Saleh, Syed Shayon; Shin, Jane Hae Soo; Mirkin, Sergei M.; Hanawalt, Philip C.

    2013-01-01

    The ability of DNA to adopt non-canonical structures can affect transcription and has broad implications for genome functioning. We have recently reported that guanine-rich (G-rich) homopurine-homopyrimidine sequences cause significant blockage of transcription in vitro in a strictly orientation-dependent manner: when the G-rich strand serves as the non-template strand [Belotserkovskii et al. (2010) Mechanisms and implications of transcription blockage by guanine-rich DNA sequences., Proc. Natl Acad. Sci. USA, 107, 12816–12821]. We have now systematically studied the effect of the sequence composition and single-stranded breaks on this blockage. Although substitution of guanine by any other base reduced the blockage, cytosine and thymine reduced the blockage more significantly than adenine substitutions, affirming the importance of both G-richness and the homopurine-homopyrimidine character of the sequence for this effect. A single-strand break in the non-template strand adjacent to the G-rich stretch dramatically increased the blockage. Breaks in the non-template strand result in much weaker blockage signals extending downstream from the break even in the absence of the G-rich stretch. Our combined data support the notion that transcription blockage at homopurine-homopyrimidine sequences is caused by R-loop formation. PMID:23275544

  5. StarScan: a web server for scanning small RNA targets from degradome sequencing data.

    PubMed

    Liu, Shun; Li, Jun-Hao; Wu, Jie; Zhou, Ke-Ren; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2015-07-01

    Endogenous small non-coding RNAs (sRNAs), including microRNAs, PIWI-interacting RNAs and small interfering RNAs, play important gene regulatory roles in animals and plants by pairing to the protein-coding and non-coding transcripts. However, computationally assigning these various sRNAs to their regulatory target genes remains technically challenging. Recently, a high-throughput degradome sequencing method was applied to identify biologically relevant sRNA cleavage sites. In this study, an integrated web-based tool, StarScan (sRNA target Scan), was developed for scanning sRNA targets using degradome sequencing data from 20 species. Given a sRNA sequence from plants or animals, our web server performs an ultrafast and exhaustive search for potential sRNA-target interactions in annotated and unannotated genomic regions. The interactions between small RNAs and target transcripts were further evaluated using a novel tool, alignScore. A novel tool, degradomeBinomTest, was developed to quantify the abundance of degradome fragments located at the 9-11th nucleotide from the sRNA 5' end. This is the first web server for discovering potential sRNA-mediated RNA cleavage events in plants and animals, which affords mechanistic insights into the regulatory roles of sRNAs. The StarScan web server is available at http://mirlab.sysu.edu.cn/starscan/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Ar-40/Ar-39 Ages for Maskelynites and K-Rich Melt from Olivine-Rich Lithology in (Kanagawa) Zagami

    NASA Technical Reports Server (NTRS)

    Park, J.; Herzog, G. F.; Nyquist, L. E.; Lindsay, F.; Turrin, B.; Swisher, C. C., III; Delaney, J. S.; Shih, C.-Y.; Niihara, T.; Misawa, K.

    2013-01-01

    We report Ar/Ar release patterns for small maskelynite grains and samples of a K-rich phase separated from the basaltic shergottite Zagami. The purpose of the work is to investigate the well-known discrepancy between published Ar/Ar ages of Zagami, >200 Ma, and its age of approx. 170 Ma as determined by other methods [1-6]. Niihara et al. [7] divide less abundant darker material present in Zagami into an olivine-rich lithology (ORL), from which most of our samples came, and a pyroxene-rich one (Dark Mottled-Lithology: DML) [8, 9]. ORL consists of vermicular fayalitic olivine, coarse-grained pyroxene, maskelynite, and a glassy phase exceptionally rich in K (up to 8.5 wt%), Al, and Si, but poor in Fe and Mg. The elemental composition suggests a late-stage melt, i.e., residual material that solidified late in a fractional crystallization sequence. Below we refer to it as "K-rich melt." The K-rich melt contains laths of captured olivine, Ca-rich pyroxene, plagioclase, and opaques. It seemed to offer an especially promising target for Ar-40/Ar-39 dating.

  7. Noninvasive Prenatal Detection of Trisomy 21 by Targeted Semiconductor Sequencing: A Technical Feasibility Study.

    PubMed

    Xi, Yanwei; Arbabi, Aryan; McNaughton, Amy J M; Hamilton, Alison; Hull, Danna; Perras, Helene; Chiu, Tillie; Morrison, Shawna; Goldsmith, Claire; Creede, Emilie; Anger, Gregory J; Honeywell, Christina; Cloutier, Mireille; Macchio, Natasha; Kiss, Courtney; Liu, Xudong; Crocker, Susan; Davies, Gregory A; Brudno, Michael; Armour, Christine M

    2017-01-01

    To develop an alternate noninvasive prenatal testing method for the assessment of trisomy 21 (T21) using a targeted semiconductor sequencing approach. A customized AmpliSeq panel was designed with 1,067 primer pairs targeting specific regions on chromosomes 21, 18, 13, and others. A total of 235 samples, including 30 affected with T21, were sequenced with an Ion Torrent Proton sequencer, and a method was developed for assessing the probability of fetal aneuploidy via derivation of a risk score. Application of the derived risk score yields a bimodal distribution, with the affected samples clustering near 1.0 and the unaffected near 0. For a risk score cutoff of 0.345, above which all would be considered at "high risk," all 30 T21-positive pregnancies were correctly predicted to be affected, and 199 of the 205 non-T21 samples were correctly predicted. The average hands-on time spent on library preparation and sequencing was 19 h in total, and the average number of reads of sequence obtained was 3.75 million per sample. With the described targeted sequencing approach on the semiconductor platform using a custom-designed library and a probabilistic statistical approach, we have demonstrated the feasibility of an alternate method of assessment for fetal T21. © 2017 S. Karger AG, Basel.

  8. MPN estimation of qPCR target sequence recoveries from whole cell calibrator samples.

    PubMed

    Sivaganesan, Mano; Siefring, Shawn; Varma, Manju; Haugland, Richard A

    2011-12-01

    DNA extracts from enumerated target organism cells (calibrator samples) have been used for estimating Enterococcus cell equivalent densities in surface waters by a comparative cycle threshold (Ct) qPCR analysis method. To compare surface water Enterococcus density estimates from different studies by this approach, either a consistent source of calibrator cells must be used or the estimates must account for any differences in target sequence recoveries from different sources of calibrator cells. In this report we describe two methods for estimating target sequence recoveries from whole cell calibrator samples based on qPCR analyses of their serially diluted DNA extracts and most probable number (MPN) calculation. The first method employed a traditional MPN calculation approach. The second method employed a Bayesian hierarchical statistical modeling approach and a Monte Carlo Markov Chain (MCMC) simulation method to account for the uncertainty in these estimates associated with different individual samples of the cell preparations, different dilutions of the DNA extracts and different qPCR analytical runs. The two methods were applied to estimate mean target sequence recoveries per cell from two different lots of a commercially available source of enumerated Enterococcus cell preparations. The mean target sequence recovery estimates (and standard errors) per cell from Lot A and B cell preparations by the Bayesian method were 22.73 (3.4) and 11.76 (2.4), respectively, when the data were adjusted for potential false positive results. Means were similar for the traditional MPN approach which cannot comparably assess uncertainty in the estimates. Cell numbers and estimates of recoverable target sequences in calibrator samples prepared from the two cell sources were also used to estimate cell equivalent and target sequence quantities recovered from surface water samples in a comparative Ct method. Our results illustrate the utility of the Bayesian method in accounting for

  9. RISC RNA sequencing for context-specific identification of in vivo microRNA targets.

    PubMed

    Matkovich, Scot J; Van Booven, Derek J; Eschenbacher, William H; Dorn, Gerald W

    2011-01-07

    MicroRNAs (miRs) are expanding our understanding of cardiac disease and have the potential to transform cardiovascular therapeutics. One miR can target hundreds of individual mRNAs, but existing methodologies are not sufficient to accurately and comprehensively identify these mRNA targets in vivo. To develop methods permitting identification of in vivo miR targets in an unbiased manner, using massively parallel sequencing of mouse cardiac transcriptomes in combination with sequencing of mRNA associated with mouse cardiac RNA-induced silencing complexes (RISCs). We optimized techniques for expression profiling small amounts of RNA without introducing amplification bias and applied this to anti-Argonaute 2 immunoprecipitated RISCs (RISC-Seq) from mouse hearts. By comparing RNA-sequencing results of cardiac RISC and transcriptome from the same individual hearts, we defined 1645 mRNAs consistently targeted to mouse cardiac RISCs. We used this approach in hearts overexpressing miRs from Myh6 promoter-driven precursors (programmed RISC-Seq) to identify 209 in vivo targets of miR-133a and 81 in vivo targets of miR-499. Consistent with the fact that miR-133a and miR-499 have widely differing "seed" sequences and belong to different miR families, only 6 targets were common to miR-133a- and miR-499-programmed hearts. RISC-sequencing is a highly sensitive method for general RISC profiling and individual miR target identification in biological context and is applicable to any tissue and any disease state.

  10. Long-period oxygen-rich optical Miras in the solar neighborhood

    NASA Technical Reports Server (NTRS)

    Jura, M.; Yamamoto, A.; Kleinmann, S. G.

    1993-01-01

    The spatial distribution of the oxygen-rich Miras with periods longer than 400 days in the neighborhood of the sun were determined using available survey and the K-band period luminosity relationship. It is found that the exponential scale height of these stars is near 240 pc. There is a marked contrast between the Mira population at about 1 kpc from the Galactic center where there are nearly as many long-period oxygen-rich Miras as intermediate-period oxygen-rich Miras. It is hypothesized that, at about 1 kpc from the Galactic center, the main sequence stars with masses larger than 1 solar mass have higher metallicities than main-sequence stars with the same masses in the solar neighborhood. In the solar neighborhood such main sequence stars become carbon-rich on the AGB and in the region near the Galactic center they become long-period oxygen-rich Miras.

  11. Histidine-rich stabilized polyplexes for cMet-directed tumor-targeted gene transfer

    NASA Astrophysics Data System (ADS)

    Kos, Petra; Lächelt, Ulrich; Herrmann, Annika; Mickler, Frauke Martina; Döblinger, Markus; He, Dongsheng; Krhač Levačić, Ana; Morys, Stephan; Bräuchle, Christoph; Wagner, Ernst

    2015-03-01

    Overexpression of the hepatocyte growth factor receptor/c-Met proto oncogene on the surface of a variety of tumor cells gives an opportunity to specifically target cancerous tissues. Herein, we report the first use of c-Met as receptor for non-viral tumor-targeted gene delivery. Sequence-defined oligomers comprising the c-Met binding peptide ligand cMBP2 for targeting, a monodisperse polyethylene glycol (PEG) for polyplex surface shielding, and various cationic (oligoethanamino) amide cores containing terminal cysteines for redox-sensitive polyplex stabilization, were assembled by solid-phase supported syntheses. The resulting oligomers exhibited a greatly enhanced cellular uptake and gene transfer over non-targeted control sequences, confirming the efficacy and target-specificity of the formed polyplexes. Implementation of endosomal escape-promoting histidines in the cationic core was required for gene expression without additional endosomolytic agent. The histidine-enriched polyplexes demonstrated stability in serum as well as receptor-specific gene transfer in vivo upon intratumoral injection. The co-formulation with an analogous PEG-free cationic oligomer led to a further compaction of pDNA polyplexes with an obvious change of shape as demonstrated by transmission electron microscopy. Such compaction was critically required for efficient intravenous gene delivery which resulted in greatly enhanced, cMBP2 ligand-dependent gene expression in the distant tumor.Overexpression of the hepatocyte growth factor receptor/c-Met proto oncogene on the surface of a variety of tumor cells gives an opportunity to specifically target cancerous tissues. Herein, we report the first use of c-Met as receptor for non-viral tumor-targeted gene delivery. Sequence-defined oligomers comprising the c-Met binding peptide ligand cMBP2 for targeting, a monodisperse polyethylene glycol (PEG) for polyplex surface shielding, and various cationic (oligoethanamino) amide cores containing

  12. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes.

    PubMed

    Feltus, Frank A; Saski, Christopher A; Mockaitis, Keithanne; Haiminen, Niina; Parida, Laxmi; Smith, Zachary; Ford, James; Staton, Margaret E; Ficklin, Stephen P; Blackmon, Barbara P; Cheng, Chun-Huai; Schnell, Raymond J; Kuhn, David N; Motamayor, Juan-Carlos

    2011-07-27

    BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed.

  13. Bioinformatics by Example: From Sequence to Target

    NASA Astrophysics Data System (ADS)

    Kossida, Sophia; Tahri, Nadia; Daizadeh, Iraj

    2002-12-01

    With the completion of the human genome, and the imminent completion of other large-scale sequencing and structure-determination projects, computer-assisted bioscience is aimed to become the new paradigm for conducting basic and applied research. The presence of these additional bioinformatics tools stirs great anxiety for experimental researchers (as well as for pedagogues), since they are now faced with a wider and deeper knowledge of differing disciplines (biology, chemistry, physics, mathematics, and computer science). This review targets those individuals who are interested in using computational methods in their teaching or research. By analyzing a real-life, pharmaceutical, multicomponent, target-based example the reader will experience this fascinating new discipline.

  14. Performance evaluation of Sanger sequencing for the diagnosis of primary hyperoxaluria and comparison with targeted next generation sequencing

    PubMed Central

    Williams, Emma L; Bagg, Eleanor A L; Mueller, Michael; Vandrovcova, Jana; Aitman, Timothy J; Rumsby, Gill

    2015-01-01

    Definitive diagnosis of primary hyperoxaluria (PH) currently utilizes sequential Sanger sequencing of the AGXT, GRPHR, and HOGA1 genes but efficacy is unproven. This analysis is time-consuming, relatively expensive, and delays in diagnosis and inappropriate treatment can occur if not pursued early in the diagnostic work-up. We reviewed testing outcomes of Sanger sequencing in 200 consecutive patient samples referred for analysis. In addition, the Illumina Truseq custom amplicon system was evaluated for paralleled next-generation sequencing (NGS) of AGXT,GRHPR, and HOGA1 in 90 known PH patients. AGXT sequencing was requested in all patients, permitting a diagnosis of PH1 in 50%. All remaining patients underwent targeted exon sequencing of GRHPR and HOGA1 with 8% diagnosed with PH2 and 8% with PH3. Complete sequencing of both GRHPR and HOGA1 was not requested in 25% of patients referred leaving their diagnosis in doubt. NGS analysis showed 98% agreement with Sanger sequencing and both approaches had 100% diagnostic specificity. Diagnostic sensitivity of Sanger sequencing was 98% and for NGS it was 97%. NGS has comparable diagnostic performance to Sanger sequencing for the diagnosis of PH and, if implemented, would screen for all forms of PH simultaneously ensuring prompt diagnosis at decreased cost. PMID:25629080

  15. Targeted next-generation sequencing in monogenic dyslipidemias.

    PubMed

    Hegele, Robert A; Ban, Matthew R; Cao, Henian; McIntyre, Adam D; Robinson, John F; Wang, Jian

    2015-04-01

    To evaluate the potential clinical translation of high-throughput next-generation sequencing (NGS) methods in diagnosis and management of dyslipidemia. Recent NGS experiments indicate that most causative genes for monogenic dyslipidemias are already known. Thus, monogenic dyslipidemias can now be diagnosed using targeted NGS. Targeting of dyslipidemia genes can be achieved by either: designing custom reagents for a dyslipidemia-specific NGS panel; or performing genome-wide NGS and focusing on genes of interest. Advantages of the former approach are lower cost and limited potential to detect incidental pathogenic variants unrelated to dyslipidemia. However, the latter approach is more flexible because masking criteria can be altered as knowledge advances, with no need for re-design of reagents or follow-up sequencing runs. Also, the cost of genome-wide analysis is decreasing and ethical concerns can likely be mitigated. DNA-based diagnosis is already part of the clinical diagnostic algorithms for familial hypercholesterolemia. Furthermore, DNA-based diagnosis is supplanting traditional biochemical methods to diagnose chylomicronemia caused by deficiency of lipoprotein lipase or its co-factors. The increasing availability and decreasing cost of clinical NGS for dyslipidemia means that its potential benefits can now be evaluated on a larger scale.

  16. [Detection of pathogenic mutations in Marfan syndrome by targeted next-generation semiconductor sequencing].

    PubMed

    Lu, Chaoxia; Wu, Wei; Xiao, Jifang; Meng, Yan; Zhang, Shuyang; Zhang, Xue

    2013-06-01

    To detect pathogenic mutations in Marfan syndrome (MFS) using an Ion Torrent Personal Genome Machine (PGM) and to validate the result of targeted next-generation semiconductor sequencing for the diagnosis of genetic disorders. Peripheral blood samples were collected from three MFS patients and a normal control with informed consent. Genomic DNA was isolated by standard method and then subjected to targeted sequencing using an Ion Ampliseq(TM) Inherited Disease Panel. Three multiplex PCR reactions were carried out to amplify the coding exons of 328 genes including FBN1, TGFBR1 and TGFBR2. DNA fragments from different samples were ligated with barcoded sequencing adaptors. Template preparation and emulsion PCR, and Ion Sphere Particles enrichment were carried out using an Ion One Touch system. The ion sphere particles were sequenced on a 318 chip using the PGM platform. Data from the PGM runs were processed using an Ion Torrent Suite 3.2 software to generate sequence reads. After sequence alignment and extraction of SNPs and indels, all the variants were filtered against dbSNP137. DNA sequences were visualized with an Integrated Genomics Viewer. The most likely disease-causing variants were analyzed by Sanger sequencing. The PGM sequencing has yielded an output of 855.80 Mb, with a > 100 × median sequencing depth and a coverage of > 98% for the targeted regions in all the four samples. After data analysis and database filtering, one known missense mutation (p.E1811K) and two novel premature termination mutations (p.E2264X and p.L871FfsX23) in the FBN1 gene were identified in the three MFS patients. All mutations were verified by conventional Sanger sequencing. Pathogenic FBN1 mutations have been identified in all patients with MFS, indicating that the targeted next-generation sequencing on the PGM sequencers can be applied for accurate and high-throughput testing of genetic disorders.

  17. Next-generation sequencing for targeted discovery of rare mutations in rice

    USDA-ARS?s Scientific Manuscript database

    Advances in DNA sequencing (i.e., next-generation sequencing, NGS) have greatly increased the power and efficiency of detecting rare mutations in large mutant populations. Targeting Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach for identifying gene mutations resulting fro...

  18. Details on Silica-Rich Elk Target near Marias Pass

    NASA Image and Video Library

    2015-12-17

    This image from the Chemistry and Camera (ChemCam) instrument on NASA's Curiosity Mars rover shows detailed texture of a rock target called "Elk" on Mars' Mount Sharp, revealing laminations that are present in much of the Murray Formation geological unit of lower Mount Sharp. Researchers also used ChemCam's laser and spectrometers to assess Elk's composition and found it to be rich in silica. The image covers a patch of rock surface about 2.8 inches (7 centimeters) across. It was taken on May 22, 2015, during the mission's 992nd Martian day, or sol. ChemCam's Remote Micro-Imager camera, on top of Curiosity's mast, captured the image from a distance of about 9 feet (2.75 meters). Annotations in red identify five points on Elk that were hit with ChemCam's laser. Each of the highlighted points is a location where ChemCam fired its laser 30 times to ablate a tiny amount of target material. By analyzing the light emitted from this laser-ablation, researchers can deduce the composition of that point. For some purposes, composition is presented as a combination of the information from multiple points on the same rock. However, using the points individually can track fine-scale variations in targets. http://photojournal.jpl.nasa.gov/catalog/PIA20267

  19. pH-Modulated Watson-Crick duplex-quadruplex equilibria of guanine-rich and cytosine-rich DNA sequences 140 base pairs upstream of the c-kit transcription initiation site.

    PubMed

    Bucek, Pavel; Jaumot, Joaquim; Aviñó, Anna; Eritja, Ramon; Gargallo, Raimundo

    2009-11-23

    Guanine-rich regions of DNA are sequences capable of forming G-quadruplex structures. The formation of a G-quadruplex structure in a region 140 base pairs (bp) upstream of the c-kit transcription initiation site was recently proposed (Fernando et al., Biochemistry, 2006, 45, 7854). In the present study, the acid-base equilibria and the thermally induced unfolding of the structures formed by a guanine-rich region and by its complementary cytosine-rich strand in c-kit were studied by means of circular dichroism and molecular absorption spectroscopies. In addition, competition between the Watson-Crick duplex and the isolated structures was studied as a function of pH value and temperature. Multivariate data analysis methods based on both hard and soft modeling were used to allow accurate quantification of the various acid-base species present in the mixtures. Results showed that the G-quadruplex and i-motif coexist with the Watson-Crick duplex over the pH range from 3.0 to 6.5, approximately, under the experimental conditions tested in this study. At pH 7.0, the duplex is practically the only species present.

  20. Analyses of Sporocarps, Morphotyped Ectomycorrhizae, Environmental ITS and LSU Sequences Identify Common Genera that Occur at a Periglacial Site

    PubMed Central

    Jumpponen, Ari; Brown, Shawn P.; Trappe, James M.; Cázares, Efrén; Strömmer, Rauni

    2015-01-01

    Periglacial substrates exposed by retreating glaciers represent extreme and sensitive environments defined by a variety of abiotic stressors that challenge organismal establishment and survival. The simple communities often residing at these sites enable their analyses in depth. We utilized existing data and mined published sporocarp, morphotyped ectomycorrhizae (ECM), as well as environmental sequence data of internal transcribed spacer (ITS) and large subunit (LSU) regions of the ribosomal RNA gene to identify taxa that occur at a glacier forefront in the North Cascades Mountains in Washington State in the USA. The discrete data types consistently identified several common and widely distributed genera, perhaps best exemplified by Inocybe and Laccaria. Although we expected low diversity and richness, our environmental sequence data included 37 ITS and 26 LSU operational taxonomic units (OTUs) that likely form ECM. While environmental surveys of metabarcode markers detected large numbers of targeted ECM taxa, both the fruiting body and the morphotype datasets included genera that were undetected in either of the metabarcode datasets. These included hypogeous (Hymenogaster) and epigeous (Lactarius) taxa, some of which may produce large sporocarps but may possess small and/or spatially patchy genets. We highlight the importance of combining various data types to provide a comprehensive view of a fungal community, even in an environment assumed to host communities of low species richness and diversity. PMID:29376900

  1. A MicroRNA Superfamily Regulates Nucleotide Binding Site–Leucine-Rich Repeats and Other mRNAs[W][OA

    PubMed Central

    Shivaprasad, Padubidri V.; Chen, Ho-Ming; Patel, Kanu; Bond, Donna M.; Santos, Bruno A.C.M.; Baulcombe, David C.

    2012-01-01

    Analysis of tomato (Solanum lycopersicum) small RNA data sets revealed the presence of a regulatory cascade affecting disease resistance. The initiators of the cascade are microRNA members of an unusually diverse superfamily in which miR482 and miR2118 are prominent members. Members of this superfamily are variable in sequence and abundance in different species, but all variants target the coding sequence for the P-loop motif in the mRNA sequences for disease resistance proteins with nucleotide binding site (NBS) and leucine-rich repeat (LRR) motifs. We confirm, using transient expression in Nicotiana benthamiana, that miR482 targets mRNAs for NBS-LRR disease resistance proteins with coiled-coil domains at their N terminus. The targeting causes mRNA decay and production of secondary siRNAs in a manner that depends on RNA-dependent RNA polymerase 6. At least one of these secondary siRNAs targets other mRNAs of a defense-related protein. The miR482-mediated silencing cascade is suppressed in plants infected with viruses or bacteria so that expression of mRNAs with miR482 or secondary siRNA target sequences is increased. We propose that this process allows pathogen-inducible expression of NBS-LRR proteins and that it contributes to a novel layer of defense against pathogen attack. PMID:22408077

  2. Detection of Somatic Mutations in Gastroenteropancreatic Neuroendocrine Tumors Using Targeted Deep Sequencing.

    PubMed

    Backman, Samuel; Norlén, Olov; Eriksson, Barbro; Skogseid, Britt; Stålberg, Peter; Crona, Joakim

    2017-02-01

    Mutations affecting the mechanistic target of rapamycin (MTOR) signalling pathway are frequent in human cancer and have been identified in up to 15% of pancreatic neuroendocrine tumours (NETs). Grade A evidence supports the efficacy of MTOR inhibition with everolimus in pancreatic NETs. Although a significant proportion of patients experience disease stabilization, only a minority will show objective tumour responses. It has been proposed that genomic mutations resulting in activation of MTOR signalling could be used to predict sensitivity to everolimus. Patients with NETs that underwent treatment with everolimus at our Institution were identified and those with available tumour tissue were selected for further analysis. Targeted next-generation sequencing (NGS) was used to re-sequence 22 genes that were selected on the basis of documented involvement in the MTOR signalling pathway or in the tumourigenesis of gastroenterpancreatic NETs. Radiological responses were documented using Response Evaluation Criteria in Solid Tumours. Six patients were identified, one had a partial response and four had stable disease. Sequencing of tumour tissue resulted in a median sequence depth of 667.1 (range=404-1301) with 1-fold coverage of 95.9-96.5% and 10-fold coverage of 87.6-92.2%. A total of 494 genetic variants were discovered, four of which were identified as pathogenic. All pathogenic variants were validated using Sanger sequencing and were found exclusively in menin 1 (MEN1) and death domain associated protein (DAXX) genes. No mutations in the MTOR pathway-related genes were observed. Targeted NGS is a feasible method with high diagnostic yield for genetic characterization of pancreatic NETs. A potential association between mutations in NETs and response to everolimus should be investigated by future studies. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.

  3. Targeted amplicon sequencing (TAS): a scalable next-gen approach to multilocus, multitaxa phylogenetics.

    PubMed

    Bybee, Seth M; Bracken-Grissom, Heather; Haynes, Benjamin D; Hermansen, Russell A; Byers, Robert L; Clement, Mark J; Udall, Joshua A; Wilcox, Edward R; Crandall, Keith A

    2011-01-01

    Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach.

  4. Targeted Amplicon Sequencing (TAS): A Scalable Next-Gen Approach to Multilocus, Multitaxa Phylogenetics

    PubMed Central

    Bybee, Seth M.; Bracken-Grissom, Heather; Haynes, Benjamin D.; Hermansen, Russell A.; Byers, Robert L.; Clement, Mark J.; Udall, Joshua A.; Wilcox, Edward R.; Crandall, Keith A.

    2011-01-01

    Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach. PMID:22002916

  5. An N-terminal glycine-rich sequence contributes to retrovirus trimer of hairpins stability

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wilson, Kirilee A.; Maerz, Anne L.; Baer, Severine

    2007-08-10

    Retroviral transmembrane proteins (TMs) contain a glycine-rich segment linking the N-terminal fusion peptide and coiled coil core. Previously, we reported that the glycine-rich segment (Met-326-Ser-337) of the human T-cell leukemia virus type 1 (HTLV-1) TM, gp21, is a determinant of membrane fusion function [K.A. Wilson, S. Baer, A.L. Maerz, M. Alizon, P. Poumbourios, The conserved glycine-rich segment linking the N-terminal fusion peptide to the coiled coil of human T-cell leukemia virus type 1 transmembrane glycoprotein gp21 is a determinant of membrane fusion function, J. Virol. 79 (2005) 4533-4539]. Here we show that the reduced fusion activity of an I334A mutantmore » correlated with a decrease in stability of the gp21 trimer of hairpins conformation, in the context of a maltose-binding protein-gp21 chimera. The stabilizing influence of Ile-334 required the C-terminal membrane-proximal sequence Trp-431-Ser-436. Proline substitution of four of five Gly residues altered gp21 trimer of hairpins stability. Our data indicate that flexibility within and hydrophobic interactions mediated by this region are determinants of gp21 stability and membrane fusion function.« less

  6. Haloarcula hispanica CRISPR authenticates PAM of a target sequence to prime discriminative adaptation

    PubMed Central

    Li, Ming; Wang, Rui; Xiang, Hua

    2014-01-01

    The prokaryotic immune system CRISPR/Cas (Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR-associated genes) adapts to foreign invaders by acquiring their short deoxyribonucleic acid (DNA) fragments as spacers, which guide subsequent interference to foreign nucleic acids based on sequence matching. The adaptation mechanism avoiding acquiring ‘self’ DNA fragments is poorly understood. In Haloarcula hispanica, we previously showed that CRISPR adaptation requires being primed by a pre-existing spacer partially matching the invader DNA. Here, we further demonstrate that flanking a fully-matched target sequence, a functional PAM (protospacer adjacent motif) is still required to prime adaptation. Interestingly, interference utilizes only four PAM sequences, whereas adaptation-priming tolerates as many as 23 PAM sequences. This relaxed PAM selectivity explains how adaptation-priming maximizes its tolerance of PAM mutations (that escape interference) while avoiding mis-targeting the spacer DNA within CRISPR locus. We propose that the primed adaptation, which hitches and cooperates with the interference pathway, distinguishes target from non-target by CRISPR ribonucleic acid guidance and PAM recognition. PMID:24803673

  7. Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease.

    PubMed

    Dilliott, Allison A; Farhan, Sali M K; Ghani, Mahdi; Sato, Christine; Liang, Eric; Zhang, Ming; McIntyre, Adam D; Cao, Henian; Racacho, Lemuel; Robinson, John F; Strong, Michael J; Masellis, Mario; Bulman, Dennis E; Rogaeva, Ekaterina; Lang, Anthony; Tartaglia, Carmela; Finger, Elizabeth; Zinman, Lorne; Turnbull, John; Freedman, Morris; Swartz, Rick; Black, Sandra E; Hegele, Robert A

    2018-04-04

    Next-generation sequencing (NGS) is quickly revolutionizing how research into the genetic determinants of constitutional disease is performed. The technique is highly efficient with millions of sequencing reads being produced in a short time span and at relatively low cost. Specifically, targeted NGS is able to focus investigations to genomic regions of particular interest based on the disease of study. Not only does this further reduce costs and increase the speed of the process, but it lessens the computational burden that often accompanies NGS. Although targeted NGS is restricted to certain regions of the genome, preventing identification of potential novel loci of interest, it can be an excellent technique when faced with a phenotypically and genetically heterogeneous disease, for which there are previously known genetic associations. Because of the complex nature of the sequencing technique, it is important to closely adhere to protocols and methodologies in order to achieve sequencing reads of high coverage and quality. Further, once sequencing reads are obtained, a sophisticated bioinformatics workflow is utilized to accurately map reads to a reference genome, to call variants, and to ensure the variants pass quality metrics. Variants must also be annotated and curated based on their clinical significance, which can be standardized by applying the American College of Medical Genetics and Genomics Pathogenicity Guidelines. The methods presented herein will display the steps involved in generating and analyzing NGS data from a targeted sequencing panel, using the ONDRISeq neurodegenerative disease panel as a model, to identify variants that may be of clinical significance.

  8. Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

    NASA Astrophysics Data System (ADS)

    Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

    2017-07-01

    DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.

  9. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes

    PubMed Central

    2011-01-01

    Background BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. Results This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Conclusions Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed. PMID:21794110

  10. Efficient Identification of Murine M2 Macrophage Peptide Targeting Ligands by Phage Display and Next-Generation Sequencing.

    PubMed

    Liu, Gary W; Livesay, Brynn R; Kacherovsky, Nataly A; Cieslewicz, Maryelise; Lutz, Emi; Waalkes, Adam; Jensen, Michael C; Salipante, Stephen J; Pun, Suzie H

    2015-08-19

    Peptide ligands are used to increase the specificity of drug carriers to their target cells and to facilitate intracellular delivery. One method to identify such peptide ligands, phage display, enables high-throughput screening of peptide libraries for ligands binding to therapeutic targets of interest. However, conventional methods for identifying target binders in a library by Sanger sequencing are low-throughput, labor-intensive, and provide a limited perspective (<0.01%) of the complete sequence space. Moreover, the small sample space can be dominated by nonspecific, preferentially amplifying "parasitic sequences" and plastic-binding sequences, which may lead to the identification of false positives or exclude the identification of target-binding sequences. To overcome these challenges, we employed next-generation Illumina sequencing to couple high-throughput screening and high-throughput sequencing, enabling more comprehensive access to the phage display library sequence space. In this work, we define the hallmarks of binding sequences in next-generation sequencing data, and develop a method that identifies several target-binding phage clones for murine, alternatively activated M2 macrophages with a high (100%) success rate: sequences and binding motifs were reproducibly present across biological replicates; binding motifs were identified across multiple unique sequences; and an unselected, amplified library accurately filtered out parasitic sequences. In addition, we validate the Multiple Em for Motif Elicitation tool as an efficient and principled means of discovering binding sequences.

  11. Molecular Analysis of Methanogen Richness in Landfill and Marshland Targeting 16S rDNA Sequences

    PubMed Central

    Yadav, Shailendra; Kundu, Sharbadeb; Ghosh, Sankar K.; Maitra, S. S.

    2015-01-01

    Methanogens, a key contributor in global carbon cycling, methane emission, and alternative energy production, generate methane gas via anaerobic digestion of organic matter. The methane emission potential depends upon methanogenic diversity and activity. Since they are anaerobes and difficult to isolate and culture, their diversity present in the landfill sites of Delhi and marshlands of Southern Assam, India, was analyzed using molecular techniques like 16S rDNA sequencing, DGGE, and qPCR. The sequencing results indicated the presence of methanogens belonging to the seventh order and also the order Methanomicrobiales in the Ghazipur and Bhalsawa landfill sites of Delhi. Sequences, related to the phyla Crenarchaeota (thermophilic) and Thaumarchaeota (mesophilic), were detected from marshland sites of Southern Assam, India. Jaccard analysis of DGGE gel using Gel2K showed three main clusters depending on the number and similarity of band patterns. The copy number analysis of hydrogenotrophic methanogens using qPCR indicates higher abundance in landfill sites of Delhi as compared to the marshlands of Southern Assam. The knowledge about “methanogenic archaea composition” and “abundance” in the contrasting ecosystems like “landfill” and “marshland” may reorient our understanding of the Archaea inhabitants. This study could shed light on the relationship between methane-dynamics and the global warming process. PMID:26568700

  12. Genome-wide sequencing of longan (Dimocarpus longan Lour.) provides insights into molecular basis of its polyphenol-rich characteristics

    PubMed Central

    Lin, Yuling; Min, Jiumeng; Lai, Ruilian; Wu, Zhangyan; Chen, Yukun; Yu, Lili; Cheng, Chunzhen; Jin, Yuanchun; Tian, Qilin; Liu, Qingfeng; Liu, Weihua; Zhang, Chengguang; Lin, Lixia; Hu, Yan; Zhang, Dongmin; Thu, Minkyaw; Zhang, Zihao; Liu, Shengcai; Zhong, Chunshui; Fang, Xiaodong; Wang, Jian; Yang, Huanming

    2017-01-01

    Abstract Longan (Dimocarpus longan Lour.), an important subtropical fruit in the family Sapindaceae, is grown in more than 10 countries. Longan is an edible drupe fruit and a source of traditional medicine with polyphenol-rich traits. Tree size, alternate bearing, and witches' broom disease still pose serious problems. To gain insights into the genomic basis of longan traits, a draft genome sequence was assembled. The draft genome (about 471.88 Mb) of a Chinese longan cultivar, “Honghezi,” was estimated to contain 31 007 genes and 261.88 Mb of repetitive sequences. No recent whole-genome-wide duplication event was detected in the genome. Whole-genome resequencing and analysis of 13 cultivated D. longan accessions revealed the extent of genetic diversity. Comparative transcriptome studies combined with genome-wide analysis revealed polyphenol-rich and pathogen resistance characteristics. Genes involved in secondary metabolism, especially those from significantly expanded (DHS, SDH, F3΄H, ANR, and UFGT) and contracted (PAL, CHS, and F3΄5΄H) gene families with tissue-specific expression, may be important contributors to the high accumulation levels of polyphenolic compounds observed in longan fruit. The high number of genes encoding nucleotide-binding site leucine-rich repeat (NBS-LRR) and leucine-rich repeat receptor-like kinase proteins, as well as the recent expansion and contraction of the NBS-LRR family, suggested a genomic basis for resistance to insects, fungus, and bacteria in this fruit tree. These data provide insights into the evolution and diversity of the longan genome. The comparative genomic and transcriptome analyses provided information about longan-specific traits, particularly genes involved in its polyphenol-rich and pathogen resistance characteristics. PMID:28368449

  13. The Induction of Recombinant Protein Bodies in Different Subcellular Compartments Reveals a Cryptic Plastid-Targeting Signal in the 27-kDa γ-Zein Sequence

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hofbauer, Anna; Peters, Jenny; Arcalis, Elsa

    2014-12-11

    Naturally occurring storage proteins such as zeins are used as fusion partners for recombinant proteins because they induce the formation of ectopic storage organelles known as protein bodies (PBs) where the proteins are stabilized by intermolecular interactions and the formation of disulfide bonds. Endogenous PBs are derived from the endoplasmic reticulum (ER). Here, we have used different targeting sequences to determine whether ectopic PBs composed of the N-terminal portion of mature 27 kDa γ-zein added to a fluorescent protein could be induced to form elsewhere in the cell. The addition of a transit peptide for targeting to plastids causes PBmore » formation in the stroma, whereas in the absence of any added targeting sequence PBs were typically associated with the plastid envelope, revealing the presence of a cryptic plastid-targeting signal within the γ-zein cysteine-rich domain. The subcellular localization of the PBs influences their morphology and the solubility of the stored recombinant fusion protein. Our results indicate that the biogenesis and budding of PBs does not require ER-specific factors and therefore, confirm that γ-zein is a versatile fusion partner for recombinant proteins offering unique opportunities for the accumulation and bioencapsulation of recombinant proteins in different subcellular compartments.« less

  14. Investigating possible biological targets of Bj-CRP, the first cysteine-rich secretory protein (CRISP) isolated from Bothrops jararaca snake venom.

    PubMed

    Lodovicho, Marina E; Costa, Tássia R; Bernardes, Carolina P; Menaldo, Danilo L; Zoccal, Karina F; Carone, Sante E; Rosa, José C; Pucca, Manuela B; Cerni, Felipe A; Arantes, Eliane C; Tytgat, Jan; Faccioli, Lúcia H; Pereira-Crott, Luciana S; Sampaio, Suely V

    2017-01-04

    Cysteine-rich secretory proteins (CRISPs) are commonly described as part of the protein content of snake venoms, nevertheless, so far, little is known about their biological targets and functions. Our study describes the isolation and characterization of Bj-CRP, the first CRISP isolated from Bothrops jararaca snake venom, also aiming at the identification of possible targets for its actions. Bj-CRP was purified using three chromatographic steps (Sephacryl S-200, Source 15Q and C18) and showed to be an acidic protein of 24.6kDa with high sequence identity to other snake venom CRISPs. This CRISP was devoid of proteolytic, hemorrhagic or coagulant activities, and it did not affect the currents from 13 voltage-gated potassium channel isoforms. Conversely, Bj-CRP induced inflammatory responses characterized by increase of leukocytes, mainly neutrophils, after 1 and 4h of its injection in the peritoneal cavity of mice, also stimulating the production of IL-6. Bj-CRP also acted on the human complement system, modulating some of the activation pathways and acting directly on important components (C3 and C4), thus inducing the generation of anaphylatoxins (C3a, C4a and C5a). Therefore, our results for Bj-CRP open up prospects for better understanding this class of toxins and its biological actions. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  15. Computer-based prediction of mitochondria-targeting peptides.

    PubMed

    Martelli, Pier Luigi; Savojardo, Castrense; Fariselli, Piero; Tasco, Gianluca; Casadio, Rita

    2015-01-01

    Computational methods are invaluable when protein sequences, directly derived from genomic data, need functional and structural annotation. Subcellular localization is a feature necessary for understanding the protein role and the compartment where the mature protein is active and very difficult to characterize experimentally. Mitochondrial proteins encoded on the cytosolic ribosomes carry specific patterns in the precursor sequence from where it is possible to recognize a peptide targeting the protein to its final destination. Here we discuss to which extent it is feasible to develop computational methods for detecting mitochondrial targeting peptides in the precursor sequences and benchmark our and other methods on the human mitochondrial proteins endowed with experimentally characterized targeting peptides. Furthermore, we illustrate our newly implemented web server and its usage on the whole human proteome in order to infer mitochondrial targeting peptides, their cleavage sites, and whether the targeting peptide regions contain or not arginine-rich recurrent motifs. By this, we add some other 2,800 human proteins to the 124 ones already experimentally annotated with a mitochondrial targeting peptide.

  16. An Optimized Transient Dual Luciferase Assay for Quantifying MicroRNA Directed Repression of Targeted Sequences

    PubMed Central

    Moyle, Richard L.; Carvalhais, Lilia C.; Pretorius, Lara-Simone; Nowak, Ekaterina; Subramaniam, Gayathery; Dalton-Morgan, Jessica; Schenk, Peer M.

    2017-01-01

    Studies investigating the action of small RNAs on computationally predicted target genes require some form of experimental validation. Classical molecular methods of validating microRNA action on target genes are laborious, while approaches that tag predicted target sequences to qualitative reporter genes encounter technical limitations. The aim of this study was to address the challenge of experimentally validating large numbers of computationally predicted microRNA-target transcript interactions using an optimized, quantitative, cost-effective, and scalable approach. The presented method combines transient expression via agroinfiltration of Nicotiana benthamiana leaves with a quantitative dual luciferase reporter system, where firefly luciferase is used to report the microRNA-target sequence interaction and Renilla luciferase is used as an internal standard to normalize expression between replicates. We report the appropriate concentration of N. benthamiana leaf extracts and dilution factor to apply in order to avoid inhibition of firefly LUC activity. Furthermore, the optimal ratio of microRNA precursor expression construct to reporter construct and duration of the incubation period post-agroinfiltration were determined. The optimized dual luciferase assay provides an efficient, repeatable and scalable method to validate and quantify microRNA action on predicted target sequences. The optimized assay was used to validate five predicted targets of rice microRNA miR529b, with as few as six technical replicates. The assay can be extended to assess other small RNA-target sequence interactions, including assessing the functionality of an artificial miRNA or an RNAi construct on a targeted sequence. PMID:28979287

  17. Transcriptome-Wide Identification of RNA Targets of Arabidopsis SERINE/ARGININE-RICH45 Uncovers the Unexpected Roles of This RNA Binding Protein in RNA Processing[OPEN

    PubMed Central

    Wang, Yajun; Hamilton, Michael; Ben-Hur, Asa; Reddy, Anireddy S.N.

    2015-01-01

    Plant SR45 and its metazoan ortholog RNPS1 are serine/arginine-rich (SR)-like RNA binding proteins that function in splicing/postsplicing events and regulate diverse processes in eukaryotes. Interactions of SR45 with both RNAs and proteins are crucial for regulating RNA processing. However, in vivo RNA targets of SR45 are currently unclear. Using RNA immunoprecipitation followed by high-throughput sequencing, we identified over 4000 Arabidopsis thaliana RNAs that directly or indirectly associate with SR45, designated as SR45-associated RNAs (SARs). Comprehensive analyses of these SARs revealed several roles for SR45. First, SR45 associates with and regulates the expression of 30% of abscisic acid (ABA) signaling genes at the postsplicing level. Second, although most SARs are derived from intron-containing genes, surprisingly, 340 SARs are derived from intronless genes. Expression analysis of the SARs suggests that SR45 differentially regulates intronless and intron-containing SARs. Finally, we identified four overrepresented RNA motifs in SARs that likely mediate SR45’s recognition of its targets. Therefore, SR45 plays an unexpected role in mRNA processing of intronless genes, and numerous ABA signaling genes are targeted for regulation at the posttranscriptional level. The diverse molecular functions of SR45 uncovered in this study are likely applicable to other species in view of its conservation across eukaryotes. PMID:26603559

  18. Colorimetric biosensing of targeted gene sequence using dual nanoparticle platforms

    PubMed Central

    Thavanathan, Jeevan; Huang, Nay Ming; Thong, Kwai Lin

    2015-01-01

    We have developed a colorimetric biosensor using a dual platform of gold nanoparticles and graphene oxide sheets for the detection of Salmonella enterica. The presence of the invA gene in S. enterica causes a change in color of the biosensor from its original pinkish-red to a light purplish solution. This occurs through the aggregation of the primary gold nanoparticles–conjugated DNA probe onto the surface of the secondary graphene oxide–conjugated DNA probe through DNA hybridization with the targeted DNA sequence. Spectrophotometry analysis showed a shift in wavelength from 525 nm to 600 nm with 1 μM of DNA target. Specificity testing revealed that the biosensor was able to detect various serovars of the S. enterica while no color change was observed with the other bacterial species. Sensitivity testing revealed the limit of detection was at 1 nM of DNA target. This proves the effectiveness of the biosensor in the detection of S. enterica through DNA hybridization. PMID:25897217

  19. Sequence-based design of bioactive small molecules that target precursor microRNAs.

    PubMed

    Velagapudi, Sai Pradeep; Gallo, Steven M; Disney, Matthew D

    2014-04-01

    Oligonucleotides are designed to target RNA using base pairing rules, but they can be hampered by poor cellular delivery and nonspecific stimulation of the immune system. Small molecules are preferred as lead drugs or probes but cannot be designed from sequence. Herein, we describe an approach termed Inforna that designs lead small molecules for RNA from solely sequence. Inforna was applied to all human microRNA hairpin precursors, and it identified bioactive small molecules that inhibit biogenesis by binding nuclease-processing sites (44% hit rate). Among 27 lead interactions, the most avid interaction is between a benzimidazole (1) and precursor microRNA-96. Compound 1 selectively inhibits biogenesis of microRNA-96, upregulating a protein target (FOXO1) and inducing apoptosis in cancer cells. Apoptosis is ablated when FOXO1 mRNA expression is knocked down by an siRNA, validating compound selectivity. Markedly, microRNA profiling shows that 1 only affects microRNA-96 biogenesis and is at least as selective as an oligonucleotide.

  20. Global sequence variation in the histidine-rich proteins 2 and 3 of Plasmodium falciparum: implications for the performance of malaria rapid diagnostic tests

    PubMed Central

    2010-01-01

    Background Accurate diagnosis is essential for prompt and appropriate treatment of malaria. While rapid diagnostic tests (RDTs) offer great potential to improve malaria diagnosis, the sensitivity of RDTs has been reported to be highly variable. One possible factor contributing to variable test performance is the diversity of parasite antigens. This is of particular concern for Plasmodium falciparum histidine-rich protein 2 (PfHRP2)-detecting RDTs since PfHRP2 has been reported to be highly variable in isolates of the Asia-Pacific region. Methods The pfhrp2 exon 2 fragment from 458 isolates of P. falciparum collected from 38 countries was amplified and sequenced. For a subset of 80 isolates, the exon 2 fragment of histidine-rich protein 3 (pfhrp3) was also amplified and sequenced. DNA sequence and statistical analysis of the variation observed in these genes was conducted. The potential impact of the pfhrp2 variation on RDT detection rates was examined by analysing the relationship between sequence characteristics of this gene and the results of the WHO product testing of malaria RDTs: Round 1 (2008), for 34 PfHRP2-detecting RDTs. Results Sequence analysis revealed extensive variations in the number and arrangement of various repeats encoded by the genes in parasite populations world-wide. However, no statistically robust correlation between gene structure and RDT detection rate for P. falciparum parasites at 200 parasites per microlitre was identified. Conclusions The results suggest that despite extreme sequence variation, diversity of PfHRP2 does not appear to be a major cause of RDT sensitivity variation. PMID:20470441

  1. Evolution of Hsp70 Gene Expression: A Role for Changes in AT-Richness within Promoters

    PubMed Central

    Ma, Ronghui; Zhang, Bo; Kang, Le

    2011-01-01

    In disparate organisms adaptation to thermal stress has been linked to changes in the expression of genes encoding heat-shock proteins (Hsp). The underlying genetics, however, remain elusive. We show here that two AT-rich sequence elements in the promoter region of the hsp70 gene of the fly Liriomyza sativae that are absent in the congeneric species, Liriomyza huidobrensis, have marked cis-regulatory consequences. We studied the cis-regulatory consequences of these elements (called ATRS1 and ATRS2) by measuring the constitutive and heat-shock-induced luciferase luminescence that they drive in cells transfected with constructs carrying them modified, deleted, or intact, in the hsp70 promoter fused to the luciferase gene. The elements affected expression level markedly and in different ways: Deleting ATRS1 augmented both the constitutive and the heat-shock-induced luminescence, suggesting that this element represses transcription. Interestingly, replacing the element with random sequences of the same length and A+T content delivered the wild-type luminescence pattern, proving that the element's high A+T content is crucial for its effects. Deleting ATRS2 decreased luminescence dramatically and almost abolished heat-shock inducibility and so did replacing the element with random sequences matching the element's length and A+T content, suggesting that ATRS2's effects on transcription and heat-shock inducibility involve a common mechanism requiring at least in part the element's specific primary structure. Finally, constitutive and heat-shock luminescence were reduced strongly when two putative binding sites for the Zeste transcription factor identified within ATRS2 were altered through site-directed mutagenesis, and the heat-shock-induced luminescence increased when Zeste was over-expressed, indicating that Zeste participates in the effects mapped to ATRS2 at least in part. AT-rich sequences are common in promoters and our results suggest that they should play important

  2. Implementation of an Autonomous Multi-Maneuver Targeting Sequence for Lunar Trans-Earth Injection

    NASA Technical Reports Server (NTRS)

    Whitley, Ryan J.; Williams, Jacob

    2010-01-01

    Using a fully analytic initial guess estimate as a first iterate, a targeting procedure that constructs a flyable burn maneuver sequence to transfer a spacecraft from any closed Moon orbit to a desired Earth entry state is developed and implemented. The algorithm is built to support the need for an anytime abort capability for Orion. Based on project requirements, the Orion spacecraft must be able to autonomously calculate the translational maneuver targets for an entire Lunar mission. Translational maneuver target sequences for the Orion spacecraft include Lunar Orbit Insertion (LOI), Trans-Earth Injection (TEI), and Trajectory Correction Maneuvers (TCMs). This onboard capability is generally assumed to be supplemental to redundant ground computation in nominal mission operations and considered as a viable alternative primarily in loss of communications contingencies. Of these maneuvers, the ability to accurately and consistently establish a flyable 3-burn TEI target sequence is especially critical. The TEI is the sole means by which the crew can successfully return from the Moon to a narrowly banded Earth Entry Interface (EI) state. This is made even more critical by the desire for global access on the lunar surface. Currently, the designed propellant load is based on fully optimized TEI solutions for the worst case geometries associated with the accepted range of epochs and landing sites. This presents two challenges for an autonomous algorithm: in addition to being feasible, the targets must include burn sequences that do not exceed the anticipated propellant load.

  3. Massively Parallel Sequencing of Patients with Intellectual Disability, Congenital Anomalies and/or Autism Spectrum Disorders with a Targeted Gene Panel

    PubMed Central

    Brett, Maggie; McPherson, John; Zang, Zhi Jiang; Lai, Angeline; Tan, Ee-Shien; Ng, Ivy; Ong, Lai-Choo; Cham, Breana; Tan, Patrick; Rozen, Steve; Tan, Ene-Choo

    2014-01-01

    Developmental delay and/or intellectual disability (DD/ID) affects 1–3% of all children. At least half of these are thought to have a genetic etiology. Recent studies have shown that massively parallel sequencing (MPS) using a targeted gene panel is particularly suited for diagnostic testing for genetically heterogeneous conditions. We report on our experiences with using massively parallel sequencing of a targeted gene panel of 355 genes for investigating the genetic etiology of eight patients with a wide range of phenotypes including DD/ID, congenital anomalies and/or autism spectrum disorder. Targeted sequence enrichment was performed using the Agilent SureSelect Target Enrichment Kit and sequenced on the Illumina HiSeq2000 using paired-end reads. For all eight patients, 81–84% of the targeted regions achieved read depths of at least 20×, with average read depths overlapping targets ranging from 322× to 798×. Causative variants were successfully identified in two of the eight patients: a nonsense mutation in the ATRX gene and a canonical splice site mutation in the L1CAM gene. In a third patient, a canonical splice site variant in the USP9X gene could likely explain all or some of her clinical phenotypes. These results confirm the value of targeted MPS for investigating DD/ID in children for diagnostic purposes. However, targeted gene MPS was less likely to provide a genetic diagnosis for children whose phenotype includes autism. PMID:24690944

  4. Sensitive and Specific Target Sequences Selected from Retrotransposons of Schistosoma japonicum for the Diagnosis of Schistosomiasis

    PubMed Central

    Xu, Jing; Zhu, Xing-Quan; Wang, Sheng-Yue; Xia, Chao-Ming

    2012-01-01

    Background Schistosomiasis japonica is a serious debilitating and sometimes fatal disease. Accurate diagnostic tests play a key role in patient management and control of the disease. However, currently available diagnostic methods are not ideal, and the detection of the parasite DNA in blood samples has turned out to be one of the most promising tools for the diagnosis of schistosomiasis. In our previous investigations, a 230-bp sequence from the highly repetitive retrotransposon SjR2 was identified and it showed high sensitivity and specificity for detecting Schistosoma japonicum DNA in the sera of rabbit model and patients. Recently, 29 retrotransposons were found in S. japonicum genome by our group. The present study highlighted the key factors for selecting a new perspective sensitive target DNA sequence for the diagnosis of schistosomiasis, which can serve as example for other parasitic pathogens. Methodology/Principal Findings In this study, we demonstrated that the key factors based on the bioinformatic analysis for selecting target sequence are the higher genome proportion, repetitive complete copies and partial copies, and active ESTs than the others in the chromosome genome. New primers based on 25 novel retrotransposons and SjR2 were designed and their sensitivity and specificity for detecting S. japonicum DNA were compared. The results showed that a new 303-bp sequence from non-long terminal repeat (LTR) retrotransposon (SjCHGCS19) had high sensitivity and specificity. The 303-bp target sequence was amplified from the sera of rabbit model at 3 d post-infection by nested-PCR and it became negative at 17 weeks post-treatment. Furthermore, the percentage sensitivity of the nested-PCR was 97.67% in 43 serum samples of S. japonicum-infected patients. Conclusions/Significance Our findings highlighted the key factors based on the bioinformatic analysis for selecting target sequence from S. japonicum genome, which provide basis for establishing powerful

  5. Computational optimisation of targeted DNA sequencing for cancer detection

    NASA Astrophysics Data System (ADS)

    Martinez, Pierre; McGranahan, Nicholas; Birkbak, Nicolai Juul; Gerlinger, Marco; Swanton, Charles

    2013-12-01

    Despite recent progress thanks to next-generation sequencing technologies, personalised cancer medicine is still hampered by intra-tumour heterogeneity and drug resistance. As most patients with advanced metastatic disease face poor survival, there is need to improve early diagnosis. Analysing circulating tumour DNA (ctDNA) might represent a non-invasive method to detect mutations in patients, facilitating early detection. In this article, we define reduced gene panels from publicly available datasets as a first step to assess and optimise the potential of targeted ctDNA scans for early tumour detection. Dividing 4,467 samples into one discovery and two independent validation cohorts, we show that up to 76% of 10 cancer types harbour at least one mutation in a panel of only 25 genes, with high sensitivity across most tumour types. Our analyses demonstrate that targeting ``hotspot'' regions would introduce biases towards in-frame mutations and would compromise the reproducibility of tumour detection.

  6. Comparison of taxon-specific versus general locus sets for targeted sequence capture in plant phylogenomics.

    PubMed

    Chau, John H; Rahfeldt, Wolfgang A; Olmstead, Richard G

    2018-03-01

    Targeted sequence capture can be used to efficiently gather sequence data for large numbers of loci, such as single-copy nuclear loci. Most published studies in plants have used taxon-specific locus sets developed individually for a clade using multiple genomic and transcriptomic resources. General locus sets can also be developed from loci that have been identified as single-copy and have orthologs in large clades of plants. We identify and compare a taxon-specific locus set and three general locus sets (conserved ortholog set [COSII], shared single-copy nuclear [APVO SSC] genes, and pentatricopeptide repeat [PPR] genes) for targeted sequence capture in Buddleja (Scrophulariaceae) and outgroups. We evaluate their performance in terms of assembly success, sequence variability, and resolution and support of inferred phylogenetic trees. The taxon-specific locus set had the most target loci. Assembly success was high for all locus sets in Buddleja samples. For outgroups, general locus sets had greater assembly success. Taxon-specific and PPR loci had the highest average variability. The taxon-specific data set produced the best-supported tree, but all data sets showed improved resolution over previous non-sequence capture data sets. General locus sets can be a useful source of sequence capture targets, especially if multiple genomic resources are not available for a taxon.

  7. Captured metagenomics: large-scale targeting of genes based on ‘sequence capture’ reveals functional diversity in soils

    PubMed Central

    Manoharan, Lokeshwaran; Kushwaha, Sandeep K.; Hedlund, Katarina; Ahrén, Dag

    2015-01-01

    Microbial enzyme diversity is a key to understand many ecosystem processes. Whole metagenome sequencing (WMG) obtains information on functional genes, but it is costly and inefficient due to large amount of sequencing that is required. In this study, we have applied a captured metagenomics technique for functional genes in soil microorganisms, as an alternative to WMG. Large-scale targeting of functional genes, coding for enzymes related to organic matter degradation, was applied to two agricultural soil communities through captured metagenomics. Captured metagenomics uses custom-designed, hybridization-based oligonucleotide probes that enrich functional genes of interest in metagenomic libraries where only probe-bound DNA fragments are sequenced. The captured metagenomes were highly enriched with targeted genes while maintaining their target diversity and their taxonomic distribution correlated well with the traditional ribosomal sequencing. The captured metagenomes were highly enriched with genes related to organic matter degradation; at least five times more than similar, publicly available soil WMG projects. This target enrichment technique also preserves the functional representation of the soils, thereby facilitating comparative metagenomics projects. Here, we present the first study that applies the captured metagenomics approach in large scale, and this novel method allows deep investigations of central ecosystem processes by studying functional gene abundances. PMID:26490729

  8. Mapping a nucleolar targeting sequence of an RNA binding nucleolar protein, Nop25

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fujiwara, Takashi; Suzuki, Shunji; Kanno, Motoko

    2006-06-10

    Nop25 is a putative RNA binding nucleolar protein associated with rRNA transcription. The present study was undertaken to determine the mechanism of Nop25 localization in the nucleolus. Deletion experiments of Nop25 amino acid sequence showed Nop25 to contain a nuclear targeting sequence in the N-terminal and a nucleolar targeting sequence in the C-terminal. By expressing derivative peptides from the C-terminal as GFP-fusion proteins in the cells, a lysine and arginine residue-enriched peptide (KRKHPRRAQDSTKKPPSATRTSKTQRRRR) allowed a GFP-fusion protein to be transported and fully retained in the nucleolus. When the peptide was fused with cMyc epitope and expressed in the cells, amore » cMyc epitope was then detected in the nucleolus. Nop25 did not localize in the nucleolus by deletion of the peptide from Nop25. Furthermore, deletion of a subdomain (KRKHPRRAQ) in the peptide or amino acid substitution of lysine and arginine residues in the subdomain resulted in the loss of Nop25 nucleolar localization. These results suggest that the lysine and arginine residue-enriched peptide is the most prominent nucleolar targeting sequence of Nop25 and that the long stretch of basic residues might play an important role in the nucleolar localization of Nop25. Although Nop25 contained putative SUMOylation, phosphorylation and glycosylation sites, the amino acid substitution in these sites had no effect on the nucleolar localization, thus suggesting that these post-translational modifications did not contribute to the localization of Nop25 in the nucleolus. The treatment of the cells, which expressed a GFP-fusion protein with a nucleolar targeting sequence of Nop25, with RNase A resulted in a complete dislocation of the protein from the nucleolus. These data suggested that the nucleolar targeting sequence might therefore play an important role in the binding of Nop25 to RNA molecules and that the RNA binding of Nop25 might be essential for the nucleolar localization of Nop25.« less

  9. The metal-rich abundance pattern - spectroscopic properties and abundances for 107 main-sequence stars

    NASA Astrophysics Data System (ADS)

    Ivanyuk, O. M.; Jenkins, J. S.; Pavlenko, Ya. V.; Jones, H. R. A.; Pinfield, D. J.

    2017-07-01

    We report results from the high-resolution spectral analysis of the 107 metal-rich (mostly [Fe/H] ≥ 7.67 dex) target stars from the Calan-Hertfordshire Extrasolar Planet Search programme observed with HARPS. Using our procedure of finding the best fit to the absorption line profiles in the observed spectra, we measure the abundances of Na, Mg, Al, Si, Ca, Ti, Cr, Mn, Fe, Ni, Cu and Zn, and then compare them with known results from different authors. Most of our abundances agree with these works at the level of ±0.05 dex or better for the stars we have in common. However, we do find systematic differences that make direct inferences difficult. Our analysis suggests that the selection of line lists and atomic line data along with the adopted continuum level influence these differences the most. At the same time, we confirm the positive trends of abundances versus metallicity for Na, Mn, Ni and, to a lesser degree, Al. A slight negative trend is observed for Ca, whereas Si and Cr tend to follow iron. Our analysis allows us to determine the positively skewed normal distribution of projected rotational velocities with a maximum peaking at 3 km s-1. Finally, we obtained a Gaussian distribution of microturbulent velocities that has a maximum at 1.2 km s-1 and a full width at half-maximum Δv1/2 = 0.35 km s-1, indicating that metal-rich dwarfs and subgiants in our sample have a very restricted range in microturbulent velocity.

  10. Nucleotide sequences of Dictyostelium discoideum developmentally regulated cDNAs rich in (AAC) imply proteins that contain clusters of asparagine, glutamine, or threonine.

    PubMed

    Shaw, D R; Richter, H; Giorda, R; Ohmachi, T; Ennis, H L

    1989-09-01

    A Dictyostelium discoideum repetitive element composed of long repeats of the codon (AAC) is found in developmentally regulated transcripts. The concentration of (AAC) sequences is low in mRNA from dormant spores and growing cells and increases markedly during spore germination and multicellular development. The sequence hybridizes to many different sized Dictyostelium DNA restriction fragments indicating that it is scattered throughout the genome. Four cDNA clones isolated contain (AAC) sequences in the deduced coding region. Interestingly, the (AAC)-rich sequences are present in all three reading frames in the deduced proteins, i.e., AAC (asparagine), ACA (threonine) and CAA (glutamine). Three of the clones contain only one of these in-frame so that the individual proteins carry either asparagine, threonine, or glutamine clusters, not mixtures. However, one clone is both glutamine- and asparagine-rich. The (AAC) portion of the transcripts are reiterated 300 times in the haploid genome while the other portions of the cDNAs represent single copy genes, whose sequences show no similarity other than the (AAC) repeats. The repeated sequence is similar to the opa or M sequence found in Drosophila melanogaster notch and homeo box genes and in fly developmentally regulated transcripts. The transcripts are present on polysomes suggesting that they are translated. Although the function of these repeats is unknown, long amino acid repeats are a characteristic feature of extracellular proteins of lower eukaryotes.

  11. Experience of targeted Usher exome sequencing as a clinical test

    PubMed Central

    Besnard, Thomas; García-García, Gema; Baux, David; Vaché, Christel; Faugère, Valérie; Larrieu, Lise; Léonard, Susana; Millan, Jose M; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise

    2014-01-01

    We show that massively parallel targeted sequencing of 19 genes provides a new and reliable strategy for molecular diagnosis of Usher syndrome (USH) and nonsyndromic deafness, particularly appropriate for these disorders characterized by a high clinical and genetic heterogeneity and a complex structure of several of the genes involved. A series of 71 patients including Usher patients previously screened by Sanger sequencing plus newly referred patients was studied. Ninety-eight percent of the variants previously identified by Sanger sequencing were found by next-generation sequencing (NGS). NGS proved to be efficient as it offers analysis of all relevant genes which is laborious to reach with Sanger sequencing. Among the 13 newly referred Usher patients, both mutations in the same gene were identified in 77% of cases (10 patients) and one candidate pathogenic variant in two additional patients. This work can be considered as pilot for implementing NGS for genetically heterogeneous diseases in clinical service. PMID:24498627

  12. Frugal Chemoprevention: Targeting Nrf2 with Foods Rich in Sulforaphane

    PubMed Central

    Yang, Li; Palliyaguru, Dushani L.; Kensler, Thomas W.

    2015-01-01

    With the properties of efficacy, safety, tolerability, practicability and low cost, foods containing bioactive phytochemicals are gaining significant attention as elements of chemoprevention strategies against cancer. Sulforaphane [1-isothiocyanato-4-(methylsulfinyl)butane], a naturally occurring isothiocyanate produced by cruciferous vegetables such as broccoli, is found to be a highly promising chemoprevention agent against not only variety of cancers such as breast, prostate, colon, skin, lung, stomach or bladder carcinogenesis, but also cardiovascular disease, neurodegenerative diseases, and diabetes. For reasons of experimental exigency, pre-clinical studies have focused principally on sulforaphane itself, while clinical studies have relied on broccoli sprout preparations rich in either sulforaphane or its biogenic precursor, glucoraphanin. Substantive subsequent evaluation of sulforaphane pharmacokinetics and pharmacodynamics has been undertaken using either pure compound or food matrices. Sulforaphane affects multiple targets in cells. One key molecular mechanism of action for sulforaphane entails activation of the Nrf2- Keap1 signaling pathway although other actions contribute to the broad spectrum of efficacy in different animal models. This review summarizes the current status of pre-clinical chemoprevention studies with sulforaphane and highlights the progress and challenges for the application of foods rich in sulforaphane and/or glucoraphanin in the arena of clinical chemoprevention. PMID:26970133

  13. Frugal chemoprevention: targeting Nrf2 with foods rich in sulforaphane.

    PubMed

    Yang, Li; Palliyaguru, Dushani L; Kensler, Thomas W

    2016-02-01

    With the properties of efficacy, safety, tolerability, practicability and low cost, foods containing bioactive phytochemicals are gaining significant attention as elements of chemoprevention strategies against cancer. Sulforaphane [1-isothiocyanato-4-(methylsulfinyl)butane], a naturally occurring isothiocyanate produced by cruciferous vegetables such as broccoli, is found to be a highly promising chemoprevention agent against not only a variety of cancers such as breast, prostate, colon, skin, lung, stomach or bladder, but also cardiovascular disease, neurodegenerative diseases, and diabetes. For reasons of experimental exigency, preclinical studies have focused principally on sulforaphane itself, while clinical studies have relied on broccoli sprout preparations rich in either sulforaphane or its biogenic precursor, glucoraphanin. Substantive subsequent evaluation of sulforaphane pharmacokinetics and pharmacodynamics has been undertaken using either pure compound or food matrices. Sulforaphane affects multiple targets in cells. One key molecular mechanism of action for sulforaphane entails activation of the Nrf2-Keap1 signaling pathway although other actions contribute to the broad spectrum of efficacy in different animal models. This review summarizes the current status of pre-clinical chemoprevention studies with sulforaphane and highlights the progress and challenges for the application of foods rich in sulforaphane and/or glucoraphanin in the arena of clinical chemoprevention. Copyright © 2016 Elsevier Inc. All rights reserved.

  14. Cost-Effectiveness of Treatment Sequences of Chemotherapies and Targeted Biologics for Elderly Metastatic Colorectal Cancer Patients.

    PubMed

    Parikh, Rohan C; Du, Xianglin L; Robert, Morgan O; Lairson, David R

    2017-01-01

    Treatment patterns for metastatic colorectal cancer (mCRC) patients have changed considerably over the last decade with the introduction of new chemotherapies and targeted biologics. These treatments are often administered in various sequences with limited evidence regarding their cost-effectiveness. To conduct a pharmacoeconomic evaluation of commonly administered treatment sequences among elderly mCRC patients. A probabilistic discrete event simulation model assuming Weibull distribution was developed to evaluate the cost-effectiveness of the following common treatment sequences: (a) first-line oxaliplatin/irinotecan followed by second-line oxaliplatin/irinotecan + bevacizumab (OI-OIB); (b) first-line oxaliplatin/irinotecan + bevacizumab followed by second-line oxaliplatin/irinotecan + bevacizumab (OIB-OIB); (c) OI-OIB followed by a third-line targeted biologic (OI-OIB-TB); and (d) OIB-OIB followed by a third-line targeted biologic (OIB-OIB-TB). Input parameters for the model were primarily obtained from the Surveillance, Epidemiology, and End Results-Medicare linked dataset for incident mCRC patients aged 65 years and older diagnosed from January 2004 through December 2009. A probabilistic sensitivity analysis was performed to account for parameter uncertainty. Costs (2014 U.S. dollars) and effectiveness were discounted at an annual rate of 3%. In the base case analyses, at the willingness-to-pay (WTP) threshold of $100,000/quality-adjusted life-year (QALY) gained, the treatment sequence OIB-OIB (vs. OI-OIB) was not cost-effective with an incremental cost-effectiveness ratio (ICER) per patient of $119,007/QALY; OI-OIB-TB (vs. OIB-OIB) was dominated; and OIB-OIB-TB (vs. OIB-OIB) was not cost-effective with an ICER of $405,857/QALY. Results similar to the base case analysis were obtained assuming log-normal distribution. Cost-effectiveness acceptability curves derived from a probabilistic sensitivity analysis showed that at a WTP of $100,000/QALY gained, sequence

  15. EM connectomics reveals axonal target variation in a sequence-generating network

    PubMed Central

    Narayanan, Rajeevan T; Svara, Fabian; Egger, Robert; Oberlaender, Marcel; Denk, Winfried; Long, Michael A

    2017-01-01

    The sequential activation of neurons has been observed in various areas of the brain, but in no case is the underlying network structure well understood. Here we examined the circuit anatomy of zebra finch HVC, a cortical region that generates sequences underlying the temporal progression of the song. We combined serial block-face electron microscopy with light microscopy to determine the cell types targeted by HVC(RA) neurons, which control song timing. Close to their soma, axons almost exclusively targeted inhibitory interneurons, consistent with what had been found with electrical recordings from pairs of cells. Conversely, far from the soma the targets were mostly other excitatory neurons, about half of these being other HVC(RA) cells. Both observations are consistent with the notion that the neural sequences that pace the song are generated by global synaptic chains in HVC embedded within local inhibitory networks. DOI: http://dx.doi.org/10.7554/eLife.24364.001 PMID:28346140

  16. A statistical approach to detection of copy number variations in PCR-enriched targeted sequencing data.

    PubMed

    Demidov, German; Simakova, Tamara; Vnuchkova, Julia; Bragin, Anton

    2016-10-22

    Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted massive parallel sequencing (MPS) protocols. MPS is widely used in biomedical research and clinical diagnostics as the fast and accurate tool for the detection of short genetic variations. However, identification of larger variations such as structure variants and copy number variations (CNV) is still being a challenge for targeted MPS. Some approaches and tools for structural variants detection were proposed, but they have limitations and often require datasets of certain type, size and expected number of amplicons affected by CNVs. In the paper, we describe novel algorithm for high-resolution germinal CNV detection in the PCR-enriched targeted sequencing data and present accompanying tool. We have developed a machine learning algorithm for the detection of large duplications and deletions in the targeted sequencing data generated with PCR-based enrichment step. We have performed verification studies and established the algorithm's sensitivity and specificity. We have compared developed tool with other available methods applicable for the described data and revealed its higher performance. We showed that our method has high specificity and sensitivity for high-resolution copy number detection in targeted sequencing data using large cohort of samples.

  17. Genetic diagnosis of familial hypercholesterolaemia by targeted next-generation sequencing

    PubMed Central

    Maglio, C; Mancina, R M; Motta, B M; Stef, M; Pirazzi, C; Palacios, L; Askaryar, N; Borén, J; Wiklund, O; Romeo, S

    2014-01-01

    Maglio C., Mancina R. M., Motta B. M., Stef M., Pirazzi C., Palacios L., Askaryar N., Borén J., Wiklund O., Romeo S. (University of Gothenburg, Gothenburg, Sweden; University Magna Graecia of Catanzaro, Italy; University of Milan, Italy; Progenika Biopharma SA, Derio, Spain). Genetic diagnosis of familial hypercholesterolaemia by targeted next-generation sequencing. Objectives The aim of this study was to combine clinical criteria and next-generation sequencing (pyrosequencing) to establish a diagnosis of familial hypercholesterolaemia (FH). Design, setting and subjects A total of 77 subjects with a Dutch Lipid Clinic Network score of ≥3 (possible, probable or definite FH clinical diagnosis) were recruited from the Lipid Clinic at Sahlgrenska Hospital, Gothenburg, Sweden. Next-generation sequencing was performed in all subjects using SEQPRO LIPO RS, a kit that detects mutations in the low-density lipoprotein receptor (LDLR), apolipoprotein B (APOB), proprotein convertase subtilisin/kexin type 9 (PCSK9) and LDLR adapter protein 1 (LDLRAP1) genes; copy-number variations in the LDLR gene were also examined. Results A total of 26 mutations were detected in 50 subjects (65% success rate). Amongst these, 23 mutations were in the LDLR gene, two in the APOB gene and one in the PCSK9 gene. Four mutations with unknown pathogenicity were detected in LDLR. Of these, three mutations (Gly505Asp, Ile585Thr and Gln660Arg) have been previously reported in subjects with FH, but their pathogenicity has not been proved. The fourth, a mutation in LDLR affecting a splicing site (exon 6–intron 6) has not previously been reported; it was found to segregate with high cholesterol levels in the family of the proband. Conclusions Using a combination of clinical criteria and targeted next-generation sequencing, we have achieved FH diagnosis with a high success rate. Furthermore, we identified a new splicing-site mutation in the LDLR gene. PMID:24785115

  18. Targeted cancer exome sequencing reveals recurrent mutations in myeloproliferative neoplasms

    PubMed Central

    Tenedini, E; Bernardis, I; Artusi, V; Artuso, L; Roncaglia, E; Guglielmelli, P; Pieri, L; Bogani, C; Biamonte, F; Rotunno, G; Mannarelli, C; Bianchi, E; Pancrazzi, A; Fanelli, T; Malagoli Tagliazucchi, G; Ferrari, S; Manfredini, R; Vannucchi, A M; Tagliafico, E

    2014-01-01

    With the intent of dissecting the molecular complexity of Philadelphia-negative myeloproliferative neoplasms (MPN), we designed a target enrichment panel to explore, using next-generation sequencing (NGS), the mutational status of an extensive list of 2000 cancer-associated genes and microRNAs. The genomic DNA of granulocytes and in vitro-expanded CD3+T-lymphocytes, as a germline control, was target-enriched and sequenced in a learning cohort of 20 MPN patients using Roche 454 technology. We identified 141 genuine somatic mutations, most of which were not previously described. To test the frequency of the identified variants, a larger validation cohort of 189 MPN patients was additionally screened for these mutations using Ion Torrent AmpliSeq NGS. Excluding the genes already described in MPN, for 8 genes (SCRIB, MIR662, BARD1, TCF12, FAT4, DAP3, POLG and NRAS), we demonstrated a mutation frequency between 3 and 8%. We also found that mutations at codon 12 of NRAS (NRASG12V and NRASG12D) were significantly associated, for primary myelofibrosis (PMF), with highest dynamic international prognostic scoring system (DIPSS)-plus score categories. This association was then confirmed in 66 additional PMF patients composing a final dataset of 168 PMF showing a NRAS mutation frequency of 4.7%, which was associated with a worse outcome, as defined by the DIPSS plus score. PMID:24150215

  19. Heterologous mitochondrial targeting sequences can deliver functional proteins into mitochondria.

    PubMed

    Marcus, Dana; Lichtenstein, Michal; Cohen, Natali; Hadad, Rita; Erlich-Hadad, Tal; Greif, Hagar; Lorberboum-Galski, Haya

    2016-12-01

    Mitochondrial Targeting Sequences (MTSs) are responsible for trafficking nuclear-encoded proteins into mitochondria. Once entering the mitochondria, the MTS is recognized and cleaved off. Some MTSs are long and undergo two-step processing, as in the case of the human frataxin (FXN) protein (80aa), implicated in Friedreich's ataxia (FA). Therefore, we chose the FXN protein to examine whether nuclear-encoded mitochondrial proteins can efficiently be targeted via a heterologous MTS (hMTS) and deliver a functional protein into mitochondria. We examined three hMTSs; that of citrate synthase (cs), lipoamide deydrogenase (LAD) and C6ORF66 (ORF), as classically MTS sequences, known to be removed by one-step processing, to deliver FXN into mitochondria, in the form of fusion proteins. We demonstrate that using hMTSs for delivering FXN results in the production of 4-5-fold larger amounts of the fusion proteins, and at 4-5-fold higher concentrations. Moreover, hMTSs delivered a functional FXN protein into the mitochondria even more efficiently than the native MTSfxn, as evidenced by the rescue of FA patients' cells from oxidative stress; demonstrating a 18%-54% increase in cell survival; and a 13%-33% increase in ATP levels, as compared to the fusion protein carrying the native MTS. One fusion protein with MTScs increased aconitase activity within patients' cells, by 400-fold. The implications form our studies are of vast importance for both basic and translational research of mitochondrial proteins as any mitochondrial protein can be delivered efficiently by an hMTS. Moreover, effective targeting of functional proteins is important for restoration of mitochondrial function and treatment of related disorders. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Interactions between the R2R3-MYB Transcription Factor, AtMYB61, and Target DNA Binding Sites

    PubMed Central

    Prouse, Michael B.; Campbell, Malcolm M.

    2013-01-01

    Despite the prominent roles played by R2R3-MYB transcription factors in the regulation of plant gene expression, little is known about the details of how these proteins interact with their DNA targets. For example, while Arabidopsis thaliana R2R3-MYB protein AtMYB61 is known to alter transcript abundance of a specific set of target genes, little is known about the specific DNA sequences to which AtMYB61 binds. To address this gap in knowledge, DNA sequences bound by AtMYB61 were identified using cyclic amplification and selection of targets (CASTing). The DNA targets identified using this approach corresponded to AC elements, sequences enriched in adenosine and cytosine nucleotides. The preferred target sequence that bound with the greatest affinity to AtMYB61 recombinant protein was ACCTAC, the AC-I element. Mutational analyses based on the AC-I element showed that ACC nucleotides in the AC-I element served as the core recognition motif, critical for AtMYB61 binding. Molecular modelling predicted interactions between AtMYB61 amino acid residues and corresponding nucleotides in the DNA targets. The affinity between AtMYB61 and specific target DNA sequences did not correlate with AtMYB61-driven transcriptional activation with each of the target sequences. CASTing-selected motifs were found in the regulatory regions of genes previously shown to be regulated by AtMYB61. Taken together, these findings are consistent with the hypothesis that AtMYB61 regulates transcription from specific cis-acting AC elements in vivo. The results shed light on the specifics of DNA binding by an important family of plant-specific transcriptional regulators. PMID:23741471

  1. In vivo gene correction with targeted sequence substitution through microhomology-mediated end joining.

    PubMed

    Shin, Jeong Hong; Jung, Soobin; Ramakrishna, Suresh; Kim, Hyongbum Henry; Lee, Junwon

    2018-07-07

    Genome editing technology using programmable nucleases has rapidly evolved in recent years. The primary mechanism to achieve precise integration of a transgene is mainly based on homology-directed repair (HDR). However, an HDR-based genome-editing approach is less efficient than non-homologous end-joining (NHEJ). Recently, a microhomology-mediated end-joining (MMEJ)-based transgene integration approach was developed, showing feasibility both in vitro and in vivo. We expanded this method to achieve targeted sequence substitution (TSS) of mutated sequences with normal sequences using double-guide RNAs (gRNAs), and a donor template flanking the microhomologies and target sequence of the gRNAs in vitro and in vivo. Our method could realize more efficient sequence substitution than the HDR-based method in vitro using a reporter cell line, and led to the survival of a hereditary tyrosinemia mouse model in vivo. The proposed MMEJ-based TSS approach could provide a novel therapeutic strategy, in addition to HDR, to achieve gene correction from a mutated sequence to a normal sequence. Copyright © 2018 Elsevier Inc. All rights reserved.

  2. Spes: An intense source of Neutron-Rich Radioactive Beams at Legnaro

    NASA Astrophysics Data System (ADS)

    Andrighetto, A.; Manzolaro, M.; Corradetti, S.; Scarpa, D.; Monetti, A.; Rossignoli, M.; Ballan, M.; Borgna, F.; D'Agostini, F.; Gramegna, F.; Prete, G.; Meneghetti, G.; Ferrari, M.; Zenoni, A.

    2018-02-01

    The Isotope Separation On-Line (ISOL) method for the production of Radioactive Ion Beams (RIB) is attracting significant interest in the worldwide nuclear physics community. Within this context the SPES (Selective Production of Exotic Species) RIB facility is now under construction at INFN LNL (Istituto Nazionale di Fisica Nucleare Laboratori Nazionali di Legnaro). This technique is established as one of the main techniques for high intensity and high quality beams production. The SPES facility will produce n-rich isotopes by means of a 40 MeV proton beam, emitted by a cyclotron, impinging on a uranium carbide multi-foil fission target. The aim of this work is to describe the most important results obtained by the study of the on-line behavior of the SPES production target assembly. This target system will produce RIBs at a rate of about 1013 fissions per second, it will be able to dissipate a total power of up to 10 kW, and it is planned to work continuously for 2 week-runs of irradiation. ISOL beams of 24 different elements will be produced, therefore a target and ion source development is ongoing to ensure a great variety of produced isotopes and to improve the beam intensity and purity.

  3. Production of neutron-rich nuclei approaching r-process by gamma-induced fission of 238U at ELI-NP

    NASA Astrophysics Data System (ADS)

    Mei, Bo; Balabanski, Dimiter; Constantin, Paul; Anh Le, Tuan; Viet Cuong, Phan

    2018-05-01

    The investigation of neutron-rich exotic nuclei is crucial not only for nuclear physics but also for nuclear astrophysics. Experimentally, only few neutron-rich nuclei near the stability have been studied, however, most neutron-rich nuclei have not been measured due to their small production cross sections as well as short half-lives. At ELI-NP, gamma beams with high intensities will open new opportunities to investigate very neutron-rich fragments produced by photofission of 238U targets in a gas cell. Based on some simulations, a novel gas cell has been designed to produce, stop and extract 238U photofission fragments. The extraction time and efficiency of photofission fragments have been optimized by using SIMION simulations. According to these simulations, a high extraction efficiency and a short extraction time can be achieved for 238U photofission fragments in the gas cell, which will allow one to measure very neutron-rich fragments with short half-lives by using the IGISOL facility proposed at ELI-NP.

  4. A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution.

    PubMed

    Reinharz, Vladimir; Ponty, Yann; Waldispühl, Jérôme

    2013-07-01

    The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/. Supplementary data are available at Bioinformatics online.

  5. Deep sequencing methods for protein engineering and design.

    PubMed

    Wrenbeck, Emily E; Faber, Matthew S; Whitehead, Timothy A

    2017-08-01

    The advent of next-generation sequencing (NGS) has revolutionized protein science, and the development of complementary methods enabling NGS-driven protein engineering have followed. In general, these experiments address the functional consequences of thousands of protein variants in a massively parallel manner using genotype-phenotype linked high-throughput functional screens followed by DNA counting via deep sequencing. We highlight the use of information rich datasets to engineer protein molecular recognition. Examples include the creation of multiple dual-affinity Fabs targeting structurally dissimilar epitopes and engineering of a broad germline-targeted anti-HIV-1 immunogen. Additionally, we highlight the generation of enzyme fitness landscapes for conducting fundamental studies of protein behavior and evolution. We conclude with discussion of technological advances. Copyright © 2016 Elsevier Ltd. All rights reserved.

  6. MicroRNAs Form Triplexes with Double Stranded DNA at Sequence-Specific Binding Sites; a Eukaryotic Mechanism via which microRNAs Could Directly Alter Gene Expression

    PubMed Central

    Grace, Christy R.; Ferreira, Antonio M.; Waddell, M. Brett; Ridout, Granger; Naeve, Deanna; Leuze, Michael; LoCascio, Philip F.; Panetta, John C.; Wilkinson, Mark R.; Pui, Ching-Hon; Naeve, Clayton W.; Uberbacher, Edward C.; Bonten, Erik J.; Evans, William E.

    2016-01-01

    MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA) and typically down-regulating their stability or translation. Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence (i.e., NMR, FRET, SPR) that purine or pyrimidine-rich microRNAs of appropriate length and sequence form triple-helical structures with purine-rich sequences of duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show that several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 × 10−16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. This work has thus revealed a new mechanism by which microRNAs could interact with gene promoter regions to modify gene transcription. PMID:26844769

  7. Molecular characterization of oral squamous cell carcinoma using targeted next-generation sequencing.

    PubMed

    Er, Tze-Kiong; Wang, Yen-Yun; Chen, Chih-Chieh; Herreros-Villanueva, Marta; Liu, Ta-Chih; Yuan, Shyng-Shiou F

    2015-10-01

    Many genetic factors play an important role in the development of oral squamous cell carcinoma. The aim of this study was to assess the mutational profile in oral squamous cell carcinoma using formalin-fixed, paraffin-embedded tumors from a Taiwanese population by performing targeted sequencing of 26 cancer-associated genes that are frequently mutated in solid tumors. Next-generation sequencing was performed in 50 formalin-fixed, paraffin-embedded tumor specimens obtained from patients with oral squamous cell carcinoma. Genetic alterations in the 26 cancer-associated genes were detected using a deep sequencing (>1000X) approach. TP53, PIK3CA, MET, APC, CDH1, and FBXW7 were most frequently mutated genes. Most remarkably, TP53 mutations and PIK3CA mutations, which accounted for 68% and 18% of tumors, respectively, were more prevalent in a Taiwanese population. Other genes including MET (4%), APC (4%), CDH1 (2%), and FBXW7 (2%) were identified in our population. In summary, our study shows the feasibility of performing targeted sequencing using formalin-fixed, paraffin-embedded samples. Additionally, this study also reports the mutational landscape of oral squamous cell carcinoma in the Taiwanese population. We believe that this study will shed new light on fundamental aspects in understanding the molecular pathogenesis of oral squamous cell carcinoma and may aid in the development of new targeted therapies. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  8. The Active Target Time Projection Chamber at NSCL

    NASA Astrophysics Data System (ADS)

    Bazin, D.; Bradt, J.; Ayyad, Y.; Mittig, W.; Ahn, T.; Beceiro-Novo, S.; Carpenter, L.; Cortesi, M.; Fritsch, A.; Kolata, J. J.; Lynch, W.; Watwood, N.

    2017-11-01

    Reactions in inverse kinematics close to the Coulomb barrier offer unique opportunities to study exotic nuclei, but they are plagued by the difficulty to efficiently and precisely measure the characteristics of the emerging particles. The Active Target Time Projection Chamber (AT-TPC) offers an elegant solution to this dilemma. In this device, the detector gas of the time projection chamber is at the same time the target in which nuclear reactions take place. The use of this new paradigm offers several advantages over conventional inert target methods, the most significant being the ability to increase the luminosity of experiments without loss of resolution. The AT-TPC and some results obtained on resonant α scattering to explore the clustering properties of neutron-rich nuclei are presented, as well as fusion cross section results using a 10Be radioactive beam. In addition, the first re-accelerated radioactive beam experiment using the fully commissioned ReA3 linac was conducted recently at the NSCL with the AT-TPC, where proton resonant scattering of a 4.6 MeV/u 46Ar beam was used to measure the neutron single-particle strength in 47Ar.

  9. Draft genome sequence of Lampropedia cohaerens strain CT6(T) isolated from arsenic rich microbial mats of a Himalayan hot water spring.

    PubMed

    Tripathi, Charu; Mahato, Nitish K; Rani, Pooja; Singh, Yogendra; Kamra, Komal; Lal, Rup

    2016-01-01

    Lampropedia cohaerens strain CT6(T), a non-motile, aerobic and coccoid strain was isolated from arsenic rich microbial mats (temperature ~45 °C) of a hot water spring located atop the Himalayan ranges at Manikaran, India. The present study reports the first genome sequence of type strain CT6(T) of genus Lampropedia cohaerens. Sequencing data was generated using the Illumina HiSeq 2000 platform and assembled with ABySS v 1.3.5. The 3,158,922 bp genome was assembled into 41 contigs with a mean GC content of 63.5 % and 2823 coding sequences. Strain CT6(T) was found to harbour genes involved in both the Entner-Duodoroff pathway and non-phosphorylated ED pathway. Strain CT6(T) also contained genes responsible for imparting resistance to arsenic, copper, cobalt, zinc, cadmium and magnesium, providing survival advantages at a thermal location. Additionally, the presence of genes associated with biofilm formation, pyrroloquinoline-quinone production, isoquinoline degradation and mineral phosphate solubilisation in the genome demonstrate the diverse genetic potential for survival at stressed niches.

  10. Discovery of Influenza A Virus Sequence Pairs and Their Combinations for Simultaneous Heterosubtypic Targeting that Hedge against Antiviral Resistance

    PubMed Central

    Lin, Jing; Pramono, Zacharias Aloysius Dwi; Maurer-Stroh, Sebastian

    2016-01-01

    The multiple circulating human influenza A virus subtypes coupled with the perpetual genomic mutations and segment reassortment events challenge the development of effective therapeutics. The capacity to drug most RNAs motivates the investigation on viral RNA targets. 123,060 segment sequences from 35,938 strains of the most prevalent subtypes also infecting humans–H1N1, 2009 pandemic H1N1, H3N2, H5N1 and H7N9, were used to identify 1,183 conserved RNA target sequences (≥15-mer) in the internal segments. 100% theoretical coverage in simultaneous heterosubtypic targeting is achieved by pairing specific sequences from the same segment (“Duals”) or from two segments (“Doubles”); 1,662 Duals and 28,463 Doubles identified. By combining specific Duals and/or Doubles to form a target graph wherein an edge connecting two vertices (target sequences) represents a Dual or Double, it is possible to hedge against antiviral resistance besides maintaining 100% heterosubtypic coverage. To evaluate the hedging potential, we define the hedge-factor as the minimum number of resistant target sequences that will render the graph to become resistant i.e. eliminate all the edges therein; a target sequence or a graph is considered resistant when it cannot achieve 100% heterosubtypic coverage. In an n-vertices graph (n ≥ 3), the hedge-factor is maximal (= n– 1) when it is a complete graph i.e. every distinct pair in a graph is either a Dual or Double. Computational analyses uncover an extensive number of complete graphs of different sizes. Monte Carlo simulations show that the mutation counts and time elapsed for a target graph to become resistant increase with the hedge-factor. Incidentally, target sequences which were reported to reduce virus titre in experiments are included in our target graphs. The identity of target sequence pairs for heterosubtypic targeting and their combinations for hedging antiviral resistance are useful toolkits to construct target graphs for

  11. Robust Small Target Co-Detection from Airborne Infrared Image Sequences.

    PubMed

    Gao, Jingli; Wen, Chenglin; Liu, Meiqin

    2017-09-29

    In this paper, a novel infrared target co-detection model combining the self-correlation features of backgrounds and the commonality features of targets in the spatio-temporal domain is proposed to detect small targets in a sequence of infrared images with complex backgrounds. Firstly, a dense target extraction model based on nonlinear weights is proposed, which can better suppress background of images and enhance small targets than weights of singular values. Secondly, a sparse target extraction model based on entry-wise weighted robust principal component analysis is proposed. The entry-wise weight adaptively incorporates structural prior in terms of local weighted entropy, thus, it can extract real targets accurately and suppress background clutters efficiently. Finally, the commonality of targets in the spatio-temporal domain are used to construct target refinement model for false alarms suppression and target confirmation. Since real targets could appear in both of the dense and sparse reconstruction maps of a single frame, and form trajectories after tracklet association of consecutive frames, the location correlation of the dense and sparse reconstruction maps for a single frame and tracklet association of the location correlation maps for successive frames have strong ability to discriminate between small targets and background clutters. Experimental results demonstrate that the proposed small target co-detection method can not only suppress background clutters effectively, but also detect targets accurately even if with target-like interference.

  12. The minimal amount of starting DNA for Agilent’s hybrid capture-based targeted massively parallel sequencing

    PubMed Central

    Chung, Jongsuk; Son, Dae-Soon; Jeon, Hyo-Jeong; Kim, Kyoung-Mee; Park, Gahee; Ryu, Gyu Ha; Park, Woong-Yang; Park, Donghyun

    2016-01-01

    Targeted capture massively parallel sequencing is increasingly being used in clinical settings, and as costs continue to decline, use of this technology may become routine in health care. However, a limited amount of tissue has often been a challenge in meeting quality requirements. To offer a practical guideline for the minimum amount of input DNA for targeted sequencing, we optimized and evaluated the performance of targeted sequencing depending on the input DNA amount. First, using various amounts of input DNA, we compared commercially available library construction kits and selected Agilent’s SureSelect-XT and KAPA Biosystems’ Hyper Prep kits as the kits most compatible with targeted deep sequencing using Agilent’s SureSelect custom capture. Then, we optimized the adapter ligation conditions of the Hyper Prep kit to improve library construction efficiency and adapted multiplexed hybrid selection to reduce the cost of sequencing. In this study, we systematically evaluated the performance of the optimized protocol depending on the amount of input DNA, ranging from 6.25 to 200 ng, suggesting the minimal input DNA amounts based on coverage depths required for specific applications. PMID:27220682

  13. Classification of G-protein coupled receptors based on a rich generation of convolutional neural network, N-gram transformation and multiple sequence alignments.

    PubMed

    Li, Man; Ling, Cheng; Xu, Qi; Gao, Jingyang

    2018-02-01

    Sequence classification is crucial in predicting the function of newly discovered sequences. In recent years, the prediction of the incremental large-scale and diversity of sequences has heavily relied on the involvement of machine-learning algorithms. To improve prediction accuracy, these algorithms must confront the key challenge of extracting valuable features. In this work, we propose a feature-enhanced protein classification approach, considering the rich generation of multiple sequence alignment algorithms, N-gram probabilistic language model and the deep learning technique. The essence behind the proposed method is that if each group of sequences can be represented by one feature sequence, composed of homologous sites, there should be less loss when the sequence is rebuilt, when a more relevant sequence is added to the group. On the basis of this consideration, the prediction becomes whether a query sequence belonging to a group of sequences can be transferred to calculate the probability that the new feature sequence evolves from the original one. The proposed work focuses on the hierarchical classification of G-protein Coupled Receptors (GPCRs), which begins by extracting the feature sequences from the multiple sequence alignment results of the GPCRs sub-subfamilies. The N-gram model is then applied to construct the input vectors. Finally, these vectors are imported into a convolutional neural network to make a prediction. The experimental results elucidate that the proposed method provides significant performance improvements. The classification error rate of the proposed method is reduced by at least 4.67% (family level I) and 5.75% (family Level II), in comparison with the current state-of-the-art methods. The implementation program of the proposed work is freely available at: https://github.com/alanFchina/CNN .

  14. Validation of the RAGE Hydrocode for Impacts into Volatile-Rich Targets

    NASA Astrophysics Data System (ADS)

    Plesko, C. S.; Asphaug, E.; Coker, R. F.; Wohletz, K. H.; Korycansky, D. G.; Gisler, G. R.

    2007-12-01

    In preparation for a detailed study of large-scale impacts into the Martian surface, we have validated the RAGE hydrocode (Gittings et al., in press, CSD) against a suite of experiments and statistical models. We present comparisons of hydrocode models to centimeter-scale gas gun impacts (Nakazawa et al. 2002), an underground nuclear test (Perret, 1971), and crater scaling laws (Holsapple 1993, O'Keefe and Ahrens 1993). We have also conducted model convergence and uncertainty analyses which will be presented. Results to date are encouraging for our current model goals, and indicate areas where the hydrocode may be extended in the future. This validation work is focused on questions related to the specific problem of large impacts into volatile-rich targets. The overall goal of this effort is to be able to realistically model large-scale Noachian, and possibly post- Noachian, impacts on Mars not so much to model the crater morphology as to understand the evolution of target volatiles in the post-impact regime, to explore how large craters might set the stage for post-impact hydro- geologic evolution both locally (in the crater subsurface) and globally, due to the redistribution of volatiles from the surface and subsurface into the atmosphere. This work is performed under the auspices of IGPP and the DOE at LANL under contracts W-7405-ENG-36 and DE-AC52-06NA25396. Effort by DK and EA is sponsored by NASA's Mars Fundamental Research Program.

  15. Bamgineer: Introduction of simulated allele-specific copy number variants into exome and targeted sequence data sets.

    PubMed

    Samadian, Soroush; Bruce, Jeff P; Pugh, Trevor J

    2018-03-01

    Somatic copy number variations (CNVs) play a crucial role in development of many human cancers. The broad availability of next-generation sequencing data has enabled the development of algorithms to computationally infer CNV profiles from a variety of data types including exome and targeted sequence data; currently the most prevalent types of cancer genomics data. However, systemic evaluation and comparison of these tools remains challenging due to a lack of ground truth reference sets. To address this need, we have developed Bamgineer, a tool written in Python to introduce user-defined haplotype-phased allele-specific copy number events into an existing Binary Alignment Mapping (BAM) file, with a focus on targeted and exome sequencing experiments. As input, this tool requires a read alignment file (BAM format), lists of non-overlapping genome coordinates for introduction of gains and losses (bed file), and an optional file defining known haplotypes (vcf format). To improve runtime performance, Bamgineer introduces the desired CNVs in parallel using queuing and parallel processing on a local machine or on a high-performance computing cluster. As proof-of-principle, we applied Bamgineer to a single high-coverage (mean: 220X) exome sequence file from a blood sample to simulate copy number profiles of 3 exemplar tumors from each of 10 tumor types at 5 tumor cellularity levels (20-100%, 150 BAM files in total). To demonstrate feasibility beyond exome data, we introduced read alignments to a targeted 5-gene cell-free DNA sequencing library to simulate EGFR amplifications at frequencies consistent with circulating tumor DNA (10, 1, 0.1 and 0.01%) while retaining the multimodal insert size distribution of the original data. We expect Bamgineer to be of use for development and systematic benchmarking of CNV calling algorithms by users using locally-generated data for a variety of applications. The source code is freely available at http://github.com/pughlab/bamgineer.

  16. A proline-rich sequence unique to MEK1 and MEK2 is required for raf binding and regulates MEK function.

    PubMed

    Catling, A D; Schaeffer, H J; Reuter, C W; Reddy, G R; Weber, M J

    1995-10-01

    Mammalian MEK1 and MEK2 contain a proline-rich (PR) sequence that is absent both from the yeast homologs Ste7 and Byr1 and from a recently cloned activator of the JNK/stress-activated protein kinases, SEK1/MKK4. Since this PR sequence occurs in MEKs that are regulated by Raf family enzymes but is missing from MEKs and SEKs activated independently of Raf, we sought to investigate the role of this sequence in MEK1 and MEK2 regulation and function. Deletion of the PR sequence from MEK1 blocked the ability of MEK1 to associate with members of the Raf family and markedly attenuated activation of the protein in vivo following growth factor stimulation. In addition, this sequence was necessary for efficient activation of MEK1 in vitro by B-Raf but dispensable for activation by a novel MEK1 activator which we have previously detected in fractionated fibroblast extracts. Furthermore, we found that a phosphorylation site within the PR sequence of MEK1 was required for sustained MEK1 activity in response to serum stimulation of quiescent fibroblasts. Consistent with this observation, we observed that MEK2, which lacks a phosphorylation site at the corresponding position, was activated only transiently following serum stimulation. Finally, we found that deletion of the PR sequence from a constitutively activated MEK1 mutant rendered the protein nontransforming in Rat1 fibroblasts. These observations indicate a critical role for the PR sequence in directing specific protein-protein interactions important for the activation, inactivation, and downstream functioning of the MEKs.

  17. A proline-rich sequence unique to MEK1 and MEK2 is required for raf binding and regulates MEK function.

    PubMed Central

    Catling, A D; Schaeffer, H J; Reuter, C W; Reddy, G R; Weber, M J

    1995-01-01

    Mammalian MEK1 and MEK2 contain a proline-rich (PR) sequence that is absent both from the yeast homologs Ste7 and Byr1 and from a recently cloned activator of the JNK/stress-activated protein kinases, SEK1/MKK4. Since this PR sequence occurs in MEKs that are regulated by Raf family enzymes but is missing from MEKs and SEKs activated independently of Raf, we sought to investigate the role of this sequence in MEK1 and MEK2 regulation and function. Deletion of the PR sequence from MEK1 blocked the ability of MEK1 to associate with members of the Raf family and markedly attenuated activation of the protein in vivo following growth factor stimulation. In addition, this sequence was necessary for efficient activation of MEK1 in vitro by B-Raf but dispensable for activation by a novel MEK1 activator which we have previously detected in fractionated fibroblast extracts. Furthermore, we found that a phosphorylation site within the PR sequence of MEK1 was required for sustained MEK1 activity in response to serum stimulation of quiescent fibroblasts. Consistent with this observation, we observed that MEK2, which lacks a phosphorylation site at the corresponding position, was activated only transiently following serum stimulation. Finally, we found that deletion of the PR sequence from a constitutively activated MEK1 mutant rendered the protein nontransforming in Rat1 fibroblasts. These observations indicate a critical role for the PR sequence in directing specific protein-protein interactions important for the activation, inactivation, and downstream functioning of the MEKs. PMID:7565670

  18. Extraordinary Sequence Divergence at Tsga8, an X-linked Gene Involved in Mouse Spermiogenesis

    PubMed Central

    Good, Jeffrey M.; Vanderpool, Dan; Smith, Kimberly L.; Nachman, Michael W.

    2011-01-01

    The X chromosome plays an important role in both adaptive evolution and speciation. We used a molecular evolutionary screen of X-linked genes potentially involved in reproductive isolation in mice to identify putative targets of recurrent positive selection. We then sequenced five very rapidly evolving genes within and between several closely related species of mice in the genus Mus. All five genes were involved in male reproduction and four of the genes showed evidence of recurrent positive selection. The most remarkable evolutionary patterns were found at Testis-specific gene a8 (Tsga8), a spermatogenesis-specific gene expressed during postmeiotic chromatin condensation and nuclear transformation. Tsga8 was characterized by extremely high levels of insertion–deletion variation of an alanine-rich repetitive motif in natural populations of Mus domesticus and M. musculus, differing in length from the reference mouse genome by up to 89 amino acids (27% of the total protein length). This population-level variation was coupled with striking divergence in protein sequence and length between closely related mouse species. Although no clear orthologs had previously been described for Tsga8 in other mammalian species, we have identified a highly divergent hypothetical gene on the rat X chromosome that shares clear orthology with the 5′ and 3′ ends of Tsga8. Further inspection of this ortholog verified that it is expressed in rat testis and shares remarkable similarity with mouse Tsga8 across several general features of the protein sequence despite no conservation of nucleotide sequence across over 60% of the rat-coding domain. Overall, Tsga8 appears to be one of the most rapidly evolving genes to have been described in rodents. We discuss the potential evolutionary causes and functional implications of this extraordinary divergence and the possible contribution of Tsga8 and the other four genes we examined to reproductive isolation in mice. PMID:21186189

  19. External Guide Sequences Targeting the aac(6′)-Ib mRNA Induce Inhibition of Amikacin Resistance▿

    PubMed Central

    Bistué, Alfonso J. C. Soler; Ha, Hongphuc; Sarno, Renee; Don, Michelle; Zorreguieta, Angeles; Tolmasky, Marcelo E.

    2007-01-01

    The dissemination of AAC(6′)-I-type acetyltransferases have rendered amikacin and other aminoglycosides all but useless in some parts of the world. Antisense technologies could be an alternative to extend the life of these antibiotics. External guide sequences are short antisense oligoribonucleotides that induce RNase P-mediated cleavage of a target RNA by forming a precursor tRNA-like complex. Thirteen-nucleotide external guide sequences complementary to locations within five regions accessible for interaction with antisense oligonucleotides in the mRNA that encodes AAC(6′)-Ib were analyzed. While small variations in the location targeted by different external guide sequences resulted in big changes in efficiency of binding to native aac(6′)-Ib mRNA, most of them induced high levels of RNase P-mediated cleavage in vitro. Recombinant plasmids coding for selected external guide sequences were introduced into Escherichia coli harboring aac(6′)-Ib, and the transformant strains were tested to determine their resistance to amikacin. The two external guide sequences that showed the strongest binding efficiency to the mRNA in vitro, EGSC3 and EGSA2, interfered with expression of the resistance phenotype at different degrees. Growth curve experiments showed that E. coli cells harboring a plasmid coding for EGSC3, the external guide sequence with the highest mRNA binding affinity in vitro, did not grow for at least 300 min in the presence of 15 μg of amikacin/ml. EGSA2, which had a lower mRNA-binding affinity in vitro than EGSC3, inhibited the expression of amikacin resistance at a lesser level; growth of E. coli harboring a plasmid coding for EGSA2, in the presence of 15 μg of amikacin/ml was undetectable for 200 min but reached an optical density at 600 nm of 0.5 after 5 h of incubation. Our results indicate that the use of external guide sequences could be a viable strategy to preserve the efficacy of amikacin. PMID:17387154

  20. Organic- and carbonate-rich soil formation ˜2.6 billion years ago at Schagen, East Transvaal district, South Africa

    NASA Astrophysics Data System (ADS)

    Watanabe, Yumiko; Stewart, Brian W.; Ohmoto, Hiroshi

    2004-05-01

    A ˜17-m paleosol sequence at Schagen, South Africa, which developed on a serpentinized dunite intrusion in a granite-gneiss terrain ˜2.6 Ga ago, is characterized by an alternating succession of thick (˜1-3 m) carbonate-rich (dolomite and calcite) zones and silicate-rich (serpentines, talc, and quartz) zones; the upper ˜8 m section is especially rich in organic C (up to ˜1.4 wt.%). Petrologic and geochemical data suggest the upper ˜8 m section is composed of at least three soil profiles that developed on: (i) silicate-rich rock fragments (and minerals) that were transported from local sources (serpentinite and granite) by fluvial and/or eolian processes; and (ii) dolomite and calcite zones that formed by locally discharged groundwater. The Mg and Fe in the paleosol sequence were largely supplied from local sources (mostly serpentinite), but the Ca, Sr, and HCO 3- were supplied by groundwater that originated from a surrounding granite-gneiss terrain. In the uppermost soil profile, the (Fe is retained, the Fe 3+/Fe 2+ ratio increases, and ferri-stilpnomelane is abundant. These data suggest the atmospheric pO 2 was much greater than ˜10 -3.7 atm (>0.1% present atmospheric level [PAL]). The carbonaceous matter in the soils is intimately associated with clays (talc, chlorite, and ferri-stilpnomelane) and occurs mostly as seams (20 μm to 1 mm thick) that parallel the soil horizons. These occurrences, crystallographic structures, H/C ratios, and δ 13C org values (-17.4 to -14.4‰ PDB) suggest that the carbonaceous matter is a remnant of in situ microbial mats, originally ˜1 to ˜20 mm thick. The microbial mats developed: (a) mostly on soil surfaces during the formation of silicate-rich soils, and (b) at the bottom of an evaporating, anoxic, alkaline pond during the precipitation of the Fe-rich dolomite. These δ 13C org values are difficult to be explained by a current popular idea of a methane- and organic haze-rich Archean atmosphere (Pavlov et al., 2001

  1. Effects of metal-rich particulate matter exposure on exogenous and endogenous viral sequence methylation in healthy steel-workers.

    PubMed

    Mercorio, Roberta; Bonzini, Matteo; Angelici, Laura; Iodice, Simona; Delbue, Serena; Mariani, Jacopo; Apostoli, Pietro; Pesatori, Angela Cecilia; Bollati, Valentina

    2017-11-01

    Inhaled particles have been shown to produce systemic changes in DNA methylation. Global hypomethylation has been associated to viral sequence reactivation, possibly linked to the activation of pro-inflammatory pathways occurring after exposure. This observation provides a rationale to investigate viral sequence (both exogenous and endogenous) methylation in association to metal-rich particulate matter exposure. To verify this hypothesis, we chose the Wp promoter of the Epstein-Barr Virus (EBV-Wp) and the promoter of the human-endogenous-retrovirus w (HERV-w), respectively as a paradigm of an exogenous and an endogenous retroviral sequence, to be investigated by bisulfite PCR Pyrosequencing. We enrolled 63 male workers in an electric furnace steel plant, exposed to high level of metal-rich particulate matter. Comparing samples obtained in the first day of a work week (time 0-baseline, after 2 days off work) and the samples obtained after 3 days of work (time 1-post exposure), the mean methylation of EBV-Wp was significantly higher at baseline compared to post-exposure (mean baseline = 56.7%5mC; mean post-exposure = 47.9%5mC; p-value = 0.009), whereas the mean methylation of HERV-w did not significantly differ. Individual exposure to inhalable particles and metals was estimated based on measures in all working areas and time spent by the study subjects in each area. In a regression model adjusted for age, body mass index and smoking, PM and metal components had a positive association with EBV-Wp methylation (i.e. PM10: β = 5.99, p-value < 0.038; nickel: β = 17.82, p-value = 0.02; arsenic: β = 13.59, p-value < 0.015). The difference observed comparing baseline and post-exposure samples may be suggestive of a rapid change in EBV methylation induced by air particles, while correlation between EBV methylation and PM/metal exposure may represent a more stable adaptive mechanism. Future studies investigating a larger panel of viral sequences could better elucidate

  2. '2A-Like' Signal Sequences Mediating Translational Recoding: A Novel Form of Dual Protein Targeting.

    PubMed

    Roulston, Claire; Luke, Garry A; de Felipe, Pablo; Ruan, Lin; Cope, Jonathan; Nicholson, John; Sukhodub, Andriy; Tilsner, Jens; Ryan, Martin D

    2016-08-01

    We report the initial characterization of an N-terminal oligopeptide '2A-like' sequence that is able to function both as a signal sequence and as a translational recoding element. Owing to this translational recoding activity, two forms of nascent polypeptide are synthesized: (i) when 2A-mediated translational recoding has not occurred: the nascent polypeptide is fused to the 2A-like N-terminal signal sequence and the fusion translation product is targeted to the exocytic pathway, and, (ii) a translation product where 2A-mediated translational recoding has occurred: the 2A-like signal sequence is synthesized as a separate translation product and, therefore, the nascent (downstream) polypeptide lacks the 2A-like signal sequence and is localized to the cytoplasm. This type of dual-functional signal sequence results, therefore, in the partitioning of the translation products between the two sub-cellular sites and represents a newly described form of dual protein targeting. © 2016 The Authors. Traffic published by John Wiley & Sons Ltd.

  3. Rapid molecular diagnostics of severe primary immunodeficiency determined by using targeted next-generation sequencing.

    PubMed

    Yu, Hui; Zhang, Victor Wei; Stray-Pedersen, Asbjørg; Hanson, Imelda Celine; Forbes, Lisa R; de la Morena, M Teresa; Chinn, Ivan K; Gorman, Elizabeth; Mendelsohn, Nancy J; Pozos, Tamara; Wiszniewski, Wojciech; Nicholas, Sarah K; Yates, Anne B; Moore, Lindsey E; Berge, Knut Erik; Sorte, Hanne; Bayer, Diana K; ALZahrani, Daifulah; Geha, Raif S; Feng, Yanming; Wang, Guoli; Orange, Jordan S; Lupski, James R; Wang, Jing; Wong, Lee-Jun

    2016-10-01

    Primary immunodeficiency diseases (PIDDs) are inherited disorders of the immune system. The most severe form, severe combined immunodeficiency (SCID), presents with profound deficiencies of T cells, B cells, or both at birth. If not treated promptly, affected patients usually do not live beyond infancy because of infections. Genetic heterogeneity of SCID frequently delays the diagnosis; a specific diagnosis is crucial for life-saving treatment and optimal management. We developed a next-generation sequencing (NGS)-based multigene-targeted panel for SCID and other severe PIDDs requiring rapid therapeutic actions in a clinical laboratory setting. The target gene capture/NGS assay provides an average read depth of approximately 1000×. The deep coverage facilitates simultaneous detection of single nucleotide variants and exonic copy number variants in one comprehensive assessment. Exons with insufficient coverage (<20× read depth) or high sequence homology (pseudogenes) are complemented by amplicon-based sequencing with specific primers to ensure 100% coverage of all targeted regions. Analysis of 20 patient samples with low T-cell receptor excision circle numbers on newborn screening or a positive family history or clinical suspicion of SCID or other severe PIDD identified deleterious mutations in 14 of them. Identified pathogenic variants included both single nucleotide variants and exonic copy number variants, such as hemizygous nonsense, frameshift, and missense changes in IL2RG; compound heterozygous changes in ATM, RAG1, and CIITA; homozygous changes in DCLRE1C and IL7R; and a heterozygous nonsense mutation in CHD7. High-throughput deep sequencing analysis with complete clinical validation greatly increases the diagnostic yield of severe primary immunodeficiency. Establishing a molecular diagnosis enables early immune reconstitution through prompt therapeutic intervention and guides management for improved long-term quality of life. Copyright © 2016 American

  4. Targeted sequencing of plant genomes

    Treesearch

    Mark D. Huynh

    2014-01-01

    Next-generation sequencing (NGS) has revolutionized the field of genetics by providing a means for fast and relatively affordable sequencing. With the advancement of NGS, wholegenome sequencing (WGS) has become more commonplace. However, sequencing an entire genome is still not cost effective or even beneficial in all cases. In studies that do not require a whole-...

  5. Non-Adjacent Consonant Sequence Patterns in English Target Words during the First-Word Period

    ERIC Educational Resources Information Center

    Aoyama, Katsura; Davis, Barbara L.

    2017-01-01

    The goal of this study was to investigate non-adjacent consonant sequence patterns in target words during the first-word period in infants learning American English. In the spontaneous speech of eighteen participants, target words with a Consonant-Vowel-Consonant (C[subscript 1]VC[subscript 2]) shape were analyzed. Target words were grouped into…

  6. Experimental and statistical post-validation of positive example EST sequences carrying peroxisome targeting signals type 1 (PTS1).

    PubMed

    Lingner, Thomas; Kataya, Amr R A; Reumann, Sigrun

    2012-02-01

    We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences. As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity." Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals.

  7. Experimental and statistical post-validation of positive example EST sequences carrying peroxisome targeting signals type 1 (PTS1)

    PubMed Central

    Lingner, Thomas; Kataya, Amr R. A.; Reumann, Sigrun

    2012-01-01

    We recently developed the first algorithms specifically for plants to predict proteins carrying peroxisome targeting signals type 1 (PTS1) from genome sequences.1 As validated experimentally, the prediction methods are able to correctly predict unknown peroxisomal Arabidopsis proteins and to infer novel PTS1 tripeptides. The high prediction performance is primarily determined by the large number and sequence diversity of the underlying positive example sequences, which mainly derived from EST databases. However, a few constructs remained cytosolic in experimental validation studies, indicating sequencing errors in some ESTs. To identify erroneous sequences, we validated subcellular targeting of additional positive example sequences in the present study. Moreover, we analyzed the distribution of prediction scores separately for each orthologous group of PTS1 proteins, which generally resembled normal distributions with group-specific mean values. The cytosolic sequences commonly represented outliers of low prediction scores and were located at the very tail of a fitted normal distribution. Three statistical methods for identifying outliers were compared in terms of sensitivity and specificity.” Their combined application allows elimination of erroneous ESTs from positive example data sets. This new post-validation method will further improve the prediction accuracy of both PTS1 and PTS2 protein prediction models for plants, fungi, and mammals. PMID:22415050

  8. Influence of quasi-specific sites on kinetics of target DNA search by a sequence-specific DNA-binding protein.

    PubMed

    Kemme, Catherine A; Esadze, Alexandre; Iwahara, Junji

    2015-11-10

    Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such "quasi-specific" sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1's association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins.

  9. Influence of Quasi-Specific Sites on Kinetics of Target DNA Search by a Sequence-Specific DNA-Binding Protein

    PubMed Central

    2015-01-01

    Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such “quasi-specific” sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1’s association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins. PMID:26502071

  10. High flux, beamed neutron sources employing deuteron-rich ion beams from D2O-ice layered targets

    NASA Astrophysics Data System (ADS)

    Alejo, A.; Krygier, A. G.; Ahmed, H.; Morrison, J. T.; Clarke, R. J.; Fuchs, J.; Green, A.; Green, J. S.; Jung, D.; Kleinschmidt, A.; Najmudin, Z.; Nakamura, H.; Norreys, P.; Notley, M.; Oliver, M.; Roth, M.; Vassura, L.; Zepf, M.; Borghesi, M.; Freeman, R. R.; Kar, S.

    2017-06-01

    A forwardly-peaked bright neutron source was produced using a laser-driven, deuteron-rich ion beam in a pitcher-catcher scenario. A proton-free ion source was produced via target normal sheath acceleration from Au foils having a thin layer of D2O ice at the rear side, irradiated by sub-petawatt laser pulses (˜200 J, ˜750 fs) at peak intensity ˜ 2× {10}20 {{W}} {{cm}}-2. The neutrons were preferentially produced in a beam of ˜70° FWHM cone along the ion beam forward direction, with maximum energy up to ˜40 MeV and a peak flux along the axis ˜ 2× {10}9 {{n}} {{sr}}-1 for neutron energy above 2.5 MeV. The experimental data is in good agreement with the simulations carried out for the d(d,n)3He reaction using the deuteron beam produced by the ice-layered target.

  11. In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

    PubMed Central

    Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

    2011-01-01

    To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533

  12. Contribution of the first K-homology domain of poly(C)-binding protein 1 to its affinity and specificity for C-rich oligonucleotides.

    PubMed

    Yoga, Yano M K; Traore, Daouda A K; Sidiqi, Mahjooba; Szeto, Chris; Pendini, Nicole R; Barker, Andrew; Leedman, Peter J; Wilce, Jacqueline A; Wilce, Matthew C J

    2012-06-01

    Poly-C-binding proteins are triple KH (hnRNP K homology) domain proteins with specificity for single stranded C-rich RNA and DNA. They play diverse roles in the regulation of protein expression at both transcriptional and translational levels. Here, we analyse the contributions of individual αCP1 KH domains to binding C-rich oligonucleotides using biophysical and structural methods. Using surface plasmon resonance (SPR), we demonstrate that KH1 makes the most stable interactions with both RNA and DNA, KH3 binds with intermediate affinity and KH2 only interacts detectibly with DNA. The crystal structure of KH1 bound to a 5'-CCCTCCCT-3' DNA sequence shows a 2:1 protein:DNA stoichiometry and demonstrates a molecular arrangement of KH domains bound to immediately adjacent oligonucleotide target sites. SPR experiments, with a series of poly-C-sequences reveals that cytosine is preferred at all four positions in the oligonucleotide binding cleft and that a C-tetrad binds KH1 with 10 times higher affinity than a C-triplet. The basis for this high affinity interaction is finally detailed with the structure determination of a KH1.W.C54S mutant bound to 5'-ACCCCA-3' DNA sequence. Together, these data establish the lead role of KH1 in oligonucleotide binding by αCP1 and reveal the molecular basis of its specificity for a C-rich tetrad.

  13. Targeted next generation sequencing for molecular diagnosis of Usher syndrome.

    PubMed

    Aparisi, María J; Aller, Elena; Fuster-García, Carla; García-García, Gema; Rodrigo, Regina; Vázquez-Manrique, Rafael P; Blanco-Kelly, Fiona; Ayuso, Carmen; Roux, Anne-Françoise; Jaijo, Teresa; Millán, José M

    2014-11-18

    Usher syndrome is an autosomal recessive disease that associates sensorineural hearing loss, retinitis pigmentosa and, in some cases, vestibular dysfunction. It is clinically and genetically heterogeneous. To date, 10 genes have been associated with the disease, making its molecular diagnosis based on Sanger sequencing, expensive and time-consuming. Consequently, the aim of the present study was to develop a molecular diagnostics method for Usher syndrome, based on targeted next generation sequencing. A custom HaloPlex panel for Illumina platforms was designed to capture all exons of the 10 known causative Usher syndrome genes (MYO7A, USH1C, CDH23, PCDH15, USH1G, CIB2, USH2A, GPR98, DFNB31 and CLRN1), the two Usher syndrome-related genes (HARS and PDZD7) and the two candidate genes VEZT and MYO15A. A cohort of 44 patients suffering from Usher syndrome was selected for this study. This cohort was divided into two groups: a test group of 11 patients with known mutations and another group of 33 patients with unknown mutations. Forty USH patients were successfully sequenced, 8 USH patients from the test group and 32 patients from the group composed of USH patients without genetic diagnosis. We were able to detect biallelic mutations in one USH gene in 22 out of 32 USH patients (68.75%) and to identify 79.7% of the expected mutated alleles. Fifty-three different mutations were detected. These mutations included 21 missense, 8 nonsense, 9 frameshifts, 9 intronic mutations and 6 large rearrangements. Targeted next generation sequencing allowed us to detect both point mutations and large rearrangements in a single experiment, minimizing the economic cost of the study, increasing the detection ratio of the genetic cause of the disease and improving the genetic diagnosis of Usher syndrome patients.

  14. Species richness of arbuscular mycorrhizal fungi: associations with grassland plant richness and biomass.

    PubMed

    Hiiesalu, Inga; Pärtel, Meelis; Davison, John; Gerhold, Pille; Metsis, Madis; Moora, Mari; Öpik, Maarja; Vasar, Martti; Zobel, Martin; Wilson, Scott D

    2014-07-01

    Although experiments show a positive association between vascular plant and arbuscular mycorrhizal fungal (AMF) species richness, evidence from natural ecosystems is scarce. Furthermore, there is little knowledge about how AMF richness varies with belowground plant richness and biomass. We examined relationships among AMF richness, above- and belowground plant richness, and plant root and shoot biomass in a native North American grassland. Root-colonizing AMF richness and belowground plant richness were detected from the same bulk root samples by 454-sequencing of the AMF SSU rRNA and plant trnL genes. In total we detected 63 AMF taxa. Plant richness was 1.5 times greater belowground than aboveground. AMF richness was significantly positively correlated with plant species richness, and more strongly with below- than aboveground plant richness. Belowground plant richness was positively correlated with belowground plant biomass and total plant biomass, whereas aboveground plant richness was positively correlated only with belowground plant biomass. By contrast, AMF richness was negatively correlated with belowground and total plant biomass. Our results indicate that AMF richness and plant belowground richness are more strongly related with each other and with plant community biomass than with the plant aboveground richness measures that have been almost exclusively considered to date. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.

  15. TP53, PIK3CA, FBXW7 and KRAS Mutations in Esophageal Cancer Identified by Targeted Sequencing.

    PubMed

    Zheng, Huili; Wang, Yan; Tang, Chuanning; Jones, Lindsey; Ye, Hua; Zhang, Guangchun; Cao, Weihai; Li, Jingwen; Liu, Lifeng; Liu, Zhencong; Zhang, Chao; Lou, Feng; Liu, Zhiyuan; Li, Yangyang; Shi, Zhenfen; Zhang, Jingbo; Zhang, Dandan; Sun, Hong; Dong, Haichao; Dong, Zhishou; Guo, Baishuai; Yan, H E; Lu, Qingyu; Huang, Xue; Chen, Si-Yi

    2016-01-01

    Esophageal cancer (EC) is a common malignancy with significant morbidity and mortality. As individual cancers exhibit unique mutation patterns, identifying and characterizing gene mutations in EC that may serve as biomarkers might help predict patient outcome and guide treatment. Traditionally, personalized cancer DNA sequencing was impractical and expensive. Recent technological advancements have made targeted DNA sequencing more cost- and time-effective with reliable results. This technology may be useful for clinicians to direct patient treatment. The Ion PGM and AmpliSeq Cancer Panel was used to identify mutations at 737 hotspot loci of 45 cancer-related genes in 64 EC samples from Chinese patients. Frequent mutations were found in TP53 and less frequent mutations in PIK3CA, FBXW7 and KRAS. These results demonstrate that targeted sequencing can reliably identify mutations in individual tumors that make this technology a possibility for clinical use. Copyright© 2016, International Institute of Anticancer Research (Dr. John G. Delinasios), All rights reserved.

  16. 40Ar/39Ar dating of a Langhian biotite-rich clay layer in the pelagic sequence of the Cònero Riviera, Ancona, Italy

    NASA Astrophysics Data System (ADS)

    Mader, Dieter; Montanari, Alessandro; Gattacceca, Jérôme; Koeberl, Christian; Handler, Robert; Coccioni, Rodolfo

    2001-12-01

    A nearly complete and undisturbed Miocene carbonate sequence is present in the easternmost part of the Umbria-Marche basin, Italy, which is ideal for detailed and integrated stratigraphic investigations of the Miocene Epoch. In this study, we were trying to obtain evidence for the presence or absence of distal ejecta from the 15 Ma Ries impact structure in southern Germany, located about 600 km to the north-northwest of the Umbria-Marche basin. The first step is to find coeval strata in the Umbria-Marche sequence. At the La Vedova section, Cònero Riviera, we dated a volcaniclastic biotite-rich clay layer, the Aldo Level, which is situated within planktonic foraminiferal Zone N8, at 14.9±0.2 Ma, using the 40Ar/39Ar method. Together with detailed geologic and stratigraphic information about the Aldo Level, the resulting age can be used confidentially to calibrate the Langhian stage. Besides providing new constraints on Miocene geochronology, this age can now be used for impact stratigraphic studies. To directly correlate the biotite ages of the La Vedova section with rocks from the Ries impact event, Ries impact glass was also analyzed and found to be coeval. Although unrelated to this impact event, the biotite-rich clay layer should help in the search for evidence of distal ejecta related to the Ries crater.

  17. A flexible and economical barcoding approach for highly multiplexed amplicon sequencing of diverse target genes

    PubMed Central

    Herbold, Craig W.; Pelikan, Claus; Kuzyk, Orest; Hausmann, Bela; Angel, Roey; Berry, David; Loy, Alexander

    2015-01-01

    High throughput sequencing of phylogenetic and functional gene amplicons provides tremendous insight into the structure and functional potential of complex microbial communities. Here, we introduce a highly adaptable and economical PCR approach to barcoding and pooling libraries of numerous target genes. In this approach, we replace gene- and sequencing platform-specific fusion primers with general, interchangeable barcoding primers, enabling nearly limitless customized barcode-primer combinations. Compared to barcoding with long fusion primers, our multiple-target gene approach is more economical because it overall requires lower number of primers and is based on short primers with generally lower synthesis and purification costs. To highlight our approach, we pooled over 900 different small-subunit rRNA and functional gene amplicon libraries obtained from various environmental or host-associated microbial community samples into a single, paired-end Illumina MiSeq run. Although the amplicon regions ranged in size from approximately 290 to 720 bp, we found no significant systematic sequencing bias related to amplicon length or gene target. Our results indicate that this flexible multiplexing approach produces large, diverse, and high quality sets of amplicon sequence data for modern studies in microbial ecology. PMID:26236305

  18. Rational Design of Small Molecules Targeting Oncogenic Noncoding RNAs from Sequence.

    PubMed

    Disney, Matthew D; Angelbello, Alicia J

    2016-12-20

    The discovery of RNA catalysis in the 1980s and the dissemination of the human genome sequence at the start of this century inspired investigations of the regulatory roles of noncoding RNAs in biology. In fact, the Encyclopedia of DNA Elements (ENCODE) project has shown that only 1-2% of the human genome encodes protein, yet 75% is transcribed into RNA. Functional studies both preceding and following the ENCODE project have shown that these noncoding RNAs have important roles in regulating gene expression, developmental timing, and other critical functions. RNA's diverse roles are often a consequence of the various folds that it adopts. The single-stranded nature of the biopolymer enables it to adopt intramolecular folds with noncanonical pairings to lower its free energy. These folds can be scaffolds to bind proteins or to form frameworks to interact with other RNAs. Not surprisingly, dysregulation of certain noncoding RNAs has been shown to be causative of disease. Given this as the background, it is easy to see why it would be useful to develop methods that target RNA and manipulate its biology in rational and predictable ways. The antisense approach has afforded strategies to target RNAs via Watson-Crick base pairing and has typically focused on targeting partially unstructured regions of RNA. Small molecule strategies to target RNA would be desirable not only because compounds could be lead optimized via medicinal chemistry but also because structured regions within an RNA of interest could be targeted to directly interfere with RNA folds that contribute to disease. Additionally, small molecules have historically been the most successful drug candidates. Until recently, the ability to design small molecules that target non-ribosomal RNAs has been elusive, creating the perception that they are "undruggable". In this Account, approaches to demystify targeting RNA with small molecules are described. Rather than bulk screening for compounds that bind to singular

  19. Method to amplify variable sequences without imposing primer sequences

    DOEpatents

    Bradbury, Andrew M.; Zeytun, Ahmet

    2006-11-14

    The present invention provides methods of amplifying target sequences without including regions flanking the target sequence in the amplified product or imposing amplification primer sequences on the amplified product. Also provided are methods of preparing a library from such amplified target sequences.

  20. MicroRNA-128 targets myostatin at coding domain sequence to regulate myoblasts in skeletal muscle development.

    PubMed

    Shi, Lei; Zhou, Bo; Li, Pinghua; Schinckel, Allan P; Liang, Tingting; Wang, Han; Li, Huizhi; Fu, Lingling; Chu, Qingpo; Huang, Ruihua

    2015-09-01

    MicroRNAs (miRNAs or miRs) play a critical role in skeletal muscle development. In a previous study we observed that miR-128 was highly expressed in skeletal muscle. However, its function in regulating skeletal muscle development is not clear. Our hypothesis was that miR-128 is involved in the regulation of the proliferation and differentiation of skeletal myoblasts. In this study, through bioinformatics analyses, we demonstrate that miR-128 specifically targeted mRNA of myostatin (MSTN), a critical inhibitor of skeletal myogenesis, at coding domain sequence (CDS) region, resulting in down-regulating of myostatin post-transcription. Overexpression of miR-128 inhibited proliferation of mouse C2C12 myoblast cells but promoted myotube formation; whereas knockdown of miR-128 had completely opposite effects. In addition, ectopic miR-128 regulated the expression of myogenic factor 5 (Myf5), myogenin (MyoG), paired box (Pax) 3 and 7. Furthermore, an inverse relationship was found between the expression of miR-128 and MSTN protein expression in vivo and in vitro. Taken together, these results reveal that there is a novel pathway in skeletal muscle development in which miR-128 regulates myostatin at CDS region to inhibit proliferation but promote differentiation of myoblast cells. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Whole-exome sequencing and targeted gene sequencing provide insights into the role of PALB2 as a male breast cancer susceptibility gene.

    PubMed

    Silvestri, Valentina; Zelli, Veronica; Valentini, Virginia; Rizzolo, Piera; Navazio, Anna Sara; Coppa, Anna; Agata, Simona; Oliani, Cristina; Barana, Daniela; Castrignanò, Tiziana; Viel, Alessandra; Russo, Antonio; Tibiletti, Maria Grazia; Zanna, Ines; Masala, Giovanna; Cortesi, Laura; Manoukian, Siranoush; Azzollini, Jacopo; Peissel, Bernard; Bonanni, Bernardo; Peterlongo, Paolo; Radice, Paolo; Palli, Domenico; Giannini, Giuseppe; Chillemi, Giovanni; Montagna, Marco; Ottini, Laura

    2017-01-01

    Male breast cancer (MBC) is a rare disease whose etiology appears to be largely associated with genetic factors. BRCA1 and BRCA2 mutations account for about 10% of all MBC cases. Thus, a fraction of MBC cases are expected to be due to genetic factors not yet identified. To further explain the genetic susceptibility for MBC, whole-exome sequencing (WES) and targeted gene sequencing were applied to high-risk, BRCA1/2 mutation-negative MBC cases. Germ-line DNA of 1 male and 2 female BRCA1/2 mutation-negative breast cancer (BC) cases from a pedigree showing a first-degree family history of MBC was analyzed with WES. Targeted gene sequencing for the validation of WES results was performed for 48 high-risk, BRCA1/2 mutation-negative MBC cases from an Italian multicenter study of MBC. A case-control series of 433 BRCA1/2 mutation-negative MBC and female breast cancer (FBC) cases and 849 male and female controls was included in the study. WES in the family identified the partner and localizer of BRCA2 (PALB2) c.419delA truncating mutation carried by the proband, her father, and her paternal uncle (all affected with BC) and the N-acetyltransferase 1 (NAT1) c.97C>T nonsense mutation carried by the proband's maternal aunt. Targeted PALB2 sequencing detected the c.1984A>T nonsense mutation in 1 of the 48 BRCA1/2 mutation-negative MBC cases. NAT1 c.97C>T was not found in the case-control series. These results add strength to the evidence showing that PALB2 is involved in BC risk for both sexes and indicate that consideration should be given to clinical testing of PALB2 for BRCA1/2 mutation-negative families with multiple MBC and FBC cases. Cancer 2017;123:210-218. © 2016 American Cancer Society. © 2016 American Cancer Society.

  2. Identification of peptide sequences that target to the brain using in vivo phage display.

    PubMed

    Li, Jingwei; Zhang, Qizhi; Pang, Zhiqing; Wang, Yuchen; Liu, Qingfeng; Guo, Liangran; Jiang, Xinguo

    2012-06-01

    Phage display technology could provide a rapid means for the discovery of novel peptides. To find peptide ligands specific for the brain vascular receptors, we performed a modified phage display method. Phages were recovered from mice brain parenchyma after administrated with a random 7-mer peptide library intravenously. A longer circulation time was arranged according to the biodistributive brain/blood ratios of phage particles. Following sequential rounds of isolation, a number of phages were sequenced and a peptide sequence (CTSTSAPYC, denoted as PepC7) was identified. Clone 7-1, which encodes PepC7, exhibited translocation efficiency about 41-fold higher than the random library phage. Immunofluorescence analysis revealed that Clone 7-1 had a significant superiority on transport efficiency into the brain compared with native M13 phage. Clone 7-1 was inhibited from homing to the brain in a dose-dependent fashion when cyclic peptides of the same sequence were present in a competition assay. Interestingly, the linear peptide (ATSTSAPYA, Pep7) and a scrambled control peptide PepSC7 (CSPATSYTC) did not compete with the phage at the same tested concentration (0.2-200 pg). Labeled by Cy5.5, PepC7 exhibited significant brain-targeting capability in in vivo optical imaging analysis. The cyclic conformation of PepC7 formed by disulfide bond, and the correct structure itself play a critical role in maintaining the selectivity and affinity for the brain. In conclusion, PepC7 is a promising brain-target motif never been reported before and it could be applied to targeted drug delivery into the brain.

  3. LROC Targeted Observations for the Next Generation of Scientific Exploration

    NASA Astrophysics Data System (ADS)

    Jolliff, B. L.

    2015-12-01

    Imaging of the Moon at high spatial resolution (0.5 to 2 mpp) by the Lunar Reconnaissance Orbiter Camera (LROC) Narrow Angle Cameras (NAC) plus topographic data derived from LROC NAC and WAC (Wide Angle Camera) and LOLA (Lunar Orbiting Laser Altimeter), coupled with recently obtained hyperspectral NIR and thermal data, permit studies of composition, mineralogy, and geologic context at essentially an outcrop scale. Such studies pave the way for future landed and sample return missions for high science priority targets. Among such targets are (1) the youngest volcanic rocks on the Moon, including mare basalts formed as recently as ~1 Ga, and irregular mare patches (IMPs) that appear to be even younger [1]; (2) volcanic rocks and complexes with compositions more silica-rich than mare basalts [2-4]; (3) differentiated impact-melt deposits [5,6], ancient volcanics, and compositional anomalies within the South Pole-Aitken basin; (4) exposures of recently discovered key crustal rock types in uplifted structures such as essentially pure anorthosite [7] and spinel-rich rocks [8]; and (5) frozen volatile-element-rich deposits in polar areas [9]. Important data sets include feature sequences of paired NAC images obtained under similar illumination conditions, NAC geometric stereo, from which high-resolution DTMs can be made, and photometric sequences useful for assessing composition in areas of mature cover soils. Examples of each of these target types will be discussed in context of potential future missions. References: [1] Braden et al. (2014) Nat. Geo. 7, 787-791. [2] Glotch et al. (2010) Science, 329, 1510-1513. [3] Greenhagen et al. (2010) Science, 329, 1507-1509. [4] Jolliff et al. (2011) Nat. Geo. 4, 566-571. [5] Vaughan et al (2013) PSS 91, 101-106. [6] Hurwitz and Kring (2014) J. Geophys. Res. 119, 1110-1133 [7] Ohtake et al. (2009) Nature, 461, 236-241 [8] Pieters et al. (2014) Am. Min. 99, 1893-1910. [9] Colaprete et al. (2010) Science 330, 463-468.

  4. Identification of Five Novel Variants in Chinese Oculocutaneous Albinism by Targeted Next-Generation Sequencing.

    PubMed

    Qiu, Biyuan; Ma, Tao; Peng, Chunyan; Zheng, Xiaoqin; Yang, Jiyun

    2018-04-01

    The diagnosis of oculocutaneous albinism (OCA) is established using clinical signs and symptoms. OCA is, however, a highly genetically heterogeneous disease with mutations identified in at least nineteen unique genes, many of which produce overlapping phenotypic traits. Thus, differentiating genetic OCA subtypes for diagnoses and genetic counseling is challenging, based on clinical presentation alone, and would benefit from a comprehensive molecular diagnostic. To develop and validate a more comprehensive, targeted, next-generation-sequencing-based diagnostic for the identification of OCA-causing variants. The genomic DNA samples from 28 OCA probands were analyzed by targeted next-generation sequencing (NGS), and the candidate variants were confirmed through Sanger sequencing. We observed mutations in the TYR, OCA2, and SLC45A2 genes in 25/28 (89%) patients with OCA. We identified 38 pathogenic variants among these three genes, including 5 novel variants: c.1970G>T (p.Gly657Val), c.1669A>C (p.Thr557Pro), c.2339-2A>C, and c.1349C>G (p.Thr450Arg) in OCA2; c.459_470delTTTTGCTGCCGA (p.Ala155_Phe158del) in SLC45A2. Our findings expand the mutational spectrum of OCA in the Chinese population, and the assay we developed should be broadly useful as a molecular diagnostic, and as an aid for genetic counseling for OCA patients.

  5. Hi-Plex for Simple, Accurate, and Cost-Effective Amplicon-based Targeted DNA Sequencing.

    PubMed

    Pope, Bernard J; Hammet, Fleur; Nguyen-Dumont, Tu; Park, Daniel J

    2018-01-01

    Hi-Plex is a suite of methods to enable simple, accurate, and cost-effective highly multiplex PCR-based targeted sequencing (Nguyen-Dumont et al., Biotechniques 58:33-36, 2015). At its core is the principle of using gene-specific primers (GSPs) to "seed" (or target) the reaction and universal primers to "drive" the majority of the reaction. In this manner, effects on amplification efficiencies across the target amplicons can, to a large extent, be restricted to early seeding cycles. Product sizes are defined within a relatively narrow range to enable high-specificity size selection, replication uniformity across target sites (including in the context of fragmented input DNA such as that derived from fixed tumor specimens (Nguyen-Dumont et al., Biotechniques 55:69-74, 2013; Nguyen-Dumont et al., Anal Biochem 470:48-51, 2015), and application of high-specificity genetic variant calling algorithms (Pope et al., Source Code Biol Med 9:3, 2014; Park et al., BMC Bioinformatics 17:165, 2016). Hi-Plex offers a streamlined workflow that is suitable for testing large numbers of specimens without the need for automation.

  6. Modelling a Set of Carbon-Rich AGB Stars at High-Angular Resolution

    NASA Astrophysics Data System (ADS)

    Rau, Gioia; Hron, Josef; Paladini, Claudia; Aringer, Bernard; Eriksson, Kjell; Marigo, Paola; Nowotny, Walter; Grellmann, Rebekka

    2016-07-01

    We compared spectro-photometric and interferometric observations of six carbon-rich AGB stars with a grid of self-consistentmodel atmospheres. The targets are: R Lep, R Vol, Y Pav, AQ Sgr, U Hya and X TrA. Please refer to the publication Rau et al. 2016(subm.) for further details on those findings.

  7. Computer program for the IBM personal computer which searches for approximate matches to short oligonucleotide sequences in long target DNA sequences.

    PubMed Central

    Myers, E W; Mount, D W

    1986-01-01

    We describe a program which may be used to find approximate matches to a short predefined DNA sequence in a larger target DNA sequence. The program predicts the usefulness of specific DNA probes and sequencing primers and finds nearly identical sequences that might represent the same regulatory signal. The program is written in the C programming language and will run on virtually any computer system with a C compiler, such as the IBM/PC and other computers running under the MS/DOS and UNIX operating systems. The program has been integrated into an existing software package for the IBM personal computer (see article by Mount and Conrad, this volume). Some examples of its use are given. PMID:3753785

  8. Systematic evaluation of a targeted gene capture sequencing panel for molecular diagnosis of retinitis pigmentosa.

    PubMed

    Huang, Hui; Chen, Yanhua; Chen, Huishuang; Ma, Yuanyuan; Chiang, Pei-Wen; Zhong, Jing; Liu, Xuyang; Asan; Wu, Jing; Su, Yan; Li, Xin; Deng, Jianlian; Huang, Yingping; Zhang, Xinxin; Li, Yang; Fan, Ning; Wang, Ying; Tang, Lihui; Shen, Jinting; Chen, Meiyan; Zhang, Xiuqing; Te, Deng; Banerjee, Santasree; Liu, Hui; Qi, Ming; Yi, Xin

    2018-01-01

    Inherited eye diseases are major causes of vision loss in both children and adults. Inherited eye diseases are characterized by clinical variability and pronounced genetic heterogeneity. Genetic testing may provide an accurate diagnosis for ophthalmic genetic disorders and allow gene therapy for specific diseases. A targeted gene capture panel was designed to capture exons of 283 inherited eye disease genes including 58 known causative retinitis pigmentosa (RP) genes. 180 samples were tested with this panel, 68 were previously tested by Sanger sequencing. Systematic evaluation of our method and comprehensive molecular diagnosis were carried on 99 RP patients. 96.85% targeted regions were covered by at least 20 folds, the accuracy of variants detection was 99.994%. In 4 of the 68 samples previously tested by Sanger sequencing, mutations of other diseases not consisting with the clinical diagnosis were detected by next-generation sequencing (NGS) not Sanger. Among the 99 RP patients, 64 (64.6%) were detected with pathogenic mutations, while in 3 patients, it was inconsistent between molecular diagnosis and their initial clinical diagnosis. After revisiting, one patient's clinical diagnosis was reclassified. In addition, 3 patients were found carrying large deletions. We have systematically evaluated our method and compared it with Sanger sequencing, and have identified a large number of novel mutations in a cohort of 99 RP patients. The results showed a sufficient accuracy of our method and suggested the importance of molecular diagnosis in clinical diagnosis.

  9. Systematic evaluation of a targeted gene capture sequencing panel for molecular diagnosis of retinitis pigmentosa

    PubMed Central

    Ma, Yuanyuan; Chiang, Pei-Wen; Zhong, Jing; Liu, Xuyang; Asan; Wu, Jing; Su, Yan; Li, Xin; Deng, Jianlian; Huang, Yingping; Zhang, Xinxin; Li, Yang; Fan, Ning; Wang, Ying; Tang, Lihui; Shen, Jinting; Chen, Meiyan; Zhang, Xiuqing; Te, Deng; Banerjee, Santasree; Liu, Hui; Qi, Ming; Yi, Xin

    2018-01-01

    Background Inherited eye diseases are major causes of vision loss in both children and adults. Inherited eye diseases are characterized by clinical variability and pronounced genetic heterogeneity. Genetic testing may provide an accurate diagnosis for ophthalmic genetic disorders and allow gene therapy for specific diseases. Methods A targeted gene capture panel was designed to capture exons of 283 inherited eye disease genes including 58 known causative retinitis pigmentosa (RP) genes. 180 samples were tested with this panel, 68 were previously tested by Sanger sequencing. Systematic evaluation of our method and comprehensive molecular diagnosis were carried on 99 RP patients. Results 96.85% targeted regions were covered by at least 20 folds, the accuracy of variants detection was 99.994%. In 4 of the 68 samples previously tested by Sanger sequencing, mutations of other diseases not consisting with the clinical diagnosis were detected by next-generation sequencing (NGS) not Sanger. Among the 99 RP patients, 64 (64.6%) were detected with pathogenic mutations, while in 3 patients, it was inconsistent between molecular diagnosis and their initial clinical diagnosis. After revisiting, one patient’s clinical diagnosis was reclassified. In addition, 3 patients were found carrying large deletions. Conclusions We have systematically evaluated our method and compared it with Sanger sequencing, and have identified a large number of novel mutations in a cohort of 99 RP patients. The results showed a sufficient accuracy of our method and suggested the importance of molecular diagnosis in clinical diagnosis. PMID:29641573

  10. Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller.

    PubMed

    Xu, Chang; Nezami Ranjbar, Mohammad R; Wu, Zhong; DiCarlo, John; Wang, Yexun

    2017-01-03

    Detection of DNA mutations at very low allele fractions with high accuracy will significantly improve the effectiveness of precision medicine for cancer patients. To achieve this goal through next generation sequencing, researchers need a detection method that 1) captures rare mutation-containing DNA fragments efficiently in the mix of abundant wild-type DNA; 2) sequences the DNA library extensively to deep coverage; and 3) distinguishes low level true variants from amplification and sequencing errors with high accuracy. Targeted enrichment using PCR primers provides researchers with a convenient way to achieve deep sequencing for a small, yet most relevant region using benchtop sequencers. Molecular barcoding (or indexing) provides a unique solution for reducing sequencing artifacts analytically. Although different molecular barcoding schemes have been reported in recent literature, most variant calling has been done on limited targets, using simple custom scripts. The analytical performance of barcode-aware variant calling can be significantly improved by incorporating advanced statistical models. We present here a highly efficient, simple and scalable enrichment protocol that integrates molecular barcodes in multiplex PCR amplification. In addition, we developed smCounter, an open source, generic, barcode-aware variant caller based on a Bayesian probabilistic model. smCounter was optimized and benchmarked on two independent read sets with SNVs and indels at 5 and 1% allele fractions. Variants were called with very good sensitivity and specificity within coding regions. We demonstrated that we can accurately detect somatic mutations with allele fractions as low as 1% in coding regions using our enrichment protocol and variant caller.

  11. Identifying mRNA sequence elements for target recognition by human Argonaute proteins

    PubMed Central

    Li, Jingjing; Kim, TaeHyung; Nutiu, Razvan; Ray, Debashish; Hughes, Timothy R.; Zhang, Zhaolei

    2014-01-01

    It is commonly known that mammalian microRNAs (miRNAs) guide the RNA-induced silencing complex (RISC) to target mRNAs through the seed-pairing rule. However, recent experiments that coimmunoprecipitate the Argonaute proteins (AGOs), the central catalytic component of RISC, have consistently revealed extensive AGO-associated mRNAs that lack seed complementarity with miRNAs. We herein test the hypothesis that AGO has its own binding preference within target mRNAs, independent of guide miRNAs. By systematically analyzing the data from in vivo cross-linking experiments with human AGOs, we have identified a structurally accessible and evolutionarily conserved region (∼10 nucleotides in length) that alone can accurately predict AGO–mRNA associations, independent of the presence of miRNA binding sites. Within this region, we further identified an enriched motif that was replicable on independent AGO-immunoprecipitation data sets. We used RNAcompete to enumerate the RNA-binding preference of human AGO2 to all possible 7-mer RNA sequences and validated the AGO motif in vitro. These findings reveal a novel function of AGOs as sequence-specific RNA-binding proteins, which may aid miRNAs in recognizing their targets with high specificity. PMID:24663241

  12. Controlling the prion propensity of glutamine/asparagine-rich proteins.

    PubMed

    Paul, Kacy R; Ross, Eric D

    2015-01-01

    The yeast Saccharomyces cerevisiae can harbor a number of distinct prions. Most of the yeast prion proteins contain a glutamine/asparagine (Q/N) rich region that drives prion formation. Prion-like domains, defined as regions with high compositional similarity to yeast prion domains, are common in eukaryotic proteomes, and mutations in various human proteins containing prion-like domains have been linked to degenerative diseases, including amyotrophic lateral sclerosis. Here, we discuss a recent study in which we utilized two strategies to generate prion activity in non-prion Q/N-rich domains. First, we made targeted mutations in four non-prion Q/N-rich domains, replacing predicted prion-inhibiting amino acids with prion-promoting amino acids. All four mutants formed foci when expressed in yeast, and two acquired bona fide prion activity. Prion activity could be generated with as few as two mutations, suggesting that many non-prion Q/N-rich proteins may be just a small number of mutations from acquiring aggregation or prion activity. Second, we created tandem repeats of short prion-prone segments, and observed length-dependent prion activity. These studies demonstrate the considerable progress that has been made in understanding the sequence basis for aggregation of prion and prion-like domains, and suggest possible mechanisms by which new prion domains could evolve.

  13. Clinical Validation of Copy Number Variant Detection from Targeted Next-Generation Sequencing Panels.

    PubMed

    Kerkhof, Jennifer; Schenkel, Laila C; Reilly, Jack; McRobbie, Sheri; Aref-Eshghi, Erfan; Stuart, Alan; Rupar, C Anthony; Adams, Paul; Hegele, Robert A; Lin, Hanxin; Rodenhiser, David; Knoll, Joan; Ainsworth, Peter J; Sadikovic, Bekim

    2017-11-01

    Next-generation sequencing (NGS) technology has rapidly replaced Sanger sequencing in the assessment of sequence variations in clinical genetics laboratories. One major limitation of current NGS approaches is the ability to detect copy number variations (CNVs) approximately >50 bp. Because these represent a major mutational burden in many genetic disorders, parallel CNV assessment using alternate supplemental methods, along with the NGS analysis, is normally required, resulting in increased labor, costs, and turnaround times. The objective of this study was to clinically validate a novel CNV detection algorithm using targeted clinical NGS gene panel data. We have applied this approach in a retrospective cohort of 391 samples and a prospective cohort of 2375 samples and found a 100% sensitivity (95% CI, 89%-100%) for 37 unique events and a high degree of specificity to detect CNVs across nine distinct targeted NGS gene panels. This NGS CNV pipeline enables stand-alone first-tier assessment for CNV and sequence variants in a clinical laboratory setting, dispensing with the need for parallel CNV analysis using classic techniques, such as microarray, long-range PCR, or multiplex ligation-dependent probe amplification. This NGS CNV pipeline can also be applied to the assessment of complex genomic regions, including pseudogenic DNA sequences, such as the PMS2CL gene, and to mitochondrial genome heteroplasmy detection. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  14. Contribution of the first K-homology domain of poly(C)-binding protein 1 to its affinity and specificity for C-rich oligonucleotides

    PubMed Central

    Yoga, Yano M. K.; Traore, Daouda A. K.; Sidiqi, Mahjooba; Szeto, Chris; Pendini, Nicole R.; Barker, Andrew; Leedman, Peter J.; Wilce, Jacqueline A.; Wilce, Matthew C. J.

    2012-01-01

    Poly-C-binding proteins are triple KH (hnRNP K homology) domain proteins with specificity for single stranded C-rich RNA and DNA. They play diverse roles in the regulation of protein expression at both transcriptional and translational levels. Here, we analyse the contributions of individual αCP1 KH domains to binding C-rich oligonucleotides using biophysical and structural methods. Using surface plasmon resonance (SPR), we demonstrate that KH1 makes the most stable interactions with both RNA and DNA, KH3 binds with intermediate affinity and KH2 only interacts detectibly with DNA. The crystal structure of KH1 bound to a 5′-CCCTCCCT-3′ DNA sequence shows a 2:1 protein:DNA stoichiometry and demonstrates a molecular arrangement of KH domains bound to immediately adjacent oligonucleotide target sites. SPR experiments, with a series of poly-C-sequences reveals that cytosine is preferred at all four positions in the oligonucleotide binding cleft and that a C-tetrad binds KH1 with 10 times higher affinity than a C-triplet. The basis for this high affinity interaction is finally detailed with the structure determination of a KH1.W.C54S mutant bound to 5′-ACCCCA-3′ DNA sequence. Together, these data establish the lead role of KH1 in oligonucleotide binding by αCP1 and reveal the molecular basis of its specificity for a C-rich tetrad. PMID:22344691

  15. FrameD: A flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences.

    PubMed

    Schiex, Thomas; Gouzy, Jérôme; Moisan, Annick; de Oliveira, Yannick

    2003-07-01

    We describe FrameD, a program that predicts coding regions in prokaryotic and matured eukaryotic sequences. Initially targeted at gene prediction in bacterial GC rich genomes, the gene model used in FrameD also allows to predict genes in the presence of frameshifts and partially undetermined sequences which makes it also very suitable for gene prediction and frameshift correction in unfinished sequences such as EST and EST cluster sequences. Like recent eukaryotic gene prediction programs, FrameD also includes the ability to take into account protein similarity information both in its prediction and its graphical output. Its performances are evaluated on different bacterial genomes. The web site (http://genopole.toulouse.inra.fr/bioinfo/FrameD/FD) allows direct prediction, sequence correction and translation and the ability to learn new models for new organisms.

  16. Integration of targeted sequencing and NIPT into clinical practice in a Chinese family with maple syrup urine disease.

    PubMed

    You, Yanqin; Sun, Yan; Li, Xuchao; Li, Yali; Wei, Xiaoming; Chen, Fang; Ge, Huijuan; Lan, Zhangzhang; Zhu, Qian; Tang, Ying; Wang, Shujuan; Gao, Ya; Jiang, Fuman; Song, Jiaping; Shi, Quan; Zhu, Xuan; Mu, Feng; Dong, Wei; Gao, Vince; Jiang, Hui; Yi, Xin; Wang, Wei; Gao, Zhiying

    2014-08-01

    This article demonstrates a prominent noninvasive prenatal approach to assist the clinical diagnosis of a single-gene disorder disease, maple syrup urine disease, using targeted sequencing knowledge from the affected family. The method reported here combines novel mutant discovery in known genes by targeted massively parallel sequencing with noninvasive prenatal testing. By applying this new strategy, we successfully revealed novel mutations in the gene BCKDHA (Ex2_4dup and c.392A>G) in this Chinese family and developed a prenatal haplotype-assisted approach to noninvasively detect the genotype of the fetus (transmitted from both parents). This is the first report of integration of targeted sequencing and noninvasive prenatal testing into clinical practice. Our study has demonstrated that this massively parallel sequencing-based strategy can potentially be used for single-gene disorder diagnosis in the future.

  17. A collagen-binding EGFR antibody fragment targeting tumors with a collagen-rich extracellular matrix.

    PubMed

    Liang, Hui; Li, Xiaoran; Wang, Bin; Chen, Bing; Zhao, Yannan; Sun, Jie; Zhuang, Yan; Shi, Jiajia; Shen, He; Zhang, Zhijun; Dai, Jianwu

    2016-02-17

    Many tumors over-express collagen, which constitutes the physical scaffold of tumor microenvironment. Collagen has been considered to be a target for cancer therapy. The collagen-binding domain (CBD) is a short peptide, which could bind to collagen and achieve the sustained release of CBD-fused proteins in collagen scaffold. Here, a collagen-binding EGFR antibody fragment was designed and expressed for targeting the collagen-rich extracellular matrix in tumors. The antibody fragment (Fab) of cetuximab was fused with CBD (CBD-Fab) and expressed in Pichia pastoris. CBD-Fab maintained antigen binding and anti-tumor activity of cetuximab and obtained a collagen-binding ability in vitro. The results also showed CBD-Fab was mainly enriched in tumors and had longer retention time in tumors in A431 s.c. xenografts. Furthermore, CBD-Fab showed a similar therapeutic efficacy as cetuximab in A431 xenografts. Although CBD-Fab hasn't showed better therapeutic effects than cetuximab, its smaller molecular and special target may be applicable as antibody-drug conjugates (ADC) or immunotoxins.

  18. A collagen-binding EGFR antibody fragment targeting tumors with a collagen-rich extracellular matrix

    PubMed Central

    Liang, Hui; Li, Xiaoran; Wang, Bin; Chen, Bing; Zhao, Yannan; Sun, Jie; Zhuang, Yan; Shi, Jiajia; Shen, He; Zhang, Zhijun; Dai, Jianwu

    2016-01-01

    Many tumors over-express collagen, which constitutes the physical scaffold of tumor microenvironment. Collagen has been considered to be a target for cancer therapy. The collagen-binding domain (CBD) is a short peptide, which could bind to collagen and achieve the sustained release of CBD-fused proteins in collagen scaffold. Here, a collagen-binding EGFR antibody fragment was designed and expressed for targeting the collagen-rich extracellular matrix in tumors. The antibody fragment (Fab) of cetuximab was fused with CBD (CBD-Fab) and expressed in Pichia pastoris. CBD-Fab maintained antigen binding and anti-tumor activity of cetuximab and obtained a collagen-binding ability in vitro. The results also showed CBD-Fab was mainly enriched in tumors and had longer retention time in tumors in A431 s.c. xenografts. Furthermore, CBD-Fab showed a similar therapeutic efficacy as cetuximab in A431 xenografts. Although CBD-Fab hasn’t showed better therapeutic effects than cetuximab, its smaller molecular and special target may be applicable as antibody–drug conjugates (ADC) or immunotoxins. PMID:26883295

  19. A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers.

    PubMed

    Quail, Michael A; Smith, Miriam; Coupland, Paul; Otto, Thomas D; Harris, Simon R; Connor, Thomas R; Bertoni, Anna; Swerdlow, Harold P; Gu, Yong

    2012-07-24

    Next generation sequencing (NGS) technology has revolutionized genomic and genetic research. The pace of change in this area is rapid with three major new sequencing platforms having been released in 2011: Ion Torrent's PGM, Pacific Biosciences' RS and the Illumina MiSeq. Here we compare the results obtained with those platforms to the performance of the Illumina HiSeq, the current market leader. In order to compare these platforms, and get sufficient coverage depth to allow meaningful analysis, we have sequenced a set of 4 microbial genomes with mean GC content ranging from 19.3 to 67.7%. Together, these represent a comprehensive range of genome content. Here we report our analysis of that sequence data in terms of coverage distribution, bias, GC distribution, variant detection and accuracy. Sequence generated by Ion Torrent, MiSeq and Pacific Biosciences technologies displays near perfect coverage behaviour on GC-rich, neutral and moderately AT-rich genomes, but a profound bias was observed upon sequencing the extremely AT-rich genome of Plasmodium falciparum on the PGM, resulting in no coverage for approximately 30% of the genome. We analysed the ability to call variants from each platform and found that we could call slightly more variants from Ion Torrent data compared to MiSeq data, but at the expense of a higher false positive rate. Variant calling from Pacific Biosciences data was possible but higher coverage depth was required. Context specific errors were observed in both PGM and MiSeq data, but not in that from the Pacific Biosciences platform. All three fast turnaround sequencers evaluated here were able to generate usable sequence. However there are key differences between the quality of that data and the applications it will support.

  20. A Systematic Prediction of Drug-Target Interactions Using Molecular Fingerprints and Protein Sequences.

    PubMed

    Huang, Yu-An; You, Zhu-Hong; Chen, Xing

    2018-01-01

    Drug-Target Interactions (DTI) play a crucial role in discovering new drug candidates and finding new proteins to target for drug development. Although the number of detected DTI obtained by high-throughput techniques has been increasing, the number of known DTI is still limited. On the other hand, the experimental methods for detecting the interactions among drugs and proteins are costly and inefficient. Therefore, computational approaches for predicting DTI are drawing increasing attention in recent years. In this paper, we report a novel computational model for predicting the DTI using extremely randomized trees model and protein amino acids information. More specifically, the protein sequence is represented as a Pseudo Substitution Matrix Representation (Pseudo-SMR) descriptor in which the influence of biological evolutionary information is retained. For the representation of drug molecules, a novel fingerprint feature vector is utilized to describe its substructure information. Then the DTI pair is characterized by concatenating the two vector spaces of protein sequence and drug substructure. Finally, the proposed method is explored for predicting the DTI on four benchmark datasets: Enzyme, Ion Channel, GPCRs and Nuclear Receptor. The experimental results demonstrate that this method achieves promising prediction accuracies of 89.85%, 87.87%, 82.99% and 81.67%, respectively. For further evaluation, we compared the performance of Extremely Randomized Trees model with that of the state-of-the-art Support Vector Machine classifier. And we also compared the proposed model with existing computational models, and confirmed 15 potential drug-target interactions by looking for existing databases. The experiment results show that the proposed method is feasible and promising for predicting drug-target interactions for new drug candidate screening based on sizeable features. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  1. Sequence-based design of bioactive small molecules that target precursor microRNAs

    PubMed Central

    Velagapudi, Sai Pradeep; Gallo, Steven M.; Disney, Matthew D.

    2014-01-01

    Oligonucleotides are designed to target RNA using base pairing rules, however, they are hampered by poor cellular delivery and non-specific stimulation of the immune system. Small molecules are preferred as lead drugs or probes, but cannot be designed from sequence. Herein, we describe an approach termed Inforna that designs lead small molecules for RNA from solely sequence. Inforna was applied to all human microRNA precursors and identified bioactive small molecules that inhibit biogenesis by binding to nuclease processing sites (41% hit rate). Amongst 29 lead interactions, the most avid interaction is between a benzimidazole (1) and precursor microRNA-96. Compound 1 selectively inhibits biogenesis of microRNA-96, upregulating a protein target (FOXO1) and inducing apoptosis in cancer cells. Apoptosis is ablated when FOXO1 mRNA expression is knocked down by an siRNA, validating compound selectivity. Importantly, microRNA profiling shows that 1 only significantly effects microRNA-96 biogenesis and is more selective than an oligonucleotide. PMID:24509821

  2. The Cretaceous-Paleogene transition and Chicxulub impact ejecta in the northwestern Gulf of Mexico: Paleoenvironments, sequence stratigraphic setting and target lithologies

    NASA Astrophysics Data System (ADS)

    Schulte, Peter

    2003-07-01

    The Cretaceous-Paleogene (K-P) transition is characterized by a period of mass extinctions, the Chicxulub impact event, sea-level changes, and considerable climate changes (e.g., cooling). The Gulf of Mexico region is a key area for addressing these issues, specifically because of the proximity to the large Chicxulub impact structure in southern Mexico, and because of its shallow shelf areas throughout the Maastrichtian to Danian period. This study presents the results of a multidisciplinary investigation of Chicxulub impact ejecta and marine sediments from the K-P transition in the western Gulf of Mexico. Sedimentological, mineralogical, and geochemical aspects of K-P sections and cores from northeastern Mexico, Texas, and Alabama have been by studied with focus on Chicxulub ejecta, long- or short-term facies change, and sequence stratigraphic setting. CHICXULUB EJECTA: The Chicxulub ejecta (or impact spherule) deposits from northeastern Mexico and Texas revealed an unexpected complex and localized ejecta composition. Fe-Mg-rich chlorite- as well as Si-Al-K-rich glass-spherules are the predominant silicic ejecta components in northeastern Mexico, whereas in Texas, spherules of Mg-rich smectite compositions were encountered. Spherules contain Fe-Ti-K-rich schlieren, Fe-Mg-rich globules, and rare µm-sized metallic and sulfidic Ni-Co-(Ir-?) rich inclusions. This composition provides evidence for a distinct range of target rocks of mafic to intermediate composition, presumably situated in the northwestern sector of the Chicxulub impact structure, in addition to the possibility of contamination by meteoritic material. The absence of spinels and the ubiquitous presence of hematite and goethite points to high oxygen fugacity during the impact process. Besides these silicic phases, the most prominent ejecta component is carbonate.! Carbonate is found in ejecta deposits as unshocked clasts, accretionary lapilli-like grains, melt globules (often with quenching textures

  3. Two-Way Gold Nanoparticle Label-Free Sensing of Specific Sequence and Small Molecule Targets Using Switchable Concatemers.

    PubMed

    Zhu, Longjiao; Shao, Xiangli; Luo, Yunbo; Huang, Kunlung; Xu, Wentao

    2017-05-19

    A two-way colorimetric biosensor based on unmodified gold nanoparticles (GNPs) and a switchable double-stranded DNA (dsDNA) concatemer have been demonstrated. Two hairpin probes (H1 and H2) were first designed that provided the fuels to assemble the dsDNA concatemers via hybridization chain reaction (HCR). A functional hairpin (FH) was rationally designed to recognize the target sequences. All the hairpins contained a single-stranded DNA (ssDNA) loop and sticky end to prevent GNPs from salt-induced aggregation. In the presence of target sequence, the capture probe blocked in the FH recognizes the target to form a duplex DNA, which causes the release of the initiator probe by FH conformational change. This process then starts the alternate-opening of H1 and H2 through HCR, and dsDNA concatemers grow from the target sequence. As a result, unmodified GNPs undergo salt-induced aggregation because the formed dsDNA concatemers are stiffer and provide less stabilization. A light purple-to-blue color variation was observed in the bulk solution, termed the light-off sensing way. Furthermore, H1 ingeniously inserted an aptamer sequence to generate dsDNA concatemers with multiple small molecule binding sites. In the presence of small molecule targets, concatemers can be disassembled into mixtures with ssDNA sticky ends. A blue-to-purple reverse color variation was observed due to the regeneration of the ssDNA, termed the light-on way. The two-way biosensor can detect both nucleic acids and small molecule targets with one sensing device. This switchable sensing element is label-free, enzyme-free, and sophisticated-instrumentation-free. The detection limits of both targets were below nanomolar.

  4. Kilo-sequencing: an ordered strategy for rapid DNA sequence data acquisition.

    PubMed Central

    Barnes, W M; Bevan, M

    1983-01-01

    A strategy for rapid DNA sequence acquisition in an ordered, nonrandom manner, while retaining all of the conveniences of the dideoxy method with M13 transducing phage DNA template, is described. Target DNA 3 to 14 kb in size can be stably carried by our M13 vectors. Suitable targets are stretches of DNA which lack an enzyme recognition site which is unique on our cloning vectors and adjacent to the sequencing primer; current sites that are so useful when lacking are Pst, Xba, HindIII, BglII, EcoRI. By an in vitro procedure, we cut RF DNA once randomly and once specifically, to create thousands of deletions which start at the unique restriction site adjacent to the dideoxy sequencing primer and extend various distances across the target DNA. Phage carrying a desired size of deletions, whose DNA as template will give rise to DNA sequence data in a desired location along the target DNA, may be purified by electrophoresis alive on agarose gels. Phage running in the same location on the agarose gel thus conveniently give rise to nucleotide sequence data from the same kilobase of target DNA. Images PMID:6298723

  5. Genome sequence of the button mushroom Agaricus bisporus reveals mechanisms governing adaptation to a humic-rich ecological niche

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Morin, Emmanuelle; Kohler, Annegret; Baker, Adam R.

    Agaricus bisporus is the model fungus for the adaptation, persistence, and growth in the humic-rich leaf-litter environment. Aside from its ecological role, A. bisporus has been an important component of the human diet for over 200 y and worldwide cultivation of the button mushroom forms a multibillion dollar industry. We present two A. bisporus genomes, their gene repertoires and transcript profiles on compost and during mushroom formation. The genomes encode a full repertoire of polysaccharide-degrading enzymes similar to that of wood-decayers. Comparative transcriptomics of mycelium grown on defined medium, casing-soil, and compost revealed genes encoding enzymes involved in xylan, cellulose,more » pectin, and protein degradation are more highly expressed in compost. The striking expansion of heme-thiolate peroxidases and etherases is distinctive from Agaricomycotina wood-decayers and suggests a broad attack on decaying lignin and related metabolites found in humic acid-rich environment. Similarly, up-regulation of these genes together with a lignolytic manganese peroxidase, multiple copper radical oxidases, and cytochrome P450s is consistent with challenges posed by complex humic-rich substrates. The gene repertoire and expression of hydrolytic enzymes in A. bisporus is substantially different from the taxonomically related ectomycorrhizal symbiont Laccaria bicolor. A common promoter motif was also identified in genes very highly expressed in humic-rich substrates. These observations reveal genetic and enzymatic mechanisms governing adaptation to the humic-rich ecological niche formed during plant degradation, further defining the critical role such fungi contribute to soil structure and carbon sequestration in terrestrial ecosystems. Genome sequence will expedite mushroom breeding for improved agronomic characteristics.« less

  6. Genome sequence of the button mushroom Agaricus bisporus reveals mechanisms governing adaptation to a humic-rich ecological niche

    PubMed Central

    Morin, Emmanuelle; Kohler, Annegret; Baker, Adam R.; Foulongne-Oriol, Marie; Lombard, Vincent; Nagye, Laszlo G.; Ohm, Robin A.; Patyshakuliyeva, Aleksandrina; Brun, Annick; Aerts, Andrea L.; Bailey, Andrew M.; Billette, Christophe; Coutinho, Pedro M.; Deakin, Greg; Doddapaneni, Harshavardhan; Floudas, Dimitrios; Grimwood, Jane; Hildén, Kristiina; Kües, Ursula; LaButti, Kurt M.; Lapidus, Alla; Lindquist, Erika A.; Lucas, Susan M.; Murat, Claude; Riley, Robert W.; Salamov, Asaf A.; Schmutz, Jeremy; Subramanian, Venkataramanan; Wösten, Han A. B.; Xu, Jianping; Eastwood, Daniel C.; Foster, Gary D.; Sonnenberg, Anton S. M.; Cullen, Dan; de Vries, Ronald P.; Lundell, Taina; Hibbett, David S.; Henrissat, Bernard; Burton, Kerry S.; Kerrigan, Richard W.; Challen, Michael P.; Grigoriev, Igor V.; Martin, Francis

    2012-01-01

    Agaricus bisporus is the model fungus for the adaptation, persistence, and growth in the humic-rich leaf-litter environment. Aside from its ecological role, A. bisporus has been an important component of the human diet for over 200 y and worldwide cultivation of the “button mushroom” forms a multibillion dollar industry. We present two A. bisporus genomes, their gene repertoires and transcript profiles on compost and during mushroom formation. The genomes encode a full repertoire of polysaccharide-degrading enzymes similar to that of wood-decayers. Comparative transcriptomics of mycelium grown on defined medium, casing-soil, and compost revealed genes encoding enzymes involved in xylan, cellulose, pectin, and protein degradation are more highly expressed in compost. The striking expansion of heme-thiolate peroxidases and β-etherases is distinctive from Agaricomycotina wood-decayers and suggests a broad attack on decaying lignin and related metabolites found in humic acid-rich environment. Similarly, up-regulation of these genes together with a lignolytic manganese peroxidase, multiple copper radical oxidases, and cytochrome P450s is consistent with challenges posed by complex humic-rich substrates. The gene repertoire and expression of hydrolytic enzymes in A. bisporus is substantially different from the taxonomically related ectomycorrhizal symbiont Laccaria bicolor. A common promoter motif was also identified in genes very highly expressed in humic-rich substrates. These observations reveal genetic and enzymatic mechanisms governing adaptation to the humic-rich ecological niche formed during plant degradation, further defining the critical role such fungi contribute to soil structure and carbon sequestration in terrestrial ecosystems. Genome sequence will expedite mushroom breeding for improved agronomic characteristics. PMID:23045686

  7. Microbial communities and arsenic biogeochemistry at the outflow of an alkaline sulfide-rich hot spring.

    PubMed

    Jiang, Zhou; Li, Ping; Van Nostrand, Joy D; Zhang, Ping; Zhou, Jizhong; Wang, Yanhong; Dai, Xinyue; Zhang, Rui; Jiang, Dawei; Wang, Yanxin

    2016-04-29

    Alkaline sulfide-rich hot springs provide a unique environment for microbial community and arsenic (As) biogeochemistry. In this study, a representative alkaline sulfide-rich hot spring, Zimeiquan in the Tengchong geothermal area, was chosen to study arsenic geochemistry and microbial community using Illumina MiSeq sequencing. Over 0.26 million 16S rRNA sequence reads were obtained from 5-paired parallel water and sediment samples along the hot spring's outflow channel. High ratios of As(V)/AsSum (total combined arsenate and arsenite concentrations) (0.59-0.78), coupled with high sulfide (up to 5.87 mg/L), were present in the hot spring's pools, which suggested As(III) oxidation occurred. Along the outflow channel, AsSum increased from 5.45 to 13.86 μmol/L, and the combined sulfide and sulfate concentrations increased from 292.02 to 364.28 μmol/L. These increases were primarily attributed to thioarsenic transformation. Temperature, sulfide, As and dissolved oxygen significantly shaped the microbial communities between not only the pools and downstream samples, but also water and sediment samples. Results implied that the upstream Thermocrinis was responsible for the transformation of thioarsenic to As(III) and the downstream Thermus contributed to derived As(III) oxidation. This study improves our understanding of microbially-mediated As transformation in alkaline sulfide-rich hot springs.

  8. Microbial communities and arsenic biogeochemistry at the outflow of an alkaline sulfide-rich hot spring

    PubMed Central

    Jiang, Zhou; Li, Ping; Van Nostrand, Joy D.; Zhang, Ping; Zhou, Jizhong; Wang, Yanhong; Dai, Xinyue; Zhang, Rui; Jiang, Dawei; Wang, Yanxin

    2016-01-01

    Alkaline sulfide-rich hot springs provide a unique environment for microbial community and arsenic (As) biogeochemistry. In this study, a representative alkaline sulfide-rich hot spring, Zimeiquan in the Tengchong geothermal area, was chosen to study arsenic geochemistry and microbial community using Illumina MiSeq sequencing. Over 0.26 million 16S rRNA sequence reads were obtained from 5-paired parallel water and sediment samples along the hot spring’s outflow channel. High ratios of As(V)/AsSum (total combined arsenate and arsenite concentrations) (0.59–0.78), coupled with high sulfide (up to 5.87 mg/L), were present in the hot spring’s pools, which suggested As(III) oxidation occurred. Along the outflow channel, AsSum increased from 5.45 to 13.86 μmol/L, and the combined sulfide and sulfate concentrations increased from 292.02 to 364.28 μmol/L. These increases were primarily attributed to thioarsenic transformation. Temperature, sulfide, As and dissolved oxygen significantly shaped the microbial communities between not only the pools and downstream samples, but also water and sediment samples. Results implied that the upstream Thermocrinis was responsible for the transformation of thioarsenic to As(III) and the downstream Thermus contributed to derived As(III) oxidation. This study improves our understanding of microbially-mediated As transformation in alkaline sulfide-rich hot springs. PMID:27126380

  9. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  10. Targeting of Repeated Sequences Unique to a Gene Results in Significant Increases in Antisense Oligonucleotide Potency

    PubMed Central

    Vickers, Timothy A.; Freier, Susan M.; Bui, Huynh-Hoa; Watt, Andrew; Crooke, Stanley T.

    2014-01-01

    A new strategy for identifying potent RNase H-dependent antisense oligonucleotides (ASOs) is presented. Our analysis of the human transcriptome revealed that a significant proportion of genes contain unique repeated sequences of 16 or more nucleotides in length. Activities of ASOs targeting these repeated sites in several representative genes were compared to those of ASOs targeting unique single sites in the same transcript. Antisense activity at repeated sites was also evaluated in a highly controlled minigene system. Targeting both native and minigene repeat sites resulted in significant increases in potency as compared to targeting of non-repeated sites. The increased potency at these sites is a result of increased frequency of ASO/RNA interactions which, in turn, increases the probability of a productive interaction between the ASO/RNA heteroduplex and human RNase H1 in the cell. These results suggest a new, highly efficient strategy for rapid identification of highly potent ASOs. PMID:25334092

  11. Targeted or whole genome sequencing of formalin fixed tissue samples: potential applications in cancer genomics.

    PubMed

    Munchel, Sarah; Hoang, Yen; Zhao, Yue; Cottrell, Joseph; Klotzle, Brandy; Godwin, Andrew K; Koestler, Devin; Beyerlein, Peter; Fan, Jian-Bing; Bibikova, Marina; Chien, Jeremy

    2015-09-22

    Current genomic studies are limited by the poor availability of fresh-frozen tissue samples. Although formalin-fixed diagnostic samples are in abundance, they are seldom used in current genomic studies because of the concern of formalin-fixation artifacts. Better characterization of these artifacts will allow the use of archived clinical specimens in translational and clinical research studies. To provide a systematic analysis of formalin-fixation artifacts on Illumina sequencing, we generated 26 DNA sequencing data sets from 13 pairs of matched formalin-fixed paraffin-embedded (FFPE) and fresh-frozen (FF) tissue samples. The results indicate high rate of concordant calls between matched FF/FFPE pairs at reference and variant positions in three commonly used sequencing approaches (whole genome, whole exome, and targeted exon sequencing). Global mismatch rates and C · G > T · A substitutions were comparable between matched FF/FFPE samples, and discordant rates were low (<0.26%) in all samples. Finally, low-pass whole genome sequencing produces similar pattern of copy number alterations between FF/FFPE pairs. The results from our studies suggest the potential use of diagnostic FFPE samples for cancer genomic studies to characterize and catalog variations in cancer genomes.

  12. PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3' UTRs and coding sequences.

    PubMed

    Šulc, Miroslav; Marín, Ray M; Robins, Harlan S; Vaníček, Jiří

    2015-07-01

    The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3' untranslated regions (3' UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3' UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA-mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA-mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Targeted Sequencing of Venom Genes from Cone Snail Genomes Improves Understanding of Conotoxin Molecular Evolution

    PubMed Central

    Mahardika, Gusti N

    2018-01-01

    Abstract To expand our capacity to discover venom sequences from the genomes of venomous organisms, we applied targeted sequencing techniques to selectively recover venom gene superfamilies and nontoxin loci from the genomes of 32 cone snail species (family, Conidae), a diverse group of marine gastropods that capture their prey using a cocktail of neurotoxic peptides (conotoxins). We were able to successfully recover conotoxin gene superfamilies across all species with high confidence (> 100× coverage) and used these data to provide new insights into conotoxin evolution. First, we found that conotoxin gene superfamilies are composed of one to six exons and are typically short in length (mean = ∼85 bp). Second, we expanded our understanding of the following genetic features of conotoxin evolution: 1) positive selection, where exons coding the mature toxin region were often three times more divergent than their adjacent noncoding regions, 2) expression regulation, with comparisons to transcriptome data showing that cone snails only express a fraction of the genes available in their genome (24–63%), and 3) extensive gene turnover, where Conidae species varied from 120 to 859 conotoxin gene copies. Finally, using comparative phylogenetic methods, we found that while diet specificity did not predict patterns of conotoxin evolution, dietary breadth was positively correlated with total conotoxin gene diversity. Overall, the targeted sequencing technique demonstrated here has the potential to radically increase the pace at which venom gene families are sequenced and studied, reshaping our ability to understand the impact of genetic changes on ecologically relevant phenotypes and subsequent diversification. PMID:29514313

  14. Targeted next-generation sequencing in chronic lymphocytic leukemia: a high-throughput yet tailored approach will facilitate implementation in a clinical setting.

    PubMed

    Sutton, Lesley-Ann; Ljungström, Viktor; Mansouri, Larry; Young, Emma; Cortese, Diego; Navrkalova, Veronika; Malcikova, Jitka; Muggen, Alice F; Trbusek, Martin; Panagiotidis, Panagiotis; Davi, Frederic; Belessi, Chrysoula; Langerak, Anton W; Ghia, Paolo; Pospisilova, Sarka; Stamatopoulos, Kostas; Rosenquist, Richard

    2015-03-01

    Next-generation sequencing has revealed novel recurrent mutations in chronic lymphocytic leukemia, particularly in patients with aggressive disease. Here, we explored targeted re-sequencing as a novel strategy to assess the mutation status of genes with prognostic potential. To this end, we utilized HaloPlex targeted enrichment technology and designed a panel including nine genes: ATM, BIRC3, MYD88, NOTCH1, SF3B1 and TP53, which have been linked to the prognosis of chronic lymphocytic leukemia, and KLHL6, POT1 and XPO1, which are less characterized but were found to be recurrently mutated in various sequencing studies. A total of 188 chronic lymphocytic leukemia patients with poor prognostic features (unmutated IGHV, n=137; IGHV3-21 subset #2, n=51) were sequenced on the HiSeq 2000 and data were analyzed using well-established bioinformatics tools. Using a conservative cutoff of 10% for the mutant allele, we found that 114/180 (63%) patients carried at least one mutation, with mutations in ATM, BIRC3, NOTCH1, SF3B1 and TP53 accounting for 149/177 (84%) of all mutations. We selected 155 mutations for Sanger validation (variant allele frequency, 10-99%) and 93% (144/155) of mutations were confirmed; notably, all 11 discordant variants had a variant allele frequency between 11-27%, hence at the detection limit of conventional Sanger sequencing. Technical precision was assessed by repeating the entire HaloPlex procedure for 63 patients; concordance was found for 77/82 (94%) mutations. In summary, this study demonstrates that targeted next-generation sequencing is an accurate and reproducible technique potentially suitable for routine screening, eventually as a stand-alone test without the need for confirmation by Sanger sequencing. Copyright© Ferrata Storti Foundation.

  15. Unrelated sequences at the 5' end of mouse LINE-1 repeated elements define two distinct subfamilies.

    PubMed Central

    Wincker, P; Jubier-Maurin, V; Roizès, G

    1987-01-01

    Some full length members of the mouse long interspersed repeated DNA family L1Md have been shown to be associated at their 5' end with a variable number of tandem repetitions, the A repeats, that have been suggested to be transcription controlling elements. We report that the other type of repeat, named F, found at the 5' end of a few L1 elements is also an integral part of full length L1 copies. Sequencing shows that the F repeats are GC rich, and organized in tandem. The L1 copies associated with either A or F repeats can be correlated with two different subsets of L1 sequences distinguished by a series of variant nucleotides specific to each and by unassociated but frequent restriction sites. These findings suggest that sequence replacement has occurred at least once in 5' of L1Md, and is related to the generation of specific subfamilies. Images PMID:3684566

  16. Clinical Validation of Targeted Next Generation Sequencing for Colon and Lung Cancers

    PubMed Central

    D’Haene, Nicky; Le Mercier, Marie; De Nève, Nancy; Blanchard, Oriane; Delaunoy, Mélanie; El Housni, Hakim; Dessars, Barbara; Heimann, Pierre; Remmelink, Myriam; Demetter, Pieter; Tejpar, Sabine; Salmon, Isabelle

    2015-01-01

    Objective Recently, Next Generation Sequencing (NGS) has begun to supplant other technologies for gene mutation testing that is now required for targeted therapies. However, transfer of NGS technology to clinical daily practice requires validation. Methods We validated the Ion Torrent AmpliSeq Colon and Lung cancer panel interrogating 1850 hotspots in 22 genes using the Ion Torrent Personal Genome Machine. First, we used commercial reference standards that carry mutations at defined allelic frequency (AF). Then, 51 colorectal adenocarcinomas (CRC) and 39 non small cell lung carcinomas (NSCLC) were retrospectively analyzed. Results Sensitivity and accuracy for detecting variants at an AF >4% was 100% for commercial reference standards. Among the 90 cases, 89 (98.9%) were successfully sequenced. Among the 86 samples for which NGS and the reference test were both informative, 83 showed concordant results between NGS and the reference test; i.e. KRAS and BRAF for CRC and EGFR for NSCLC, with the 3 discordant cases each characterized by an AF <10%. Conclusions Overall, the AmpliSeq colon/lung cancer panel was specific and sensitive for mutation analysis of gene panels and can be incorporated into clinical daily practice. PMID:26366557

  17. In and out of the minor groove: interaction of an AT-rich DNA with the drug CD27

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Acosta-Reyes, Francisco J.; Dardonville, Christophe; Koning, Harry P. de

    New features of an antiprotozoal DNA minor-groove binding drug, which acts as a cross-linking agent, are presented. It also fills the minor groove of DNA completely and prevents the access of proteins. These features are also expected for other minor-groove binding drugs when associated with suitable DNA targets. The DNA of several pathogens is very rich in AT base pairs. Typical examples include the malaria parasite Plasmodium falciparum and the causative agents of trichomoniasis and trypanosomiases. This fact has prompted studies of drugs which interact with the minor groove of DNA, some of which are used in medical practice. Previousmore » studies have been performed almost exclusively with the AATT sequence. New features should be uncovered through the study of different DNA sequences. In this paper, the crystal structure of the complex of the DNA duplex d(AAAATTTT){sub 2} with the dicationic drug 4, 4′-bis(imidazolinylamino)diphenylamine (CD27) is presented. The drug binds to the minor groove of DNA as expected, but it shows two new features that have not previously been described: (i) the drugs protrude from the DNA and interact with neighbouring molecules, so that they may act as cross-linking agents, and (ii) the drugs completely cover the whole minor groove of DNA and displace bound water. Thus, they may prevent the access to DNA of proteins such as AT-hook proteins. These features are also expected for other minor-groove binding drugs when associated with all-AT DNA. These findings allow a better understanding of this family of compounds and will help in the development of new, more effective drugs. New data on the biological interaction of CD27 with the causative agent of trichomoniasis, Trichomonas vaginalis, are also reported.« less

  18. Cryptosporidium in fish: alternative sequencing approaches and analyses at multiple loci to resolve mixed infections.

    PubMed

    Paparini, Andrea; Yang, Rongchang; Chen, Linda; Tong, Kaising; Gibson-Kueh, Susan; Lymbery, Alan; Ryan, Una M

    2017-11-01

    Currently, the systematics, biology and epidemiology of piscine Cryptosporidium species are poorly understood. Here, we compared Sanger ‒ and next-generation ‒ sequencing (NGS), of piscine Cryptosporidium, at the 18S rRNA and actin genes. The hosts comprised 11 ornamental fish species, spanning four orders and eight families. The objectives were: to (i) confirm the rich genetic diversity of the parasite and the high frequency of mixed infections; and (ii) explore the potential of NGS in the presence of complex genetic mixtures. By Sanger sequencing, four main genotypes were obtained at the actin locus, while for the 18S locus, seven genotypes were identified. At both loci, NGS revealed frequent mixed infections, consisting of one highly dominant variant plus substantially rarer genotypes. Both sequencing methods detected novel Cryptosporidium genotypes at both loci, including a novel and highly abundant actin genotype that was identified by both Sanger sequencing and NGS. Importantly, this genotype accounted for 68·9% of all NGS reads from all samples (249 585/362 372). The present study confirms that aquarium fish can harbour a large and unexplored Cryptosporidium genetic diversity. Although commonly used in molecular parasitology studies, nested PCR prevents quantitative comparisons and thwarts the advantages of NGS, when this latter approach is used to investigate multiple infections.

  19. Functional Gene Analysis of Freshwater Iron-Rich Flocs at Circumneutral pH and Isolation of a Stalk-Forming Microaerophilic Iron-Oxidizing Bacterium

    PubMed Central

    Chan, Clara; Itoh, Takashi; Ohkuma, Moriya

    2013-01-01

    Iron-rich flocs often occur where anoxic water containing ferrous iron encounters oxygenated environments. Culture-independent molecular analyses have revealed the presence of 16S rRNA gene sequences related to diverse bacteria, including autotrophic iron oxidizers and methanotrophs in iron-rich flocs; however, the metabolic functions of the microbial communities remain poorly characterized, particularly regarding carbon cycling. In the present study, we cultivated iron-oxidizing bacteria (FeOB) and performed clone library analyses of functional genes related to carbon fixation and methane oxidization (cbbM and pmoA, respectively), in addition to bacterial and archaeal 16S rRNA genes, in freshwater iron-rich flocs at groundwater discharge points. The analyses of 16S rRNA, cbbM, and pmoA genes strongly suggested the coexistence of autotrophic iron oxidizers and methanotrophs in the flocs. Furthermore, a novel stalk-forming microaerophilic FeOB, strain OYT1, was isolated and characterized phylogenetically and physiologically. The 16S rRNA and cbbM gene sequences of OYT1 are related to those of other microaerophilic FeOB in the family Gallionellaceae, of the Betaproteobacteria, isolated from freshwater environments at circumneutral pH. The physiological characteristics of OYT1 will help elucidate the ecophysiology of microaerophilic FeOB. Overall, this study demonstrates functional roles of microorganisms in iron flocs, suggesting several possible linkages between Fe and C cycling. PMID:23811518

  20. Retrotransposon insertion targeting: a mechanism for homogenization of centromere sequences on nonhomologous chromosomes.

    PubMed

    Birchler, James A; Presting, Gernot G

    2012-04-01

    The centromeres of most eukaryotic organisms consist of highly repetitive arrays that are similar across nonhomologous chromosomes. These sequences evolve rapidly, thus posing a mystery as to how such arrays can be homogenized. Recent work in species in which centromere-enriched retrotransposons occur indicates that these elements preferentially insert into the centromeric regions. In two different Arabidopsis species, a related element was recognized in which the specificity for such targeting was altered. These observations provide a partial explanation for how homogenization of centromere DNA sequences occurs.

  1. GC-Rich DNA Elements Enable Replication Origin Activity in the Methylotrophic Yeast Pichia pastoris

    PubMed Central

    Liachko, Ivan; Youngblood, Rachel A.; Tsui, Kyle; Bubb, Kerry L.; Queitsch, Christine; Raghuraman, M. K.; Nislow, Corey; Brewer, Bonita J.; Dunham, Maitreya J.

    2014-01-01

    The well-studied DNA replication origins of the model budding and fission yeasts are A/T-rich elements. However, unlike their yeast counterparts, both plant and metazoan origins are G/C-rich and are associated with transcription start sites. Here we show that an industrially important methylotrophic budding yeast, Pichia pastoris, simultaneously employs at least two types of replication origins—a G/C-rich type associated with transcription start sites and an A/T-rich type more reminiscent of typical budding and fission yeast origins. We used a suite of massively parallel sequencing tools to map and dissect P. pastoris origins comprehensively, to measure their replication dynamics, and to assay the global positioning of nucleosomes across the genome. Our results suggest that some functional overlap exists between promoter sequences and G/C-rich replication origins in P. pastoris and imply an evolutionary bifurcation of the modes of replication initiation. PMID:24603708

  2. GC-rich DNA elements enable replication origin activity in the methylotrophic yeast Pichia pastoris.

    PubMed

    Liachko, Ivan; Youngblood, Rachel A; Tsui, Kyle; Bubb, Kerry L; Queitsch, Christine; Raghuraman, M K; Nislow, Corey; Brewer, Bonita J; Dunham, Maitreya J

    2014-03-01

    The well-studied DNA replication origins of the model budding and fission yeasts are A/T-rich elements. However, unlike their yeast counterparts, both plant and metazoan origins are G/C-rich and are associated with transcription start sites. Here we show that an industrially important methylotrophic budding yeast, Pichia pastoris, simultaneously employs at least two types of replication origins--a G/C-rich type associated with transcription start sites and an A/T-rich type more reminiscent of typical budding and fission yeast origins. We used a suite of massively parallel sequencing tools to map and dissect P. pastoris origins comprehensively, to measure their replication dynamics, and to assay the global positioning of nucleosomes across the genome. Our results suggest that some functional overlap exists between promoter sequences and G/C-rich replication origins in P. pastoris and imply an evolutionary bifurcation of the modes of replication initiation.

  3. Site-Specific Targeting of Platelet-Rich Plasma via Superparamagnetic Nanoparticles

    PubMed Central

    Talaie, Tara; Pratt, Stephen J.P.; Vanegas, Camilo; Xu, Su; Henn, R. Frank; Yarowsky, Paul; Lovering, Richard M.

    2015-01-01

    Background: Muscle strains are one of the most common injuries treated by physicians. Standard conservative therapy for acute muscle strains usually involves short-term rest, ice, and nonsteroidal anti-inflammatory medications, but there is no clear consensus regarding treatments to accelerate recovery. Recently, clinical use of platelet-rich plasma (PRP) has gained momentum as an option for therapy and is appealing for many reasons, most notably because it provides growth factors in physiological proportions and it is autologous, safe, easily accessible, and potentially beneficial. Local delivery of PRP to injured muscles can hasten recovery of function. However, specific targeting of PRP to sites of tissue damage in vivo is a major challenge that can limit its efficacy. Hypothesis: Location of PRP delivery can be monitored and controlled in vivo with noninvasive tools. Study Design: Controlled laboratory study. Methods: Superparamagnetic iron oxide nanoparticles (SPIONs) can be visualized by both magnetic resonance imaging (MRI) (in vivo) and fluorescence microscopy (after tissue harvesting). PRP was labeled with SPIONs and administered by intramuscular injections of SPION-containing platelets. MRI was used to monitor the ability to manipulate and retain the location of PRP in vivo by placement of an external magnet. Platelets were isolated from whole blood and incubated with SPIONs. Following SPION incubation with PRP, a magnetic field was used to manipulate platelet location in culture dishes. In vivo, the tibialis anterior (TA) muscles of anesthetized Sprague-Dawley rats were injected with SPION-containing platelets, and MRI was used to track platelet position with and without a magnet worn over the TA muscles for 4 days. Results: The method used to isolate PRP yielded a high concentration (almost 4-fold increase) of platelets. In vitro experiments showed that the platelets successfully took up SPIONs and then rapidly responded to an applied magnetic field

  4. Mg-Spinel-rich lithology at crater Endymion in the lunar nearside

    NASA Astrophysics Data System (ADS)

    Bhattacharya, Satadru; Chauhan, Prakash; Ajai, A.

    2012-07-01

    The recent discovery of a Mg-Spinel-rich lithology at the inner ring of Mare Moscoviense (a farside mare) by [1, 2] based on the analysis of high-resolution Moon Mineralogy Mapper (M3) data from Chandrayaan-1, has stimulated interest in studying and identifying more and more such rock types across the lunar surface as spinel-rich lithologies and OOS (Orthopyroxene-Olivine-Spinel) suites of rocks hold the key to understand the deeper crustal composition and processes of the Moon. The genesis of this spinel-rich rare and unusual lithology on the lunar surface is yet to be understood by the lunar scientists. [3-6] has reported the occurrence of Mg-Spinel-rich lithology at the central peaks of crater Theophilus. The Mg-spinel-rich lithology at Theophilus is found to occur in association with mafic-free plagioclase and associated with lesser exposures of pyroxene and olivine-bearing materials. In a very recent work, [7] has identified Mg-spinel rich lithology at the floor of crater Copernicus. Very recently [8] has reported presence of Mg-spinel-rich lithology at the central peak of crater Tycho in association with olivine, crystalline plagioclase and high-Ca pyroxenes. All these detections are restricted within very small areal extents. Here, we report a new identification of this Mg-spinel-rich lithology at the rim of crater Endymion situated near the northeast limb of the Moon at the nearside using high-resolution M3 data. In Endymion, Mg-spinel-rich lithology occurs in close association with orthopyroxene-olivine assemblages and therefore represent OOS lithological suite of rocks. Spectral signature of Mg-spinel-rich lithology at the rim of crater Endymion: Spectra of Mg-spinel lacks 1000-nm absorption feature and is characterised by a strong absorption near 2000 nm arising due to the small amounts of Fe2+ in the tetrahedral crystallographic site of the mineral. Spectral signature of Mg-spinel-rich lithology, as obtained from the southern rim of crater Endymion

  5. Quantifying Genome Editing Outcomes at Endogenous Loci using SMRT Sequencing

    PubMed Central

    Clark, Joseph; Punjya, Niraj; Sebastiano, Vittorio; Bao, Gang; Porteus, Matthew H

    2014-01-01

    SUMMARY Targeted genome editing with engineered nucleases has transformed the ability to introduce precise sequence modifications at almost any site within the genome. A major obstacle to probing the efficiency and consequences of genome editing is that no existing method enables the frequency of different editing events to be simultaneously measured across a cell population at any endogenous genomic locus. We have developed a novel method for quantifying individual genome editing outcomes at any site of interest using single molecule real time (SMRT) DNA sequencing. We show that this approach can be applied at various loci, using multiple engineered nuclease platforms including TALENs, RNA guided endonucleases (CRISPR/Cas9), and ZFNs, and in different cell lines to identify conditions and strategies in which the desired engineering outcome has occurred. This approach facilitates the evaluation of new gene editing technologies and permits sensitive quantification of editing outcomes in almost every experimental system used. PMID:24685129

  6. Controlling the prion propensity of glutamine/asparagine-rich proteins

    PubMed Central

    Paul, Kacy R; Ross, Eric D

    2015-01-01

    ABSTRACT The yeast Saccharomyces cerevisiae can harbor a number of distinct prions. Most of the yeast prion proteins contain a glutamine/asparagine (Q/N) rich region that drives prion formation. Prion-like domains, defined as regions with high compositional similarity to yeast prion domains, are common in eukaryotic proteomes, and mutations in various human proteins containing prion-like domains have been linked to degenerative diseases, including amyotrophic lateral sclerosis. Here, we discuss a recent study in which we utilized two strategies to generate prion activity in non-prion Q/N-rich domains. First, we made targeted mutations in four non-prion Q/N-rich domains, replacing predicted prion-inhibiting amino acids with prion-promoting amino acids. All four mutants formed foci when expressed in yeast, and two acquired bona fide prion activity. Prion activity could be generated with as few as two mutations, suggesting that many non-prion Q/N-rich proteins may be just a small number of mutations from acquiring aggregation or prion activity. Second, we created tandem repeats of short prion-prone segments, and observed length-dependent prion activity. These studies demonstrate the considerable progress that has been made in understanding the sequence basis for aggregation of prion and prion-like domains, and suggest possible mechanisms by which new prion domains could evolve. PMID:26555096

  7. Targeted next generation sequencing for the detection of ciprofloxacin resistance markers using molecular inversion probes

    DTIC Science & Technology

    2016-07-06

    1 Targeted next-generation sequencing for the detection of ciprofloxacin resistance markers using molecular inversion probes Christopher P...development and evaluation of a panel of 44 single-stranded molecular inversion probes (MIPs) coupled to next-generation sequencing (NGS) for the...padlock and molecular inversion probes as upfront enrichment steps for use with NGS showed the specificity and multiplexability of these techniques

  8. DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

    PubMed Central

    Palzkill, T G; Oliver, S G; Newlon, C S

    1986-01-01

    Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036

  9. Alanine rich peptide from Populus trichocarpa inhibit growth of Staphylococcus aureus via targetting its extracellular domain of Sensor Histidine Kinase YycGex protein.

    PubMed

    Al Akeel, Raid; Mateen, Ayesha; Syed, Rabbani; Alqahtani, Mohammed S; Alqahtani, Ali S

    2018-05-22

    Due to growing concern towards microbial resistance, ongoing search for developing novel bioactive compounds such as peptides is on rise. The aim of this study was to evaluate antimicrobial effect of Populus trichocarpa extract, chemically identify the active peptide fraction and finds its target in Staphylococcus aureus. In this study the active fraction of P. trichocarpa crude extract was purified and characterized using MS/MS. This peptide PT13 antimicrobial activity was confirmed by in-vitro agar based disk diffusion and in-vivo infection model of G. mellonella. The proteomic expression analysis of S. aureus under influence of PT13 was studied using LTQ-Orbitrap-MS in-solution digestion and identity of target protein was acquired with their quantified expression using label-free approach of Progenesis QI software. Docking study was performed with peptide PT13 and its target YycG protein using CABS-dock. The active fraction PT13 sequence was identified as KVPVAAAAAAAAAVVASSMVVAAAK, with 25 amino acid including 13 alanine having M/Z 2194.2469. PT13 was uniformly inhibited growth S. aureus SA91 and MIC was determined 16 μg/mL for SA91 S. aureus strain. Sensor histidine kinase (YycG) was most significant target found differentially expressed under influence of PT13. G. mellonella larvae were killed rapidly due to S aureus infection, whereas death in protected group was insignificant in compare to control. The docking models showed ten docking models with RMSD value 1.89 for cluster 1 and RMSD value 3.95 for cluster 2 which is predicted to be high quality model. Alanine rich peptide could be useful in constructing as antimicrobial peptide for targeting extracellular Domain of Sensor Histidine Kinase YycG from S. aureus used in the study. Copyright © 2018 Elsevier Ltd. All rights reserved.

  10. Plastoglobule-Targeting Competence of a Putative Transit Peptide Sequence from Rice Phytoene Synthase 2 in Plastids.

    PubMed

    You, Min Kyoung; Kim, Jin Hwa; Lee, Yeo Jin; Jeong, Ye Sol; Ha, Sun-Hwa

    2016-12-22

    Plastoglobules (PGs) are thylakoid membrane microdomains within plastids that are known as specialized locations of carotenogenesis. Three rice phytoene synthase proteins (OsPSYs) involved in carotenoid biosynthesis have been identified. Here, the N-terminal 80-amino-acid portion of OsPSY2 (PTp) was demonstrated to be a chloroplast-targeting peptide by displaying cytosolic localization of OsPSY2(ΔPTp):mCherry in rice protoplast, in contrast to chloroplast localization of OsPSY2:mCherry in a punctate pattern. The peptide sequence of a PTp was predicted to harbor two transmembrane domains eligible for a putative PG-targeting signal. To assess and enhance the PG-targeting ability of PTp, the original PTp DNA sequence ( PTp ) was modified to a synthetic DNA sequence ( stPTp ), which had 84.4% similarity to the original sequence. The motivation of this modification was to reduce the GC ratio from 75% to 65% and to disentangle the hairpin loop structures of PTp . These two DNA sequences were fused to the sequence of the synthetic green fluorescent protein (sGFP) and drove GFP expression with different efficiencies. In particular, the RNA and protein levels of stPTp-sGFP were slightly improved to 1.4-fold and 1.3-fold more than those of sGFP, respectively. The green fluorescent signals of their mature proteins were all observed as speckle-like patterns with slightly blurred stromal signals in chloroplasts. These discrete green speckles of PTp - sGFP and stPTp - sGFP corresponded exactly to the red fluorescent signal displayed by OsPSY2:mCherry in both etiolated and greening protoplasts and it is presumed to correspond to distinct PGs. In conclusion, we identified PTp as a transit peptide sequence facilitating preferential translocation of foreign proteins to PGs, and developed an improved PTp sequence, a s tPTp , which is expected to be very useful for applications in plant biotechnologies requiring precise micro-compartmental localization in plastids.

  11. Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome.

    PubMed

    Nicholson, Matthew J; Theodorou, Michael K; Brookman, Jayne L

    2005-01-01

    The anaerobic gut fungi occupy a unique niche in the intestinal tract of large herbivorous animals and are thought to act as primary colonizers of plant material during digestion. They are the only known obligately anaerobic fungi but molecular analysis of this group has been hampered by difficulties in their culture and manipulation, and by their extremely high A+T nucleotide content. This study begins to answer some of the fundamental questions about the structure and organization of the anaerobic gut fungal genome. Directed plasmid libraries using genomic DNA digested with highly or moderately rich AT-specific restriction enzymes (VspI and EcoRI) were prepared from a polycentric Orpinomyces isolate. Clones were sequenced from these libraries and the breadth of genomic inserts, both genic and intergenic, was characterized. Genes encoding numerous functions not previously characterized for these fungi were identified, including cytoskeletal, secretory pathway and transporter genes. A peptidase gene with no introns and having sequence similarity to a gene encoding a bacterial peptidase was also identified, extending the range of metabolic enzymes resulting from apparent trans-kingdom transfer from bacteria to fungi, as previously characterized largely for genes encoding plant-degrading enzymes. This paper presents the first thorough analysis of the genic, intergenic and rDNA regions of a variety of genomic segments from an anaerobic gut fungus and provides observations on rules governing intron boundaries, the codon biases observed with different types of genes, and the sequence of only the second anaerobic gut fungal promoter reported. Large numbers of retrotransposon sequences of different types were found and the authors speculate on the possible consequences of any such transposon activity in the genome. The coding sequences identified included several orphan gene sequences, including one with regions strongly suggestive of structural proteins such as collagens

  12. UniDrug-target: a computational tool to identify unique drug targets in pathogenic bacteria.

    PubMed

    Chanumolu, Sree Krishna; Rout, Chittaranjan; Chauhan, Rajinder S

    2012-01-01

    Targeting conserved proteins of bacteria through antibacterial medications has resulted in both the development of resistant strains and changes to human health by destroying beneficial microbes which eventually become breeding grounds for the evolution of resistances. Despite the availability of more than 800 genomes sequences, 430 pathways, 4743 enzymes, 9257 metabolic reactions and protein (three-dimensional) 3D structures in bacteria, no pathogen-specific computational drug target identification tool has been developed. A web server, UniDrug-Target, which combines bacterial biological information and computational methods to stringently identify pathogen-specific proteins as drug targets, has been designed. Besides predicting pathogen-specific proteins essentiality, chokepoint property, etc., three new algorithms were developed and implemented by using protein sequences, domains, structures, and metabolic reactions for construction of partial metabolic networks (PMNs), determination of conservation in critical residues, and variation analysis of residues forming similar cavities in proteins sequences. First, PMNs are constructed to determine the extent of disturbances in metabolite production by targeting a protein as drug target. Conservation of pathogen-specific protein's critical residues involved in cavity formation and biological function determined at domain-level with low-matching sequences. Last, variation analysis of residues forming similar cavities in proteins sequences from pathogenic versus non-pathogenic bacteria and humans is performed. The server is capable of predicting drug targets for any sequenced pathogenic bacteria having fasta sequences and annotated information. The utility of UniDrug-Target server was demonstrated for Mycobacterium tuberculosis (H37Rv). The UniDrug-Target identified 265 mycobacteria pathogen-specific proteins, including 17 essential proteins which can be potential drug targets. UniDrug-Target is expected to accelerate

  13. Olivine-rich asteroids in the near-Earth space

    NASA Astrophysics Data System (ADS)

    Popescu, Marcel; Perna, D.; Barucci, M. A.; Fornasier, S.; Doressoundiram, A.; Lantz, C.; Merlin, F.; Belskaya, I. N.; Fulchignoni, M.

    2018-06-01

    In the framework of a 30-night spectroscopic survey of small near-Earth asteroids (NEAs), we present new results regarding the identification of olivine-rich objects. The following NEAs were classified as A-type using visible spectra obtained with 3.6-m New Technology Telescope: (293726) 2007 RQ17, (444584) 2006 UK, 2012 NP, 2014 YS34, 2015 HB117, 2015 LH, 2015 TB179, 2015 TW144. We determined a relative abundance of 5.4 per cent (8 out of 147 observed targets) A-types at a 100-m size range of NEA population. The ratio is at least five times larger compared with the previously known A-types, which represent less than ˜ 1 per cent of NEAs taxonomically classified. By taking into account that part of our targets may not be confirmed as olivine-rich asteroids by their near-infrared spectra, or they can have a nebular origin, our result provides an upper-limit estimation of mantle fragments at size ranges below 300 m. Our findings are compared with the `battered-to-bits' scenario, claiming that at small sizes the olivine-rich objects should be more abundant when compared with basaltic and iron ones.

  14. High-throughput sequencing of retrotransposon integration provides a saturated profile of target activity in Schizosaccharomyces pombe.

    PubMed

    Guo, Yabin; Levin, Henry L

    2010-02-01

    The biological impact of transposons on the physiology of the host depends greatly on the frequency and position of integration. Previous studies of Tf1, a long terminal repeat retrotransposon in Schizosaccharomyces pombe, showed that integration occurs at the promoters of RNA polymerase II (Pol II) transcribed genes. To determine whether specific promoters are preferred targets of integration, we sequenced large numbers of insertions using high-throughput pyrosequencing. In four independent experiments we identified a total of 73,125 independent integration events. These data provided strong support for the conclusion that Pol II promoters are the targets of Tf1 integration. The size and number of the integration experiments resulted in reproducible measures of integration for each intergenic region and ORF in the S. pombe genome. The reproducibility of the integration activity from experiment to experiment demonstrates that we have saturated the full set of insertion sites that are actively targeted by Tf1. We found Tf1 integration was highly biased in favor of a specific set of Pol II promoters. The overwhelming majority (76%) of the insertions were distributed in intergenic sequences that contained 31% of the promoters of S. pombe. Interestingly, there was no correlation between the amount of integration at these promoters and their level of transcription. Instead, we found Tf1 had a strong preference for promoters that are induced by conditions of stress. This targeting of stress response genes coupled with the ability of Tf1 to regulate the expression of adjacent genes suggests Tf1 may improve the survival of S. pombe when cells are exposed to environmental stress.

  15. High-throughput sequencing of retrotransposon integration provides a saturated profile of target activity in Schizosaccharomyces pombe

    PubMed Central

    Guo, Yabin; Levin, Henry L.

    2010-01-01

    The biological impact of transposons on the physiology of the host depends greatly on the frequency and position of integration. Previous studies of Tf1, a long terminal repeat retrotransposon in Schizosaccharomyces pombe, showed that integration occurs at the promoters of RNA polymerase II (Pol II) transcribed genes. To determine whether specific promoters are preferred targets of integration, we sequenced large numbers of insertions using high-throughput pyrosequencing. In four independent experiments we identified a total of 73,125 independent integration events. These data provided strong support for the conclusion that Pol II promoters are the targets of Tf1 integration. The size and number of the integration experiments resulted in reproducible measures of integration for each intergenic region and ORF in the S. pombe genome. The reproducibility of the integration activity from experiment to experiment demonstrates that we have saturated the full set of insertion sites that are actively targeted by Tf1. We found Tf1 integration was highly biased in favor of a specific set of Pol II promoters. The overwhelming majority (76%) of the insertions were distributed in intergenic sequences that contained 31% of the promoters of S. pombe. Interestingly, there was no correlation between the amount of integration at these promoters and their level of transcription. Instead, we found Tf1 had a strong preference for promoters that are induced by conditions of stress. This targeting of stress response genes coupled with the ability of Tf1 to regulate the expression of adjacent genes suggests Tf1 may improve the survival of S. pombe when cells are exposed to environmental stress. PMID:20040583

  16. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  17. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  18. Kleptoparasitic behavior and species richness at Mt. Graham red squirrel middens

    Treesearch

    Andrew J. Edelman; John L. Koprowski; Jennifer L. Edelman

    2005-01-01

    We used remote photography to assess the frequency of inter- and intra-specific kleptoparasitism and species richness at Mt. Graham red squirrel (Tamiasciurus hudsonicus grahamensis) middens. Remote cameras and conifer cones were placed at occupied and unoccupied middens, and random sites. Species richness of small mammals was higher at red squirrel...

  19. Selection of Optimal Polypurine Tract Region Sequences during Moloney Murine Leukemia Virus Replication

    PubMed Central

    Robson, Nicole D.; Telesnitsky, Alice

    2000-01-01

    Retrovirus plus-strand synthesis is primed by a cleavage remnant of the polypurine tract (PPT) region of viral RNA. In this study, we tested replication properties for Moloney murine leukemia viruses with targeted mutations in the PPT and in conserved sequences upstream, as well as for pools of mutants with randomized sequences in these regions. The importance of maintaining some purine residues within the PPT was indicated both by examining the evolution of random PPT pools and from the replication properties of targeted mutants. Although many different PPT sequences could support efficient replication and one mutant that contained two differences in the core PPT was found to replicate as well as the wild type, some sequences in the core PPT clearly conferred advantages over others. Contributions of sequences upstream of the core PPT were examined with deletion mutants. A conserved T-stretch within the upstream sequence was examined in detail and found to be unimportant to helper functions. Evolution of virus pools containing randomized T-stretch sequences demonstrated marked preference for the wild-type sequence in six of its eight positions. These findings demonstrate that maintenance of the T-rich element is more important to viral replication than is maintenance of the core PPT. PMID:11044073

  20. A Phylogenomic Approach Based on PCR Target Enrichment and High Throughput Sequencing: Resolving the Diversity within the South American Species of Bartsia L. (Orobanchaceae)

    PubMed Central

    Tank, David C.

    2016-01-01

    Advances in high-throughput sequencing (HTS) have allowed researchers to obtain large amounts of biological sequence information at speeds and costs unimaginable only a decade ago. Phylogenetics, and the study of evolution in general, is quickly migrating towards using HTS to generate larger and more complex molecular datasets. In this paper, we present a method that utilizes microfluidic PCR and HTS to generate large amounts of sequence data suitable for phylogenetic analyses. The approach uses the Fluidigm Access Array System (Fluidigm, San Francisco, CA, USA) and two sets of PCR primers to simultaneously amplify 48 target regions across 48 samples, incorporating sample-specific barcodes and HTS adapters (2,304 unique amplicons per Access Array). The final product is a pooled set of amplicons ready to be sequenced, and thus, there is no need to construct separate, costly genomic libraries for each sample. Further, we present a bioinformatics pipeline to process the raw HTS reads to either generate consensus sequences (with or without ambiguities) for every locus in every sample or—more importantly—recover the separate alleles from heterozygous target regions in each sample. This is important because it adds allelic information that is well suited for coalescent-based phylogenetic analyses that are becoming very common in conservation and evolutionary biology. To test our approach and bioinformatics pipeline, we sequenced 576 samples across 96 target regions belonging to the South American clade of the genus Bartsia L. in the plant family Orobanchaceae. After sequencing cleanup and alignment, the experiment resulted in ~25,300bp across 486 samples for a set of 48 primer pairs targeting the plastome, and ~13,500bp for 363 samples for a set of primers targeting regions in the nuclear genome. Finally, we constructed a combined concatenated matrix from all 96 primer combinations, resulting in a combined aligned length of ~40,500bp for 349 samples. PMID:26828929

  1. Development of target ion source systems for radioactive beams at GANIL

    NASA Astrophysics Data System (ADS)

    Bajeat, O.; Delahaye, P.; Couratin, C.; Dubois, M.; Franberg-Delahaye, H.; Henares, J. L.; Huguet, Y.; Jardin, P.; Lecesne, N.; Lecomte, P.; Leroy, R.; Maunoury, L.; Osmond, B.; Sjodin, M.

    2013-12-01

    The GANIL facility (Caen, France) is dedicated to the acceleration of heavy ion beams including radioactive beams produced by the Isotope Separation On-Line (ISOL) method at the SPIRAL1 facility. To extend the range of radioactive ion beams available at GANIL, using the ISOL method two projects are underway: SPIRAL1 upgrade and the construction of SPIRAL2. For SPIRAL1, a new target ion source system (TISS) using the VADIS FEBIAD ion source coupled to the SPIRAL1 carbon target will be tested on-line by the end of 2013 and installed in the cave of SPIRAL1 for operation in 2015. The SPIRAL2 project is under construction and is being design for using different production methods as fission, fusion or spallation reactions to cover a large area of the chart of nuclei. It will produce among others neutron rich beams obtained by the fission of uranium induced by fast neutrons. The production target made from uranium carbide and heated at 2000 °C will be associated with several types of ion sources. Developments currently in progress at GANIL for each of these projects are presented.

  2. Sequencing Technologies Panel at SFAF

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Turner, Steve; Fiske, Haley; Knight, Jim

    2010-06-02

    From left to right: Steve Turner of Pacific Biosciences, Haley Fiske of Illumina, Jim Knight of Roche, Michael Rhodes of Life Technologies and Peter Vander Horn of Life Technologies' Single Molecule Sequencing group discuss new sequencing technologies and applications on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  3. Atypical case of Wolfram syndrome revealed through targeted exome sequencing in a patient with suspected mitochondrial disease

    PubMed Central

    2012-01-01

    Background Mitochondrial diseases comprise a diverse set of clinical disorders that affect multiple organ systems with varying severity and age of onset. Due to their clinical and genetic heterogeneity, these diseases are difficult to diagnose. We have developed a targeted exome sequencing approach to improve our ability to properly diagnose mitochondrial diseases and apply it here to an individual patient. Our method targets mitochondrial DNA (mtDNA) and the exons of 1,600 nuclear genes involved in mitochondrial biology or Mendelian disorders with multi-system phenotypes, thereby allowing for simultaneous evaluation of multiple disease loci. Case Presentation Targeted exome sequencing was performed on a patient initially suspected to have a mitochondrial disorder. The patient presented with diabetes mellitus, diffuse brain atrophy, autonomic neuropathy, optic nerve atrophy, and a severe amnestic syndrome. Further work-up revealed multiple heteroplasmic mtDNA deletions as well as profound thiamine deficiency without a clear nutritional cause. Targeted exome sequencing revealed a homozygous c.1672C > T (p.R558C) missense mutation in exon 8 of WFS1 that has previously been reported in a patient with Wolfram syndrome. Conclusion This case demonstrates how clinical application of next-generation sequencing technology can enhance the diagnosis of patients suspected to have rare genetic disorders. Furthermore, the finding of unexplained thiamine deficiency in a patient with Wolfram syndrome suggests a potential link between WFS1 biology and thiamine metabolism that has implications for the clinical management of Wolfram syndrome patients. PMID:22226368

  4. PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3′ UTRs and coding sequences

    PubMed Central

    Šulc, Miroslav; Marín, Ray M.; Robins, Harlan S.; Vaníček, Jiří

    2015-01-01

    The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3′ untranslated regions (3′ UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3′ UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA–mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA–mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats. PMID:25948580

  5. Performance Comparison of Bench-Top Next Generation Sequencers Using Microdroplet PCR-Based Enrichment for Targeted Sequencing in Patients with Autism Spectrum Disorder

    PubMed Central

    Okamoto, Nobuhiko; Nakashima, Mitsuko; Tsurusaki, Yoshinori; Miyake, Noriko; Saitsu, Hirotomo; Matsumoto, Naomichi

    2013-01-01

    Next-generation sequencing (NGS) combined with enrichment of target genes enables highly efficient and low-cost sequencing of multiple genes for genetic diseases. The aim of this study was to validate the accuracy and sensitivity of our method for comprehensive mutation detection in autism spectrum disorder (ASD). We assessed the performance of the bench-top Ion Torrent PGM and Illumina MiSeq platforms as optimized solutions for mutation detection, using microdroplet PCR-based enrichment of 62 ASD associated genes. Ten patients with known mutations were sequenced using NGS to validate the sensitivity of our method. The overall read quality was better with MiSeq, largely because of the increased indel-related error associated with PGM. The sensitivity of SNV detection was similar between the two platforms, suggesting they are both suitable for SNV detection in the human genome. Next, we used these methods to analyze 28 patients with ASD, and identified 22 novel variants in genes associated with ASD, with one mutation detected by MiSeq only. Thus, our results support the combination of target gene enrichment and NGS as a valuable molecular method for investigating rare variants in ASD. PMID:24066114

  6. Regulation of tumor cell migration by protein tyrosine phosphatase (PTP)-proline-, glutamate-, serine-, and threonine-rich sequence (PEST)

    PubMed Central

    Zheng, Yanhua; Lu, Zhimin

    2013-01-01

    Protein tyrosine phosphatase (PTP)–proline-, glutamate-, serine-, and threonine-rich sequence (PEST) is ubiquitously expressed and is a critical regulator of cell adhesion and migration. PTP-PEST activity can be regulated transcriptionally via gene deletion or mutation in several types of human cancers or via post-translational modifications, including phosphorylation, oxidation, and caspase-dependent cleavage. PTP-PEST interacts with and dephosphorylates cytoskeletal and focal adhesion-associated proteins. Dephosphorylation of PTP-PEST substrates regulates their enzymatic activities and/or their interaction with other proteins and plays an essential role in the tumor cell migration process. PMID:23237212

  7. HIV and Drug Resistance: Hitting a Moving Target | Center for Cancer Research

    Cancer.gov

    Prior research revealed how HIV-1 makes its destructive entry into the target cell by fusing together the cholesterol-rich lipid bilayer of the viral envelope—made with key glycoproteins gp120 and gp41—and the host cell’s plasma membrane. Cell-viral interactions begin with the binding of gp120 to the CD4 receptor molecule on the target cell, followed by gp120 binding to coreceptors. These coreceptors likely reside in structures called lipid rafts—areas in the cell plasma membrane that are rich in cholesterol, saturated fatty acids, and certain proteins that facilitate the entry of viruses into host cells. Finally, sequences in gp41 trigger the fusion of the viral and cellular lipid bilayers. The lipid rafts are then involved in the production of new viral particles.

  8. Sample limited characterization of a novel disulfide-rich venom peptide toxin from terebrid marine snail Terebra variegata.

    PubMed

    Anand, Prachi; Grigoryan, Alexandre; Bhuiyan, Mohammed H; Ueberheide, Beatrix; Russell, Victoria; Quinoñez, Jose; Moy, Patrick; Chait, Brian T; Poget, Sébastien F; Holford, Mandë

    2014-01-01

    Disulfide-rich peptide toxins found in the secretions of venomous organisms such as snakes, spiders, scorpions, leeches, and marine snails are highly efficient and effective tools for novel therapeutic drug development. Venom peptide toxins have been used extensively to characterize ion channels in the nervous system and platelet aggregation in haemostatic systems. A significant hurdle in characterizing disulfide-rich peptide toxins from venomous animals is obtaining significant quantities needed for sequence and structural analyses. Presented here is a strategy for the structural characterization of venom peptide toxins from sample limited (4 ng) specimens via direct mass spectrometry sequencing, chemical synthesis and NMR structure elucidation. Using this integrated approach, venom peptide Tv1 from Terebra variegata was discovered. Tv1 displays a unique fold not witnessed in prior snail neuropeptides. The novel structural features found for Tv1 suggest that the terebrid pool of peptide toxins may target different neuronal agents with varying specificities compared to previously characterized snail neuropeptides.

  9. Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data.

    PubMed

    Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A; Larsen, Martin Jakob

    2016-01-01

    Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths.

  10. Silent genetic alterations identified by targeted next-generation sequencing in pheochromocytoma/paraganglioma: A clinicopathological correlations.

    PubMed

    Pillai, Suja; Gopalan, Vinod; Lo, Chung Y; Liew, Victor; Smith, Robert A; Lam, Alfred King Y

    2017-02-01

    The goal of this pilot study was to develop a customized, cost-effective amplicon panel (Ampliseq) for target sequencing in a cohort of patients with sporadic phaeochromocytoma/paraganglioma. Phaeochromocytoma/paragangliomas from 25 patients were analysed by targeted next-generation sequencing approach using an Ion Torrent PGM instrument. Primers for 15 target genes (NF1, RET, VHL, SDHA, SDHB, SDHC, SDHD, SDHAF2, TMEM127, MAX, MEN1, KIF1Bβ, EPAS1, CDKN2 & PHD2) were designed using ion ampliseq designer. Ion Reporter software and Ingenuity® Variant Analysis™ software (www.ingenuity.com/variants) from Ingenuity Systems were used to analysis these results. Overall, 713 variants were identified. The variants identified from the Ion Reporter ranged from 64 to 161 per patient. Single nucleotide variants (SNV) were the most common. Further annotation with the help of Ingenuity variant analysis revealed 29 of these 713variants were deletions. Of these, six variants were non-pathogenic and four were likely to be pathogenic. The remaining 19 variants were of uncertain significance. The most frequently altered gene in the cohort was KIF1B followed by NF1. Novel KIF1B pathogenic variant c.3375+1G>A was identified. The mutation was noted in a patient with clinically confirmed neurofibromatosis. Chromosome 1 showed the presence of maximum number of variants. Use of targeted next-generation sequencing is a sensitive method for the detecting genetic changes in patients with phaeochromocytoma/paraganglioma. The precise detection of these genetic changes helps in understanding the pathogenesis of these tumours. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Widespread platinum anomaly documented at the Younger Dryas onset in North American sedimentary sequences

    PubMed Central

    Moore, Christopher R.; West, Allen; LeCompte, Malcolm A.; Brooks, Mark J.; Daniel, I. Randolph; Goodyear, Albert C.; Ferguson, Terry A.; Ivester, Andrew H.; Feathers, James K.; Kennett, James P.; Tankersley, Kenneth B.; Adedeji, A. Victor; Bunch, Ted E.

    2017-01-01

    Previously, a large platinum (Pt) anomaly was reported in the Greenland ice sheet at the Younger Dryas boundary (YDB) (12,800 Cal B.P.). In order to evaluate its geographic extent, fire-assay and inductively coupled plasma mass spectrometry (FA and ICP-MS) elemental analyses were performed on 11 widely separated archaeological bulk sedimentary sequences. We document discovery of a distinct Pt anomaly spread widely across North America and dating to the Younger Dryas (YD) onset. The apparent synchroneity of this widespread YDB Pt anomaly is consistent with Greenland Ice Sheet Project 2 (GISP2) data that indicated atmospheric input of platinum-rich dust. We expect the Pt anomaly to serve as a widely-distributed time marker horizon (datum) for identification and correlation of the onset of the YD climatic episode at 12,800 Cal B.P. This Pt datum will facilitate the dating and correlating of archaeological, paleontological, and paleoenvironmental data between sequences, especially those with limited age control. PMID:28276513

  12. Widespread platinum anomaly documented at the Younger Dryas onset in North American sedimentary sequences

    NASA Astrophysics Data System (ADS)

    Moore, Christopher R.; West, Allen; Lecompte, Malcolm A.; Brooks, Mark J.; Daniel, I. Randolph; Goodyear, Albert C.; Ferguson, Terry A.; Ivester, Andrew H.; Feathers, James K.; Kennett, James P.; Tankersley, Kenneth B.; Adedeji, A. Victor; Bunch, Ted E.

    2017-03-01

    Previously, a large platinum (Pt) anomaly was reported in the Greenland ice sheet at the Younger Dryas boundary (YDB) (12,800 Cal B.P.). In order to evaluate its geographic extent, fire-assay and inductively coupled plasma mass spectrometry (FA and ICP-MS) elemental analyses were performed on 11 widely separated archaeological bulk sedimentary sequences. We document discovery of a distinct Pt anomaly spread widely across North America and dating to the Younger Dryas (YD) onset. The apparent synchroneity of this widespread YDB Pt anomaly is consistent with Greenland Ice Sheet Project 2 (GISP2) data that indicated atmospheric input of platinum-rich dust. We expect the Pt anomaly to serve as a widely-distributed time marker horizon (datum) for identification and correlation of the onset of the YD climatic episode at 12,800 Cal B.P. This Pt datum will facilitate the dating and correlating of archaeological, paleontological, and paleoenvironmental data between sequences, especially those with limited age control.

  13. Classification of video sequences into chosen generalized use classes of target size and lighting level.

    PubMed

    Leszczuk, Mikołaj; Dudek, Łukasz; Witkowski, Marcin

    The VQiPS (Video Quality in Public Safety) Working Group, supported by the U.S. Department of Homeland Security, has been developing a user guide for public safety video applications. According to VQiPS, five parameters have particular importance influencing the ability to achieve a recognition task. They are: usage time-frame, discrimination level, target size, lighting level, and level of motion. These parameters form what are referred to as Generalized Use Classes (GUCs). The aim of our research was to develop algorithms that would automatically assist classification of input sequences into one of the GUCs. Target size and lighting level parameters were approached. The experiment described reveals the experts' ambiguity and hesitation during the manual target size determination process. However, the automatic methods developed for target size classification make it possible to determine GUC parameters with 70 % compliance to the end-users' opinion. Lighting levels of the entire sequence can be classified with an efficiency reaching 93 %. To make the algorithms available for use, a test application has been developed. It is able to process video files and display classification results, the user interface being very simple and requiring only minimal user interaction.

  14. Contribution of AT-, GC-, and methylated cytidine-rich DNA to chromatin composition in Malpighian tubule cell nuclei of Panstrongylus megistus (Hemiptera, Reduviidae).

    PubMed

    Alvarenga, Elenice M; Mondin, Mateus; Rodrigues, Vera L C C; Andrade, Larissa M; Vidal, Benedicto de Campos; Mello, Maria Luiza S

    2012-11-01

    The Malpighian tubule cell nuclei of male Panstrongylus megistus, a vector of Chagas disease, contain one chromocenter, which is composed solely of the Y chromosome. Considering that different chromosomes contribute to the composition of chromocenters in different triatomini species, the aim of this study was to determine the contribution of AT-, GC-, and methylated cytidine-rich DNA in the chromocenter as well as in euchromatin of Malpighian tubule cell nuclei of P. megistus in comparison with published data for Triatoma infestans. Staining with 4',6-diamidino-2-phenylindole/actinomycin D and chromomycin A(3)/distamycin, immunodetection of 5-methylcytidine and AgNOR test were used. The results revealed AT-rich/GC-poor DNA in the male chromocenter, but equally distributed AT and GC DNA sequences in male and female euchromatin, like in T. infestans. Accumulation of argyrophilic proteins encircling the chromocenter did not always correlate with that of GC-rich DNA. Methylated DNA identified by immunodetection was found sparsely distributed in the euchromatin of both sexes and at some points around the chromocenter edge, but it could not be considered responsible for chromatin condensation in the chromocenter, like in T. infestans. However, unlike in T. infestans, no correlation between the chromocenter AT-rich DNA and nucleolus organizing region (NOR) DNA was found in P. megistus. Copyright © 2011 Elsevier GmbH. All rights reserved.

  15. ampliMethProfiler: a pipeline for the analysis of CpG methylation profiles of targeted deep bisulfite sequenced amplicons.

    PubMed

    Scala, Giovanni; Affinito, Ornella; Palumbo, Domenico; Florio, Ermanno; Monticelli, Antonella; Miele, Gennaro; Chiariotti, Lorenzo; Cocozza, Sergio

    2016-11-25

    CpG sites in an individual molecule may exist in a binary state (methylated or unmethylated) and each individual DNA molecule, containing a certain number of CpGs, is a combination of these states defining an epihaplotype. Classic quantification based approaches to study DNA methylation are intrinsically unable to fully represent the complexity of the underlying methylation substrate. Epihaplotype based approaches, on the other hand, allow methylation profiles of cell populations to be studied at the single molecule level. For such investigations, next-generation sequencing techniques can be used, both for quantitative and for epihaplotype analysis. Currently available tools for methylation analysis lack output formats that explicitly report CpG methylation profiles at the single molecule level and that have suited statistical tools for their interpretation. Here we present ampliMethProfiler, a python-based pipeline for the extraction and statistical epihaplotype analysis of amplicons from targeted deep bisulfite sequencing of multiple DNA regions. ampliMethProfiler tool provides an easy and user friendly way to extract and analyze the epihaplotype composition of reads from targeted bisulfite sequencing experiments. ampliMethProfiler is written in python language and requires a local installation of BLAST and (optionally) QIIME tools. It can be run on Linux and OS X platforms. The software is open source and freely available at http://amplimethprofiler.sourceforge.net .

  16. Evaluating allopolyploid origins in strawberries (Fragaria) using haplotypes generated from target capture sequencing.

    PubMed

    Kamneva, Olga K; Syring, John; Liston, Aaron; Rosenberg, Noah A

    2017-08-04

    Hybridization is observed in many eukaryotic lineages and can lead to the formation of polyploid species. The study of hybridization and polyploidization faces challenges both in data generation and in accounting for population-level phenomena such as coalescence processes in phylogenetic analysis. Genus Fragaria is one example of a set of plant taxa in which a range of ploidy levels is observed across species, but phylogenetic origins are unknown. Here, using 20 diploid and polyploid Fragaria species, we combine approaches from NGS data analysis and phylogenetics to infer evolutionary origins of polyploid strawberries, taking into account coalescence processes. We generate haplotype sequences for 257 low-copy nuclear markers assembled from Illumina target capture sequence data. We then identify putative hybridization events by analyzing gene tree topologies, and further test predicted hybridizations in a coalescence framework. This approach confirms the allopolyploid ancestry of F. chiloensis and F. virginiana, and provides new allopolyploid ancestry hypotheses for F. iturupensis, F. moschata, and F. orientalis. Evidence of gene flow between diploids F. bucharica and F. vesca is also detected, suggesting that it might be appropriate to consider these groups as conspecifics. This study is one of the first in which target capture sequencing followed by computational deconvolution of individual haplotypes is used for tracing origins of polyploid taxa. The study also provides new perspectives on the evolutionary history of Fragaria.

  17. Spectroscopy of neutron rich nuclei using cold neutron induced fission of actinide targets at the ILL: the EXILL campaign

    NASA Astrophysics Data System (ADS)

    de France, G.; Blanc, A.; Drouet, F.; Jentschel, M.; Köster, U.; Mutti, P.; Régis, J. M.; Simpson, G.; Soldner, T.; Stezowski, O.; Ur, C. A.; Urban, W.; Vancrayenest, A.

    2014-03-01

    A combination of germanium detectors has been installed at the PF1B neutron guide of the ILL to perform the prompt spectroscopy of neutron-rich nuclei produced in the neutron-capture induced-fission of 235U and 241Pu. In addition LaBr3 detectors from the FATIMA collaboration have been installed in complement with the EXOGAM clovers to measure lifetimes of low-lying excited states. The measured characteristics and online spectra indicate very good performances of the overall setup.

  18. Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data

    PubMed Central

    Krøigård, Anne Bruun; Thomassen, Mads; Lænkholm, Anne-Vibeke; Kruse, Torben A.; Larsen, Martin Jakob

    2016-01-01

    Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant callers creates a need for comparison and validation of the tools, as no de facto standard for detection of somatic mutations exists and only limited comparisons have been reported. We have performed a comprehensive evaluation using exome sequencing and targeted deep sequencing data of paired tumor-normal samples from five breast cancer patients to evaluate the performance of nine publicly available somatic variant callers: EBCall, Mutect, Seurat, Shimmer, Indelocator, Somatic Sniper, Strelka, VarScan 2 and Virmid for the detection of single nucleotide mutations and small deletions and insertions. We report a large variation in the number of calls from the nine somatic variant callers on the same sequencing data and highly variable agreement. Sequencing depth had markedly diverse impact on individual callers, as for some callers, increased sequencing depth highly improved sensitivity. For SNV calling, we report EBCall, Mutect, Virmid and Strelka to be the most reliable somatic variant callers for both exome sequencing and targeted deep sequencing. For indel calling, EBCall is superior due to high sensitivity and robustness to changes in sequencing depths. PMID:27002637

  19. High-fidelity target sequencing of individual molecules identified using barcode sequences: de novo detection and absolute quantitation of mutations in plasma cell-free DNA from cancer patients.

    PubMed

    Kukita, Yoji; Matoba, Ryo; Uchida, Junji; Hamakawa, Takuya; Doki, Yuichiro; Imamura, Fumio; Kato, Kikuya

    2015-08-01

    Circulating tumour DNA (ctDNA) is an emerging field of cancer research. However, current ctDNA analysis is usually restricted to one or a few mutation sites due to technical limitations. In the case of massively parallel DNA sequencers, the number of false positives caused by a high read error rate is a major problem. In addition, the final sequence reads do not represent the original DNA population due to the global amplification step during the template preparation. We established a high-fidelity target sequencing system of individual molecules identified in plasma cell-free DNA using barcode sequences; this system consists of the following two steps. (i) A novel target sequencing method that adds barcode sequences by adaptor ligation. This method uses linear amplification to eliminate the errors introduced during the early cycles of polymerase chain reaction. (ii) The monitoring and removal of erroneous barcode tags. This process involves the identification of individual molecules that have been sequenced and for which the number of mutations have been absolute quantitated. Using plasma cell-free DNA from patients with gastric or lung cancer, we demonstrated that the system achieved near complete elimination of false positives and enabled de novo detection and absolute quantitation of mutations in plasma cell-free DNA. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  20. More on the Possible Composition of the Meridiani Hematite-Rich Concretions

    NASA Technical Reports Server (NTRS)

    Jolliff, B. L.; Gellert, R.; Mittlefehldt, D. W.

    2007-01-01

    Elsewhere in these proceedings, Schneider et al. discuss compositional constraints on hematite-rich spherule (blueberry) formation at Meridiani Planum. Schneider et al. provide the background for work done to date to understand the composition and mineralogy of the spherules and devise a test of possible concretion growth processes. They also report the results of area analyses of spherules in targets analyzed with the Alpha Particle X-ray Spectrometer (APXS) and test several possible models for included components other than hematite. In this abstract, we use the compositional trends for spherule-rich targets to compute possible elemental compositions of the spherules. This approach differs from that of, which also used a determination of the area of spherules in APXS targets, coupled with a correction for the radial acceptance function, to try to un-mix the compositions directly, using 2 and 3-component models and mass balance. That approach contained a fair amount of uncertainty owing to problems associated with irregular and heterogeneous target geometry, unknown composition of non-spherule lithic components, and variable dust coatings on spherules. Since then, Opportunity has analyzed additional spherule-rich targets, and the compositional trends so obtained permit a more direct assessment of the data.

  1. Generic detection of poleroviruses using an RT-PCR assay targeting the RdRp coding sequence.

    PubMed

    Lotos, Leonidas; Efthimiou, Konstantinos; Maliogka, Varvara I; Katis, Nikolaos I

    2014-03-01

    In this study a two-step RT-PCR assay was developed for the generic detection of poleroviruses. The RdRp coding region was selected as the primers' target, since it differs significantly from that of other members in the family Luteoviridae and its sequence can be more informative than other regions in the viral genome. Species specific RT-PCR assays targeting the same region were also developed for the detection of the six most widespread poleroviral species (Beet mild yellowing virus, Beet western yellows virus, Cucurbit aphid-borne virus, Carrot red leaf virus, Potato leafroll virus and Turnip yellows virus) in Greece and the collection of isolates. These isolates along with other characterized ones were used for the evaluation of the generic PCR's detection range. The developed assay efficiently amplified a 593bp RdRp fragment from 46 isolates of 10 different Polerovirus species. Phylogenetic analysis using the generic PCR's amplicon sequence showed that although it cannot accurately infer evolutionary relationships within the genus it can differentiate poleroviruses at the species level. Overall, the described generic assay could be applied for the reliable detection of Polerovirus infections and, in combination with the specific PCRs, for the identification of new and uncharacterized species in the genus. Copyright © 2013 Elsevier B.V. All rights reserved.

  2. Aptamer/Au nanoparticles/cobalt sulfide nanosheets biosensor for 17β-estradiol detection using a guanine-rich complementary DNA sequence for signal amplification.

    PubMed

    Huang, Ke-Jing; Liu, Yu-Jie; Zhang, Ji-Zong; Cao, Jun-Tao; Liu, Yan-Ming

    2015-05-15

    We have developed a sensitive sensing platform for 17β-estradiol by combining the aptamer probe and hybridization reaction. In this assay, 2-dimensional cobalt sulfide nanosheet (CoS) was synthesized by a simple hydrothermal method with L-cysteine as sulfur donor. An electrochemical aptamer biosensor was constructed by assembling a thiol group tagged 17β-estradiol aptamer on CoS and gold nanoparticles (AuNPs) modified electrode. Methylene blue was applied as a tracer and a guanine-rich complementary DNA sequence was designed to bind with the unbound 17β-estradiol aptamer for signal amplification. The binding of guanine-rich DNA to the aptamer was inhibited when the aptamer captured 17β-estradiol. Using guanine-rich DNA in the assay greatly amplified the redox signal of methylene blue bound to the detection probe. The CoS/AuNPs film formed on the biosensor surface appeared to be a good conductor for accelerating the electron transfer. The method demonstrated a high sensitivity of detection with the dynamic concentration range spanning from 1.0×10(-9) to 1.0×10(-12) M and a detection limit of 7.0×10(-13) M. Besides, the fabricated biosensor exhibited good selectivity toward 17β-estradiol even when interferents were presented at 100-fold concentrations. Our attempt will extend the application of the CoS nanosheet and this signal amplification assay to biosensing areas. Copyright © 2014 Elsevier B.V. All rights reserved.

  3. A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries

    PubMed Central

    2011-01-01

    Genome targeting methods enable cost-effective capture of specific subsets of the genome for sequencing. We present here an automated, highly scalable method for carrying out the Solution Hybrid Selection capture approach that provides a dramatic increase in scale and throughput of sequence-ready libraries produced. Significant process improvements and a series of in-process quality control checkpoints are also added. These process improvements can also be used in a manual version of the protocol. PMID:21205303

  4. mCAL: A New Approach for Versatile Multiplex Action of Cas9 Using One sgRNA and Loci Flanked by a Programmed Target Sequence.

    PubMed

    Finnigan, Gregory C; Thorner, Jeremy

    2016-07-07

    Genome editing exploiting CRISPR/Cas9 has been adopted widely in academia and in the biotechnology industry to manipulate DNA sequences in diverse organisms. Molecular engineering of Cas9 itself and its guide RNA, and the strategies for using them, have increased efficiency, optimized specificity, reduced inappropriate off-target effects, and introduced modifications for performing other functions (transcriptional regulation, high-resolution imaging, protein recruitment, and high-throughput screening). Moreover, Cas9 has the ability to multiplex, i.e., to act at different genomic targets within the same nucleus. Currently, however, introducing concurrent changes at multiple loci involves: (i) identification of appropriate genomic sites, especially the availability of suitable PAM sequences; (ii) the design, construction, and expression of multiple sgRNA directed against those sites; (iii) potential difficulties in altering essential genes; and (iv) lingering concerns about "off-target" effects. We have devised a new approach that circumvents these drawbacks, as we demonstrate here using the yeast Saccharomyces cerevisiae First, any gene(s) of interest are flanked upstream and downstream with a single unique target sequence that does not normally exist in the genome. Thereafter, expression of one sgRNA and cotransformation with appropriate PCR fragments permits concomitant Cas9-mediated alteration of multiple genes (both essential and nonessential). The system we developed also allows for maintenance of the integrated, inducible Cas9-expression cassette or its simultaneous scarless excision. Our scheme-dubbed mCAL for " M: ultiplexing of C: as9 at A: rtificial L: oci"-can be applied to any organism in which the CRISPR/Cas9 methodology is currently being utilized. In principle, it can be applied to install synthetic sequences into the genome, to generate genomic libraries, and to program strains or cell lines so that they can be conveniently (and repeatedly

  5. Leucine-rich-repeat-containing variable lymphocyte receptors as modules to target plant-expressed proteins

    DOE PAGES

    Velásquez, André C.; Nomura, Kinya; Cooper, Max D.; ...

    2017-04-19

    The ability to target and manipulate protein-based cellular processes would accelerate plant research; yet, the technology to specifically and selectively target plant-expressed proteins is still in its infancy. Leucine-rich repeats (LRRs) are ubiquitously present protein domains involved in mediating protein–protein interactions. LRRs confer the binding specificity to the highly diverse variable lymphocyte receptor (VLR) antibodies (including VLRA, VLRB and VLRC types) that jawless vertebrates make as the functional equivalents of jawed vertebrate immunoglobulin-based antibodies. Here, VLRBs targeting an effector protein from a plant pathogen, HopM1, were developed by immunizing lampreys and using yeast surface display to select for high-affinity VLRBs.more » HopM1-specific VLRBs (VLRM1) were expressed in planta in the cytosol, the trans-Golgi network, and the apoplast. Expression of VLRM1 was higher when the protein localized to an oxidizing environment that would favor disulfide bridge formation (when VLRM1 was not localized to the cytoplasm), as disulfide bonds are necessary for proper VLR folding. VLRM1 specifically interacted in planta with HopM1 but not with an unrelated bacterial effector protein while HopM1 failed to interact with a non-specific VLRB. Later, VLRs may be used as flexible modules to bind proteins or carbohydrates of interest in planta, with broad possibilities for their use by binding directly to their targets and inhibiting their action, or by creating chimeric proteins with new specificities in which endogenous LRR domains are replaced by those present in VLRs.« less

  6. Leucine-rich-repeat-containing variable lymphocyte receptors as modules to target plant-expressed proteins

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Velásquez, André C.; Nomura, Kinya; Cooper, Max D.

    The ability to target and manipulate protein-based cellular processes would accelerate plant research; yet, the technology to specifically and selectively target plant-expressed proteins is still in its infancy. Leucine-rich repeats (LRRs) are ubiquitously present protein domains involved in mediating protein–protein interactions. LRRs confer the binding specificity to the highly diverse variable lymphocyte receptor (VLR) antibodies (including VLRA, VLRB and VLRC types) that jawless vertebrates make as the functional equivalents of jawed vertebrate immunoglobulin-based antibodies. Here, VLRBs targeting an effector protein from a plant pathogen, HopM1, were developed by immunizing lampreys and using yeast surface display to select for high-affinity VLRBs.more » HopM1-specific VLRBs (VLRM1) were expressed in planta in the cytosol, the trans-Golgi network, and the apoplast. Expression of VLRM1 was higher when the protein localized to an oxidizing environment that would favor disulfide bridge formation (when VLRM1 was not localized to the cytoplasm), as disulfide bonds are necessary for proper VLR folding. VLRM1 specifically interacted in planta with HopM1 but not with an unrelated bacterial effector protein while HopM1 failed to interact with a non-specific VLRB. Later, VLRs may be used as flexible modules to bind proteins or carbohydrates of interest in planta, with broad possibilities for their use by binding directly to their targets and inhibiting their action, or by creating chimeric proteins with new specificities in which endogenous LRR domains are replaced by those present in VLRs.« less

  7. Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences

    PubMed Central

    Gibbs, Mark J; Armstrong, John S; Gibbs, Adrian J

    2005-01-01

    Background Most current DNA diagnostic tests for identifying organisms use specific oligonucleotide probes that are complementary in sequence to, and hence only hybridise with the DNA of one target species. By contrast, in traditional taxonomy, specimens are usually identified by 'dichotomous keys' that use combinations of characters shared by different members of the target set. Using one specific character for each target is the least efficient strategy for identification. Using combinations of shared bisectionally-distributed characters is much more efficient, and this strategy is most efficient when they separate the targets in a progressively binary way. Results We have developed a practical method for finding minimal sets of sub-sequences that identify individual sequences, and could be targeted by combinations of probes, so that the efficient strategy of traditional taxonomic identification could be used in DNA diagnosis. The sizes of minimal sub-sequence sets depended mostly on sequence diversity and sub-sequence length and interactions between these parameters. We found that 201 distinct cytochrome oxidase subunit-1 (CO1) genes from moths (Lepidoptera) were distinguished using only 15 sub-sequences 20 nucleotides long, whereas only 8–10 sub-sequences 6–10 nucleotides long were required to distinguish the CO1 genes of 92 species from the 9 largest orders of insects. Conclusion The presence/absence of sub-sequences in a set of gene sequences can be used like the questions in a traditional dichotomous taxonomic key; hybridisation probes complementary to such sub-sequences should provide a very efficient means for identifying individual species, subtypes or genotypes. Sequence diversity and sub-sequence length are the major factors that determine the numbers of distinguishing sub-sequences in any set of sequences. PMID:15817134

  8. Sequence-Specific Targeting of Dosage Compensation in Drosophila Favors an Active Chromatin Context

    PubMed Central

    Gelbart, Marnie; Tolstorukov, Michael Y.; Plachetka, Annette; Kharchenko, Peter V.; Jung, Youngsook L.; Gorchakov, Andrey A.; Larschan, Erica; Gu, Tingting; Minoda, Aki; Riddle, Nicole C.; Schwartz, Yuri B.; Elgin, Sarah C. R.; Karpen, Gary H.; Pirrotta, Vincenzo; Kuroda, Mitzi I.; Park, Peter J.

    2012-01-01

    The Drosophila MSL complex mediates dosage compensation by increasing transcription of the single X chromosome in males approximately two-fold. This is accomplished through recognition of the X chromosome and subsequent acetylation of histone H4K16 on X-linked genes. Initial binding to the X is thought to occur at “entry sites” that contain a consensus sequence motif (“MSL recognition element” or MRE). However, this motif is only ∼2 fold enriched on X, and only a fraction of the motifs on X are initially targeted. Here we ask whether chromatin context could distinguish between utilized and non-utilized copies of the motif, by comparing their relative enrichment for histone modifications and chromosomal proteins mapped in the modENCODE project. Through a comparative analysis of the chromatin features in male S2 cells (which contain MSL complex) and female Kc cells (which lack the complex), we find that the presence of active chromatin modifications, together with an elevated local GC content in the surrounding sequences, has strong predictive value for functional MSL entry sites, independent of MSL binding. We tested these sites for function in Kc cells by RNAi knockdown of Sxl, resulting in induction of MSL complex. We show that ectopic MSL expression in Kc cells leads to H4K16 acetylation around these sites and a relative increase in X chromosome transcription. Collectively, our results support a model in which a pre-existing active chromatin environment, coincident with H3K36me3, contributes to MSL entry site selection. The consequences of MSL targeting of the male X chromosome include increase in nucleosome lability, enrichment for H4K16 acetylation and JIL-1 kinase, and depletion of linker histone H1 on active X-linked genes. Our analysis can serve as a model for identifying chromatin and local sequence features that may contribute to selection of functional protein binding sites in the genome. PMID:22570616

  9. Evolutionary diversity and potential recombinogenic role of integration targets of non-LTR retrotransposons

    PubMed Central

    Gentles, Andrew J.; Kohany, Oleksiy; Jurka, Jerzy

    2005-01-01

    Short interspersed elements (SINEs) make up a significant fraction of total DNA in mammalian genomes, providing a rich substrate for chromosomal rearrangements by SINE-SINE recombinations. Proliferation of mammalian SINEs is mediated primarily by LINE1 (L1) non-LTR retrotransposons that preferentially integrate at DNA sequence targets with average length ~15 bp and containing conserved endonucleolytic nicking signals at both ends. We report that sequence variations in the first of the two nicking signals, represented by a 5′TT-AAAA consensus sequence, affect the position of the second signal thus leading to target site duplications (TSDs) of different lengths. The length distribution of TSDs appears to be affected also by L1-encoded enzyme variants, since targets with the same 5′ nicking site can be of different average length in different mammalian species. Taking this into account, we re-analyzed the second nicking site and found that it is larger and includes more conserved sites than previously appreciated, with a consensus of 5′ANTNTN-AA. We also studied potential involvement of the nicking sites in stimulating recombinations between SINE elements. We determined that SINE elements retaining TSDs with perfect 5′TT-AAAA nicking sites appear to be lost relatively rapidly from the human and rat genomes, and less rapidly from dog. We speculate that the introduction of single-strand DNA breaks induced by recurring endonucleolytic attacks at these sites, combined with the ubiquitousness of SINEs, may significantly promote recombination between repetitive elements, leading to the observed losses. At the same time new L1 subfamilies may be selected for “incompatibility” with pre-existing targets. This provides a possible driving force for the continual emergence of new L1 subfamilies which, in turn, may affect selection of L1-dependent SINE subfamilies. PMID:15944437

  10. Production cross sections of neutron-rich No-263261 isotopes

    NASA Astrophysics Data System (ADS)

    Li, Jingjing; Li, Cheng; Zhang, Gen; Zhu, Long; Liu, Zhong; Zhang, Feng-Shou

    2017-05-01

    The fusion excitation functions of No-263249 are studied by using various reaction systems based on the dinuclear system model. The neutron-rich radioactive beam 22O is used to produce neutron-rich nobelium isotopes, and the new neutron-rich isotopes No-263261 are synthesized by 242Pu(22O,3 n )261No , 244Pu(22O,4 n )262No , and 244Pu(22O,3 n )263No reactions, respectively. The corresponding maximum evaporation residue cross sections are 0.628, 4.649, and 1.638 μ b , respectively. The effects of the three processes (capture, fusion, and survival) in the complete fusion reaction are also analyzed. From investigation, a neutron-rich radioactive beam as the projectile and neutron-rich actinide as the target could be a new selection of the projectile-target combination to produce a neutron-rich heavy nuclide.

  11. Sensitive detection of mercury and copper ions by fluorescent DNA/Ag nanoclusters in guanine-rich DNA hybridization

    NASA Astrophysics Data System (ADS)

    Peng, Jun; Ling, Jian; Zhang, Xiu-Qing; Bai, Hui-Ping; Zheng, Liyan; Cao, Qiu-E.; Ding, Zhong-Tao

    2015-02-01

    In this work, we designed a new fluorescent oligonucleotides-stabilized silver nanoclusters (DNA/AgNCs) probe for sensitive detection of mercury and copper ions. This probe contains two tailored DNA sequence. One is a signal probe contains a cytosine-rich sequence template for AgNCs synthesis and link sequence at both ends. The other is a guanine-rich sequence for signal enhancement and link sequence complementary to the link sequence of the signal probe. After hybridization, the fluorescence of hybridized double-strand DNA/AgNCs is 200-fold enhanced based on the fluorescence enhancement effect of DNA/AgNCs in proximity of guanine-rich DNA sequence. The double-strand DNA/AgNCs probe is brighter and stable than that of single-strand DNA/AgNCs, and more importantly, can be used as novel fluorescent probes for detecting mercury and copper ions. Mercury and copper ions in the range of 6.0-160.0 and 6-240 nM, can be linearly detected with the detection limits of 2.1 and 3.4 nM, respectively. Our results indicated that the analytical parameters of the method for mercury and copper ions detection are much better than which using a single-strand DNA/AgNCs.

  12. Development of Genetic Markers in Eucalyptus Species by Target Enrichment and Exome Sequencing

    PubMed Central

    Dasgupta, Modhumita Ghosh; Dharanishanthi, Veeramuthu; Agarwal, Ishangi; Krutovsky, Konstantin V.

    2015-01-01

    The advent of next-generation sequencing has facilitated large-scale discovery, validation and assessment of genetic markers for high density genotyping. The present study was undertaken to identify markers in genes supposedly related to wood property traits in three Eucalyptus species. Ninety four genes involved in xylogenesis were selected for hybridization probe based nuclear genomic DNA target enrichment and exome sequencing. Genomic DNA was isolated from the leaf tissues and used for on-array probe hybridization followed by Illumina sequencing. The raw sequence reads were trimmed and high-quality reads were mapped to the E. grandis reference sequence and the presence of single nucleotide variants (SNVs) and insertions/ deletions (InDels) were identified across the three species. The average read coverage was 216X and a total of 2294 SNVs and 479 InDels were discovered in E. camaldulensis, 2383 SNVs and 518 InDels in E. tereticornis, and 1228 SNVs and 409 InDels in E. grandis. Additionally, SNV calling and InDel detection were conducted in pair-wise comparisons of E. tereticornis vs. E. grandis, E. camaldulensis vs. E. tereticornis and E. camaldulensis vs. E. grandis. This study presents an efficient and high throughput method on development of genetic markers for family– based QTL and association analysis in Eucalyptus. PMID:25602379

  13. Fluorescence turn-on detection of target sequence DNA based on silicon nanodot-mediated quenching.

    PubMed

    Zhang, Yanan; Ning, Xinping; Mao, Guobin; Ji, Xinghu; He, Zhike

    2018-05-01

    We have developed a new enzyme-free method for target sequence DNA detection based on the dynamic quenching of fluorescent silicon nanodots (SiNDs) toward Cy5-tagged DNA probe. Fascinatingly, the water-soluble SiNDs can quench the fluorescence of cyanine (Cy5) in Cy5-tagged DNA probe in homogeneous solution, and the fluorescence of Cy5-tagged DNA probe can be restored in the presence of target sequence DNA (the synthetic target miRNA-27a). Based on this phenomenon, a SiND-featured fluorescent sensor has been constructed for "turn-on" detection of the synthetic target miRNA-27a for the first time. This newly developed approach possesses the merits of low cost, simple design, and convenient operation since no enzymatic reaction, toxic reagents, or separation procedures are involved. The established method achieves a detection limit of 0.16 nM, and the relative standard deviation of this method is 9% (1 nM, n = 5). The linear range is 0.5-20 nM, and the recoveries in spiked human fluids are in the range of 90-122%. This protocol provides a new tactic in the development of the nonenzymic miRNA biosensors and opens a promising avenue for early diagnosis of miRNA-associated disease. Graphical abstract The SiND-based fluorescent sensor for detection of S-miR-27a.

  14. Targeted DNA sequencing and in situ mutation analysis using mobile phone microscopy

    NASA Astrophysics Data System (ADS)

    Kühnemund, Malte; Wei, Qingshan; Darai, Evangelia; Wang, Yingjie; Hernández-Neuta, Iván; Yang, Zhao; Tseng, Derek; Ahlford, Annika; Mathot, Lucy; Sjöblom, Tobias; Ozcan, Aydogan; Nilsson, Mats

    2017-01-01

    Molecular diagnostics is typically outsourced to well-equipped centralized laboratories, often far from the patient. We developed molecular assays and portable optical imaging designs that permit on-site diagnostics with a cost-effective mobile-phone-based multimodal microscope. We demonstrate that targeted next-generation DNA sequencing reactions and in situ point mutation detection assays in preserved tumour samples can be imaged and analysed using mobile phone microscopy, achieving a new milestone for tele-medicine technologies.

  15. Detection of canonical A-to-G editing events at 3' UTRs and microRNA target sites in human lungs using next-generation sequencing.

    PubMed

    Soundararajan, Ramani; Stearns, Timothy M; Griswold, Anthony L; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F; King, Benjamin L; Kolliputi, Narasaiah

    2015-11-03

    RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3' untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3' UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states.

  16. Deep Sequencing Insights in Therapeutic shRNA Processing and siRNA Target Cleavage Precision.

    PubMed

    Denise, Hubert; Moschos, Sterghios A; Sidders, Benjamin; Burden, Frances; Perkins, Hannah; Carter, Nikki; Stroud, Tim; Kennedy, Michael; Fancy, Sally-Ann; Lapthorn, Cris; Lavender, Helen; Kinloch, Ross; Suhy, David; Corbau, Romu

    2014-02-04

    TT-034 (PF-05095808) is a recombinant adeno-associated virus serotype 8 (AAV8) agent expressing three short hairpin RNA (shRNA) pro-drugs that target the hepatitis C virus (HCV) RNA genome. The cytosolic enzyme Dicer cleaves each shRNA into multiple, potentially active small interfering RNA (siRNA) drugs. Using next-generation sequencing (NGS) to identify and characterize active shRNAs maturation products, we observed that each TT-034-encoded shRNA could be processed into as many as 95 separate siRNA strands. Few of these appeared active as determined by Sanger 5' RNA Ligase-Mediated Rapid Amplification of cDNA Ends (5-RACE) and through synthetic shRNA and siRNA analogue studies. Moreover, NGS scrutiny applied on 5-RACE products (RACE-seq) suggested that synthetic siRNAs could direct cleavage in not one, but up to five separate positions on targeted RNA, in a sequence-dependent manner. These data support an on-target mechanism of action for TT-034 without cytotoxicity and question the accepted precision of substrate processing by the key RNA interference (RNAi) enzymes Dicer and siRNA-induced silencing complex (siRISC).Molecular Therapy-Nucleic Acids (2014) 3, e145; doi:10.1038/mtna.2013.73; published online 4 February 2014.

  17. Implementing targeted region capture sequencing for the clinical detection of Alagille syndrome: An efficient and cost‑effective method.

    PubMed

    Huang, Tianhong; Yang, Guilin; Dang, Xiao; Ao, Feijian; Li, Jiankang; He, Yizhou; Tang, Qiyuan; He, Qing

    2017-11-01

    Alagille syndrome (AGS) is a highly variable, autosomal dominant disease that affects multiple structures including the liver, heart, eyes, bones and face. Targeted region capture sequencing focuses on a panel of known pathogenic genes and provides a rapid, cost‑effective and accurate method for molecular diagnosis. In a Chinese family, this method was used on the proband and Sanger sequencing was applied to validate the candidate mutation. A de novo heterozygous mutation (c.3254_3255insT p.Leu1085PhefsX24) of the jagged 1 gene was identified as the potential disease‑causing gene mutation. In conclusion, the present study suggested that target region capture sequencing is an efficient, reliable and accurate approach for the clinical diagnosis of AGS. Furthermore, these results expand on the understanding of the pathogenesis of AGS.

  18. The N-terminus of survivin is a mitochondrial-targeting sequence and Src regulator

    PubMed Central

    Dunajová, Lucia; Cash, Emily; Markus, Robert; Rochette, Sophie; Townley, Amelia R.

    2016-01-01

    ABSTRACT Survivin (also known as BIRC5) is a cancer-associated protein that exists in several locations in the cell. Its cytoplasmic residence in interphase cells is governed by CRM1 (also known as XPO1)-mediated nuclear exportation, and its localisation during mitosis to the centromeres and midzone microtubules is that of a canonical chromosomal passenger protein. In addition to these well-established locations, survivin is also a mitochondrial protein, but how it gets there and its function therein is presently unclear. Here, we show that the first ten amino acids at the N-terminus of survivin are sufficient to target GFP to the mitochondria in vivo, and ectopic expression of this decapeptide decreases cell adhesion and accelerates proliferation. The data support a signalling mechanism in which this decapeptide regulates the tyrosine kinase Src, leading to reduced focal adhesion plaques and disruption of F-actin organisation. This strongly suggests that the N-terminus of survivin is a mitochondrial-targeting sequence that regulates Src, and that survivin acts in concert with Src to promote tumorigenesis. PMID:27246243

  19. Mitochondrial targeting sequence variants of the CHCHD2 gene are a risk for Lewy body disorders

    PubMed Central

    Ogaki, Kotaro; Koga, Shunsuke; Heckman, Michael G.; Fiesel, Fabienne C.; Ando, Maya; Labbé, Catherine; Lorenzo-Betancor, Oswaldo; Moussaud-Lamodière, Elisabeth L.; Soto-Ortolaza, Alexandra I.; Walton, Ronald L.; Strongosky, Audrey J.; Uitti, Ryan J.; McCarthy, Allan; Lynch, Timothy; Siuda, Joanna; Opala, Grzegorz; Rudzinska, Monika; Krygowska-Wajs, Anna; Barcikowska, Maria; Czyzewski, Krzysztof; Puschmann, Andreas; Nishioka, Kenya; Funayama, Manabu; Hattori, Nobutaka; Parisi, Joseph E.; Petersen, Ronald C.; Graff-Radford, Neill R.; Boeve, Bradley F.; Springer, Wolfdieter; Wszolek, Zbigniew K.; Dickson, Dennis W.

    2015-01-01

    Objective: To assess the role of CHCHD2 variants in patients with Parkinson disease (PD) and Lewy body disease (LBD) in Caucasian populations. Methods: All exons of the CHCHD2 gene were sequenced in a US Caucasian patient-control series (878 PD, 610 LBD, and 717 controls). Subsequently, exons 1 and 2 were sequenced in an Irish series (355 PD and 365 controls) and a Polish series (394 PD and 350 controls). Immunohistochemistry and immunofluorescence studies were performed on pathologic LBD cases with rare CHCHD2 variants. Results: We identified 9 rare exonic variants of unknown significance. These variants were more frequent in the combined group of PD and LBD patients compared to controls (0.6% vs 0.1%, p = 0.013). In addition, the presence of any rare variant was more common in patients with LBD (2.5% vs 1.0%, p = 0.050) compared to controls. Eight of these 9 variants were located within the gene's mitochondrial targeting sequence. Conclusions: Although the role of variants of the CHCHD2 gene in PD and LBD remains to be further elucidated, the rare variants in the mitochondrial targeting sequence may be a risk factor for Lewy body disorders, which may link CHCHD2 to other genetic forms of parkinsonism with mitochondrial dysfunction. PMID:26561290

  20. Targeted exome sequencing reveals novel USH2A mutations in Chinese patients with simplex Usher syndrome.

    PubMed

    Shu, Hai-Rong; Bi, Huai; Pan, Yang-Chun; Xu, Hang-Yu; Song, Jian-Xin; Hu, Jie

    2015-09-16

    Usher syndrome (USH) is an autosomal recessive disorder characterized by hearing impairment and vision dysfunction due to retinitis pigmentosa. Phenotypic and genetic heterogeneities of this disease make it impractical to obtain a genetic diagnosis by conventional Sanger sequencing. In this study, we applied a next-generation sequencing approach to detect genetic abnormalities in patients with USH. Two unrelated Chinese families were recruited, consisting of two USH afflicted patients and four unaffected relatives. We selected 199 genes related to inherited retinal diseases as targets for deep exome sequencing. Through systematic data analysis using an established bioinformatics pipeline, all variants that passed filter criteria were validated by Sanger sequencing and co-segregation analysis. A homozygous frameshift mutation (c.4382delA, p.T1462Lfs*2) was revealed in exon20 of gene USH2A in the F1 family. Two compound heterozygous mutations, IVS47 + 1G > A and c.13156A > T (p.I4386F), located in intron 48 and exon 63 respectively, of USH2A, were identified as causative mutations for the F2 family. Of note, the missense mutation c.13156A > T has not been reported so far. In conclusion, targeted exome sequencing precisely and rapidly identified the genetic defects in two Chinese USH families and this technique can be applied as a routine examination for these disorders with significant clinical and genetic heterogeneity.

  1. Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy

    NASA Astrophysics Data System (ADS)

    Chen, Ellson Y.

    1997-05-01

    So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.

  2. Targeted DNA sequencing and in situ mutation analysis using mobile phone microscopy

    PubMed Central

    Kühnemund, Malte; Wei, Qingshan; Darai, Evangelia; Wang, Yingjie; Hernández-Neuta, Iván; Yang, Zhao; Tseng, Derek; Ahlford, Annika; Mathot, Lucy; Sjöblom, Tobias; Ozcan, Aydogan; Nilsson, Mats

    2017-01-01

    Molecular diagnostics is typically outsourced to well-equipped centralized laboratories, often far from the patient. We developed molecular assays and portable optical imaging designs that permit on-site diagnostics with a cost-effective mobile-phone-based multimodal microscope. We demonstrate that targeted next-generation DNA sequencing reactions and in situ point mutation detection assays in preserved tumour samples can be imaged and analysed using mobile phone microscopy, achieving a new milestone for tele-medicine technologies. PMID:28094784

  3. Single-Center Experience with a Targeted Next Generation Sequencing Assay for Assessment of Relevant Somatic Alterations in Solid Tumors.

    PubMed

    Paasinen-Sohns, Aino; Koelzer, Viktor H; Frank, Angela; Schafroth, Julian; Gisler, Aline; Sachs, Melanie; Graber, Anne; Rothschild, Sacha I; Wicki, Andreas; Cathomas, Gieri; Mertz, Kirsten D

    2017-03-01

    Companion diagnostics rely on genomic testing of molecular alterations to enable effective cancer treatment. Here we report the clinical application and validation of the Oncomine Focus Assay (OFA), an integrated, commercially available next-generation sequencing (NGS) assay for the rapid and simultaneous detection of single nucleotide variants, short insertions and deletions, copy number variations, and gene rearrangements in 52 cancer genes with therapeutic relevance. Two independent patient cohorts were investigated to define the workflow, turnaround times, feasibility, and reliability of OFA targeted sequencing in clinical application and using archival material. Cohort I consisted of 59 diagnostic clinical samples from the daily routine submitted for molecular testing over a 4-month time period. Cohort II consisted of 39 archival melanoma samples that were up to 15years old. Libraries were prepared from isolated nucleic acids and sequenced on the Ion Torrent PGM sequencer. Sequencing datasets were analyzed using the Ion Reporter software. Genomic alterations were identified and validated by orthogonal conventional assays including pyrosequencing and immunohistochemistry. Sequencing results of both cohorts, including archival formalin-fixed, paraffin-embedded material stored up to 15years, were consistent with published variant frequencies. A concordance of 100% between established assays and OFA targeted NGS was observed. The OFA workflow enabled a turnaround of 3½ days. Taken together, OFA was found to be a convenient tool for fast, reliable, broadly applicable and cost-effective targeted NGS of tumor samples in routine diagnostics. Thus, OFA has strong potential to become an important asset for precision oncology. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  4. Geoseq: a tool for dissecting deep-sequencing datasets.

    PubMed

    Gurtowski, James; Cancio, Anthony; Shah, Hardik; Levovitz, Chaya; George, Ajish; Homann, Robert; Sachidanandam, Ravi

    2010-10-12

    Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) hosted by the NCBI, or the DNA Data Bank of Japan (ddbj). Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a) identify differential isoform expression in mRNA-seq datasets, b) identify miRNAs (microRNAs) in libraries, and identify mature and star sequences in miRNAS and c) to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.

  5. High throughput deep degradome sequencing reveals microRNAs and their targets in response to drought stress in mulberry (Morus alba).

    PubMed

    Li, Ruixue; Chen, Dandan; Wang, Taichu; Wan, Yizhen; Li, Rongfang; Fang, Rongjun; Wang, Yuting; Hu, Fei; Zhou, Hong; Li, Long; Zhao, Weiguo

    2017-01-01

    MicroRNAs (miRNAs) play important regulatory roles by targeting mRNAs for cleavage or translational repression. Identification of miRNA targets is essential to better understanding the roles of miRNAs. miRNA targets have not been well characterized in mulberry (Morus alba). To anatomize miRNA guided gene regulation under drought stress, transcriptome-wide high throughput degradome sequencing was used in this study to directly detect drought stress responsive miRNA targets in mulberry. A drought library (DL) and a contrast library (CL) were constructed to capture the cleaved mRNAs for sequencing. In CL, 409 target genes of 30 conserved miRNA families and 990 target genes of 199 novel miRNAs were identified. In DL, 373 target genes of 30 conserved miRNA families and 950 target genes of 195 novel miRNAs were identified. Of the conserved miRNA families in DL, mno-miR156, mno-miR172, and mno-miR396 had the highest number of targets with 54, 52 and 41 transcripts, respectively, indicating that these three miRNA families and their target genes might play important functions in response to drought stress in mulberry. Additionally, we found that many of the target genes were transcription factors. By analyzing the miRNA-target molecular network, we found that the DL independent networks consisted of 838 miRNA-mRNA pairs (63.34%). The expression patterns of 11 target genes and 12 correspondent miRNAs were detected using qRT-PCR. Six miRNA targets were further verified by RNA ligase-mediated 5' rapid amplification of cDNA ends (RLM-5' RACE). Gene Ontology (GO) annotations and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis revealed that these target transcripts were implicated in a broad range of biological processes and various metabolic pathways. This is the first study to comprehensively characterize target genes and their associated miRNAs in response to drought stress by degradome sequencing in mulberry. This study provides a framework for understanding

  6. IGF-1 receptor targeted nanoparticles for image-guided therapy of stroma-rich and drug resistant human cancer

    NASA Astrophysics Data System (ADS)

    Zhou, Hongyu; Qian, Weiping; Uckun, Fatih M.; Zhou, Zhiyang; Wang, Liya; Wang, Andrew; Mao, Hui; Yang, Lily

    2016-05-01

    Low drug delivery efficiency and drug resistance from highly heterogeneous cancer cells and tumor microenvironment represent major challenges in clinical oncology. Growth factor receptor, IGF-1R, is overexpressed in both human tumor cells and tumor associated stromal cells. The level of IGF-1R expression is further up-regulated in drug resistant tumor cells. We have developed IGF-1R targeted magnetic iron oxide nanoparticles (IONPs) carrying multiple anticancer drugs into human tumors. This IGF-1R targeted theranostic nanoparticle delivery system has an iron core for non-invasive MR imaging, amphiphilic polymer coating to ensure the biocompatibility as well as for drug loading and conjugation of recombinant human IGF-1 as targeting molecules. Chemotherapy drugs, Doxorubicin (Dox), was encapsulated into the polymer coating and/or conjugated to the IONP surface by coupling with the carboxyl groups. The ability of IGF1R targeted theranostic nanoparticles to penetrate tumor stromal barrier and enhance tumor cell killing has been demonstrated in human pancreatic cancer patient tissue derived xenograft (PDX) models. Repeated systemic administrations of those IGF-1R targeted theranostic IONP carrying Dox led to breaking the tumor stromal barrier and improved therapeutic effect. Near infrared (NIR) optical and MR imaging enabled noninvasive monitoring of nanoparticle-drug delivery and therapeutic responses. Our results demonstrated that IGF-1R targeted nanoparticles carrying multiple drugs are promising combination therapy approaches for image-guided therapy of stroma-rich and drug resistant human cancer, such as pancreatic cancer.

  7. Measuring the diversity of the human microbiota with targeted next-generation sequencing.

    PubMed

    Finotello, Francesca; Mastrorilli, Eleonora; Di Camillo, Barbara

    2016-12-26

    The human microbiota is a complex ecological community of commensal, symbiotic and pathogenic microorganisms harboured by the human body. Next-generation sequencing (NGS) technologies, in particular targeted amplicon sequencing of the 16S ribosomal RNA gene (16S-seq), are enabling the identification and quantification of human-resident microorganisms at unprecedented resolution, providing novel insights into the role of the microbiota in health and disease. Once microbial abundances are quantified through NGS data analysis, diversity indices provide valuable mathematical tools to describe the ecological complexity of a single sample or to detect species differences between samples. However, diversity is not a determined physical quantity for which a consensus definition and unit of measure have been established, and several diversity indices are currently available. Furthermore, they were originally developed for macroecology and their robustness to the possible bias introduced by sequencing has not been characterized so far. To assist the reader with the selection and interpretation of diversity measures, we review a panel of broadly used indices, describing their mathematical formulations, purposes and properties, and characterize their behaviour and criticalities in dependence of the data features using simulated data as ground truth. In addition, we make available an R package, DiversitySeq, which implements in a unified framework the full panel of diversity indices and a simulator of 16S-seq data, and thus represents a valuable resource for the analysis of diversity from NGS count data and for the benchmarking of computational methods for 16S-seq. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  8. Method and apparatus for biological sequence comparison

    DOEpatents

    Marr, T.G.; Chang, W.I.

    1997-12-23

    A method and apparatus are disclosed for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence. 5 figs.

  9. Method and apparatus for biological sequence comparison

    DOEpatents

    Marr, Thomas G.; Chang, William I-Wei

    1997-01-01

    A method and apparatus for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence.

  10. Sequencing Needs for Viral Diagnostics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gardner, S N; Lam, M; Mulakken, N J

    2004-01-26

    We built a system to guide decisions regarding the amount of genomic sequencing required to develop diagnostic DNA signatures, which are short sequences that are sufficient to uniquely identify a viral species. We used our existing DNA diagnostic signature prediction pipeline, which selects regions of a target species genome that are conserved among strains of the target (for reliability, to prevent false negatives) and unique relative to other species (for specificity, to avoid false positives). We performed simulations, based on existing sequence data, to assess the number of genome sequences of a target species and of close phylogenetic relatives (''nearmore » neighbors'') that are required to predict diagnostic signature regions that are conserved among strains of the target species and unique relative to other bacterial and viral species. For DNA viruses such as variola (smallpox), three target genomes provide sufficient guidance for selecting species-wide signatures. Three near neighbor genomes are critical for species specificity. In contrast, most RNA viruses require four target genomes and no near neighbor genomes, since lack of conservation among strains is more limiting than uniqueness. SARS and Ebola Zaire are exceptional, as additional target genomes currently do not improve predictions, but near neighbor sequences are urgently needed. Our results also indicate that double stranded DNA viruses are more conserved among strains than are RNA viruses, since in most cases there was at least one conserved signature candidate for the DNA viruses and zero conserved signature candidates for the RNA viruses.« less

  11. Cargo crowding at actin-rich regions along axons causes local traffic jams.

    PubMed

    Sood, Parul; Murthy, Kausalya; Kumar, Vinod; Nonet, Michael L; Menon, Gautam I; Koushika, Sandhya P

    2018-03-01

    Steady axonal cargo flow is central to the functioning of healthy neurons. However, a substantial fraction of cargo in axons remains stationary up to several minutes. We examine the transport of precursors of synaptic vesicles (pre-SVs), endosomes and mitochondria in Caenorhabditis elegans touch receptor neurons, showing that stationary cargo are predominantly present at actin-rich regions along the neuronal process. Stationary vesicles at actin-rich regions increase the propensity of moving vesicles to stall at the same location, resulting in traffic jams arising from physical crowding. Such local traffic jams at actin-rich regions are likely to be a general feature of axonal transport since they also occur in Drosophila neurons. Repeated touch stimulation of C. elegans reduces the density of stationary pre-SVs, indicating that these traffic jams can act as both sources and sinks of vesicles. This suggests that vesicles trapped in actin-rich regions are functional reservoirs that may contribute to maintaining robust cargo flow in the neuron. A video abstract of this article can be found at: Video S1; Video S2. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  12. Plasmodium falciparum Nucleosomes Exhibit Reduced Stability and Lost Sequence Dependent Nucleosome Positioning

    PubMed Central

    Silberhorn, Elisabeth; Schwartz, Uwe; Symelka, Anne; de Koning-Ward, Tania; Längst, Gernot

    2016-01-01

    The packaging and organization of genomic DNA into chromatin represents an additional regulatory layer of gene expression, with specific nucleosome positions that restrict the accessibility of regulatory DNA elements. The mechanisms that position nucleosomes in vivo are thought to depend on the biophysical properties of the histones, sequence patterns, like phased di-nucleotide repeats and the architecture of the histone octamer that folds DNA in 1.65 tight turns. Comparative studies of human and P. falciparum histones reveal that the latter have a strongly reduced ability to recognize internal sequence dependent nucleosome positioning signals. In contrast, the nucleosomes are positioned by AT-repeat sequences flanking nucleosomes in vivo and in vitro. Further, the strong sequence variations in the plasmodium histones, compared to other mammalian histones, do not present adaptations to its AT-rich genome. Human and parasite histones bind with higher affinity to GC-rich DNA and with lower affinity to AT-rich DNA. However, the plasmodium nucleosomes are overall less stable, with increased temperature induced mobility, decreased salt stability of the histones H2A and H2B and considerable reduced binding affinity to GC-rich DNA, as compared with the human nucleosomes. In addition, we show that plasmodium histone octamers form the shortest known nucleosome repeat length (155bp) in vitro and in vivo. Our data suggest that the biochemical properties of the parasite histones are distinct from the typical characteristics of other eukaryotic histones and these properties reflect the increased accessibility of the P. falciparum genome. PMID:28033404

  13. The Evolution of Bony Vertebrate Enhancers at Odds with Their Coding Sequence Landscape.

    PubMed

    Yousaf, Aisha; Sohail Raza, Muhammad; Ali Abbasi, Amir

    2015-08-06

    Enhancers lie at the heart of transcriptional and developmental gene regulation. Therefore, changes in enhancer sequences usually disrupt the target gene expression and result in disease phenotypes. Despite the well-established role of enhancers in development and disease, evolutionary sequence studies are lacking. The current study attempts to unravel the puzzle of bony vertebrates' conserved noncoding elements (CNE) enhancer evolution. Bayesian phylogenetics of enhancer sequences spotlights promising interordinal relationships among placental mammals, proposing a closer relationship between humans and laurasiatherians while placing rodents at the basal position. Clock-based estimates of enhancer evolution provided a dynamic picture of interspecific rate changes across the bony vertebrate lineage. Moreover, coelacanth in the study augmented our appreciation of the vertebrate cis-regulatory evolution during water-land transition. Intriguingly, we observed a pronounced upsurge in enhancer evolution in land-dwelling vertebrates. These novel findings triggered us to further investigate the evolutionary trend of coding as well as CNE nonenhancer repertoires, to highlight the relative evolutionary dynamics of diverse genomic landscapes. Surprisingly, the evolutionary rates of enhancer sequences were clearly at odds with those of the coding and the CNE nonenhancer sequences during vertebrate adaptation to land, with land vertebrates exhibiting significantly reduced rates of coding sequence evolution in comparison to their fast evolving regulatory landscape. The observed variation in tetrapod cis-regulatory elements caused the fine-tuning of associated gene regulatory networks. Therefore, the increased evolutionary rate of tetrapods' enhancer sequences might be responsible for the variation in developmental regulatory circuits during the process of vertebrate adaptation to land. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for

  14. "Looking-at-nothing" during sequential sensorimotor actions: Long-term memory-based eye scanning of remembered target locations.

    PubMed

    Foerster, Rebecca M

    2018-03-01

    Before acting humans saccade to a target object to extract relevant visual information. Even when acting on remembered objects, locations previously occupied by relevant objects are fixated during imagery and memory tasks - a phenomenon called "looking-at-nothing". While looking-at-nothing was robustly found in tasks encouraging declarative memory built-up, results are mixed in the case of procedural sensorimotor tasks. Eye-guidance to manual targets in complete darkness was observed in a task practiced for days beforehand, while investigations using only a single session did not find fixations to remembered action targets. Here, it is asked whether looking-at-nothing can be found in a single sensorimotor session and thus independent from sleep consolidation, and how it progresses when visual information is repeatedly unavailable. Eye movements were investigated in a computerized version of the trail making test. Participants clicked on numbered circles in ascending sequence. Fifty trials were performed with the same spatial arrangement of 9 visual targets to enable long-term memory consolidation. During 50 consecutive trials, participants had to click the remembered target sequence on an empty screen. Participants scanned the visual targets and also the empty target locations sequentially with their eyes, however, the latter less precise than the former. Over the course of the memory trials, manual and oculomotor sequential target scanning became more similar to the visual trials. Results argue for robust looking-at-nothing during procedural sensorimotor tasks provided that long-term memory information is sufficient. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. CRISPR/Cas9-mediated gene knockout screens and target identification via whole-genome sequencing uncover host genes required for picornavirus infection.

    PubMed

    Kim, Heon Seok; Lee, Kyungjin; Bae, Sangsu; Park, Jeongbin; Lee, Chong-Kyo; Kim, Meehyein; Kim, Eunji; Kim, Minju; Kim, Seokjoong; Kim, Chonsaeng; Kim, Jin-Soo

    2017-06-23

    Several groups have used genome-wide libraries of lentiviruses encoding small guide RNAs (sgRNAs) for genetic screens. In most cases, sgRNA expression cassettes are integrated into cells by using lentiviruses, and target genes are statistically estimated by the readout of sgRNA sequences after targeted sequencing. We present a new virus-free method for human gene knockout screens using a genome-wide library of CRISPR/Cas9 sgRNAs based on plasmids and target gene identification via whole-genome sequencing (WGS) confirmation of authentic mutations rather than statistical estimation through targeted amplicon sequencing. We used 30,840 pairs of individually synthesized oligonucleotides to construct the genome-scale sgRNA library, collectively targeting 10,280 human genes ( i.e. three sgRNAs per gene). These plasmid libraries were co-transfected with a Cas9-expression plasmid into human cells, which were then treated with cytotoxic drugs or viruses. Only cells lacking key factors essential for cytotoxic drug metabolism or viral infection were able to survive. Genomic DNA isolated from cells that survived these challenges was subjected to WGS to directly identify CRISPR/Cas9-mediated causal mutations essential for cell survival. With this approach, we were able to identify known and novel genes essential for viral infection in human cells. We propose that genome-wide sgRNA screens based on plasmids coupled with WGS are powerful tools for forward genetics studies and drug target discovery. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  16. Detection of canonical A-to-G editing events at 3′ UTRs and microRNA target sites in human lungs using next-generation sequencing

    PubMed Central

    Soundararajan, Ramani; Stearns, Timothy M.; Griswold, Anthony J.; Mehta, Arpit; Czachor, Alexander; Fukumoto, Jutaro; Lockey, Richard F.; King, Benjamin L.; Kolliputi, Narasaiah

    2015-01-01

    RNA editing is a post-transcriptional modification of RNA. The majority of these changes result from adenosine deaminase acting on RNA (ADARs) catalyzing the conversion of adenosine residues to inosine in double-stranded RNAs (dsRNAs). Massively parallel sequencing has enabled the identification of RNA editing sites in human transcriptomes. In this study, we sequenced DNA and RNA from human lungs and identified RNA editing sites with high confidence via a computational pipeline utilizing stringent analysis thresholds. We identified a total of 3,447 editing sites that overlapped in three human lung samples, and with 50% of these sites having canonical A-to-G base changes. Approximately 27% of the edited sites overlapped with Alu repeats, and showed A-to-G clustering (>3 clusters in 100 bp). The majority of edited sites mapped to either 3′ untranslated regions (UTRs) or introns close to splice sites; whereas, only few sites were in exons resulting in non-synonymous amino acid changes. Interestingly, we identified 652 A-to-G editing events in the 3′ UTR of 205 target genes that mapped to 932 potential miRNA target binding sites. Several of these miRNA edited sites were validated in silico. Additionally, we validated several A-to-G edited sites by Sanger sequencing. Altogether, our study suggests a role for RNA editing in miRNA-mediated gene regulation and splicing in human lungs. In this study, we have generated a RNA editome of human lung tissue that can be compared with other RNA editomes across different lung tissues to delineate a role for RNA editing in normal and diseased states. PMID:26486088

  17. Evaluation of targeted exome sequencing for 28 protein-based blood group systems, including the homologous gene systems, for blood group genotyping.

    PubMed

    Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A

    2017-04-01

    Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.

  18. Secondary neutrons as the main source of neutron-rich fission products in the bombardment of a thick U target by 1 GeV protons

    NASA Astrophysics Data System (ADS)

    Barzakh, A. E.; Lhersonneau, G.; Batist, L. Kh.; Fedorov, D. V.; Ivanov, V. S.; Mezilev, K. A.; Molkanov, P. L.; Moroz, F. V.; Orlov, S. Yu.; Panteleev, V. N.; Volkov, Yu. M.; Alyakrinskiy, O.; Barbui, M.; Stroe, L.; Tecchio, L. B.

    2011-05-01

    The diffusion-effusion model has been used to analyse the release and yields of Fr and Cs isotopes from uranium carbide targets of very different thicknesses (6.3 and 148 g/cm2) bombarded by a 1 GeV proton beam. Release curves of several isotopes of the same element and production efficiency versus decay half-life are well fitted with the same set of parameters. Comparison of efficiencies for neutron-rich and neutron-deficient Cs isotopes enables separation of the contributions from the primary ( p + 238U) and secondary (n + 238U) reactions to the production of neutron-rich Cs isotopes. A rather simple calculation of the neutron contribution describes these data fairly well. The FLUKA code describes the primary and secondary-reaction contributions to the Cs isotopes production efficiencies for different targets quite well.

  19. Comparative Analysis of Predicted Plastid-Targeted Proteomes of Sequenced Higher Plant Genomes

    PubMed Central

    Schaeffer, Scott; Harper, Artemus; Raja, Rajani; Jaiswal, Pankaj; Dhingra, Amit

    2014-01-01

    Plastids are actively involved in numerous plant processes critical to growth, development and adaptation. They play a primary role in photosynthesis, pigment and monoterpene synthesis, gravity sensing, starch and fatty acid synthesis, as well as oil, and protein storage. We applied two complementary methods to analyze the recently published apple genome (Malus × domestica) to identify putative plastid-targeted proteins, the first using TargetP and the second using a custom workflow utilizing a set of predictive programs. Apple shares roughly 40% of its 10,492 putative plastid-targeted proteins with that of the Arabidopsis (Arabidopsis thaliana) plastid-targeted proteome as identified by the Chloroplast 2010 project and ∼57% of its entire proteome with Arabidopsis. This suggests that the plastid-targeted proteomes between apple and Arabidopsis are different, and interestingly alludes to the presence of differential targeting of homologs between the two species. Co-expression analysis of 2,224 genes encoding putative plastid-targeted apple proteins suggests that they play a role in plant developmental and intermediary metabolism. Further, an inter-specific comparison of Arabidopsis, Prunus persica (Peach), Malus × domestica (Apple), Populus trichocarpa (Black cottonwood), Fragaria vesca (Woodland Strawberry), Solanum lycopersicum (Tomato) and Vitis vinifera (Grapevine) also identified a large number of novel species-specific plastid-targeted proteins. This analysis also revealed the presence of alternatively targeted homologs across species. Two separate analyses revealed that a small subset of proteins, one representing 289 protein clusters and the other 737 unique protein sequences, are conserved between seven plastid-targeted angiosperm proteomes. Majority of the novel proteins were annotated to play roles in stress response, transport, catabolic processes, and cellular component organization. Our results suggest that the current state of knowledge regarding

  20. Characterisation of IS153, an IS3-family insertion sequence isolated from Lactobacillus sanfranciscensis and its use for strain differentiation.

    PubMed

    Ehrmann, M A; Vogel, R E

    2001-11-01

    An insertion sequence has been identified in the genome of Lactobacillus sanfranciscensis DSM 20451T as segment of 1351 nucleotides containing 37-bp imperfect terminal inverted repeats. The sequence of this element encodes two out of phase, overlapping open reading frames, orfA and orfB, from which three putative proteins are produced. OrfAB is a transframe protein produced by -1 translational frame shifting between orf A and orf B that is presumed to be the transposase. The large orfAB of this element encodes a 342 amino acid protein that displays similarities with transposases encoded by bacterial insertion sequences belonging to the IS3 family. In L. sanfranciscensis type strain DSM 20451T multiple truncated IS elements were identified. Inverse PCR was used to analyze target sites of four of these elements, but except of their highly AT rich character not any sequence specificity was identified so far. Moreover, no flanking direct repeats were identified. Multiple copies of IS153 were detected by hybridization in other strains of L. sanfranciscensis. Resulting hybridization patterns were shown to differentiate between organisms at strain level rather than a probe targeted against the 16S rDNA. With a PCR based approach IS153 or highly similar sequences were detected in L. acidophilus, L. casei, L. malefermentans, L. plantarum, L. hilgardii, L. collinoides L. farciminis L. sakei and L. salivarius, L. reuteri as well as in Enterococcus faecium, Pediococcus acidilactici and P. pentosaceus.

  1. Sequence- and Interactome-Based Prediction of Viral Protein Hotspots Targeting Host Proteins: A Case Study for HIV Nef

    PubMed Central

    Sarmady, Mahdi; Dampier, William; Tozeren, Aydin

    2011-01-01

    Virus proteins alter protein pathways of the host toward the synthesis of viral particles by breaking and making edges via binding to host proteins. In this study, we developed a computational approach to predict viral sequence hotspots for binding to host proteins based on sequences of viral and host proteins and literature-curated virus-host protein interactome data. We use a motif discovery algorithm repeatedly on collections of sequences of viral proteins and immediate binding partners of their host targets and choose only those motifs that are conserved on viral sequences and highly statistically enriched among binding partners of virus protein targeted host proteins. Our results match experimental data on binding sites of Nef to host proteins such as MAPK1, VAV1, LCK, HCK, HLA-A, CD4, FYN, and GNB2L1 with high statistical significance but is a poor predictor of Nef binding sites on highly flexible, hoop-like regions. Predicted hotspots recapture CD8 cell epitopes of HIV Nef highlighting their importance in modulating virus-host interactions. Host proteins potentially targeted or outcompeted by Nef appear crowding the T cell receptor, natural killer cell mediated cytotoxicity, and neurotrophin signaling pathways. Scanning of HIV Nef motifs on multiple alignments of hepatitis C protein NS5A produces results consistent with literature, indicating the potential value of the hotspot discovery in advancing our understanding of virus-host crosstalk. PMID:21738584

  2. Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets

    DOE PAGES

    Schulze, Kornelius; Imbeaud, Sandrine; Letouzé, Eric; ...

    2015-03-30

    Our genomic analyses promise to improve tumor characterization to optimize personalized treatment for patients with hepatocellular carcinoma (HCC). Exome sequencing analysis of 243 liver tumors identified mutational signatures associated with specific risk factors, mainly combined alcohol and tobacco consumption and exposure to aflatoxin B1. We identified 161 putative driver genes associated with 11 recurrently altered pathways. Associations of mutations defined 3 groups of genes related to risk factors and centered on CTNNB1 (alcohol), TP53 (hepatitis B virus, HBV) and AXIN1. These analyses according to tumor stage progression identified TERT promoter mutation as an early event, whereasFGF3, FGF4, FGF19 or CCND1more » amplification and TP53 and CDKN2A alterations appeared at more advanced stages in aggressive tumors. In 28% of the tumors, we identified genetic alterations potentially targetable by US Food and Drug Administration (FDA)–approved drugs. Finally, we identified risk factor–specific mutational signatures and defined the extensive landscape of altered genes and pathways in HCC, which will be useful to design clinical trials for targeted therapy.« less

  3. Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schulze, Kornelius; Imbeaud, Sandrine; Letouzé, Eric

    Our genomic analyses promise to improve tumor characterization to optimize personalized treatment for patients with hepatocellular carcinoma (HCC). Exome sequencing analysis of 243 liver tumors identified mutational signatures associated with specific risk factors, mainly combined alcohol and tobacco consumption and exposure to aflatoxin B1. We identified 161 putative driver genes associated with 11 recurrently altered pathways. Associations of mutations defined 3 groups of genes related to risk factors and centered on CTNNB1 (alcohol), TP53 (hepatitis B virus, HBV) and AXIN1. These analyses according to tumor stage progression identified TERT promoter mutation as an early event, whereasFGF3, FGF4, FGF19 or CCND1more » amplification and TP53 and CDKN2A alterations appeared at more advanced stages in aggressive tumors. In 28% of the tumors, we identified genetic alterations potentially targetable by US Food and Drug Administration (FDA)–approved drugs. Finally, we identified risk factor–specific mutational signatures and defined the extensive landscape of altered genes and pathways in HCC, which will be useful to design clinical trials for targeted therapy.« less

  4. Evaluation of Targeted Next-Generation Sequencing for Detection of Bovine Pathogens in Clinical Samples.

    PubMed

    Anis, Eman; Hawkins, Ian K; Ilha, Marcia R S; Woldemeskel, Moges W; Saliki, Jeremiah T; Wilkes, Rebecca P

    2018-07-01

    The laboratory diagnosis of infectious diseases, especially those caused by mixed infections, is challenging. Routinely, it requires submission of multiple samples to separate laboratories. Advances in next-generation sequencing (NGS) have provided the opportunity for development of a comprehensive method to identify infectious agents. This study describes the use of target-specific primers for PCR-mediated amplification with the NGS technology in which pathogen genomic regions of interest are enriched and selectively sequenced from clinical samples. In the study, 198 primers were designed to target 43 common bovine and small-ruminant bacterial, fungal, viral, and parasitic pathogens, and a bioinformatics tool was specifically constructed for the detection of targeted pathogens. The primers were confirmed to detect the intended pathogens by testing reference strains and isolates. The method was then validated using 60 clinical samples (including tissues, feces, and milk) that were also tested with other routine diagnostic techniques. The detection limits of the targeted NGS method were evaluated using 10 representative pathogens that were also tested by quantitative PCR (qPCR), and the NGS method was able to detect the organisms from samples with qPCR threshold cycle ( C T ) values in the 30s. The method was successful for the detection of multiple pathogens in the clinical samples, including some additional pathogens missed by the routine techniques because the specific tests needed for the particular organisms were not performed. The results demonstrate the feasibility of the approach and indicate that it is possible to incorporate NGS as a diagnostic tool in a cost-effective manner into a veterinary diagnostic laboratory. Copyright © 2018 Anis et al.

  5. Comparative Analysis of Fruit Ripening-Related miRNAs and Their Targets in Blueberry Using Small RNA and Degradome Sequencing

    PubMed Central

    Hou, Yanming; Zhai, Lulu; Li, Xuyan; Xue, Yu; Wang, Jingjing; Yang, Pengjie; Cao, Chunmei; Li, Hongxue; Cui, Yuhai; Bian, Shaomin

    2017-01-01

    MicroRNAs (miRNAs) play vital roles in the regulation of fruit development and ripening. Blueberry is an important small berry fruit crop with economical and nutritional value. However, nothing is known about the miRNAs and their targets involved in blueberry fruit ripening. In this study, using high-throughput sequencing of small RNAs, 84 known miRNAs belonging to 28 families and 16 novel miRNAs were identified in white fruit (WF) and blue fruit (BF) libraries, which represent fruit ripening onset and in progress, respectively. Among them, 41 miRNAs were shown to be differentially expressed during fruit maturation, and 16 miRNAs representing 16 families were further chosen to validate the sRNA sequencing data by stem-loop qRT-PCR. Meanwhile, 178 targets were identified for 41 known and 7 novel miRNAs in WF and BF libraries using degradome sequencing, and targets of miR160 were validated using RLM-RACE (RNA Ligase-Mediated (RLM)-Rapid Amplification of cDNA Ends) approach. Moreover, the expression patterns of 6 miRNAs and their targets were examined during fruit development and ripening. Finally, integrative analysis of miRNAs and their targets revealed a complex miRNA-mRNA regulatory network involving a wide variety of biological processes. The findings will facilitate future investigations of the miRNA-mediated mechanisms that regulate fruit development and ripening in blueberry. PMID:29257112

  6. Insights into Deep-Sea Sediment Fungal Communities from the East Indian Ocean Using Targeted Environmental Sequencing Combined with Traditional Cultivation

    PubMed Central

    Zhang, Xiao-yong; Tang, Gui-ling; Xu, Xin-ya; Nong, Xu-hua; Qi, Shu-Hua

    2014-01-01

    The fungal diversity in deep-sea environments has recently gained an increasing amount attention. Our knowledge and understanding of the true fungal diversity and the role it plays in deep-sea environments, however, is still limited. We investigated the fungal community structure in five sediments from a depth of ∼4000 m in the East India Ocean using a combination of targeted environmental sequencing and traditional cultivation. This approach resulted in the recovery of a total of 45 fungal operational taxonomic units (OTUs) and 20 culturable fungal phylotypes. This finding indicates that there is a great amount of fungal diversity in the deep-sea sediments collected in the East Indian Ocean. Three fungal OTUs and one culturable phylotype demonstrated high divergence (89%–97%) from the existing sequences in the GenBank. Moreover, 44.4% fungal OTUs and 30% culturable fungal phylotypes are new reports for deep-sea sediments. These results suggest that the deep-sea sediments from the East India Ocean can serve as habitats for new fungal communities compared with other deep-sea environments. In addition, different fungal community could be detected when using targeted environmental sequencing compared with traditional cultivation in this study, which suggests that a combination of targeted environmental sequencing and traditional cultivation will generate a more diverse fungal community in deep-sea environments than using either targeted environmental sequencing or traditional cultivation alone. This study is the first to report new insights into the fungal communities in deep-sea sediments from the East Indian Ocean, which increases our knowledge and understanding of the fungal diversity in deep-sea environments. PMID:25272044

  7. Methods for sequencing GC-rich and CCT repeat DNA templates

    DOEpatents

    Robinson, Donna L.

    2007-02-20

    The present invention is directed to a PCR-based method of cycle sequencing DNA and other polynucleotide sequences having high CG content and regions of high GC content, and includes for example DNA strands with a high Cytosine and/or Guanosine content and repeated motifs such as CCT repeats.

  8. The Replication Focus Targeting Sequence (RFTS) Domain Is a DNA-competitive Inhibitor of Dnmt1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Syeda, Farisa; Fagan, Rebecca L.; Wean, Matthew

    Dnmt1 (DNA methyltransferase 1) is the principal enzyme responsible for maintenance of cytosine methylation at CpG dinucleotides in the mammalian genome. The N-terminal replication focus targeting sequence (RFTS) domain of Dnmt1 has been implicated in subcellular localization, protein association, and catalytic function. However, progress in understanding its function has been limited by the lack of assays for and a structure of this domain. Here, we show that the naked DNA- and polynucleosome-binding activities of Dnmt1 are inhibited by the RFTS domain, which functions by virtue of binding the catalytic domain to the exclusion of DNA. Kinetic analysis with a fluorogenicmore » DNA substrate established the RFTS domain as a 600-fold inhibitor of Dnmt1 enzymatic activity. The crystal structure of the RFTS domain reveals a novel fold and supports a mechanism in which an RFTS-targeted Dnmt1-binding protein, such as Uhrf1, may activate Dnmt1 for DNA binding.« less

  9. CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites

    PubMed Central

    Naito, Yuki; Hino, Kimihiro; Bono, Hidemasa; Ui-Tei, Kumiko

    2015-01-01

    Summary: CRISPRdirect is a simple and functional web server for selecting rational CRISPR/Cas targets from an input sequence. The CRISPR/Cas system is a promising technique for genome engineering which allows target-specific cleavage of genomic DNA guided by Cas9 nuclease in complex with a guide RNA (gRNA), that complementarily binds to a ∼20 nt targeted sequence. The target sequence requirements are twofold. First, the 5′-NGG protospacer adjacent motif (PAM) sequence must be located adjacent to the target sequence. Second, the target sequence should be specific within the entire genome in order to avoid off-target editing. CRISPRdirect enables users to easily select rational target sequences with minimized off-target sites by performing exhaustive searches against genomic sequences. The server currently incorporates the genomic sequences of human, mouse, rat, marmoset, pig, chicken, frog, zebrafish, Ciona, fruit fly, silkworm, Caenorhabditis elegans, Arabidopsis, rice, Sorghum and budding yeast. Availability: Freely available at http://crispr.dbcls.jp/. Contact: y-naito@dbcls.rois.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25414360

  10. Selectivity sequences and sorption capacities of phosphatic clay and humus rich soil towards the heavy metals present in zinc mine tailing.

    PubMed

    Chaturvedi, Pranav Kumar; Seth, Chandra Shekhar; Misra, Virendra

    2007-08-25

    Sorption efficacy of phosphatic clay and humus rich soil alone and on combination were tested towards heavy metals present in zinc mine tailing (Zawar Zinc Mine), Udaipur (India). Characterization of the zinc mine tailing sample indicated the presence of Pb, Cu, Zn and Mn in the concentration of 637, 186, 720 and 577microg(-1), respectively. For sorption efficacy, the zinc mine tailing soil were properly amended with phosphatic clay and humus rich soil separately and in combination and leachability study was performed by batch experiment at different pH range from 3 to 9. The data showed that the percent leachability of heavy metal in non-amended soil was 75-90%. After amendment with phosphatic clay percent leachability of heavy metals became 35-45%. Further, the addition of humus soil to phosphatic clay decreased the percent leachability up to 5-15% at all tested pH. Column leachability experiment was performed to evaluate the rate of leachability. The shape of cumulative curves of Pb, Cu, Zn and Mn showed an increase in its concavity in following order: Pbsequence calculated on the basis of distribution coefficient (K(d)) from the batch experiment was Pb>Cu>Zn>Mn. Further, Langmuir isotherms applied for the sorption studies indicated that phosphatic clay in the presence of humus soil had high affinity for Pb followed by Cu, Zn and Mn, with sorption capacities (b) 139.94, 97.02, 83.32 and 67.58microgg(-1), respectively.

  11. Hydrodynamic models for novae with ejecta rich in oxygen, neon and magnesium

    NASA Technical Reports Server (NTRS)

    Starrfield, S.; Sparks, W. M.; Truran, J. W.

    1985-01-01

    The characteristics of a new class of novae are identified and explained. This class consists of those objects that have been observed to eject material rich in oxygen, neon, magnesium, and aluminum at high velocities. We propose that for this class of novae the outburst is occurring not on a carbon-oxygen white dwarf but on an oxygen-neon-magnesium white dwarf which has evolved from a star which had a main sequence mass of approx. 8 solar masses to approx. 12 solar masses. An outburst was simulated by evolving 1.25 solar mass white dwarfs accreting hydrogen rich material at various rates. The effective enrichment of the envelope by ONeMg material from the core is simulated by enhancing oxygen in the accreted layers. The resulting evolutionary sequences can eject the entire accreted envelope plus core material at high velocities. They can also become super-Eddington at maximum bolometric luminosity. The expected frequency of such events (approx. 1/4) is in good agreement with the observed numbers of these novae.

  12. Effect of inherent location uncertainty on detection of stationary targets in noisy image sequences.

    PubMed

    Manjeshwar, R M; Wilson, D L

    2001-01-01

    The effect of inherent location uncertainty on the detection of stationary targets was determined in noisy image sequences. Targets were thick and thin projected cylinders mimicking arteries, catheters, and guide wires in medical imaging x-ray fluoroscopy. With the use of an adaptive forced-choice method, detection contrast sensitivity (the inverse of contrast) was measured both with and without marker cues that directed the attention of observers to the target location. With the probability correct clamped at 80%, contrast sensitivity increased an average of 77% when the marker was added to the thin-cylinder target. There was an insignificant effect on the thick cylinder. The large enhancement with the thin cylinder was obtained even though the target was located exactly in the center of a small panel, giving observers the impression that it was well localized. Psychometric functions consisting of d' plotted as a function of the square root of the signal-energy-to-noise-ratio gave a positive x intercept for the case of the thin cylinder without a marker. This x intercept, characteristic of uncertainty in other types of detection experiments, disappeared when the marker was added or when the thick cylinder was used. Inherent location uncertainty was further characterized by using four different markers with varying proximity to the target. Visual detection by human observers increased monotonically as the markers better localized the target. Human performance was modeled as a matched-filter detector with an uncertainty in the placement of the template. The removal of a location cue was modeled by introducing a location uncertainty of approximately equals 0.4 mm on the display device or only 7 microm on the retina, a size on the order of a single photoreceptor field. We conclude that detection is affected by target location uncertainty on the order of cellular dimensions, an observation with important implications for detection mechanisms in humans. In medical

  13. Sedimentary environment and tectonic deformations of the Neoproterozoic Iron formation at the Wadi El-Dabbah greenstone sequence, Central Eastern Desert, Egypt

    NASA Astrophysics Data System (ADS)

    Kiyokawa, S.; Suzuki, T.; Ikehara, M.; Horie, K.; Takehara, M.; Abd-Elmonem, H.; Dawoud, A. D. M.; El-Hasan, M. M.

    2017-12-01

    El-Dabbah area Central Eastern Desert of the Nubia Shield preserved Neoproterozoic lower green schist faces volcaniclastics greenstone sequence and covered strike-slip deformation related subaerial sedimentary sequence (Hammamat Group). The volcaniclastics greenstone sequence (El-Dabbah Formation) preserved several iron beds bearing well stratified sequence. Four tectonic deformation identified as this area; thrust deformation (D1), strike-slip deformation with transtension normal fault and strong left-lateral shear (D2), subaerial pull apart sediments basin formed strike-slip deformations (D3), and extensional deformation after the Hammamat Group sedimentation (D4). New age data from intrusions identified about 638 Ma white granite and about 660 Ma quartz porphyry. Based on the detail mapping, we reconstruct more than 5000m thick volcano sedimentary succession. At least, 10 iron rich sections were identified within 3500m thick volcano-sedimentary sequence. There are 14 iron formation sequence identified in this greenstone sequence. Each Iron sequences are bedded with greenish-black shales within massive volcaniclastics and lava flow. Iron formation is formed mostly fine grain magnetite deposited within volcanic mudstone and siltstone with gradual distribution. Timing of this iron sediment is identified within Sturtian glaciation (730-700Ma). However, there is no geological direct support evidence in the Snowball earth event at this greenstone sequence. The volcanic activities at this ocean already produced many Fe2+ to ocean water. Repeated iron precipitation occur during volcanic activity interphase period which produced oxidation of iron and produce oxyhydroxide with mud-silt sediment at bottom of ocean.

  14. Relationships between avian richness and landscape structure at multiple scales using multiple landscapes

    USGS Publications Warehouse

    Mitchell, M.S.; Rutzmoser, S.H.; Wigley, T.B.; Loehle, C.; Gerwin, J.A.; Keyser, P.D.; Lancia, R.A.; Perry, R.W.; Reynolds, C.J.; Thill, R.E.; Weih, R.; White, D.; Wood, P.B.

    2006-01-01

    Little is known about factors that structure biodiversity on landscape scales, yet current land management protocols, such as forest certification programs, place an increasing emphasis on managing for sustainable biodiversity at landscape scales. We used a replicated landscape study to evaluate relationships between forest structure and avian diversity at both stand and landscape-levels. We used data on bird communities collected under comparable sampling protocols on four managed forests located across the Southeastern US to develop logistic regression models describing relationships between habitat factors and the distribution of overall richness and richness of selected guilds. Landscape models generated for eight of nine guilds showed a strong relationship between richness and both availability and configuration of landscape features. Diversity of topographic features and heterogeneity of forest structure were primary determinants of avian species richness. Forest heterogeneity, in both age and forest type, were strongly and positively associated with overall avian richness and richness for most guilds. Road density was associated positively but weakly with avian richness. Landscape variables dominated all models generated, but no consistent patterns in metrics or scale were evident. Model fit was strong for neotropical migrants and relatively weak for short-distance migrants and resident species. Our models provide a tool that will allow managers to evaluate and demonstrate quantitatively how management practices affect avian diversity on landscapes.

  15. Microstructures and formation history of melilite-rich calcium-aluminum-rich inclusions from the ALHA77307 CO3.0 chondrite

    NASA Astrophysics Data System (ADS)

    Han, Jangmi; Brearley, Adrian J.

    2017-03-01

    We have studied four melilite-rich calcium-aluminum-rich inclusions (CAIs) from the Allan Hills A77307 CO3.0 chondrite using transmission electron microscopy with the focused ion beam sample preparation technique. This type of CAI represents one of the dominant types of refractory inclusions in CO3 chondrites. Individual melilite-rich CAIs 04-07 record complex formational histories involving high-temperature gas-solid condensation that occurred under both equilibrium and disequilibrium conditions. CAI 04 contains two texturally- and compositionally-distinct occurrences of perovskite: fine-grained perovskite within a melilite-rich core and aggregates of perovskite grains that surround the core. The perovskite in the core was probably involved in a disequilibrium reaction with early equilibrium condensates (e.g., melilite and spinel) and a nebular gas to form Al-Ti-rich diopside, followed by a later condensation of the perovskite aggregates under equilibrium conditions. CAI 05 has a compact melilite-rich core surrounded by a porous mantle, and likely formed by at least two different condensation events under equilibrium and disequilibrium conditions. In CAI 06, complex intergrowth layers of spinel and diopside surrounding a melilite-rich core indicate disequilibrium reaction of spinel and melilite with a nebular gas to form Al-Ti-rich diopside following core formation by equilibrium condensation. CAI 07 is dominated by melilite with a narrow compositional range and equilibrated textures, suggesting its formation by equilibrium condensation over a limited temperature range. Collectively, we infer that the melilite-rich inclusions formed by a generalized sequence of high-temperature gas-solid condensation that involved: (1) formation of CAI cores by aggregation of primary equilibrium condensates (i.e., perovskite, spinel, and melilite), (2) back-reactions of the primary core minerals with a nebular gas under disequilibrium conditions, forming diopside that evolves in

  16. EMICORON: A multi-targeting G4 ligand with a promising preclinical profile.

    PubMed

    Porru, Manuela; Zizza, Pasquale; Franceschin, Marco; Leonetti, Carlo; Biroccio, Annamaria

    2017-05-01

    During the last decade, guanine G-rich sequences folding into G-quadruplex (G4) structures have received a lot of attention and their biological role is now a matter of large debate. Rising amounts of experimental evidence have validated several G-rich motifs as molecular targets in cancer treatment. Despite that an increasing number of small molecules has been reported to possess excellent G4 stabilizing properties, none of them has progressed through the drug-development pipeline due to their poor drug-like properties. In this context, the identification of G4 ligands with more favorable pharmacological properties and with a well-defined target activity could be fruitful for anticancer therapy application. This manuscript outlines the current state of knowledge regarding EMICORON, a G4-interactive molecule structurally and biologically similar, on the one side, to coronene and, on the other side, to a bay-monosubstituted perylene. Overall this work evidences that EMICORON, a new promising G4 ligand, possesses a marked antitumoral activity both standing alone and in combination with chemotherapeutics. Moreover, EMICORON represents a good example of multimodal class of antitumoral drug, able to simultaneously affect multiple targets participating in several distinct signaling pathways, thus simplifying the treatment modalities and improving the selectivity against cancer cells. Due to the importance of G4 forming sequences in crucial biological processes participating in tumor progression, their successful targeting with small molecules could represent a very important innovation in the development of effective therapeutic strategies against cancer. This article is part of a Special Issue entitled "G-quadruplex" Guest Editor: Dr. Concetta Giancola and Dr. Daniela Montesarchio. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. Research and development on materials for the SPES target

    NASA Astrophysics Data System (ADS)

    Corradetti, Stefano; Andrighetto, Alberto; Manzolaro, Mattia; Scarpa, Daniele; Vasquez, Jesus; Rossignoli, Massimo; Monetti, Alberto; Calderolla, Michele; Prete, Gianfranco

    2014-03-01

    The SPES project at INFN-LNL (Istituto Nazionale di Fisica Nucleare - Laboratori Nazionali di Legnaro) is focused on the production of radioactive ion beams. The core of the SPES facility is constituted by the target, which will be irradiated with a 40 MeV, 200 µA proton beam in order to produce radioactive species. In order to efficiently produce and release isotopes, the material constituting the target should be able to work under extreme conditions (high vacuum and temperatures up to 2000 °C). Both neutron-rich and proton-rich isotopes will be produced; in the first case, carbon dispersed uranium carbide (UCx) will be used as a target, whereas to produce p-rich isotopes, several types of targets will have to be irradiated. The synthesis and characterization of different types of material will be reported. Moreover, the results of irradiation and isotopes release tests on different uranium carbide target prototypes will be discussed.

  18. A Phylogenomic Perspective on the Radiation of Ray-Finned Fishes Based upon Targeted Sequencing of Ultraconserved Elements (UCEs)

    PubMed Central

    Sorenson, Laurie; Santini, Francesco

    2013-01-01

    Ray-finned fishes constitute the dominant radiation of vertebrates with over 32,000 species. Although molecular phylogenetics has begun to disentangle major evolutionary relationships within this vast section of the Tree of Life, there is no widely available approach for efficiently collecting phylogenomic data within fishes, leaving much of the enormous potential of massively parallel sequencing technologies for resolving major radiations in ray-finned fishes unrealized. Here, we provide a genomic perspective on longstanding questions regarding the diversification of major groups of ray-finned fishes through targeted enrichment of ultraconserved nuclear DNA elements (UCEs) and their flanking sequence. Our workflow efficiently and economically generates data sets that are orders of magnitude larger than those produced by traditional approaches and is well-suited to working with museum specimens. Analysis of the UCE data set recovers a well-supported phylogeny at both shallow and deep time-scales that supports a monophyletic relationship between Amia and Lepisosteus (Holostei) and reveals elopomorphs and then osteoglossomorphs to be the earliest diverging teleost lineages. Our approach additionally reveals that sequence capture of UCE regions and their flanking sequence offers enormous potential for resolving phylogenetic relationships within ray-finned fishes. PMID:23824177

  19. Short Interspersed Nuclear Element (SINE) Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

    PubMed

    Kanhayuwa, Lakkhana; Coutts, Robert H A

    2016-01-01

    Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.

  20. Improved diagnostic yield compared with targeted gene sequencing panels suggests a role for whole-genome sequencing as a first-tier genetic test

    PubMed Central

    Lionel, Anath C; Costain, Gregory; Monfared, Nasim; Walker, Susan; Reuter, Miriam S; Hosseini, S Mohsen; Thiruvahindrapuram, Bhooma; Merico, Daniele; Jobling, Rebekah; Nalpathamkalam, Thomas; Pellecchia, Giovanna; Sung, Wilson W L; Wang, Zhuozhi; Bikangaga, Peter; Boelman, Cyrus; Carter, Melissa T; Cordeiro, Dawn; Cytrynbaum, Cheryl; Dell, Sharon D; Dhir, Priya; Dowling, James J; Heon, Elise; Hewson, Stacy; Hiraki, Linda; Inbar-Feigenberg, Michal; Klatt, Regan; Kronick, Jonathan; Laxer, Ronald M; Licht, Christoph; MacDonald, Heather; Mercimek-Andrews, Saadet; Mendoza-Londono, Roberto; Piscione, Tino; Schneider, Rayfel; Schulze, Andreas; Silverman, Earl; Siriwardena, Komudi; Snead, O Carter; Sondheimer, Neal; Sutherland, Joanne; Vincent, Ajoy; Wasserman, Jonathan D; Weksberg, Rosanna; Shuman, Cheryl; Carew, Chris; Szego, Michael J; Hayeems, Robin Z; Basran, Raveen; Stavropoulos, Dimitri J; Ray, Peter N; Bowdin, Sarah; Meyn, M Stephen; Cohn, Ronald D; Scherer, Stephen W; Marshall, Christian R

    2018-01-01

    Purpose Genetic testing is an integral diagnostic component of pediatric medicine. Standard of care is often a time-consuming stepwise approach involving chromosomal microarray analysis and targeted gene sequencing panels, which can be costly and inconclusive. Whole-genome sequencing (WGS) provides a comprehensive testing platform that has the potential to streamline genetic assessments, but there are limited comparative data to guide its clinical use. Methods We prospectively recruited 103 patients from pediatric non-genetic subspecialty clinics, each with a clinical phenotype suggestive of an underlying genetic disorder, and compared the diagnostic yield and coverage of WGS with those of conventional genetic testing. Results WGS identified diagnostic variants in 41% of individuals, representing a significant increase over conventional testing results (24% P = 0.01). Genes clinically sequenced in the cohort (n = 1,226) were well covered by WGS, with a median exonic coverage of 40 × ±8 × (mean ±SD). All the molecular diagnoses made by conventional methods were captured by WGS. The 18 new diagnoses made with WGS included structural and non-exonic sequence variants not detectable with whole-exome sequencing, and confirmed recent disease associations with the genes PIGG, RNU4ATAC, TRIO, and UNC13A. Conclusion WGS as a primary clinical test provided a higher diagnostic yield than conventional genetic testing in a clinically heterogeneous cohort. PMID:28771251

  1. Improved diagnostic yield compared with targeted gene sequencing panels suggests a role for whole-genome sequencing as a first-tier genetic test.

    PubMed

    Lionel, Anath C; Costain, Gregory; Monfared, Nasim; Walker, Susan; Reuter, Miriam S; Hosseini, S Mohsen; Thiruvahindrapuram, Bhooma; Merico, Daniele; Jobling, Rebekah; Nalpathamkalam, Thomas; Pellecchia, Giovanna; Sung, Wilson W L; Wang, Zhuozhi; Bikangaga, Peter; Boelman, Cyrus; Carter, Melissa T; Cordeiro, Dawn; Cytrynbaum, Cheryl; Dell, Sharon D; Dhir, Priya; Dowling, James J; Heon, Elise; Hewson, Stacy; Hiraki, Linda; Inbar-Feigenberg, Michal; Klatt, Regan; Kronick, Jonathan; Laxer, Ronald M; Licht, Christoph; MacDonald, Heather; Mercimek-Andrews, Saadet; Mendoza-Londono, Roberto; Piscione, Tino; Schneider, Rayfel; Schulze, Andreas; Silverman, Earl; Siriwardena, Komudi; Snead, O Carter; Sondheimer, Neal; Sutherland, Joanne; Vincent, Ajoy; Wasserman, Jonathan D; Weksberg, Rosanna; Shuman, Cheryl; Carew, Chris; Szego, Michael J; Hayeems, Robin Z; Basran, Raveen; Stavropoulos, Dimitri J; Ray, Peter N; Bowdin, Sarah; Meyn, M Stephen; Cohn, Ronald D; Scherer, Stephen W; Marshall, Christian R

    2018-04-01

    PurposeGenetic testing is an integral diagnostic component of pediatric medicine. Standard of care is often a time-consuming stepwise approach involving chromosomal microarray analysis and targeted gene sequencing panels, which can be costly and inconclusive. Whole-genome sequencing (WGS) provides a comprehensive testing platform that has the potential to streamline genetic assessments, but there are limited comparative data to guide its clinical use.MethodsWe prospectively recruited 103 patients from pediatric non-genetic subspecialty clinics, each with a clinical phenotype suggestive of an underlying genetic disorder, and compared the diagnostic yield and coverage of WGS with those of conventional genetic testing.ResultsWGS identified diagnostic variants in 41% of individuals, representing a significant increase over conventional testing results (24%; P = 0.01). Genes clinically sequenced in the cohort (n = 1,226) were well covered by WGS, with a median exonic coverage of 40 × ±8 × (mean ±SD). All the molecular diagnoses made by conventional methods were captured by WGS. The 18 new diagnoses made with WGS included structural and non-exonic sequence variants not detectable with whole-exome sequencing, and confirmed recent disease associations with the genes PIGG, RNU4ATAC, TRIO, and UNC13A.ConclusionWGS as a primary clinical test provided a higher diagnostic yield than conventional genetic testing in a clinically heterogeneous cohort.

  2. Targeted sequencing-based analyses of candidate gene variants in ulcerative colitis-associated colorectal neoplasia.

    PubMed

    Chakrabarty, Sanjiban; Varghese, Vinay Koshy; Sahu, Pranoy; Jayaram, Pradyumna; Shivakumar, Bhadravathi M; Pai, Cannanore Ganesh; Satyamoorthy, Kapaettu

    2017-06-27

    Long-standing ulcerative colitis (UC) leading to colorectal cancer (CRC) is one of the most serious and life-threatening consequences acknowledged globally. Ulcerative colitis-associated colorectal carcinogenesis showed distinct molecular alterations when compared with sporadic colorectal carcinoma. Targeted sequencing of 409 genes in tissue samples of 18 long-standing UC subjects at high risk of colorectal carcinoma (UCHR) was performed to identify somatic driver mutations, which may be involved in the molecular changes during the transformation of non-dysplastic mucosa to high-grade dysplasia. Findings from the study are also compared with previously published genome wide and exome sequencing data in inflammatory bowel disease-associated and sporadic colorectal carcinoma. Next-generation sequencing analysis identified 1107 mutations in 275 genes in UCHR subjects. In addition to TP53 (17%) and KRAS (22%) mutations, recurrent mutations in APC (33%), ACVR2A (61%), ARID1A (44%), RAF1 (39%) and MTOR (61%) were observed in UCHR subjects. In addition, APC, FGFR3, FGFR2 and PIK3CA driver mutations were identified in UCHR subjects. Recurrent mutations in ARID1A (44%), SMARCA4 (17%), MLL2 (44%), MLL3 (67%), SETD2 (17%) and TET2 (50%) genes involved in histone modification and chromatin remodelling were identified in UCHR subjects. Our study identifies new oncogenic driver mutations which may be involved in the transition of non-dysplastic cells to dysplastic phenotype in the subjects with long-standing UC with high risk of progression into colorectal neoplasia.

  3. Biogeographic affinity helps explain productivity-richness relationships at regional and local scales

    USGS Publications Warehouse

    Harrison, S.; Grace, J.B.

    2007-01-01

    The unresolved question of what causes the observed positive relationship between large-scale productivity and species richness has long interested ecologists and evolutionists. Here we examine a potential explanation that we call the biogeographic affinity hypothesis, which proposes that the productivity-richness relationship is a function of species' climatic tolerances that in turn are shaped by the earth's climatic history combined with evolutionary niche conservatism. Using botanical data from regions and sites across California, we find support for a key prediction of this hypothesis, namely, that the productivity-species richness relationship differs strongly and predictably among groups of higher taxa on the basis of their biogeographic affinities (i.e., between families or genera primarily associated with north-temperate, semiarid, or desert zones). We also show that a consideration of biogeographic affinity can yield new insights on how productivity-richness patterns at large geographic scales filter down to affect patterns of species richness and composition within local communities. ?? 2007 by The University of Chicago. All rights reserved.

  4. An atypical topoisomerase II sequence from the slime mold Physarum polycephalum.

    PubMed

    Hugodot, Yannick; Dutertre, Murielle; Duguet, Michel

    2004-01-21

    We have determined the complete nucleotide sequence of the cDNA encoding DNA topoisomerase II from Physarum polycephalum. Using degenerate primers, based on the conserved amino acid sequences of other eukaryotic enzymes, a 250-bp fragment was polymerase chain reaction (PCR) amplified. This fragment was used as a probe to screen a Physarum cDNA library. A partial cDNA clone was isolated that was truncated at the 3' end. Rapid amplification of cDNA ends (RACE)-PCR was employed to isolate the remaining portion of the gene. The complete sequence of 4613 bp contains an open reading frame of 4494 bp that codes for 1498 amino acid residues with a theoretical molecular weight of 167 kDa. The predicted amino acid sequence shares similarity with those of other eukaryotes and shows the highest degree of identity with the enzyme of Dictyostelium discoideum. However, the enzyme of P. polycephalum contains an atypical amino-terminal domain very rich in serine and proline, whose function is unknown. Remarkably, both a mitochondrial targeting sequence and a nuclear localization signal were predicted respectively in the amino and carboxy-terminus of the protein, as in the case of human topoisomerase III alpha. At the Physarum genomic level, the topoisomerase II gene encompasses a region of about 16 kbp suggesting a large proportion of intronic sequences, an unusual situation for a gene of a lower eukaryote, often free of introns. Finally, expression of topoisomerase II mRNA does not appear significantly dependent on the plasmodium cycle stage, possibly due to the lack of G1 phase or (and) to a mitochondrial localization of the enzyme.

  5. Sequencing Centers Panel at SFAF

    ScienceCinema

    Schilkey, Faye; Ali, Johar; Grafham, Darren; Muzny, Donna; Fulton, Bob; Fitzgerald, Mike; Hostetler, Jessica; Daum, Chris

    2018-02-13

    From left to right: Faye Schilkey of NCGR, Johar Ali of OICR, Darren Grafham of Wellcome Trust Sanger Institute, Donna Muzny of the Baylor College of Medicine, Bob Fulton of Washington University, Mike Fitzgerald of the Broad Institute, Jessica Hostetler of the J. Craig Venter Institute and Chris Daum of the DOE Joint Genome Institute discuss sequencing technologies, applications and pipelines on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.

  6. Sequencing Centers Panel at SFAF

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schilkey, Faye; Ali, Johar; Grafham, Darren

    From left to right: Faye Schilkey of NCGR, Johar Ali of OICR, Darren Grafham of Wellcome Trust Sanger Institute, Donna Muzny of the Baylor College of Medicine, Bob Fulton of Washington University, Mike Fitzgerald of the Broad Institute, Jessica Hostetler of the J. Craig Venter Institute and Chris Daum of the DOE Joint Genome Institute discuss sequencing technologies, applications and pipelines on June 2, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM.

  7. A Multidimensional Strategy to Detect Polypharmacological Targets in the Absence of Structural and Sequence Homology

    PubMed Central

    Durrant, Jacob D.; Amaro, Rommie E.; Xie, Lei; Urbaniak, Michael D.; Ferguson, Michael A. J.; Haapalainen, Antti; Chen, Zhijun; Di Guilmi, Anne Marie; Wunder, Frank; Bourne, Philip E.; McCammon, J. Andrew

    2010-01-01

    Conventional drug design embraces the “one gene, one drug, one disease” philosophy. Polypharmacology, which focuses on multi-target drugs, has emerged as a new paradigm in drug discovery. The rational design of drugs that act via polypharmacological mechanisms can produce compounds that exhibit increased therapeutic potency and against which resistance is less likely to develop. Additionally, identifying multiple protein targets is also critical for side-effect prediction. One third of potential therapeutic compounds fail in clinical trials or are later removed from the market due to unacceptable side effects often caused by off-target binding. In the current work, we introduce a multidimensional strategy for the identification of secondary targets of known small-molecule inhibitors in the absence of global structural and sequence homology with the primary target protein. To demonstrate the utility of the strategy, we identify several targets of 4,5-dihydroxy-3-(1-naphthyldiazenyl)-2,7-naphthalenedisulfonic acid, a known micromolar inhibitor of Trypanosoma brucei RNA editing ligase 1. As it is capable of identifying potential secondary targets, the strategy described here may play a useful role in future efforts to reduce drug side effects and/or to increase polypharmacology. PMID:20098496

  8. A multidimensional strategy to detect polypharmacological targets in the absence of structural and sequence homology.

    PubMed

    Durrant, Jacob D; Amaro, Rommie E; Xie, Lei; Urbaniak, Michael D; Ferguson, Michael A J; Haapalainen, Antti; Chen, Zhijun; Di Guilmi, Anne Marie; Wunder, Frank; Bourne, Philip E; McCammon, J Andrew

    2010-01-22

    Conventional drug design embraces the "one gene, one drug, one disease" philosophy. Polypharmacology, which focuses on multi-target drugs, has emerged as a new paradigm in drug discovery. The rational design of drugs that act via polypharmacological mechanisms can produce compounds that exhibit increased therapeutic potency and against which resistance is less likely to develop. Additionally, identifying multiple protein targets is also critical for side-effect prediction. One third of potential therapeutic compounds fail in clinical trials or are later removed from the market due to unacceptable side effects often caused by off-target binding. In the current work, we introduce a multidimensional strategy for the identification of secondary targets of known small-molecule inhibitors in the absence of global structural and sequence homology with the primary target protein. To demonstrate the utility of the strategy, we identify several targets of 4,5-dihydroxy-3-(1-naphthyldiazenyl)-2,7-naphthalenedisulfonic acid, a known micromolar inhibitor of Trypanosoma brucei RNA editing ligase 1. As it is capable of identifying potential secondary targets, the strategy described here may play a useful role in future efforts to reduce drug side effects and/or to increase polypharmacology.

  9. Size and sequence polymorphisms in the glutamate-rich protein gene of the human malaria parasite Plasmodium falciparum in Thailand.

    PubMed

    Pattaradilokrat, Sittiporn; Trakoolsoontorn, Chawinya; Simpalipan, Phumin; Warrit, Natapot; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai

    2018-01-22

    The glutamate-rich protein (GLURP) of the malaria parasite Plasmodium falciparum is a key surface antigen that serves as a component of a clinical vaccine. Moreover, the GLURP gene is also employed routinely as a genetic marker for malarial genotyping in epidemiological studies. While extensive size polymorphisms in GLURP are well recorded, the extent of the sequence diversity of this gene is rarely investigated. The present study aimed to explore the genetic diversity of GLURP in natural populations of P. falciparum. The polymorphic C-terminal repetitive R2 region of GLURP sequences from 65 P. falciparum isolates in Thailand were generated and combined with the data from 103 worldwide isolates to generate a GLURP database. The collection was comprised of 168 alleles, encoding 105 unique GLURP subtypes, characterized by 18 types of amino acid repeat units (AAU). Of these, 28 GLURP subtypes, formed by 10 AAU types, were detected in P. falciparum in Thailand. Among them, 19 GLURP subtypes and 2 AAU types are described for the first time in the Thai parasite population. The AAU sequences were highly conserved, which is likely due to negative selection. Standard Fst analysis revealed the shared distributions of GLURP types among the P. falciparum populations, providing evidence of gene flow among the different demographic populations. Sequence diversity causing size variations in GLURP in Thai P. falciparum populations were detected, and caused by non-synonymous substitutions in repeat units and some insertion/deletion of aspartic acid or glutamic acid codons between repeat units. The P. falciparum population structure based on GLURP showed promising implications for the development of GLURP-based vaccines and for monitoring vaccine efficacy.

  10. The chaperonin-60 universal target is a barcode for bacteria that enables de novo assembly of metagenomic sequence data.

    PubMed

    Links, Matthew G; Dumonceaux, Tim J; Hemmingsen, Sean M; Hill, Janet E

    2012-01-01

    Barcoding with molecular sequences is widely used to catalogue eukaryotic biodiversity. Studies investigating the community dynamics of microbes have relied heavily on gene-centric metagenomic profiling using two genes (16S rRNA and cpn60) to identify and track Bacteria. While there have been criteria formalized for barcoding of eukaryotes, these criteria have not been used to evaluate gene targets for other domains of life. Using the framework of the International Barcode of Life we evaluated DNA barcodes for Bacteria. Candidates from the 16S rRNA gene and the protein coding cpn60 gene were evaluated. Within complete bacterial genomes in the public domain representing 983 species from 21 phyla, the largest difference between median pairwise inter- and intra-specific distances ("barcode gap") was found from cpn60. Distribution of sequence diversity along the ∼555 bp cpn60 target region was remarkably uniform. The barcode gap of the cpn60 universal target facilitated the faithful de novo assembly of full-length operational taxonomic units from pyrosequencing data from a synthetic microbial community. Analysis supported the recognition of both 16S rRNA and cpn60 as DNA barcodes for Bacteria. The cpn60 universal target was found to have a much larger barcode gap than 16S rRNA suggesting cpn60 as a preferred barcode for Bacteria. A large barcode gap for cpn60 provided a robust target for species-level characterization of data. The assembly of consensus sequences for barcodes was shown to be a reliable method for the identification and tracking of novel microbes in metagenomic studies.

  11. Program Synthesizes UML Sequence Diagrams

    NASA Technical Reports Server (NTRS)

    Barry, Matthew R.; Osborne, Richard N.

    2006-01-01

    A computer program called "Rational Sequence" generates Universal Modeling Language (UML) sequence diagrams of a target Java program running on a Java virtual machine (JVM). Rational Sequence thereby performs a reverse engineering function that aids in the design documentation of the target Java program. Whereas previously, the construction of sequence diagrams was a tedious manual process, Rational Sequence generates UML sequence diagrams automatically from the running Java code.

  12. Target capture enrichment of nuclear SNP markers for massively parallel sequencing of degraded and mixed samples.

    PubMed

    Bose, Nikhil; Carlberg, Katie; Sensabaugh, George; Erlich, Henry; Calloway, Cassandra

    2018-05-01

    DNA from biological forensic samples can be highly fragmented and present in limited quantity. When DNA is highly fragmented, conventional PCR based Short Tandem Repeat (STR) analysis may fail as primer binding sites may not be present on a single template molecule. Single Nucleotide Polymorphisms (SNPs) can serve as an alternative type of genetic marker for analysis of degraded samples because the targeted variation is a single base. However, conventional PCR based SNP analysis methods still require intact primer binding sites for target amplification. Recently, probe capture methods for targeted enrichment have shown success in recovering degraded DNA as well as DNA from ancient bone samples using next-generation sequencing (NGS) technologies. The goal of this study was to design and test a probe capture assay targeting forensically relevant nuclear SNP markers for clonal and massively parallel sequencing (MPS) of degraded and limited DNA samples as well as mixtures. A set of 411 polymorphic markers totaling 451 nuclear SNPs (375 SNPs and 36 microhaplotype markers) was selected for the custom probe capture panel. The SNP markers were selected for a broad range of forensic applications including human individual identification, kinship, and lineage analysis as well as for mixture analysis. Performance of the custom SNP probe capture NGS assay was characterized by analyzing read depth and heterozygote allele balance across 15 samples at 25 ng input DNA. Performance thresholds were established based on read depth ≥500X and heterozygote allele balance within ±10% deviation from 50:50, which was observed for 426 out of 451 SNPs. These 426 SNPs were analyzed in size selected samples (at ≤75 bp, ≤100 bp, ≤150 bp, ≤200 bp, and ≤250 bp) as well as mock degraded samples fragmented to an average of 150 bp. Samples selected for ≤75 bp exhibited 99-100% reportable SNPs across varied DNA amounts and as low as 0.5 ng. Mock degraded samples at 1

  13. Germline TRAV5D-4 T-Cell Receptor Sequence Targets a Primary Insulin Peptide of NOD Mice

    PubMed Central

    Nakayama, Maki; Castoe, Todd; Sosinowski, Tomasz; He, XiangLing; Johnson, Kelly; Haskins, Kathryn; Vignali, Dario A.A.; Gapin, Laurent; Pollock, David; Eisenbarth, George S.

    2012-01-01

    There is accumulating evidence that autoimmunity to insulin B chain peptide, amino acids 9–23 (insulin B:9–23), is central to development of autoimmune diabetes of the NOD mouse model. We hypothesized that enhanced susceptibility to autoimmune diabetes is the result of targeting of insulin by a T-cell receptor (TCR) sequence commonly encoded in the germline. In this study, we aimed to demonstrate that a particular Vα gene TRAV5D-4 with multiple junction sequences is sufficient to induce anti-islet autoimmunity by studying retrogenic mouse lines expressing α-chains with different Vα TRAV genes. Retrogenic NOD strains expressing Vα TRAV5D-4 α-chains with many different complementarity determining region (CDR) 3 sequences, even those derived from TCRs recognizing islet-irrelevant molecules, developed anti-insulin autoimmunity. Induction of insulin autoantibodies by TRAV5D-4 α-chains was abrogated by the mutation of insulin peptide B:9–23 or that of two amino acid residues in CDR1 and 2 of the TRAV5D-4. TRAV13–1, the human ortholog of murine TRAV5D-4, was also capable of inducing in vivo anti-insulin autoimmunity when combined with different murine CDR3 sequences. Targeting primary autoantigenic peptides by simple germline-encoded TCR motifs may underlie enhanced susceptibility to the development of autoimmune diabetes. PMID:22315318

  14. Composition of Meridiani Hematite-rich Spherules: A Mass-Balance Mixing-Model Approach

    NASA Technical Reports Server (NTRS)

    jOLLIFF, b. l.

    2005-01-01

    One of the great surprises of the Mars Exploration Rovers (MER) mission is the discovery at Meridiani Planum that the surface hematite signature observed from orbit is attributable largely to a surface enrichment of hematite-rich spherules, thought to be concretions, that have weathered out of rocks similar to the underlying sulfate-rich rock formation [1]. A strong hematite signature has been observed by the Mini-TES [2] and by in-situ measurements of spherule-rich targets by the Mossbauer spectrometer (MB) [3] and the alpha-particle X-ray spectrometer (APXS) [4]. The Mini-TES derived spectrum of spherule-rich targets on the plains is consistent with nearly pure coarse-grained hematite, with perhaps as little as 5-10 areal % of other components [2]. The occurrence and abundance of the spherules as the bearer of the widespread hematite signature observed by MGS TES over much of Meridiani Planum is significant for global remote sensing, and their occurrence as concretions in the outcrop lithology is significant for the diagenetic history and role of water in the formation of the sedimentary rock formation [5].

  15. Targeted DNA sequencing of non-small cell lung cancer identifies mutations associated with brain metastases.

    PubMed

    Wilson, George D; Johnson, Matthew D; Ahmed, Samreen; Cardenas, Paola Yumpo; Grills, Inga S; Thibodeau, Bryan J

    2018-05-25

    This study explores the hypothesis that dominant molecular oncogenes in non-small cell lung cancer (NSCLC) are associated with metastatic spread to the brain. NSCLC patient groups with no evidence of metastasis, with metastatic disease to a non-CNS site, who developed brain metastasis after diagnosis, and patients with simultaneous diagnosis of NSCLC and metastatic brain lesions were studied using targeted sequencing. In patients with brain metastasis versus those without, only 2 variants (one each in BCL6 and NOTHC2) were identified that occurred in ≥ 4 NSCLC of patients with brain metastases but ≤ 1 of the NSCLC samples without brain metastases. At the gene level, 20 genes were found to have unique variants in more than 33% of the patients with brain metastases. When analyzed at the patient level, these 20 genes formed the basis of a predictive test to discriminate those with brain metastasis. Further analysis showed that PI3K/AKT signaling is altered in both the primary and metastases of NSCLC patients with brain lesions. While no single variant was associated with brain metastasis, this study describes a potential gene panel for the identification of patients at risk and implicates PI3K/AKT signaling as a therapeutic target.

  16. Targeted DNA sequencing of non-small cell lung cancer identifies mutations associated with brain metastases

    PubMed Central

    Wilson, George D.; Johnson, Matthew D.; Ahmed, Samreen; Cardenas, Paola Yumpo; Grills, Inga S.; Thibodeau, Bryan J.

    2018-01-01

    Introduction This study explores the hypothesis that dominant molecular oncogenes in non-small cell lung cancer (NSCLC) are associated with metastatic spread to the brain. Methods NSCLC patient groups with no evidence of metastasis, with metastatic disease to a non-CNS site, who developed brain metastasis after diagnosis, and patients with simultaneous diagnosis of NSCLC and metastatic brain lesions were studied using targeted sequencing. Results In patients with brain metastasis versus those without, only 2 variants (one each in BCL6 and NOTHC2) were identified that occurred in ≥ 4 NSCLC of patients with brain metastases but ≤ 1 of the NSCLC samples without brain metastases. At the gene level, 20 genes were found to have unique variants in more than 33% of the patients with brain metastases. When analyzed at the patient level, these 20 genes formed the basis of a predictive test to discriminate those with brain metastasis. Further analysis showed that PI3K/AKT signaling is altered in both the primary and metastases of NSCLC patients with brain lesions. Conclusion While no single variant was associated with brain metastasis, this study describes a potential gene panel for the identification of patients at risk and implicates PI3K/AKT signaling as a therapeutic target. PMID:29899834

  17. Formation of a spatter-rich pyroclastic density current deposit in a Neogene sequence of trachytic-mafic igneous rocks at Mason Spur, Erebus volcanic province, Antarctica

    NASA Astrophysics Data System (ADS)

    Martin, A. P.; Smellie, J. L.; Cooper, A. F.; Townsend, D. B.

    2018-01-01

    Erosion has revealed a remarkable section through the heart of a volcanic island, Mason Spur, in the southwestern Ross Sea, Antarctica, including an unusually well-exposed section of caldera fill. The near-continuous exposure, 10 km laterally and > 1 km vertically, cuts through Cenozoic alkalic volcanic rocks of the Erebus volcanic province (McMurdo Volcanic Group) and permits the study of an ancient volcanic succession that is rarely available due to subsequent burial or erosion. The caldera filling sequence includes an unusual trachytic spatter-rich lapilli tuff (ignimbrite) facies that is particularly striking because of the presence of abundant black fluidal, dense juvenile spatter clasts of trachytic obsidian up to 2 m long supported in a pale cream-coloured pumiceous lapilli tuff matrix. Field mapping indicates that the deposit is an ignimbrite and, together with petrological considerations, it is suggested that mixing of dense spatter and pumiceous lapilli tuff in the investigated deposit occurred during emplacement, not necessarily in the same vent, with the mixed fragmental material emplaced as a pyroclastic density current. Liquid water was not initially present but a steam phase was probably generated during transport and may represent water ingested during passage of the current as it passed over either wet ground, stream, shallow lake or (possibly) snow. Well-exposed caldera interiors are uncommon and that at Mason Spur is helping understand eruption dynamics associated with a complex large island volcano. The results of our study should help to elucidate interpretations of other, less well exposed, pyroclastic density current deposits elsewhere in Antarctica and globally.

  18. Solid phase sequencing of biopolymers

    DOEpatents

    Cantor, Charles; Koster, Hubert

    2010-09-28

    This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.

  19. TargetM6A: Identifying N6-Methyladenosine Sites From RNA Sequences via Position-Specific Nucleotide Propensities and a Support Vector Machine.

    PubMed

    Li, Guang-Qing; Liu, Zi; Shen, Hong-Bin; Yu, Dong-Jun

    2016-10-01

    As one of the most ubiquitous post-transcriptional modifications of RNA, N 6 -methyladenosine ( [Formula: see text]) plays an essential role in many vital biological processes. The identification of [Formula: see text] sites in RNAs is significantly important for both basic biomedical research and practical drug development. In this study, we designed a computational-based method, called TargetM6A, to rapidly and accurately target [Formula: see text] sites solely from the primary RNA sequences. Two new features, i.e., position-specific nucleotide/dinucleotide propensities (PSNP/PSDP), are introduced and combined with the traditional nucleotide composition (NC) feature to formulate RNA sequences. The extracted features are further optimized to obtain a much more compact and discriminative feature subset by applying an incremental feature selection (IFS) procedure. Based on the optimized feature subset, we trained TargetM6A on the training dataset with a support vector machine (SVM) as the prediction engine. We compared the proposed TargetM6A method with existing methods for predicting [Formula: see text] sites by performing stringent jackknife tests and independent validation tests on benchmark datasets. The experimental results show that the proposed TargetM6A method outperformed the existing methods for predicting [Formula: see text] sites and remarkably improved the prediction performances, with MCC = 0.526 and AUC = 0.818. We also provided a user-friendly web server for TargetM6A, which is publicly accessible for academic use at http://csbio.njust.edu.cn/bioinf/TargetM6A.

  20. Application of Stochastic Labeling with Random-Sequence Barcodes for Simultaneous Quantification and Sequencing of Environmental 16S rRNA Genes.

    PubMed

    Hoshino, Tatsuhiko; Inagaki, Fumio

    2017-01-01

    Next-generation sequencing (NGS) is a powerful tool for analyzing environmental DNA and provides the comprehensive molecular view of microbial communities. For obtaining the copy number of particular sequences in the NGS library, however, additional quantitative analysis as quantitative PCR (qPCR) or digital PCR (dPCR) is required. Furthermore, number of sequences in a sequence library does not always reflect the original copy number of a target gene because of biases caused by PCR amplification, making it difficult to convert the proportion of particular sequences in the NGS library to the copy number using the mass of input DNA. To address this issue, we applied stochastic labeling approach with random-tag sequences and developed a NGS-based quantification protocol, which enables simultaneous sequencing and quantification of the targeted DNA. This quantitative sequencing (qSeq) is initiated from single-primer extension (SPE) using a primer with random tag adjacent to the 5' end of target-specific sequence. During SPE, each DNA molecule is stochastically labeled with the random tag. Subsequently, first-round PCR is conducted, specifically targeting the SPE product, followed by second-round PCR to index for NGS. The number of random tags is only determined during the SPE step and is therefore not affected by the two rounds of PCR that may introduce amplification biases. In the case of 16S rRNA genes, after NGS sequencing and taxonomic classification, the absolute number of target phylotypes 16S rRNA gene can be estimated by Poisson statistics by counting random tags incorporated at the end of sequence. To test the feasibility of this approach, the 16S rRNA gene of Sulfolobus tokodaii was subjected to qSeq, which resulted in accurate quantification of 5.0 × 103 to 5.0 × 104 copies of the 16S rRNA gene. Furthermore, qSeq was applied to mock microbial communities and environmental samples, and the results were comparable to those obtained using digital PCR and

  1. Thermal stability of G-rich anti-parallel DNA triplexes upon insertion of LNA and α-L-LNA.

    PubMed

    Kosbar, Tamer R; Sofan, Mamdouh A; Abou-Zeid, Laila; Pedersen, Erik B

    2015-05-14

    G-rich anti-parallel DNA triplexes were modified with LNA or α-L-LNA in their Watson-Crick and TFO strands. The triplexes were formed by targeting a pyrimidine strand to a putative hairpin formed by Hoogsteen base pairing in order to use the UV melting method to evaluate the stability of the triplexes. Their thermal stability was reduced when the TFO strand was modified with LNA or α-L-LNA. The same trend was observed when the TFO strand and the purine Watson-Crick strand both were modified with LNA. When all triad components were modified with α-L-LNA and LNA in the middle of the triplex, the thermal melting was increased. When the pyrimidine sequence was modified with a single insertion of LNA or α-L-LNA the ΔTm increased. Moreover, increasing the number of α-L-LNA in the pyrimidine target sequence to six insertions, leads to a high increase in the thermal stability. The conformational S-type structure of α-L-LNA in anti-parallel triplexes is preferable for triplex stability.

  2. Target-projectile interaction during impact melting at Kamil Crater, Egypt

    NASA Astrophysics Data System (ADS)

    Fazio, Agnese; D'Orazio, Massimo; Cordier, Carole; Folco, Luigi

    2016-05-01

    In small meteorite impacts, the projectile may survive through fragmentation; in addition, it may melt, and chemically and physically interact with both shocked and melted target rocks. However, the mixing/mingling between projectile and target melts is a process still not completely understood. Kamil Crater (45 m in diameter; Egypt), generated by the hypervelocity impact of the Gebel Kamil Ni-rich ataxite on sandstone target, allows to study the target-projectile interaction in a simple and fresh geological setting. We conducted a petrographic and geochemical study of macroscopic impact melt lapilli and bombs ejected from the crater, which were collected during our geophysical campaign in February 2010. Two types of glasses constitute the impact melt lapilli and bombs: a white glass and a dark glass. The white glass is mostly made of SiO2 and it is devoid of inclusions. Its negligible Ni and Co contents suggest derivation from the target rocks without interaction with the projectile (<0.1 wt% of projectile contamination). The dark glass is a silicate melt with variable contents of Al2O3 (0.84-18.7 wt%), FeOT (1.83-61.5 wt%), and NiO (<0.01-10.2 wt%). The dark glass typically includes fragments (from few μm to several mm in size) of shocked sandstone, diaplectic glass, lechatelierite, and Ni-Fe metal blebs. The metal blebs are enriched in Ni compared to the Gebel Kamil meteorite. The dark glass is thus a mixture of target and projectile melts (11-12 wt% of projectile contamination). Based on recently proposed models for target-projectile interaction and for impact glass formation, we suggest a scenario for the glass formation at Kamil. During the transition from the contact and compression stage and the excavation stage, projectile and target liquids formed at their interface and chemically interact in a restricted zone. Projectile contamination affected only a shallow portion of the target rocks. The SiO2 melt that eventually solidified as white glass behaved as

  3. Rational Design of a Transferrin-Binding Peptide Sequence Tailored to Targeted Nanoparticle Internalization.

    PubMed

    Santi, Melissa; Maccari, Giuseppe; Mereghetti, Paolo; Voliani, Valerio; Rocchiccioli, Silvia; Ucciferri, Nadia; Luin, Stefano; Signore, Giovanni

    2017-02-15

    The transferrin receptor (TfR) is a promising target in cancer therapy owing to its overexpression in most solid tumors and on the blood-brain barrier. Nanostructures chemically derivatized with transferrin are employed in TfR targeting but often lose their functionality upon injection in the bloodstream. As an alternative strategy, we rationally designed a peptide coating able to bind transferrin on suitable pockets not involved in binding to TfR or iron by using an iterative multiscale-modeling approach coupled with quantitative structure-activity and relationship (QSAR) analysis and evolutionary algorithms. We tested that selected sequences have low aspecific protein adsorption and high binding energy toward transferrin, and one of them is efficiently internalized in cells with a transferrin-dependent pathway. Furthermore, it promotes transferrin-mediated endocytosis of gold nanoparticles by modifying their protein corona and promoting oriented adsorption of transferrin. This strategy leads to highly effective nanostructures, potentially useful in diagnostic and therapeutic applications, which exploit (and do not suffer) the protein solvation for achieving a better targeting.

  4. Discovery and Annotation of Plant Endogenous Target Mimicry Sequences from Public Transcriptome Libraries: A Case Study of Prunus persica.

    PubMed

    Karakülah, Gökhan

    2017-06-28

    Novel transcript discovery through RNA sequencing has substantially improved our understanding of the transcriptome dynamics of biological systems. Endogenous target mimicry (eTM) transcripts, a novel class of regulatory molecules, bind to their target microRNAs (miRNAs) by base pairing and block their biological activity. The objective of this study was to provide a computational analysis framework for the prediction of putative eTM sequences in plants, and as an example, to discover previously un-annotated eTMs in Prunus persica (peach) transcriptome. Therefore, two public peach transcriptome libraries downloaded from Sequence Read Archive (SRA) and a previously published set of long non-coding RNAs (lncRNAs) were investigated with multi-step analysis pipeline, and 44 putative eTMs were found. Additionally, an eTM-miRNA-mRNA regulatory network module associated with peach fruit organ development was built via integration of the miRNA target information and predicted eTM-miRNA interactions. My findings suggest that one of the most widely expressed miRNA families among diverse plant species, miR156, might be potentially sponged by seven putative eTMs. Besides, the study indicates eTMs potentially play roles in the regulation of development processes in peach fruit via targeting specific miRNAs. In conclusion, by following the step-by step instructions provided in this study, novel eTMs can be identified and annotated effectively in public plant transcriptome libraries.

  5. Evaluation of cysteine proteases of Plasmodium vivax as antimalarial drug targets: sequence analysis and sensitivity to cysteine protease inhibitors.

    PubMed

    Na, Byoung-Kuk; Kim, Tong-Soo; Rosenthal, Philip J; Lee, Jong-Koo; Kong, Yoon

    2004-10-01

    Cysteine proteases perform critical roles in the life cycles of malaria parasites. In Plasmodium falciparum, treatment of cysteine protease inhibitors inhibits hemoglobin hydrolysis and blocks the parasite development in vitro and in vivo, suggesting that plasmodial cysteine proteases may be interesting targets for new chemotherapeutics. To determine whether sequence diversity may limit chemotherapy against Plasmodium vivax, we analyzed sequence variations in the genes encoding three cysteine proteases, vivapain-1, -2 and -3, in 22 wild isolates of P. vivax. The sequences were highly conserved among wild isolates. A small number of substitutions leading to amino acid changes were found, while they did not modify essential residues for the function or structure of the enzymes. The substrate specificities and sensitivities to synthetic cysteine protease inhibitors of vivapain-2 and -3 from wild isolates were also very similar. These results support the suggestion that cysteine proteases of P. vivax are promising antimalarial chemotherapeutic targets.

  6. Three-dimensional structure and cytokine distribution of platelet-rich fibrin.

    PubMed

    Bai, Meng-Yi; Wang, Ching-Wei; Wang, Jyun-Yi; Lin, Ming-Fang; Chan, Wing P

    2017-02-01

    Previous reports have revealed that several cytokines (including platelet-derived growth factor-BB, transforming growth factors-β1 and insulin-like growth factor-1) can enhance the rate of bone formation and synthesis of extracellular matrix in orthopaedics or periodontology. This study aimed to determine the concentration of cytokines within platelet-rich fibrin microstructures and investigate whether there are differences in the different portions of platelet-rich fibrin, which has implications for proper clinical use of platelet-rich fibrin gel. Whole blood was obtained from six New Zealand rabbits (male, 7 to 39 weeks old, weight 2.7-4 kg); it was then centrifuged for preparation of platelet-rich fibrin gels and harvest of plasma. The resultant platelet-rich fibrin gels were used for cytokine determination, histological analyses and scanning electron microscopy. All plasmas obtained were subject to the same cytokine determination assays for the purpose of comparison. Cytokines platelet-derived growth factor-BB and transforming growth factor-β1 formed concentration gradients from high at the red blood cell end of the platelet-rich fibrin gel (p=1.88×10-5) to low at the plasma end (p=0.19). Insulin-like growth factor-1 concentrations were similar at the red blood cell and plasma ends. The porosities of the platelet-rich fibrin samples taken in sequence from the red blood cell end to the plasma end were 6.5% ± 4.9%, 24.8% ± 7.5%, 30.3% ± 8.5%, 41.4% ± 12.3%, and 40.3% ± 11.7%, respectively, showing a gradual decrease in the compactness of the platelet-rich fibrin network. Cytokine concentrations are positively associated with platelet-rich fibrin microstructure and portion in a rabbit model. As platelet-rich fibrin is the main entity currently used in regenerative medicine, assessing cytokine concentration and the most valuable portion of PRF gels is essential and recommended to all physicians.

  7. BAC sequencing using pooled methods.

    PubMed

    Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

    2015-01-01

    Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.

  8. Viral to metazoan marine plankton nucleotide sequences from the Tara Oceans expedition

    PubMed Central

    Alberti, Adriana; Poulain, Julie; Engelen, Stefan; Labadie, Karine; Romac, Sarah; Ferrera, Isabel; Albini, Guillaume; Aury, Jean-Marc; Belser, Caroline; Bertrand, Alexis; Cruaud, Corinne; Da Silva, Corinne; Dossat, Carole; Gavory, Frédérick; Gas, Shahinaz; Guy, Julie; Haquelle, Maud; Jacoby, E'krame; Jaillon, Olivier; Lemainque, Arnaud; Pelletier, Eric; Samson, Gaëlle; Wessner, Mark; Bazire, Pascal; Beluche, Odette; Bertrand, Laurie; Besnard-Gonnet, Marielle; Bordelais, Isabelle; Boutard, Magali; Dubois, Maria; Dumont, Corinne; Ettedgui, Evelyne; Fernandez, Patricia; Garcia, Espérance; Aiach, Nathalie Giordanenco; Guerin, Thomas; Hamon, Chadia; Brun, Elodie; Lebled, Sandrine; Lenoble, Patricia; Louesse, Claudine; Mahieu, Eric; Mairey, Barbara; Martins, Nathalie; Megret, Catherine; Milani, Claire; Muanga, Jacqueline; Orvain, Céline; Payen, Emilie; Perroud, Peggy; Petit, Emmanuelle; Robert, Dominique; Ronsin, Murielle; Vacherie, Benoit; Acinas, Silvia G.; Royo-Llonch, Marta; Cornejo-Castillo, Francisco M.; Logares, Ramiro; Fernández-Gómez, Beatriz; Bowler, Chris; Cochrane, Guy; Amid, Clara; Hoopen, Petra Ten; De Vargas, Colomban; Grimsley, Nigel; Desgranges, Elodie; Kandels-Lewis, Stefanie; Ogata, Hiroyuki; Poulton, Nicole; Sieracki, Michael E.; Stepanauskas, Ramunas; Sullivan, Matthew B.; Brum, Jennifer R.; Duhaime, Melissa B.; Poulos, Bonnie T.; Hurwitz, Bonnie L.; Acinas, Silvia G.; Bork, Peer; Boss, Emmanuel; Bowler, Chris; De Vargas, Colomban; Follows, Michael; Gorsky, Gabriel; Grimsley, Nigel; Hingamp, Pascal; Iudicone, Daniele; Jaillon, Olivier; Kandels-Lewis, Stefanie; Karp-Boss, Lee; Karsenti, Eric; Not, Fabrice; Ogata, Hiroyuki; Pesant, Stéphane; Raes, Jeroen; Sardet, Christian; Sieracki, Michael E.; Speich, Sabrina; Stemmann, Lars; Sullivan, Matthew B.; Sunagawa, Shinichi; Wincker, Patrick; Pesant, Stéphane; Karsenti, Eric; Wincker, Patrick

    2017-01-01

    A unique collection of oceanic samples was gathered by the Tara Oceans expeditions (2009–2013), targeting plankton organisms ranging from viruses to metazoans, and providing rich environmental context measurements. Thanks to recent advances in the field of genomics, extensive sequencing has been performed for a deep genomic analysis of this huge collection of samples. A strategy based on different approaches, such as metabarcoding, metagenomics, single-cell genomics and metatranscriptomics, has been chosen for analysis of size-fractionated plankton communities. Here, we provide detailed procedures applied for genomic data generation, from nucleic acids extraction to sequence production, and we describe registries of genomics datasets available at the European Nucleotide Archive (ENA, www.ebi.ac.uk/ena). The association of these metadata to the experimental procedures applied for their generation will help the scientific community to access these data and facilitate their analysis. This paper complements other efforts to provide a full description of experiments and open science resources generated from the Tara Oceans project, further extending their value for the study of the world’s planktonic ecosystems. PMID:28763055

  9. Viral to metazoan marine plankton nucleotide sequences from the Tara Oceans expedition.

    PubMed

    Alberti, Adriana; Poulain, Julie; Engelen, Stefan; Labadie, Karine; Romac, Sarah; Ferrera, Isabel; Albini, Guillaume; Aury, Jean-Marc; Belser, Caroline; Bertrand, Alexis; Cruaud, Corinne; Da Silva, Corinne; Dossat, Carole; Gavory, Frédérick; Gas, Shahinaz; Guy, Julie; Haquelle, Maud; Jacoby, E'krame; Jaillon, Olivier; Lemainque, Arnaud; Pelletier, Eric; Samson, Gaëlle; Wessner, Mark; Acinas, Silvia G; Royo-Llonch, Marta; Cornejo-Castillo, Francisco M; Logares, Ramiro; Fernández-Gómez, Beatriz; Bowler, Chris; Cochrane, Guy; Amid, Clara; Hoopen, Petra Ten; De Vargas, Colomban; Grimsley, Nigel; Desgranges, Elodie; Kandels-Lewis, Stefanie; Ogata, Hiroyuki; Poulton, Nicole; Sieracki, Michael E; Stepanauskas, Ramunas; Sullivan, Matthew B; Brum, Jennifer R; Duhaime, Melissa B; Poulos, Bonnie T; Hurwitz, Bonnie L; Pesant, Stéphane; Karsenti, Eric; Wincker, Patrick

    2017-08-01

    A unique collection of oceanic samples was gathered by the Tara Oceans expeditions (2009-2013), targeting plankton organisms ranging from viruses to metazoans, and providing rich environmental context measurements. Thanks to recent advances in the field of genomics, extensive sequencing has been performed for a deep genomic analysis of this huge collection of samples. A strategy based on different approaches, such as metabarcoding, metagenomics, single-cell genomics and metatranscriptomics, has been chosen for analysis of size-fractionated plankton communities. Here, we provide detailed procedures applied for genomic data generation, from nucleic acids extraction to sequence production, and we describe registries of genomics datasets available at the European Nucleotide Archive (ENA, www.ebi.ac.uk/ena). The association of these metadata to the experimental procedures applied for their generation will help the scientific community to access these data and facilitate their analysis. This paper complements other efforts to provide a full description of experiments and open science resources generated from the Tara Oceans project, further extending their value for the study of the world's planktonic ecosystems.

  10. Sequencing of a new target genome: the Pediculus humanus humanus (Phthiraptera: Pediculidae) genome project.

    PubMed

    Pittendrigh, B R; Clark, J M; Johnston, J S; Lee, S H; Romero-Severson, J; Dasch, G A

    2006-11-01

    The human body louse, Pediculus humanus humanus (L.), and the human head louse, Pediculus humanus capitis, belong to the hemimetabolous order Phthiraptera. The body louse is the primary vector that transmits the bacterial agents of louse-borne relapsing fever, trench fever, and epidemic typhus. The genomes of the bacterial causative agents of several of these aforementioned diseases have been sequenced. Thus, determining the body louse genome will enhance studies of host-vector-pathogen interactions. Although not important as a major disease vector, head lice are of major social concern. Resistance to traditional pesticides used to control head and body lice have developed. It is imperative that new molecular targets be discovered for the development of novel compounds to control these insects. No complete genome sequence exists for a hemimetabolous insect species primarily because hemimetabolous insects often have large (2000 Mb) to very large (up to 16,300 Mb) genomes. Fortuitously, we determined that the human body louse has one of the smallest genome sizes known in insects, suggesting it may be a suitable choice as a minimal hemimetabolous genome in which many genes have been eliminated during its adaptation to human parasitism. Because many louse species infest birds and mammals, the body louse genome-sequencing project will facilitate studies of their comparative genomics. A 6-8X coverage of the body louse genome, plus sequenced expressed sequence tags, should provide the entomological, evolutionary biology, medical, and public health communities with useful genetic information.

  11. Simultaneous human platelet antigen genotyping and detection of novel single nucleotide polymorphisms by targeted next-generation sequencing.

    PubMed

    Davey, Sue; Navarrete, Cristina; Brown, Colin

    2017-06-01

    Twenty-nine human platelet antigen systems have been described to date, but the majority of current genotyping methods are restricted to the identification of those most commonly associated with alloantibody production in a clinical context. This can result in a protracted investigation if causative human platelet antigens are rare or novel. A targeted next-generation sequencing approach was designed to detect all known human platelet antigens with the additional capability of identifying novel mutations in the encoding genes. A targeted enrichment, high-sensitivity HaloPlex assay was designed to sequence all exons and flanking regions of the six genes known to encode human platelet antigens. Indexed DNA libraries were prepared from 47 previously human platelet antigen-genotyped samples and subsequently combined into one of three pools for sequencing on an Illumina MiSeq platform. The generated FASTQ files were aligned and scrutinized for each human platelet antigen polymorphism using SureCall data analysis software. Forty-six samples were successfully genotyped for human platelet antigens 1 through 29bw, with an average per base coverage depth of 1144. Concordance with historical human platelet antigen genotypes was 100%. A putative novel mutation in Exon 10 of the integrin β-3 (ITGB3) gene from an unsolved case of fetal neonatal alloimmune thrombocytopenia was also detected. A next-generation sequencing-based method that can accurately define all known human platelet antigen polymorphisms was developed. With the ability to sequence up to 96 samples simultaneously, our HaloPlex design could be used for high-throughput human platelet antigen genotyping. This method is also applicable for investigating fetal neonatal alloimmune thrombocytopenia when rare or novel human platelet antigens are suspected. © 2017 AABB.

  12. Development and Validation of Targeted Next-Generation Sequencing Panels for Detection of Germline Variants in Inherited Diseases.

    PubMed

    Santani, Avni; Murrell, Jill; Funke, Birgit; Yu, Zhenming; Hegde, Madhuri; Mao, Rong; Ferreira-Gonzalez, Andrea; Voelkerding, Karl V; Weck, Karen E

    2017-06-01

    - The number of targeted next-generation sequencing (NGS) panels for genetic diseases offered by clinical laboratories is rapidly increasing. Before an NGS-based test is implemented in a clinical laboratory, appropriate validation studies are needed to determine the performance characteristics of the test. - To provide examples of assay design and validation of targeted NGS gene panels for the detection of germline variants associated with inherited disorders. - The approaches used by 2 clinical laboratories for the development and validation of targeted NGS gene panels are described. Important design and validation considerations are examined. - Clinical laboratories must validate performance specifications of each test prior to implementation. Test design specifications and validation data are provided, outlining important steps in validation of targeted NGS panels by clinical diagnostic laboratories.

  13. Variation and Evolution in the Glutamine-Rich Repeat Region of Drosophila Argonaute-2

    PubMed Central

    Palmer, William H.; Obbard, Darren J.

    2016-01-01

    RNA interference pathways mediate biological processes through Argonaute-family proteins, which bind small RNAs as guides to silence complementary target nucleic acids . In insects and crustaceans Argonaute-2 silences viral nucleic acids, and therefore acts as a primary effector of innate antiviral immunity. Although the function of the major Argonaute-2 domains, which are conserved across most Argonaute-family proteins, are known, many invertebrate Argonaute-2 homologs contain a glutamine-rich repeat (GRR) region of unknown function at the N-terminus . Here we combine long-read amplicon sequencing of Drosophila Genetic Reference Panel (DGRP) lines with publicly available sequence data from many insect species to show that this region evolves extremely rapidly and is hyper-variable within species. We identify distinct GRR haplotype groups in Drosophila melanogaster, and suggest that one of these haplotype groups has recently risen to high frequency in a North American population. Finally, we use published data from genome-wide association studies of viral resistance in D. melanogaster to test whether GRR haplotypes are associated with survival after virus challenge. We find a marginally significant association with survival after challenge with Drosophila C Virus in the DGRP, but we were unable to replicate this finding using lines from the Drosophila Synthetic Population Resource panel. PMID:27317784

  14. Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

    PubMed Central

    Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

    1986-01-01

    A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461

  15. Computational identification of conserved microRNAs and their targets from expression sequence tags of blueberry (Vaccinium corybosum)

    PubMed Central

    Li, Xuyan; Hou, Yanming; Zhang, Li; Zhang, Wenhao; Quan, Chen; Cui, Yuhai; Bian, Shaomin

    2014-01-01

    MicroRNAs (miRNAs) are a class of endogenous, approximately 21nt in length, non-coding RNA, which mediate the expression of target genes primarily at post-transcriptional levels. miRNAs play critical roles in almost all plant cellular and metabolic processes. Although numerous miRNAs have been identified in the plant kingdom, the miRNAs in blueberry, which is an economically important small fruit crop, still remain totally unknown. In this study, we reported a computational identification of miRNAs and their targets in blueberry. By conducting an EST-based comparative genomics approach, 9 potential vco-miRNAs were discovered from 22,402 blueberry ESTs according to a series of filtering criteria, designated as vco-miR156–5p, vco-miR156–3p, vco-miR1436, vco-miR1522, vco-miR4495, vco-miR5120, vco-miR5658, vco-miR5783, and vco-miR5986. Based on sequence complementarity between miRNA and its target transcript, 34 target ESTs from blueberry and 70 targets from other species were identified for the vco-miRNAs. The targets were found to be involved in transcription, RNA splicing and binding, DNA duplication, signal transduction, transport and trafficking, stress response, as well as synthesis and metabolic process. These findings will greatly contribute to future research in regard to functions and regulatory mechanisms of blueberry miRNAs. PMID:25763692

  16. Computational identification of conserved microRNAs and their targets from expression sequence tags of blueberry (Vaccinium corybosum).

    PubMed

    Li, Xuyan; Hou, Yanming; Zhang, Li; Zhang, Wenhao; Quan, Chen; Cui, Yuhai; Bian, Shaomin

    2014-01-01

    MicroRNAs (miRNAs) are a class of endogenous, approximately 21nt in length, non-coding RNA, which mediate the expression of target genes primarily at post-transcriptional levels. miRNAs play critical roles in almost all plant cellular and metabolic processes. Although numerous miRNAs have been identified in the plant kingdom, the miRNAs in blueberry, which is an economically important small fruit crop, still remain totally unknown. In this study, we reported a computational identification of miRNAs and their targets in blueberry. By conducting an EST-based comparative genomics approach, 9 potential vco-miRNAs were discovered from 22,402 blueberry ESTs according to a series of filtering criteria, designated as vco-miR156-5p, vco-miR156-3p, vco-miR1436, vco-miR1522, vco-miR4495, vco-miR5120, vco-miR5658, vco-miR5783, and vco-miR5986. Based on sequence complementarity between miRNA and its target transcript, 34 target ESTs from blueberry and 70 targets from other species were identified for the vco-miRNAs. The targets were found to be involved in transcription, RNA splicing and binding, DNA duplication, signal transduction, transport and trafficking, stress response, as well as synthesis and metabolic process. These findings will greatly contribute to future research in regard to functions and regulatory mechanisms of blueberry miRNAs.

  17. Agaricus bisporus genome sequence: a commentary.

    PubMed

    Kerrigan, Richard W; Challen, Michael P; Burton, Kerry S

    2013-06-01

    The genomes of two isolates of Agaricus bisporus have been sequenced recently. This soil-inhabiting fungus has a wide geographical distribution in nature and it is also cultivated in an industrialized indoor process ($4.7bn annual worldwide value) to produce edible mushrooms. Previously this lignocellulosic fungus has resisted precise econutritional classification, i.e. into white- or brown-rot decomposers. The generation of the genome sequence and transcriptomic analyses has revealed a new classification, 'humicolous', for species adapted to grow in humic-rich, partially decomposed leaf material. The Agaricus biporus genomes contain a collection of polysaccharide and lignin-degrading genes and more interestingly an expanded number of genes (relative to other lignocellulosic fungi) that enhance degradation of lignin derivatives, i.e. heme-thiolate peroxidases and β-etherases. A motif that is hypothesized to be a promoter element in the humicolous adaptation suite is present in a large number of genes specifically up-regulated when the mycelium is grown on humic-rich substrate. The genome sequence of A. bisporus offers a platform to explore fungal biology in carbon-rich soil environments and terrestrial cycling of carbon, nitrogen, phosphorus and potassium. Copyright © 2013 Elsevier Inc. All rights reserved.

  18. Amino acid usage is asymmetrically biased in AT- and GC-rich microbial genomes.

    PubMed

    Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W

    2013-01-01

    Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study.

  19. Amino Acid Usage Is Asymmetrically Biased in AT- and GC-Rich Microbial Genomes

    PubMed Central

    Bohlin, Jon; Brynildsrud, Ola; Vesth, Tammi; Skjerve, Eystein; Ussery, David W.

    2013-01-01

    Introduction Genomic base composition ranges from less than 25% AT to more than 85% AT in prokaryotes. Since only a small fraction of prokaryotic genomes is not protein coding even a minor change in genomic base composition will induce profound protein changes. We examined how amino acid and codon frequencies were distributed in over 2000 microbial genomes and how these distributions were affected by base compositional changes. In addition, we wanted to know how genome-wide amino acid usage was biased in the different genomes and how changes to base composition and mutations affected this bias. To carry this out, we used a Generalized Additive Mixed-effects Model (GAMM) to explore non-linear associations and strong data dependences in closely related microbes; principal component analysis (PCA) was used to examine genomic amino acid- and codon frequencies, while the concept of relative entropy was used to analyze genomic mutation rates. Results We found that genomic amino acid frequencies carried a stronger phylogenetic signal than codon frequencies, but that this signal was weak compared to that of genomic %AT. Further, in contrast to codon usage bias (CUB), amino acid usage bias (AAUB) was differently distributed in AT- and GC-rich genomes in the sense that AT-rich genomes did not prefer specific amino acids over others to the same extent as GC-rich genomes. AAUB was also associated with relative entropy; genomes with low AAUB contained more random mutations as a consequence of relaxed purifying selection than genomes with higher AAUB. Conclusion Genomic base composition has a substantial effect on both amino acid- and codon frequencies in bacterial genomes. While phylogeny influenced amino acid usage more in GC-rich genomes, AT-content was driving amino acid usage in AT-rich genomes. We found the GAMM model to be an excellent tool to analyze the genomic data used in this study. PMID:23922837

  20. Cloning, sequence determination, and regulation of the ribonucleotide reductase subunits from Plasmodium falciparum: a target for antimalarial therapy.

    PubMed Central

    Rubin, H; Salem, J S; Li, L S; Yang, F D; Mama, S; Wang, Z M; Fisher, A; Hamann, C S; Cooperman, B S

    1993-01-01

    Malaria remains a leading cause of morbidity and mortality worldwide, accounting for more than one million deaths annually. We have focused on the reduction of ribonucleotides to 2'-deoxyribonucleotides, catalyzed by ribonucleotide reductase, which represents the rate-determining step in DNA replication as a target for antimalarial agents. We report the full-length DNA sequence corresponding to the large (PfR1) and small (PfR2) subunits of Plasmodium falciparum ribonucleotide reductase. The small subunit (PfR2) contains the major catalytic motif consisting of a tyrosyl radical and a dinuclear Fe site. Whereas PfR2 shares 59% amino acid identity with human R2, a striking sequence divergence between human R2 and PfR2 at the C terminus may provide a selective target for inhibition of the malarial enzyme. A synthetic oligopeptide corresponding to the C-terminal 7 residues of PfR2 inhibits mammalian ribonucleotide reductase at concentrations approximately 10-fold higher than that predicted to inhibit malarial R2. The gene encoding the large subunit (PfR1) contains a single intron. The cysteines thought to be involved in the reduction mechanism are conserved. In contrast to mammalian ribonucleotide reductase, the genes for PfR1 and PfR2 are located on the same chromosome and the accumulation of mRNAs for the two subunits follow different temporal patterns during the cell cycle. Images Fig. 2 Fig. 4 Fig. 5 PMID:8415692

  1. Plant species richness at different scales in native and exotic grasslands in Southeastern Arizona

    USGS Publications Warehouse

    McLaughlin, S.P.; Bowers, Janice E.

    2006-01-01

    Species richness in Madrean mixed-grass prairies dominated by native or exotic species in southeastern Arizona was characterized at the community and point scales using ten 1-m2 quadrats nested within each of eight 1000-m2 plots. In the 1000-m2 plots average richness was significantly higher in oak savanna (OS, 121.0 species) than in exotic grassland on mesa tops (EMT, 52.0 species), whereas native grassland on mesa slopes (NMS, 92.5 species) and native grassland on mesa tops (NMT, 77.0 species) did not differ significantly in richness from OS or EMT When richness was partitioned by life form, EMT was notably poorer than other community types in species of perennial grasses, perennial herbs, and summer annuals. In the 1-m2 quadrats, OS (21.2 species), NMS (20.9 species), and NMT (20.7 species) were significantly richer than EMT (5.9 species). Cover in 1-m2 plots was significantly higher in EMT than in NMT, NMS, or OS. Species richness at the point scale showed a unimodal relation to canopy cover, with cover accounting for 30% of the variation in number of species in 1-m2 quadrats. Competitive exclusion and allelopathy have perhaps limited species richness at the point scale in exotic grassland. There was no evidence of a species-pool effect between point and community scales, but such an effect between community and landscape scales was supported. Madrean mixed-grass prairies are landscapes with high species richness in comparison to other grassland types in North America, providing a large pool of potential colonizing species at the community scale. Beta-diversity (between communities) within the landscape of the Appleton-Whittell Research Ranch was consequently high despite a relative lack of habitat diversity.

  2. Effects of tryptophan-rich breakfast and light exposure during the daytime on melatonin secretion at night.

    PubMed

    Fukushige, Haruna; Fukuda, Yumi; Tanaka, Mizuho; Inami, Kaoru; Wada, Kai; Tsumura, Yuki; Kondo, Masayuki; Harada, Tetsuo; Wakamura, Tomoko; Morita, Takeshi

    2014-11-19

    The purpose of the present study is to investigate effects of tryptophan intake and light exposure on melatonin secretion and sleep by modifying tryptophan ingestion at breakfast and light exposure during the daytime, and measuring sleep quality (by using actigraphy and the OSA sleep inventory) and melatonin secretion at night. Thirty three male University students (mean ± SD age: 22 ± 3.1 years) completed the experiments lasting 5 days and 4 nights. The subjects were randomly divided into four groups: Poor*Dim (n = 10), meaning a tryptophan-poor breakfast (55 mg/meal) in the morning and dim light environment (<50 lx) during the daytime; Rich*Dim (n = 7), tryptophan-rich breakfast (476 mg/meal) and dim light environment; Poor*Bright (n = 9), tryptophan-poor breakfast and bright light environment (>5,000 lx); and Rich*Bright (n = 7), tryptophan-rich breakfast and bright light. Saliva melatonin concentrations on the fourth day were significantly lower than on the first day in the Poor*Dim group, whereas they were higher on the fourth day in the Rich*Bright group. Creatinine-adjusted melatonin in urine showed the same direction as saliva melatonin concentrations. These results indicate that the combination of a tryptophan-rich breakfast and bright light exposure during the daytime could promote melatonin secretion at night; further, the observations that the Rich*Bright group had higher melatonin concentrations than the Rich*Dim group, despite no significant differences being observed between the Poor*Dim and Rich*Dim groups nor the Poor*Bright and Rich*Bright groups, suggest that bright light exposure in the daytime is an important contributor to raised melatonin levels in the evening. This study is the first to report the quantitative effects of changed tryptophan intake at breakfast combined with daytime light exposure on melatonin secretion and sleep quality. Evening saliva melatonin secretion changed significantly and indicated that a tryptophan-rich

  3. Climate-induced lake drying causes heterogeneous reductions in waterfowl species richness

    USGS Publications Warehouse

    Roach, Jennifer K.; Griffith, Dennis B.

    2015-01-01

    ContextLake size has declined on breeding grounds for international populations of waterfowl.ObjectivesOur objectives were to (1) model the relationship between waterfowl species richness and lake size; (2) use the model and trends in lake size to project historical, contemporary, and future richness at 2500+ lakes; (3) evaluate mechanisms for the species–area relationship (SAR); and (4) identify species most vulnerable to shrinking lakes.MethodsMonte Carlo simulations of the richness model were used to generate projections. Correlations between richness and both lake size and habitat diversity were compared to identify mechanisms for the SAR. Patterns of nestedness were used to identify vulnerable species.ResultsSpecies richness was greatest at lakes that were larger, closer to rivers, had more wetlands along their perimeters and were within 5 km of a large lake. Average richness per lake was projected to decline by 11 % from 1986 to 2050 but was heterogeneous across sub-regions and lakes. Richness in sub-regions with species-rich lakes was projected to remain stable, while richness in the sub-region with species-poor lakes was projected to decline. Lake size had a greater effect on richness than did habitat diversity, suggesting that large lakes have more species because they provide more habitat but not more habitat types. The vulnerability of species to shrinking lakes was related to species rarity rather than foraging guild.ConclusionsOur maps of projected changes in species richness and rank-ordered list of species most vulnerable to shrinking lakes can be used to identify targets for conservation or monitoring.

  4. An RRM–ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion

    PubMed Central

    Collins, Katherine M.; Kainov, Yaroslav A.; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A.

    2017-01-01

    Abstract RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1–ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. PMID:28379442

  5. An RRM-ZnF RNA recognition module targets RBM10 to exonic sequences to promote exon exclusion.

    PubMed

    Collins, Katherine M; Kainov, Yaroslav A; Christodolou, Evangelos; Ray, Debashish; Morris, Quaid; Hughes, Timothy; Taylor, Ian A; Makeyev, Eugene V; Ramos, Andres

    2017-06-20

    RBM10 is an RNA-binding protein that plays an essential role in development and is frequently mutated in the context of human disease. RBM10 recognizes a diverse set of RNA motifs in introns and exons and regulates alternative splicing. However, the molecular mechanisms underlying this seemingly relaxed sequence specificity are not understood and functional studies have focused on 3΄ intronic sites only. Here, we dissect the RNA code recognized by RBM10 and relate it to the splicing regulatory function of this protein. We show that a two-domain RRM1-ZnF unit recognizes a GGA-centered motif enriched in RBM10 exonic sites with high affinity and specificity and test that the interaction with these exonic sequences promotes exon skipping. Importantly, a second RRM domain (RRM2) of RBM10 recognizes a C-rich sequence, which explains its known interaction with the intronic 3΄ site of NUMB exon 9 contributing to regulation of the Notch pathway in cancer. Together, these findings explain RBM10's broad RNA specificity and suggest that RBM10 functions as a splicing regulator using two RNA-binding units with different specificities to promote exon skipping. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Sequence Analysis and Domain Motifs in the Porcine Skin Decorin Glycosaminoglycan Chain*

    PubMed Central

    Zhao, Xue; Yang, Bo; Solakylidirim, Kemal; Joo, Eun Ji; Toida, Toshihiko; Higashi, Kyohei; Linhardt, Robert J.; Li, Lingyun

    2013-01-01

    Decorin proteoglycan is comprised of a core protein containing a single O-linked dermatan sulfate/chondroitin sulfate glycosaminoglycan (GAG) chain. Although the sequence of the decorin core protein is determined by the gene encoding its structure, the structure of its GAG chain is determined in the Golgi. The recent application of modern MS to bikunin, a far simpler chondroitin sulfate proteoglycans, suggests that it has a single or small number of defined sequences. On this basis, a similar approach to sequence the decorin of porcine skin much larger and more structurally complex dermatan sulfate/chondroitin sulfate GAG chain was undertaken. This approach resulted in information on the consistency/variability of its linkage region at the reducing end of the GAG chain, its iduronic acid-rich domain, glucuronic acid-rich domain, and non-reducing end. A general motif for the porcine skin decorin GAG chain was established. A single small decorin GAG chain was sequenced using MS/MS analysis. The data obtained in the study suggest that the decorin GAG chain has a small or a limited number of sequences. PMID:23423381

  7. Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing

    PubMed Central

    Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken

    2012-01-01

    Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646

  8. Development of a DNA Sensor Based on Nanoporous Pt-Rich Electrodes

    NASA Astrophysics Data System (ADS)

    Van Hao, Pham; Thanh, Pham Duc; Xuan, Chu Thi; Hai, Nguyen Hoang; Tuan, Mai Anh

    2017-06-01

    Nanoporous Pt-rich electrodes with 72 at.% Pt composition were fabricated by sputtering a Pt-Ag alloy, followed by an electrochemical dealloying process to selectively etch away Ag atoms. The surface properties of nanoporous membranes were investigated by energy-dispersive x-ray spectroscopy (EDS), scanning electron microscopy (SEM), atomic force microscopy (AFM), a documentation system, and a gel image system (Gel Doc Imager). A single strand of probe deoxyribonucleic acid (DNA) was immobilized onto the electrode surface by physical adsorption. The DNA probe and target hybridization were measured using a lock-in amplifier and an electrochemical impedance spectroscope (EIS). The nanoporous Pt-rich electrode-based DNA sensor offers a fast response time of 3.7 s, with a limit of detection (LOD) of 4.35 × 10-10 M of DNA target.

  9. Efficacy of Exome-Targeted Capture Sequencing to Detect Mutations in Known Cerebellar Ataxia Genes.

    PubMed

    Coutelier, Marie; Hammer, Monia B; Stevanin, Giovanni; Monin, Marie-Lorraine; Davoine, Claire-Sophie; Mochel, Fanny; Labauge, Pierre; Ewenczyk, Claire; Ding, Jinhui; Gibbs, J Raphael; Hannequin, Didier; Melki, Judith; Toutain, Annick; Laugel, Vincent; Forlani, Sylvie; Charles, Perrine; Broussolle, Emmanuel; Thobois, Stéphane; Afenjar, Alexandra; Anheim, Mathieu; Calvas, Patrick; Castelnovo, Giovanni; de Broucker, Thomas; Vidailhet, Marie; Moulignier, Antoine; Ghnassia, Robert T; Tallaksen, Chantal; Mignot, Cyril; Goizet, Cyril; Le Ber, Isabelle; Ollagnon-Roman, Elisabeth; Pouget, Jean; Brice, Alexis; Singleton, Andrew; Durr, Alexandra

    2018-05-01

    Molecular diagnosis is difficult to achieve in disease groups with a highly heterogeneous genetic background, such as cerebellar ataxia (CA). In many patients, candidate gene sequencing or focused resequencing arrays do not allow investigators to reach a genetic conclusion. To assess the efficacy of exome-targeted capture sequencing to detect mutations in genes broadly linked to CA in a large cohort of undiagnosed patients and to investigate their prevalence. Three hundred nineteen index patients with CA and without a history of dominant transmission were included in the this cohort study by the Spastic Paraplegia and Ataxia Network. Centralized storage was in the DNA and cell bank of the Brain and Spine Institute, Salpetriere Hospital, Paris, France. Patients were classified into 6 clinical groups, with the largest being those with spastic ataxia (ie, CA with pyramidal signs [n = 100]). Sequencing was performed from January 1, 2014, through December 31, 2016. Detected variants were classified as very probably or definitely causative, possibly causative, or of unknown significance based on genetic evidence and genotype-phenotype considerations. Identification of variants in genes broadly linked to CA, classified in pathogenicity groups. The 319 included patients had equal sex distribution (160 female [50.2%] and 159 male patients [49.8%]; mean [SD] age at onset, 27.9 [18.6] years). The age at onset was younger than 25 years for 131 of 298 patients (44.0%) with complete clinical information. Consanguinity was present in 101 of 298 (33.9%). Very probable or definite diagnoses were achieved for 72 patients (22.6%), with an additional 19 (6.0%) harboring possibly pathogenic variants. The most frequently mutated genes were SPG7 (n = 14), SACS (n = 8), SETX (n = 7), SYNE1 (n = 6), and CACNA1A (n = 6). The highest diagnostic rate was obtained for patients with an autosomal recessive CA with oculomotor apraxia-like phenotype (6 of 17 [35.3%]) or

  10. Association of levels of fasting glucose and insulin with rare variants at the chromosome 11p11.2-MADD locus: Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium Targeted Sequencing Study.

    PubMed

    Cornes, Belinda K; Brody, Jennifer A; Nikpoor, Naghmeh; Morrison, Alanna C; Chu, Huan; Ahn, Byung Soo; Wang, Shuai; Dauriz, Marco; Barzilay, Joshua I; Dupuis, Josée; Florez, Jose C; Coresh, Josef; Gibbs, Richard A; Kao, W H Linda; Liu, Ching-Ti; McKnight, Barbara; Muzny, Donna; Pankow, James S; Reid, Jeffrey G; White, Charles C; Johnson, Andrew D; Wong, Tien Y; Psaty, Bruce M; Boerwinkle, Eric; Rotter, Jerome I; Siscovick, David S; Sladek, Robert; Meigs, James B

    2014-06-01

    Common variation at the 11p11.2 locus, encompassing MADD, ACP2, NR1H3, MYBPC3, and SPI1, has been associated in genome-wide association studies with fasting glucose and insulin (FI). In the Cohorts for Heart and Aging Research in Genomic Epidemiology Targeted Sequencing Study, we sequenced 5 gene regions at 11p11.2 to identify rare, potentially functional variants influencing fasting glucose or FI levels. Sequencing (mean depth, 38×) across 16.1 kb in 3566 individuals without diabetes mellitus identified 653 variants, 79.9% of which were rare (minor allele frequency <1%) and novel. We analyzed rare variants in 5 gene regions with FI or fasting glucose using the sequence kernel association test. At NR1H3, 53 rare variants were jointly associated with FI (P=2.73×10(-3)); of these, 7 were predicted to have regulatory function and showed association with FI (P=1.28×10(-3)). Conditioning on 2 previously associated variants at MADD (rs7944584, rs10838687) did not attenuate this association, suggesting that there are >2 independent signals at 11p11.2. One predicted regulatory variant, chr11:47227430 (hg18; minor allele frequency=0.00068), contributed 20.6% to the overall sequence kernel association test score at NR1H3, lies in intron 2 of NR1H3, and is a predicted binding site for forkhead box A1 (FOXA1), a transcription factor associated with insulin regulation. In human HepG2 hepatoma cells, the rare chr11:47227430 A allele disrupted FOXA1 binding and reduced FOXA1-dependent transcriptional activity. Sequencing at 11p11.2-NR1H3 identified rare variation associated with FI. One variant, chr11:47227430, seems to be functional, with the rare A allele reducing transcription factor FOXA1 binding and FOXA1-dependent transcriptional activity. © 2014 American Heart Association, Inc.

  11. DNA sequencing using polymerase substrate-binding kinetics

    PubMed Central

    Previte, Michael John Robert; Zhou, Chunhong; Kellinger, Matthew; Pantoja, Rigo; Chen, Cheng-Yao; Shi, Jin; Wang, BeiBei; Kia, Amirali; Etchin, Sergey; Vieceli, John; Nikoomanzar, Ali; Bomati, Erin; Gloeckner, Christian; Ronaghi, Mostafa; He, Molly Min

    2015-01-01

    Next-generation sequencing (NGS) has transformed genomic research by decreasing the cost of sequencing. However, whole-genome sequencing is still costly and complex for diagnostics purposes. In the clinical space, targeted sequencing has the advantage of allowing researchers to focus on specific genes of interest. Routine clinical use of targeted NGS mandates inexpensive instruments, fast turnaround time and an integrated and robust workflow. Here we demonstrate a version of the Sequencing by Synthesis (SBS) chemistry that potentially can become a preferred targeted sequencing method in the clinical space. This sequencing chemistry uses natural nucleotides and is based on real-time recording of the differential polymerase/DNA-binding kinetics in the presence of correct or mismatch nucleotides. This ensemble SBS chemistry has been implemented on an existing Illumina sequencing platform with integrated cluster amplification. We discuss the advantages of this sequencing chemistry for targeted sequencing as well as its limitations for other applications. PMID:25612848

  12. RNAblueprint: flexible multiple target nucleic acid sequence design.

    PubMed

    Hammer, Stefan; Tschiatschek, Birgit; Flamm, Christoph; Hofacker, Ivo L; Findeiß, Sven

    2017-09-15

    Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. We developed a C ++  library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re-evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. RNAblueprint , Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA . s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.

  13. Bias-Corrected Targeted Next-Generation Sequencing for Rapid, Multiplexed Detection of Actionable Alterations in Cell-Free DNA from Advanced Lung Cancer Patients.

    PubMed

    Paweletz, Cloud P; Sacher, Adrian G; Raymond, Chris K; Alden, Ryan S; O'Connell, Allison; Mach, Stacy L; Kuang, Yanan; Gandhi, Leena; Kirschmeier, Paul; English, Jessie M; Lim, Lee P; Jänne, Pasi A; Oxnard, Geoffrey R

    2016-02-15

    Tumor genotyping is a powerful tool for guiding non-small cell lung cancer (NSCLC) care; however, comprehensive tumor genotyping can be logistically cumbersome. To facilitate genotyping, we developed a next-generation sequencing (NGS) assay using a desktop sequencer to detect actionable mutations and rearrangements in cell-free plasma DNA (cfDNA). An NGS panel was developed targeting 11 driver oncogenes found in NSCLC. Targeted NGS was performed using a novel methodology that maximizes on-target reads, and minimizes artifact, and was validated on DNA dilutions derived from cell lines. Plasma NGS was then blindly performed on 48 patients with advanced, progressive NSCLC and a known tumor genotype, and explored in two patients with incomplete tumor genotyping. NGS could identify mutations present in DNA dilutions at ≥ 0.4% allelic frequency with 100% sensitivity/specificity. Plasma NGS detected a broad range of driver and resistance mutations, including ALK, ROS1, and RET rearrangements, HER2 insertions, and MET amplification, with 100% specificity. Sensitivity was 77% across 62 known driver and resistance mutations from the 48 cases; in 29 cases with common EGFR and KRAS mutations, sensitivity was similar to droplet digital PCR. In two cases with incomplete tumor genotyping, plasma NGS rapidly identified a novel EGFR exon 19 deletion and a missed case of MET amplification. Blinded to tumor genotype, this plasma NGS approach detected a broad range of targetable genomic alterations in NSCLC with no false positives including complex mutations like rearrangements and unexpected resistance mutations such as EGFR C797S. Through use of widely available vacutainers and a desktop sequencing platform, this assay has the potential to be implemented broadly for patient care and translational research. ©2015 American Association for Cancer Research.

  14. Bias-corrected targeted next-generation sequencing for rapid, multiplexed detection of actionable alterations in cell-free DNA from advanced lung cancer patients

    PubMed Central

    Paweletz, Cloud P.; Sacher, Adrian G.; Raymond, Chris K.; Alden, Ryan S.; O'Connell, Allison; Mach, Stacy L.; Kuang, Yanan; Gandhi, Leena; Kirschmeier, Paul; English, Jessie M.; Lim, Lee P.; Jänne, Pasi A.; Oxnard, Geoffrey R.

    2015-01-01

    Purpose Tumor genotyping is a powerful tool for guiding non-small cell lung cancer (NSCLC) care, however comprehensive tumor genotyping can be logistically cumbersome. To facilitate genotyping, we developed a next-generation sequencing (NGS) assay using a desktop sequencer to detect actionable mutations and rearrangements in cell-free plasma DNA (cfDNA). Experimental Design An NGS panel was developed targeting 11 driver oncogenes found in NSCLC. Targeted NGS was performed using a novel methodology that maximizes on-target reads, and minimizes artifact, and was validated on DNA dilutions derived from cell lines. Plasma NGS was then blindly performed on 48 patients with advanced, progressive NSCLC and a known tumor genotype, and explored in two patients with incomplete tumor genotyping. Results NGS could identify mutations present in DNA dilutions at ≥0.4% allelic frequency with 100% sensitivity/specificity. Plasma NGS detected a broad range of driver and resistance mutations, including ALK, ROS1, and RET rearrangements, HER2 insertions, and MET amplification, with 100% specificity. Sensitivity was 77% across 62 known driver and resistance mutations from the 48 cases; in 29 cases with common EGFR and KRAS mutations, sensitivity was similar to droplet digital PCR. In two cases with incomplete tumor genotyping, plasma NGS rapidly identified a novel EGFR exon 19 deletion and a missed case of MET amplification. Conclusion Blinded to tumor genotype, this plasma NGS approach detected a broad range of targetable genomic alterations in NSCLC with no false positives including complex mutations like rearrangements and unexpected resistance mutations such as EGFR C797S. Through use of widely available vacutainers and a desktop sequencing platform, this assay has the potential to be implemented broadly for patient care and translational research. PMID:26459174

  15. Identification of rare genetic variants in Italian patients with dementia by targeted gene sequencing.

    PubMed

    Bartoletti-Stella, Anna; Baiardi, Simone; Stanzani-Maserati, Michelangelo; Piras, Silvia; Caffarra, Paolo; Raggi, Alberto; Pantieri, Roberta; Baldassari, Sara; Caporali, Leonardo; Abu-Rumeileh, Samir; Linarello, Simona; Liguori, Rocco; Parchi, Piero; Capellari, Sabina

    2018-06-01

    Genetics is intricately involved in the etiology of neurodegenerative dementias. The incidence of monogenic dementia among all neurodegenerative forms is unknown due to the lack of systematic studies and of patient/clinician access to extensive diagnostic procedures. In this study, we conducted targeted sequencing in 246 clinically heterogeneous patients, mainly with early-onset and/or familial neurodegenerative dementia, using a custom-designed next-generation sequencing panel covering 27 genes known to harbor mutations that can cause different types of dementia, in addition to the detection of C9orf72 repeat expansions. Forty-nine patients (19.9%) carried known pathogenic or novel, likely pathogenic, variants, involving both common (presenilin 1, presenilin 2, C9orf72, and granulin) and rare (optineurin, serpin family I member 1 and protein kinase cyclic adenosine monophosphate (cAMP)-dependent type I regulatory subunit beta) dementia-associated genes. Our results support the use of an extended next-generation sequencing panels as a quick, accurate, and cost-effective method for diagnosis in clinical practice. This approach could have a significant impact on the proportion of tested patients, especially among those with an early disease onset. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Prediction of Drug-Target Interaction Networks from the Integration of Protein Sequences and Drug Chemical Structures.

    PubMed

    Meng, Fan-Rong; You, Zhu-Hong; Chen, Xing; Zhou, Yong; An, Ji-Yong

    2017-07-05

    Knowledge of drug-target interaction (DTI) plays an important role in discovering new drug candidates. Unfortunately, there are unavoidable shortcomings; including the time-consuming and expensive nature of the experimental method to predict DTI. Therefore, it motivates us to develop an effective computational method to predict DTI based on protein sequence. In the paper, we proposed a novel computational approach based on protein sequence, namely PDTPS (Predicting Drug Targets with Protein Sequence) to predict DTI. The PDTPS method combines Bi-gram probabilities (BIGP), Position Specific Scoring Matrix (PSSM), and Principal Component Analysis (PCA) with Relevance Vector Machine (RVM). In order to evaluate the prediction capacity of the PDTPS, the experiment was carried out on enzyme, ion channel, GPCR, and nuclear receptor datasets by using five-fold cross-validation tests. The proposed PDTPS method achieved average accuracy of 97.73%, 93.12%, 86.78%, and 87.78% on enzyme, ion channel, GPCR and nuclear receptor datasets, respectively. The experimental results showed that our method has good prediction performance. Furthermore, in order to further evaluate the prediction performance of the proposed PDTPS method, we compared it with the state-of-the-art support vector machine (SVM) classifier on enzyme and ion channel datasets, and other exiting methods on four datasets. The promising comparison results further demonstrate that the efficiency and robust of the proposed PDTPS method. This makes it a useful tool and suitable for predicting DTI, as well as other bioinformatics tasks.

  17. Using genic sequence capture in combination with a syntenic pseudo genome to map a deletion mutant in a wheat species.

    PubMed

    Gardiner, Laura-Jayne; Gawroński, Piotr; Olohan, Lisa; Schnurbusch, Thorsten; Hall, Neil; Hall, Anthony

    2014-12-01

    Mapping-by-sequencing analyses have largely required a complete reference sequence and employed whole genome re-sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re-sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early-flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene-rich regions of hexaploid bread wheat to design a 110-Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo-chromosomes derived from the capture probe target sequence, with a long-range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval. © 2014 The Authors.The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.

  18. Accurate reconstruction of viral quasispecies spectra through improved estimation of strain richness

    PubMed Central

    2015-01-01

    Background Estimating the number of different species (richness) in a mixed microbial population has been a main focus in metagenomic research. Existing methods of species richness estimation ride on the assumption that the reads in each assembled contig correspond to only one of the microbial genomes in the population. This assumption and the underlying probabilistic formulations of existing methods are not useful for quasispecies populations where the strains are highly genetically related. The lack of knowledge on the number of different strains in a quasispecies population is observed to hinder the precision of existing Viral Quasispecies Spectrum Reconstruction (QSR) methods due to the uncontrolled reconstruction of a large number of in silico false positives. In this work, we formulated a novel probabilistic method for strain richness estimation specifically targeting viral quasispecies. By using this approach we improved our recently proposed spectrum reconstruction pipeline ViQuaS to achieve higher levels of precision in reconstructed quasispecies spectra without compromising the recall rates. We also discuss how one other existing popular QSR method named ShoRAH can be improved using this new approach. Results On benchmark data sets, our estimation method provided accurate richness estimates (< 0.2 median estimation error) and improved the precision of ViQuaS by 2%-13% and F-score by 1%-9% without compromising the recall rates. We also demonstrate that our estimation method can be used to improve the precision and F-score of ShoRAH by 0%-7% and 0%-5% respectively. Conclusions The proposed probabilistic estimation method can be used to estimate the richness of viral populations with a quasispecies behavior and to improve the accuracy of the quasispecies spectra reconstructed by the existing methods ViQuaS and ShoRAH in the presence of a moderate level of technical sequencing errors. Availability http://sourceforge.net/projects/viquas/ PMID:26678073

  19. Is sequence awareness mandatory for perceptual sequence learning: An assessment using a pure perceptual sequence learning design.

    PubMed

    Deroost, Natacha; Coomans, Daphné

    2018-02-01

    We examined the role of sequence awareness in a pure perceptual sequence learning design. Participants had to react to the target's colour that changed according to a perceptual sequence. By varying the mapping of the target's colour onto the response keys, motor responses changed randomly. The effect of sequence awareness on perceptual sequence learning was determined by manipulating the learning instructions (explicit versus implicit) and assessing the amount of sequence awareness after the experiment. In the explicit instruction condition (n = 15), participants were instructed to intentionally search for the colour sequence, whereas in the implicit instruction condition (n = 15), they were left uninformed about the sequenced nature of the task. Sequence awareness after the sequence learning task was tested by means of a questionnaire and the process-dissociation-procedure. The results showed that the instruction manipulation had no effect on the amount of perceptual sequence learning. Based on their report to have actively applied their sequence knowledge during the experiment, participants were subsequently regrouped in a sequence strategy group (n = 14, of which 4 participants from the implicit instruction condition and 10 participants from the explicit instruction condition) and a no-sequence strategy group (n = 16, of which 11 participants from the implicit instruction condition and 5 participants from the explicit instruction condition). Only participants of the sequence strategy group showed reliable perceptual sequence learning and sequence awareness. These results indicate that perceptual sequence learning depends upon the continuous employment of strategic cognitive control processes on sequence knowledge. Sequence awareness is suggested to be a necessary but not sufficient condition for perceptual learning to take place. Copyright © 2018 Elsevier B.V. All rights reserved.

  20. Targeted genomic enrichment and sequencing of CyHV-3 from carp tissues confirms low nucleotide diversity and mixed genotype infections.

    PubMed

    Hammoumi, Saliha; Vallaeys, Tatiana; Santika, Ayi; Leleux, Philippe; Borzym, Ewa; Klopp, Christophe; Avarre, Jean-Christophe

    2016-01-01

    Koi herpesvirus disease (KHVD) is an emerging disease that causes mass mortality in koi and common carp, Cyprinus carpio L. Its causative agent is Cyprinid herpesvirus 3 (CyHV-3), also known as koi herpesvirus (KHV). Although data on the pathogenesis of this deadly virus is relatively abundant in the literature, still little is known about its genomic diversity and about the molecular mechanisms that lead to such a high virulence. In this context, we developed a new strategy for sequencing full-length CyHV-3 genomes directly from infected fish tissues. Total genomic DNA extracted from carp gill tissue was specifically enriched with CyHV-3 sequences through hybridization to a set of nearly 2 million overlapping probes designed to cover the entire genome length, using KHV-J sequence (GenBank accession number AP008984) as reference. Applied to 7 CyHV-3 specimens from Poland and Indonesia, this targeted genomic enrichment enabled recovery of the full genomes with >99.9% reference coverage. The enrichment rate was directly correlated to the estimated number of viral copies contained in the DNA extracts used for library preparation, which varied between ∼5000 and ∼2×10 7 . The average sequencing depth was >200 for all samples, thus allowing the search for variants with high confidence. Sequence analyses highlighted a significant proportion of intra-specimen sequence heterogeneity, suggesting the presence of mixed infections in all investigated fish. They also showed that inter-specimen genetic diversity at the genome scale was very low (>99.95% of sequence identity). By enabling full genome comparisons directly from infected fish tissues, this new method will be valuable to trace outbreaks rapidly and at a reasonable cost, and in turn to understand the transmission routes of CyHV-3.

  1. Targeted genomic enrichment and sequencing of CyHV-3 from carp tissues confirms low nucleotide diversity and mixed genotype infections

    PubMed Central

    Hammoumi, Saliha; Vallaeys, Tatiana; Santika, Ayi; Leleux, Philippe; Borzym, Ewa; Klopp, Christophe

    2016-01-01

    Koi herpesvirus disease (KHVD) is an emerging disease that causes mass mortality in koi and common carp, Cyprinus carpio L. Its causative agent is Cyprinid herpesvirus 3 (CyHV-3), also known as koi herpesvirus (KHV). Although data on the pathogenesis of this deadly virus is relatively abundant in the literature, still little is known about its genomic diversity and about the molecular mechanisms that lead to such a high virulence. In this context, we developed a new strategy for sequencing full-length CyHV-3 genomes directly from infected fish tissues. Total genomic DNA extracted from carp gill tissue was specifically enriched with CyHV-3 sequences through hybridization to a set of nearly 2 million overlapping probes designed to cover the entire genome length, using KHV-J sequence (GenBank accession number AP008984) as reference. Applied to 7 CyHV-3 specimens from Poland and Indonesia, this targeted genomic enrichment enabled recovery of the full genomes with >99.9% reference coverage. The enrichment rate was directly correlated to the estimated number of viral copies contained in the DNA extracts used for library preparation, which varied between ∼5000 and ∼2×107. The average sequencing depth was >200 for all samples, thus allowing the search for variants with high confidence. Sequence analyses highlighted a significant proportion of intra-specimen sequence heterogeneity, suggesting the presence of mixed infections in all investigated fish. They also showed that inter-specimen genetic diversity at the genome scale was very low (>99.95% of sequence identity). By enabling full genome comparisons directly from infected fish tissues, this new method will be valuable to trace outbreaks rapidly and at a reasonable cost, and in turn to understand the transmission routes of CyHV-3. PMID:27703859

  2. Short Interspersed Nuclear Element (SINE) Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293

    PubMed Central

    Kanhayuwa, Lakkhana; Coutts, Robert H. A.

    2016-01-01

    Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4–14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140–493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3’-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50–65% and 60–75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259–343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity. PMID:27736869

  3. Interaction of Plasmodium vivax Tryptophan-rich Antigen PvTRAg38 with Band 3 on Human Erythrocyte Surface Facilitates Parasite Growth*

    PubMed Central

    Alam, Mohd. Shoeb; Choudhary, Vandana; Zeeshan, Mohammad; Tyagi, Rupesh K.; Rathore, Sumit; Sharma, Yagya D.

    2015-01-01

    Plasmodium tryptophan-rich proteins are involved in host-parasite interaction and thus potential drug/vaccine targets. Recently, we have described several P. vivax tryptophan-rich antigens (PvTRAgs), including merozoite expressed PvTRAg38, from this noncultivable human malaria parasite. PvTRAg38 is highly immunogenic in humans and binds to host erythrocytes, and this binding is inhibited by the patient sera. This binding is also affected if host erythrocytes were pretreated with chymotrypsin. Here, Band 3 has been identified as the chymotrypsin-sensitive erythrocyte receptor for this parasite protein. Interaction of PvTRAg38 with Band 3 has been mapped to its three different ectodomains (loops 1, 3, and 6) exposed at the surface of the erythrocyte. The binding region of PvTRAg38 to Band3 has been mapped to its sequence, KWVQWKNDKIRSWLSSEW, present at amino acid positions 197–214. The recombinant PvTRAg38 was able to inhibit the parasite growth in in vitro Plasmodium falciparum culture probably by competing with the ligand(s) of this heterologous parasite for the erythrocyte Band 3 receptor. In conclusion, the host-parasite interaction at the molecular level is much more complicated than known so far and should be considered during the development of anti-malarial therapeutics. PMID:26149684

  4. Targeted next-generation sequencing helps to decipher the genetic and phenotypic heterogeneity of hypertrophic cardiomyopathy

    PubMed Central

    Cecconi, Massimiliano; Parodi, Maria I.; Formisano, Francesco; Spirito, Paolo; Autore, Camillo; Musumeci, Maria B.; Favale, Stefano; Forleo, Cinzia; Rapezzi, Claudio; Biagini, Elena; Davì, Sabrina; Canepa, Elisabetta; Pennese, Loredana; Castagnetta, Mauro; Degiorgio, Dario; Coviello, Domenico A.

    2016-01-01

    Hypertrophic cardiomyopathy (HCM) is mainly associated with myosin, heavy chain 7 (MYH7) and myosin binding protein C, cardiac (MYBPC3) mutations. In order to better explain the clinical and genetic heterogeneity in HCM patients, in this study, we implemented a target-next generation sequencing (NGS) assay. An Ion AmpliSeq™ Custom Panel for the enrichment of 19 genes, of which 9 of these did not encode thick/intermediate and thin myofilament (TTm) proteins and, among them, 3 responsible of HCM phenocopy, was created. Ninety-two DNA samples were analyzed by the Ion Personal Genome Machine: 73 DNA samples (training set), previously genotyped in some of the genes by Sanger sequencing, were used to optimize the NGS strategy, whereas 19 DNA samples (discovery set) allowed the evaluation of NGS performance. In the training set, we identified 72 out of 73 expected mutations and 15 additional mutations: the molecular diagnosis was achieved in one patient with a previously wild-type status and the pre-excitation syndrome was explained in another. In the discovery set, we identified 20 mutations, 5 of which were in genes encoding non-TTm proteins, increasing the diagnostic yield by approximately 20%: a single mutation in genes encoding non-TTm proteins was identified in 2 out of 3 borderline HCM patients, whereas co-occuring mutations in genes encoding TTm and galactosidase alpha (GLA) altered proteins were characterized in a male with HCM and multiorgan dysfunction. Our combined targeted NGS-Sanger sequencing-based strategy allowed the molecular diagnosis of HCM with greater efficiency than using the conventional (Sanger) sequencing alone. Mutant alleles encoding non-TTm proteins may aid in the complete understanding of the genetic and phenotypic heterogeneity of HCM: co-occuring mutations of genes encoding TTm and non-TTm proteins could explain the wide variability of the HCM phenotype, whereas mutations in genes encoding only the non-TTm proteins are identifiable in

  5. Spectroscopy of neutron rich nuclei using cold neutron induced fission of actinide targets at the ILL: The EXILL campaign

    NASA Astrophysics Data System (ADS)

    Blanc, A.; de France, G.; Drouet, F.; Jentschel, M.; Köster, U.; Mancuso, C.; Mutti, P.; Régis, J. M.; Simpson, G.; Soldner, T.; Ur, C. A.; Urban, W.; Vancraeyenest, A.

    2013-12-01

    One way to explore exotic nuclei is to study their structure by performing γ-ray spectroscopy. At the ILL, we exploit a high neutron flux reactor to induce the cold fission of actinide targets. In this process, fission products that cannot be accessed using standard spontaneous fission sources are produced with a yield allowing their detailed study using high resolution γ-ray spectroscopy. This is what was pursued at the ILL with the EXILL (for EXOGAM at the ILL) campaign. In the present work, the EXILL setup and performance will be presented.

  6. cDNA sequences and organization of IgM heavy chain genes in two holostean fish.

    PubMed

    Wilson, M R; van Ravenstein, E; Miller, N W; Clem, L W; Middleton, D L; Warr, G W

    1995-01-01

    Immunoglobulin M heavy chain (mu) sequences of two holostean fish, the bowfin, Amia calva, and the longnose gar, Lepisosteus osseus, were amplified from spleen mRNA by RACE-PCR, cloned, and sequenced. Each mu chain showed the conserved four constant domain structure typical of a secreted mu chain. Southern blot analyses with specific heavy chain variable (VH) and constant (CH) region probes suggest that both fish possess an IgH locus that resembles that of the teleosts, amphibians, and mammals in its organization. The overall sequence similarity of gar and bowfin mu chains was 60% and 48% at the nucleotide and amino acid levels, respectively, while similarity to the mu chains of teleosts and elasmobranchs was lower. The bowfin mu chain possesses a distinctive proline-rich sequence at the C mu 1/C mu 2 boundary; a shorter proline-rich sequence is present at this position in the gar mu chain. Both gar and bowfin show, in their C mu 4 sequences, motifs that could serve as cryptic splice donor sites for the production of mRNA encoding the membrane-bound form of the mu chains, and the bowfin also shows a potential cryptic splice donor site in the C mu 3 exon.

  7. Isolation and characterization of the promoter sequence of a cassava gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in storage roots.

    PubMed

    de Souza, C R; Aragão, F J; Moreira, E C O; Costa, C N M; Nascimento, S B; Carvalho, L J

    2009-03-24

    Cassava is one of the most important tropical food crops for more than 600 million people worldwide. Transgenic technologies can be useful for increasing its nutritional value and its resistance to viral diseases and insect pests. However, tissue-specific promoters that guarantee correct expression of transgenes would be necessary. We used inverse polymerase chain reaction to isolate a promoter sequence of the Mec1 gene coding for Pt2L4, a glutamic acid-rich protein differentially expressed in cassava storage roots. In silico analysis revealed putative cis-acting regulatory elements within this promoter sequence, including root-specific elements that may be required for its expression in vascular tissues. Transient expression experiments showed that the Mec1 promoter is functional, since this sequence was able to drive GUS expression in bean embryonic axes. Results from our computational analysis can serve as a guide for functional experiments to identify regions with tissue-specific Mec1 promoter activity. The DNA sequence that we identified is a new promoter that could be a candidate for genetic engineering of cassava roots.

  8. Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples

    PubMed Central

    2012-01-01

    -natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. PMID:22235840

  9. DNA sequence analysis with droplet-based microfluidics

    PubMed Central

    Abate, Adam R.; Hung, Tony; Sperling, Ralph A.; Mary, Pascaline; Rotem, Assaf; Agresti, Jeremy J.; Weiner, Michael A.; Weitz, David A.

    2014-01-01

    Droplet-based microfluidic techniques can form and process micrometer scale droplets at thousands per second. Each droplet can house an individual biochemical reaction, allowing millions of reactions to be performed in minutes with small amounts of total reagent. This versatile approach has been used for engineering enzymes, quantifying concentrations of DNA in solution, and screening protein crystallization conditions. Here, we use it to read the sequences of DNA molecules with a FRET-based assay. Using probes of different sequences, we interrogate a target DNA molecule for polymorphisms. With a larger probe set, additional polymorphisms can be interrogated as well as targets of arbitrary sequence. PMID:24185402

  10. Combined mutation and copy-number variation detection by targeted next-generation sequencing in uveal melanoma.

    PubMed

    Smit, Kyra N; van Poppelen, Natasha M; Vaarwater, Jolanda; Verdijk, Robert; van Marion, Ronald; Kalirai, Helen; Coupland, Sarah E; Thornton, Sophie; Farquhar, Neil; Dubbink, Hendrikus-Jan; Paridaens, Dion; de Klein, Annelies; Kiliç, Emine

    2018-05-01

    Uveal melanoma is a highly aggressive cancer of the eye, in which nearly 50% of the patients die from metastasis. It is the most common type of primary eye cancer in adults. Chromosome and mutation status have been shown to correlate with the disease-free survival. Loss of chromosome 3 and inactivating mutations in BAP1, which is located on chromosome 3, are strongly associated with 'high-risk' tumors that metastasize early. Other genes often involved in uveal melanoma are SF3B1 and EIF1AX, which are found to be mutated in intermediate- and low-risk tumors, respectively. To obtain genetic information of all genes in one test, we developed a targeted sequencing method that can detect mutations in uveal melanoma genes and chromosomal anomalies in chromosome 1, 3, and 8. With as little as 10 ng DNA, we obtained enough coverage on all genes to detect mutations, such as substitutions, deletions, and insertions. These results were validated with Sanger sequencing in 28 samples. In >90% of the cases, the BAP1 mutation status corresponded to the BAP1 immunohistochemistry. The results obtained in the Ion Torrent single-nucleotide polymorphism assay were confirmed with several other techniques, such as fluorescence in situ hybridization, multiplex ligation-dependent probe amplification, and Illumina SNP array. By validating our assay in 27 formalin-fixed paraffin-embedded and 43 fresh uveal melanomas, we show that mutations and chromosome status can reliably be obtained using targeted next-generation sequencing. Implementing this technique as a diagnostic pathology application for uveal melanoma will allow prediction of the patients' metastatic risk and potentially assess eligibility for new therapies.

  11. Targeted next-generation sequencing analysis identifies novel mutations in families with severe familial exudative vitreoretinopathy.

    PubMed

    Huang, Xiao-Yan; Zhuang, Hong; Wu, Ji-Hong; Li, Jian-Kang; Hu, Fang-Yuan; Zheng, Yu; Tellier, Laurent Christian Asker M; Zhang, Sheng-Hai; Gao, Feng-Juan; Zhang, Jian-Guo; Xu, Ge-Zhi

    2017-01-01

    Familial exudative vitreoretinopathy (FEVR) is a genetically and clinically heterogeneous disease, characterized by failure of vascular development of the peripheral retina. The symptoms of FEVR vary widely among patients in the same family, and even between the two eyes of a given patient. This study was designed to identify the genetic defect in a patient cohort of ten Chinese families with a definitive diagnosis of FEVR. To identify the causative gene, next-generation sequencing (NGS)-based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members by using Sanger sequencing and quantitative real-time PCR (QPCR). Of the cohort of ten FEVR families, six pathogenic variants were identified, including four novel and two known heterozygous mutations. Of the variants identified, four were missense variants, and two were novel heterozygous deletion mutations [ LRP5 , c.4053 DelC (p.Ile1351IlefsX88); TSPAN12 , EX8Del]. The two novel heterozygous deletion mutations were not observed in the control subjects and could give rise to a relatively severe FEVR phenotype, which could be explained by the protein function prediction. We identified two novel heterozygous deletion mutations [ LRP5 , c.4053 DelC (p.Ile1351IlefsX88); TSPAN12 , EX8Del] using targeted NGS as a causative mutation for FEVR. These genetic deletion variations exhibit a severe form of FEVR, with tractional retinal detachments compared with other known point mutations. The data further enrich the mutation spectrum of FEVR and enhance our understanding of genotype-phenotype correlations to provide useful information for disease diagnosis, prognosis, and effective genetic counseling.

  12. Targeted next-generation sequencing analysis identifies novel mutations in families with severe familial exudative vitreoretinopathy

    PubMed Central

    Huang, Xiao-Yan; Zhuang, Hong; Wu, Ji-Hong; Li, Jian-Kang; Hu, Fang-Yuan; Zheng, Yu; Tellier, Laurent Christian Asker M.; Zhang, Sheng-Hai; Gao, Feng-Juan; Zhang, Jian-Guo

    2017-01-01

    Purpose Familial exudative vitreoretinopathy (FEVR) is a genetically and clinically heterogeneous disease, characterized by failure of vascular development of the peripheral retina. The symptoms of FEVR vary widely among patients in the same family, and even between the two eyes of a given patient. This study was designed to identify the genetic defect in a patient cohort of ten Chinese families with a definitive diagnosis of FEVR. Methods To identify the causative gene, next-generation sequencing (NGS)-based target capture sequencing was performed. Segregation analysis of the candidate variant was performed in additional family members by using Sanger sequencing and quantitative real-time PCR (QPCR). Results Of the cohort of ten FEVR families, six pathogenic variants were identified, including four novel and two known heterozygous mutations. Of the variants identified, four were missense variants, and two were novel heterozygous deletion mutations [LRP5, c.4053 DelC (p.Ile1351IlefsX88); TSPAN12, EX8Del]. The two novel heterozygous deletion mutations were not observed in the control subjects and could give rise to a relatively severe FEVR phenotype, which could be explained by the protein function prediction. Conclusions We identified two novel heterozygous deletion mutations [LRP5, c.4053 DelC (p.Ile1351IlefsX88); TSPAN12, EX8Del] using targeted NGS as a causative mutation for FEVR. These genetic deletion variations exhibit a severe form of FEVR, with tractional retinal detachments compared with other known point mutations. The data further enrich the mutation spectrum of FEVR and enhance our understanding of genotype–phenotype correlations to provide useful information for disease diagnosis, prognosis, and effective genetic counseling. PMID:28867931

  13. A Rapid, High-Quality, Cost-Effective, Comprehensive and Expandable Targeted Next-Generation Sequencing Assay for Inherited Heart Diseases.

    PubMed

    Wilson, Kitchener D; Shen, Peidong; Fung, Eula; Karakikes, Ioannis; Zhang, Angela; InanlooRahatloo, Kolsoum; Odegaard, Justin; Sallam, Karim; Davis, Ronald W; Lui, George K; Ashley, Euan A; Scharfe, Curt; Wu, Joseph C

    2015-09-11

    Thousands of mutations across >50 genes have been implicated in inherited cardiomyopathies. However, options for sequencing this rapidly evolving gene set are limited because many sequencing services and off-the-shelf kits suffer from slow turnaround, inefficient capture of genomic DNA, and high cost. Furthermore, customization of these assays to cover emerging targets that suit individual needs is often expensive and time consuming. We sought to develop a custom high throughput, clinical-grade next-generation sequencing assay for detecting cardiac disease gene mutations with improved accuracy, flexibility, turnaround, and cost. We used double-stranded probes (complementary long padlock probes), an inexpensive and customizable capture technology, to efficiently capture and amplify the entire coding region and flanking intronic and regulatory sequences of 88 genes and 40 microRNAs associated with inherited cardiomyopathies, congenital heart disease, and cardiac development. Multiplexing 11 samples per sequencing run resulted in a mean base pair coverage of 420, of which 97% had >20× coverage and >99% were concordant with known heterozygous single nucleotide polymorphisms. The assay correctly detected germline variants in 24 individuals and revealed several polymorphic regions in miR-499. Total run time was 3 days at an approximate cost of $100 per sample. Accurate, high-throughput detection of mutations across numerous cardiac genes is achievable with complementary long padlock probe technology. Moreover, this format allows facile insertion of additional probes as more cardiomyopathy and congenital heart disease genes are discovered, giving researchers a powerful new tool for DNA mutation detection and discovery. © 2015 American Heart Association, Inc.

  14. Assessing biosynthetic potential of agricultural groundwater through metagenomic sequencing: A diverse anammox community dominates nitrate-rich groundwater

    PubMed Central

    Applegate, Olin; Li, Xunde; Kliegman, Joseph I.; Langelier, Charles; Atwill, Edward R.; Harter, Thomas; DeRisi, Joseph L.

    2017-01-01

    Background Climate change produces extremes in both temperature and precipitation causing increased drought severity and increased reliance on groundwater resources. Agricultural practices, which rely on groundwater, are sensitive to but also sources of contaminants, including nitrate. How agricultural contamination drives groundwater geochemistry through microbial metabolism is poorly understood. Methods On an active cow dairy in the Central Valley of California, we sampled groundwater from three wells at depths of 4.3 m (two wells) and 100 m (one well) below ground surface (bgs) as well as an effluent surface water lagoon that fertilizes surrounding corn fields. We analyzed the samples for concentrations of solutes, heavy metals, and USDA pathogenic bacteria of the Escherichia coli and Enterococcus groups as part of a long term groundwater monitoring study. Whole metagenome shotgun sequencing and assembly revealed taxonomic composition and metabolic potential of the community. Results Elevated nitrate and dissolved organic carbon occurred at 4.3m but not at 100m bgs. Metagenomics confirmed chemical observations and revealed several Planctomycete genomes, including a new Brocadiaceae lineage and a likely Planctomycetes OM190, as well novel diversity and high abundance of nano-prokaryotes from the Candidate Phyla Radiation (CPR), the Diapherotrites, Parvarchaeota, Aenigmarchaeota, Nanoarchaeota, Nanohaloarchaea (DPANN) and the Thaumarchaeota, Aigarchaeota, Crenarchaeota, Korarchaeota (TACK) superphyla. Pathway analysis suggests community interactions based on complimentary primary metabolic pathways and abundant secondary metabolite operons encoding antimicrobials and quorum sensing systems. Conclusions The metagenomes show strong resemblance to activated sludge communities from a nitrogen removal reactor at a wastewater treatment plant, suggesting that natural bioremediation occurs through microbial metabolism. Elevated nitrate and rich secondary metabolite

  15. Targeted gene panel sequencing in children with very early onset inflammatory bowel disease--evaluation and prospective analysis.

    PubMed

    Kammermeier, Jochen; Drury, Suzanne; James, Chela T; Dziubak, Robert; Ocaka, Louise; Elawad, Mamoun; Beales, Philip; Lench, Nicholas; Uhlig, Holm H; Bacchelli, Chiara; Shah, Neil

    2014-11-01

    Multiple monogenetic conditions with partially overlapping phenotypes can present with inflammatory bowel disease (IBD)-like intestinal inflammation. With novel genotype-specific therapies emerging, establishing a molecular diagnosis is becoming increasingly important. We have introduced targeted next-generation sequencing (NGS) technology as a prospective screening tool in children with very early onset IBD (VEOIBD). We evaluated the coverage of 40 VEOIBD genes in two separate cohorts undergoing targeted gene panel sequencing (TGPS) (n=25) and whole exome sequencing (WES) (n=20). TGPS revealed causative mutations in four genes (IL10RA, EPCAM, TTC37 and SKIV2L) discovered unexpected phenotypes and directly influenced clinical decision making by supporting as well as avoiding haematopoietic stem cell transplantation. TGPS resulted in significantly higher median coverage when compared with WES, fewer coverage deficiencies and improved variant detection across established VEOIBD genes. Excluding or confirming known VEOIBD genotypes should be considered early in the disease course in all cases of therapy-refractory VEOIBD, as it can have a direct impact on patient management. To combine both described NGS technologies would compensate for the limitations of WES for disease-specific application while offering the opportunity for novel gene discovery in the research setting. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  16. Dipole response of neutron-rich Sn isotopes

    NASA Astrophysics Data System (ADS)

    Klimkiewicz, A.; Adrich, P.; Boretzky, K.; Fallot, M.; Aumann, T.; Cortina-Gil, D.; Datta Pramanik, U.; Elze, Th. W.; Emling, H.; Geissel, H.; Hellstroem, M.; Jones, K. L.; Kratz, J. V.; Kulessa, R.; Leifels, Y.; Nociforo, C.; Palit, R.; Simon, H.; Surowka, G.; Sümmerer, K.; Typel, S.; Walus, W.

    2007-05-01

    The neutron-rich isotopes 129-133Sn were studied in a Coulomb excitation experiment at about 500 AMeV using the FRS-LAND setup at GSI. From the exclusive measurement of all projectile-like particles following the excitation and decay of the projectile in a high-Z target, the energy differential cross section can be extracted. At these beam energies dipole transitions are dominating, and within the semi-classical approach the Coulomb excitation cross sections can be transformed into photoabsorption cross sections. In contrast to stable Sn nuclei, a substantial fraction of dipole strength is observed at energies below the giant dipole resonance (GDR). For 130Sn and 132Sn this strength is located in a peak-like structure around 10 MeV excitation energy and exhibits a few percent of the Thomas-Reiche Kuhn (TRK) sum-rule strength. Several calculations predict the appearance of dipole strength at low excitation energies in neutron-rich nuclei. This low-lying strength is often referred to as pygmy dipole resonance (PDR) and, in a macroscopic picture, is discussed in terms of a collective oscillation of excess neutrons versus the core nucleons. Moreover, a sharp rise is observed at the neutron separation threshold around 5 MeV for the odd isotopes. A possible contribution of 'threshold strength', which can be described within the direct-breakup model is discussed. The results for the neutron-rich Sn isotopes are confronted with results on stable nuclei investigated in experiments using real photons.

  17. The impact of targeting repetitive BamHI-W sequences on the sensitivity and precision of EBV DNA quantification.

    PubMed

    Sanosyan, Armen; Fayd'herbe de Maudave, Alexis; Bollore, Karine; Zimmermann, Valérie; Foulongne, Vincent; Van de Perre, Philippe; Tuaillon, Edouard

    2017-01-01

    Viral load monitoring and early Epstein-Barr virus (EBV) DNA detection are essential in routine laboratory testing, especially in preemptive management of Post-transplant Lymphoproliferative Disorder. Targeting the repetitive BamHI-W sequence was shown to increase the sensitivity of EBV DNA quantification, but the variability of BamHI-W reiterations was suggested to be a source of quantification bias. We aimed to assess the extent of variability associated with BamHI-W PCR and its impact on the sensitivity of EBV DNA quantification using the 1st WHO international standard, EBV strains and clinical samples. Repetitive BamHI-W- and LMP2 single- sequences were amplified by in-house qPCRs and BXLF-1 sequence by a commercial assay (EBV R-gene™, BioMerieux). Linearity and limits of detection of in-house methods were assessed. The impact of repeated versus single target sequences on EBV DNA quantification precision was tested on B95.8 and Raji cell lines, possessing 11 and 7 copies of the BamHI-W sequence, respectively, and on clinical samples. BamHI-W qPCR demonstrated a lower limit of detection compared to LMP2 qPCR (2.33 log10 versus 3.08 log10 IU/mL; P = 0.0002). BamHI-W qPCR underestimated the EBV DNA load on Raji strain which contained fewer BamHI-W copies than the WHO standard derived from the B95.8 EBV strain (mean bias: - 0.21 log10; 95% CI, -0.54 to 0.12). Comparison of BamHI-W qPCR versus LMP2 and BXLF-1 qPCR showed an acceptable variability between EBV DNA levels in clinical samples with the mean bias being within 0.5 log10 IU/mL EBV DNA, whereas a better quantitative concordance was observed between LMP2 and BXLF-1 assays. Targeting BamHI-W resulted to a higher sensitivity compared to LMP2 but the variable reiterations of BamHI-W segment are associated with higher quantification variability. BamHI-W can be considered for clinical and therapeutic monitoring to detect an early EBV DNA and a dynamic change in viral load.

  18. The impact of targeting repetitive BamHI-W sequences on the sensitivity and precision of EBV DNA quantification

    PubMed Central

    Fayd’herbe de Maudave, Alexis; Bollore, Karine; Zimmermann, Valérie; Foulongne, Vincent; Van de Perre, Philippe; Tuaillon, Edouard

    2017-01-01

    Background Viral load monitoring and early Epstein-Barr virus (EBV) DNA detection are essential in routine laboratory testing, especially in preemptive management of Post-transplant Lymphoproliferative Disorder. Targeting the repetitive BamHI-W sequence was shown to increase the sensitivity of EBV DNA quantification, but the variability of BamHI-W reiterations was suggested to be a source of quantification bias. We aimed to assess the extent of variability associated with BamHI-W PCR and its impact on the sensitivity of EBV DNA quantification using the 1st WHO international standard, EBV strains and clinical samples. Methods Repetitive BamHI-W- and LMP2 single- sequences were amplified by in-house qPCRs and BXLF-1 sequence by a commercial assay (EBV R-gene™, BioMerieux). Linearity and limits of detection of in-house methods were assessed. The impact of repeated versus single target sequences on EBV DNA quantification precision was tested on B95.8 and Raji cell lines, possessing 11 and 7 copies of the BamHI-W sequence, respectively, and on clinical samples. Results BamHI-W qPCR demonstrated a lower limit of detection compared to LMP2 qPCR (2.33 log10 versus 3.08 log10 IU/mL; P = 0.0002). BamHI-W qPCR underestimated the EBV DNA load on Raji strain which contained fewer BamHI-W copies than the WHO standard derived from the B95.8 EBV strain (mean bias: - 0.21 log10; 95% CI, -0.54 to 0.12). Comparison of BamHI-W qPCR versus LMP2 and BXLF-1 qPCR showed an acceptable variability between EBV DNA levels in clinical samples with the mean bias being within 0.5 log10 IU/mL EBV DNA, whereas a better quantitative concordance was observed between LMP2 and BXLF-1 assays. Conclusions Targeting BamHI-W resulted to a higher sensitivity compared to LMP2 but the variable reiterations of BamHI-W segment are associated with higher quantification variability. BamHI-W can be considered for clinical and therapeutic monitoring to detect an early EBV DNA and a dynamic change in viral load

  19. Targeted delayed scanning at CT urography: a worthwhile use of radiation?

    PubMed

    Hack, Kalesha; Pinto, Patricia A; Gollub, Marc J

    2012-10-01

    . Estimated radiation dose from additional sequences was 4.3 mSv per patient. Targeted delayed scanning at CT urography yielded no additional ureteral tumors and resulted in additional radiation exposure. © RSNA, 2012.

  20. TARGETED CAPTURE IN EVOLUTIONARY AND ECOLOGICAL GENOMICS

    PubMed Central

    Jones, Matthew R.; Good, Jeffrey M.

    2016-01-01

    The rapid expansion of next-generation sequencing has yielded a powerful array of tools to address fundamental biological questions at a scale that was inconceivable just a few years ago. Various genome partitioning strategies to sequence select subsets of the genome have emerged as powerful alternatives to whole genome sequencing in ecological and evolutionary genomic studies. High throughput targeted capture is one such strategy that involves the parallel enrichment of pre-selected genomic regions of interest. The growing use of targeted capture demonstrates its potential power to address a range of research questions, yet these approaches have yet to expand broadly across labs focused on evolutionary and ecological genomics. In part, the use of targeted capture has been hindered by the logistics of capture design and implementation in species without established reference genomes. Here we aim to 1) increase the accessibility of targeted capture to researchers working in non-model taxa by discussing capture methods that circumvent the need of a reference genome, 2) highlight the evolutionary and ecological applications where this approach is emerging as a powerful sequencing strategy, and 3) discuss the future of targeted capture and other genome partitioning approaches in light of the increasing accessibility of whole genome sequencing. Given the practical advantages and increasing feasibility of high-throughput targeted capture, we anticipate an ongoing expansion of capture-based approaches in evolutionary and ecological research, synergistic with an expansion of whole genome sequencing. PMID:26137993

  1. Phylogenetically Structured Differences in rRNA Gene Sequence Variation among Species of Arbuscular Mycorrhizal Fungi and Their Implications for Sequence Clustering

    PubMed Central

    Ekanayake, Saliya; Ruan, Yang; Schütte, Ursel M. E.; Kaonongbua, Wittaya; Fox, Geoffrey; Ye, Yuzhen; Bever, James D.

    2016-01-01

    ABSTRACT Arbuscular mycorrhizal (AM) fungi form mutualisms with plant roots that increase plant growth and shape plant communities. Each AM fungal cell contains a large amount of genetic diversity, but it is unclear if this diversity varies across evolutionary lineages. We found that sequence variation in the nuclear large-subunit (LSU) rRNA gene from 29 isolates representing 21 AM fungal species generally assorted into genus- and species-level clades, with the exception of species of the genera Claroideoglomus and Entrophospora. However, there were significant differences in the levels of sequence variation across the phylogeny and between genera, indicating that it is an evolutionarily constrained trait in AM fungi. These consistent patterns of sequence variation across both phylogenetic and taxonomic groups pose challenges to interpreting operational taxonomic units (OTUs) as approximations of species-level groups of AM fungi. We demonstrate that the OTUs produced by five sequence clustering methods using 97% or equivalent sequence similarity thresholds failed to match the expected species of AM fungi, although OTUs from AbundantOTU, CD-HIT-OTU, and CROP corresponded better to species than did OTUs from mothur or UPARSE. This lack of OTU-to-species correspondence resulted both from sequences of one species being split into multiple OTUs and from sequences of multiple species being lumped into the same OTU. The OTU richness therefore will not reliably correspond to the AM fungal species richness in environmental samples. Conservatively, this error can overestimate species richness by 4-fold or underestimate richness by one-half, and the direction of this error will depend on the genera represented in the sample. IMPORTANCE Arbuscular mycorrhizal (AM) fungi form important mutualisms with the roots of most plant species. Individual AM fungi are genetically diverse, but it is unclear whether the level of this diversity differs among evolutionary lineages. We found

  2. Clinical Validation and Implementation of a Targeted Next-Generation Sequencing Assay to Detect Somatic Variants in Non-Small Cell Lung, Melanoma, and Gastrointestinal Malignancies

    PubMed Central

    Fisher, Kevin E.; Zhang, Linsheng; Wang, Jason; Smith, Geoffrey H.; Newman, Scott; Schneider, Thomas M.; Pillai, Rathi N.; Kudchadkar, Ragini R.; Owonikoko, Taofeek K.; Ramalingam, Suresh S.; Lawson, David H.; Delman, Keith A.; El-Rayes, Bassel F.; Wilson, Malania M.; Sullivan, H. Clifford; Morrison, Annie S.; Balci, Serdar; Adsay, N. Volkan; Gal, Anthony A.; Sica, Gabriel L.; Saxe, Debra F.; Mann, Karen P.; Hill, Charles E.; Khuri, Fadlo R.; Rossi, Michael R.

    2017-01-01

    We tested and clinically validated a targeted next-generation sequencing (NGS) mutation panel using 80 formalin-fixed, paraffin-embedded (FFPE) tumor samples. Forty non-small cell lung carcinoma (NSCLC), 30 melanoma, and 30 gastrointestinal (12 colonic, 10 gastric, and 8 pancreatic adenocarcinoma) FFPE samples were selected from laboratory archives. After appropriate specimen and nucleic acid quality control, 80 NGS libraries were prepared using the Illumina TruSight tumor (TST) kit and sequenced on the Illumina MiSeq. Sequence alignment, variant calling, and sequencing quality control were performed using vendor software and laboratory-developed analysis workflows. TST generated ≥500× coverage for 98.4% of the 13,952 targeted bases. Reproducible and accurate variant calling was achieved at ≥5% variant allele frequency with 8 to 12 multiplexed samples per MiSeq flow cell. TST detected 112 variants overall, and confirmed all known single-nucleotide variants (n = 27), deletions (n = 5), insertions (n = 3), and multinucleotide variants (n = 3). TST detected at least one variant in 85.0% (68/80), and two or more variants in 36.2% (29/80), of samples. TP53 was the most frequently mutated gene in NSCLC (13 variants; 13/32 samples), gastrointestinal malignancies (15 variants; 13/25 samples), and overall (30 variants; 28/80 samples). BRAF mutations were most common in melanoma (nine variants; 9/23 samples). Clinically relevant NGS data can be obtained from routine clinical FFPE solid tumor specimens using TST, benchtop instruments, and vendor-supplied bioinformatics pipelines. PMID:26801070

  3. Liquid Hydrogen Target Experience at SLAC

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Weisend, J.G.; Boyce, R.; Candia, A.

    2005-08-29

    Liquid hydrogen targets have played a vital role in the physics program at SLAC for the past 40 years. These targets have ranged from small ''beer can'' targets to the 1.5 m long E158 target that was capable of absorbing up to 800 W without any significant density changes. Successful use of these targets has required the development of thin wall designs, liquid hydrogen pumps, remote positioning and alignment systems, safety systems, control and data acquisition systems, cryogenic cooling circuits and heat exchangers. Detailed operating procedures have been created to ensure safety and operational reliability. This paper surveys the evolutionmore » of liquid hydrogen targets at SLAC and discusses advances in several of the enabling technologies that made these targets possible.« less

  4. Carbon stars with oxygen-rich circumstellar material

    NASA Technical Reports Server (NTRS)

    Jura, Michael; Hawkins, I.

    1991-01-01

    The IUE satellite was used to search for companions to two carbon-rich stars with oxygen-rich circumstellar envelopes, EU And and V778 Cyg. Depending upon the amount of interstellar extinction and distances (probably between 1 and 2 kpc from the Sun) to these two stars, upper limits were placed between approx. 1.5 and 6 solar mass to the mass of any main sequence companions. For the 'near' distance of 1 kpc, it seems unlikely that there are white dwarf companions because the detection would be expected of ultraviolet emission from accretion of red giant wind material onto the white dwarf. A new model is proposed to explain the oxygen-rich envelopes. If these stars have a high nitrogen abundance, the carbon that is in excess of the oxygen may be carried in the circumstellar envelopes in HCN rather than C2H2 which is a likely key seed molecule for the formation of carbon grains. Consequently, carbon particles may not form; instead, oxygen-rich silicate dust may nucleate from the SiO present in the outflow.

  5. Different modes of interaction by TIAR and HuR with target RNA and DNA

    PubMed Central

    Kim, Henry S.; Wilce, Matthew C. J.; Yoga, Yano M. K.; Pendini, Nicole R.; Gunzburg, Menachem J.; Cowieson, Nathan P.; Wilson, Gerald M.; Williams, Bryan R. G.; Gorospe, Myriam; Wilce, Jacqueline A.

    2011-01-01

    TIAR and HuR are mRNA-binding proteins that play important roles in the regulation of translation. They both possess three RNA recognition motifs (RRMs) and bind to AU-rich elements (AREs), with seemingly overlapping specificity. Here we show using SPR that TIAR and HuR bind to both U-rich and AU-rich RNA in the nanomolar range, with higher overall affinity for U-rich RNA. However, the higher affinity for U–rich sequences is mainly due to faster association with U-rich RNA, which we propose is a reflection of the higher probability of association. Differences between TIAR and HuR are observed in their modes of binding to RNA. TIAR is able to bind deoxy-oligonucleotides with nanomolar affinity, whereas HuR affinity is reduced to a micromolar level. Studies with U-rich DNA reveal that TIAR binding depends less on the 2′-hydroxyl group of RNA than HuR binding. Finally we show that SAXS data, recorded for the first two domains of TIAR in complex with RNA, are more consistent with a flexible, elongated shape and not the compact shape that the first two domains of Hu proteins adopt upon binding to RNA. We thus propose that these triple-RRM proteins, which compete for the same binding sites in cells, interact with their targets in fundamentally different ways. PMID:21233170

  6. Different modes of interaction by TIAR and HuR with target RNA and DNA.

    PubMed

    Kim, Henry S; Wilce, Matthew C J; Yoga, Yano M K; Pendini, Nicole R; Gunzburg, Menachem J; Cowieson, Nathan P; Wilson, Gerald M; Williams, Bryan R G; Gorospe, Myriam; Wilce, Jacqueline A

    2011-02-01

    TIAR and HuR are mRNA-binding proteins that play important roles in the regulation of translation. They both possess three RNA recognition motifs (RRMs) and bind to AU-rich elements (AREs), with seemingly overlapping specificity. Here we show using SPR that TIAR and HuR bind to both U-rich and AU-rich RNA in the nanomolar range, with higher overall affinity for U-rich RNA. However, the higher affinity for U-rich sequences is mainly due to faster association with U-rich RNA, which we propose is a reflection of the higher probability of association. Differences between TIAR and HuR are observed in their modes of binding to RNA. TIAR is able to bind deoxy-oligonucleotides with nanomolar affinity, whereas HuR affinity is reduced to a micromolar level. Studies with U-rich DNA reveal that TIAR binding depends less on the 2'-hydroxyl group of RNA than HuR binding. Finally we show that SAXS data, recorded for the first two domains of TIAR in complex with RNA, are more consistent with a flexible, elongated shape and not the compact shape that the first two domains of Hu proteins adopt upon binding to RNA. We thus propose that these triple-RRM proteins, which compete for the same binding sites in cells, interact with their targets in fundamentally different ways.

  7. Origin of spinel-rich chondrules and inclusions in carbonaceous and ordinary chondrites

    NASA Technical Reports Server (NTRS)

    Kornacki, A. S.; Fegley, B., Jr.

    1984-01-01

    The evaluation of three models of the origin of spinel-rich chondrules and inclusions presented here includes new calculations of the major-element refractory mineral condensation sequence from a gas of solar composition over a wide pressure interval. Condensation calculations show that spinel-rich chondrules did not crystallize from metastable liquid condensates, and that spinel-rich inclusions are not aggregates of refractory nebular condensates. It is proposed that spinel-rich objects are fractionated distillation residues of small aggregates of primitive dust that lost Ca, Si-rich partial melts by evaporation, ablation, or splashing during collisions. This model also explains why spinel-rich chondrules and inclusions (1) are usually smaller than melilite-rich chondrules and inclusions; (2) often have highly fractionated trace-element compositions; and (3) usually do not contain Pt-metal nuggets even when they are more enriched in the Pt-group metals than nugget-bearing melilite-rich objects.

  8. Genomic Characterization of Non–Small-Cell Lung Cancer in African Americans by Targeted Massively Parallel Sequencing

    PubMed Central

    Araujo, Luiz H.; Timmers, Cynthia; Bell, Erica Hlavin; Shilo, Konstantin; Lammers, Philip E.; Zhao, Weiqiang; Natarajan, Thanemozhi G.; Miller, Clinton J.; Zhang, Jianying; Yilmaz, Ayse S.; Liu, Tom; Coombes, Kevin; Amann, Joseph; Carbone, David P.

    2015-01-01

    Purpose Technologic advances have enabled the comprehensive analysis of genetic perturbations in non–small-cell lung cancer (NSCLC); however, African Americans have often been underrepresented in these studies. This ethnic group has higher lung cancer incidence and mortality rates, and some studies have suggested a lower incidence of epidermal growth factor receptor mutations. Herein, we report the most in-depth molecular profile of NSCLC in African Americans to date. Methods A custom panel was designed to cover the coding regions of 81 NSCLC-related genes and 40 ancestry-informative markers. Clinical samples were sequenced on a massively parallel sequencing instrument, and anaplastic lymphoma kinase translocation was evaluated by fluorescent in situ hybridization. Results The study cohort included 99 patients (61% males, 94% smokers) comprising 31 squamous and 68 nonsquamous cell carcinomas. We detected 227 nonsilent variants in the coding sequence, including 24 samples with nonoverlapping, classic driver alterations. The frequency of driver mutations was not significantly different from that of whites, and no association was found between genetic ancestry and the presence of somatic mutations. Copy number alteration analysis disclosed distinguishable amplifications in the 3q chromosome arm in squamous cell carcinomas and pointed toward a handful of targetable alterations. We also found frequent SMARCA4 mutations and protein loss, mostly in driver-negative tumors. Conclusion Our data suggest that African American ancestry may not be significantly different from European/white background for the presence of somatic driver mutations in NSCLC. Furthermore, we demonstrated that using a comprehensive genotyping approach could identify numerous targetable alterations, with potential impact on therapeutic decisions. PMID:25918285

  9. Fungal diversity in grape must and wine fermentation assessed by massive sequencing, quantitative PCR and DGGE

    PubMed Central

    Wang, Chunxiao; García-Fernández, David; Mas, Albert; Esteve-Zarzoso, Braulio

    2015-01-01

    The diversity of fungi in grape must and during wine fermentation was investigated in this study by culture-dependent and culture-independent techniques. Carignan and Grenache grapes were harvested from three vineyards in the Priorat region (Spain) in 2012, and nine samples were selected from the grape must after crushing and during wine fermentation. From culture-dependent techniques, 362 isolates were randomly selected and identified by 5.8S-ITS-RFLP and 26S-D1/D2 sequencing. Meanwhile, genomic DNA was extracted directly from the nine samples and analyzed by qPCR, DGGE and massive sequencing. The results indicated that grape must after crushing harbored a high species richness of fungi with Aspergillus tubingensis, Aureobasidium pullulans, or Starmerella bacillaris as the dominant species. As fermentation proceeded, the species richness decreased, and yeasts such as Hanseniaspora uvarum, Starmerella bacillaris and Saccharomyces cerevisiae successively occupied the must samples. The “terroir” characteristics of the fungus population are more related to the location of the vineyard than to grape variety. Sulfur dioxide treatment caused a low effect on yeast diversity by similarity analysis. Because of the existence of large population of fungi on grape berries, massive sequencing was more appropriate to understand the fungal community in grape must after crushing than the other techniques used in this study. Suitable target sequences and databases were necessary for accurate evaluation of the community and the identification of species by the 454 pyrosequencing of amplicons. PMID:26557110

  10. Direct evidence for sequence-dependent attraction between double-stranded DNA controlled by methylation.

    PubMed

    Yoo, Jejoong; Kim, Hajin; Aksimentiev, Aleksei; Ha, Taekjip

    2016-03-22

    Although proteins mediate highly ordered DNA organization in vivo, theoretical studies suggest that homologous DNA duplexes can preferentially associate with one another even in the absence of proteins. Here we combine molecular dynamics simulations with single-molecule fluorescence resonance energy transfer experiments to examine the interactions between duplex DNA in the presence of spermine, a biological polycation. We find that AT-rich DNA duplexes associate more strongly than GC-rich duplexes, regardless of the sequence homology. Methyl groups of thymine acts as a steric block, relocating spermine from major grooves to interhelical regions, thereby increasing DNA-DNA attraction. Indeed, methylation of cytosines makes attraction between GC-rich DNA as strong as that between AT-rich DNA. Recent genome-wide chromosome organization studies showed that remote contact frequencies are higher for AT-rich and methylated DNA, suggesting that direct DNA-DNA interactions that we report here may play a role in the chromosome organization and gene regulation.

  11. Direct evidence for sequence-dependent attraction between double-stranded DNA controlled by methylation

    NASA Astrophysics Data System (ADS)

    Yoo, Jejoong; Kim, Hajin; Aksimentiev, Aleksei; Ha, Taekjip

    2016-03-01

    Although proteins mediate highly ordered DNA organization in vivo, theoretical studies suggest that homologous DNA duplexes can preferentially associate with one another even in the absence of proteins. Here we combine molecular dynamics simulations with single-molecule fluorescence resonance energy transfer experiments to examine the interactions between duplex DNA in the presence of spermine, a biological polycation. We find that AT-rich DNA duplexes associate more strongly than GC-rich duplexes, regardless of the sequence homology. Methyl groups of thymine acts as a steric block, relocating spermine from major grooves to interhelical regions, thereby increasing DNA-DNA attraction. Indeed, methylation of cytosines makes attraction between GC-rich DNA as strong as that between AT-rich DNA. Recent genome-wide chromosome organization studies showed that remote contact frequencies are higher for AT-rich and methylated DNA, suggesting that direct DNA-DNA interactions that we report here may play a role in the chromosome organization and gene regulation.

  12. Identification of MicroRNAs and their Targets Associated with Embryo Abortion during Chrysanthemum Cross Breeding via High-Throughput Sequencing.

    PubMed

    Zhang, Fengjiao; Dong, Wen; Huang, Lulu; Song, Aiping; Wang, Haibin; Fang, Weimin; Chen, Fadi; Teng, Nianjun

    2015-01-01

    MicroRNAs (miRNAs) are important regulators in plant development. They post-transcriptionally regulate gene expression during various biological and metabolic processes by binding to the 3'-untranslated region of target mRNAs to facilitate mRNA degradation or inhibit translation. Chrysanthemum (Chrysanthemum morifolium) is one of the most important ornamental flowers with increasing demand each year. However, embryo abortion is the main reason for chrysanthemum cross breeding failure. To date, there have been no experiments examining the expression of miRNAs associated with chrysanthemum embryo development. Therefore, we sequenced three small RNA libraries to identify miRNAs and their functions. Our results will provide molecular insights into chrysanthemum embryo abortion. Three small RNA libraries were built from normal chrysanthemum ovules at 12 days after pollination (DAP), and normal and abnormal chrysanthemum ovules at 18 DAP. We validated 228 miRNAs with significant changes in expression frequency during embryonic development. Comparative profiling revealed that 69 miRNAs exhibited significant differential expression between normal and abnormal embryos at 18 DAP. In addition, a total of 1037 miRNA target genes were predicted, and their annotations were defined by transcriptome data. Target genes associated with metabolic pathways were most highly represented according to the annotation. Moreover, 52 predicted target genes were identified to be associated with embryonic development, including 31 transcription factors and 21 additional genes. Gene ontology (GO) annotation also revealed that high-ranking miRNA target genes related to cellular processes and metabolic processes were involved in transcription regulation and the embryo developmental process. The present study generated three miRNA libraries and gained information on miRNAs and their targets in the chrysanthemum embryo. These results enrich the growing database of new miRNAs and lay the foundation

  13. Spectroscopy of neutron-rich nuclei at REX-ISOLDE with MINIBALL

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kroell, Th.

    2007-08-15

    We report on 'safe' Coulomb excitation of neutron-rich nuclei. The radioactive nuclei have been produced by ISOLDE at CERN and postaccelerated by the REX-ISOLDE facility. The {gamma} rays emitted by the decay of excited states have been detected by the MINIBALL array. Recent results are presented and compared to theoretical models.

  14. Characterisation of secretory calcium-binding phosphoprotein-proline-glutamine-rich 1: a novel basal lamina component expressed at cell-tooth interfaces.

    PubMed

    Moffatt, Pierre; Wazen, Rima M; Dos Santos Neves, Juliana; Nanci, Antonio

    2014-12-01

    Functional genomic screening of the rat enamel organ (EO) has led to the identification of a number of secreted proteins expressed during the maturation stage of amelogenesis, including amelotin (AMTN) and odontogenic ameloblast-associated (ODAM). In this study, we characterise the gene, protein and pattern of expression of a related protein called secretory calcium-binding phosphoprotein-proline-glutamine-rich 1 (SCPPPQ1). The Scpppq1 gene resides within the secretory calcium-binding phosphoprotein (Scpp) cluster. SCPPPQ1 is a highly conserved, 75-residue, secreted protein rich in proline, leucine, glutamine and phenylalanine. In silico data mining has revealed no correlation to any known sequences. Northern blotting of various rat tissues suggests that the expression of Scpppq1 is restricted to tooth and associated tissues. Immunohistochemical analyses show that the protein is expressed during the late maturation stage of amelogenesis and in the junctional epithelium where it localises to an atypical basal lamina at the cell-tooth interface. This discrete localisation suggests that SCPPPQ1, together with AMTN and ODAM, participates in structuring the basal lamina and in mediating attachment of epithelia cells to mineralised tooth surfaces.

  15. Silurian sequence stratigraphy in the North American craton, Great Lakes area

    USGS Publications Warehouse

    Shaver, R.H.; ,

    1996-01-01

    A notable circumstance of late Early through Late Silurian sedimentation on the Great Lakes area craton is that at least two and possibly three cycles of third-order duration (if eustatically considered) are recognized in basin and shallow-platform settings alike. Both virtually pure and siliciclastic-rich carbonate rocks exist in parts of platform-situated sections in contrast to siliciclastic-rich to evaporite-dominated basin sections. Knowledge of the reef history, together with evidence of incidental periodic incursions of siliciclastic sediments, permitted understanding of a regional event or sequence stratigraphy more than 15 years ago before conventional biostratigraphic and physical stratigraphic evidence became adequate to corroborate. This midwestern US and Ontario Silurian record has become strategic for testing different schools of thought that champion either tectonism or eustasy to explain cyclical sequences.

  16. Bioinformatics Pipelines for Targeted Resequencing and Whole-Exome Sequencing of Human and Mouse Genomes: A Virtual Appliance Approach for Instant Deployment

    PubMed Central

    Saeed, Isaam; Wong, Stephen Q.; Mar, Victoria; Goode, David L.; Caramia, Franco; Doig, Ken; Ryland, Georgina L.; Thompson, Ella R.; Hunter, Sally M.; Halgamuge, Saman K.; Ellul, Jason; Dobrovic, Alexander; Campbell, Ian G.; Papenfuss, Anthony T.; McArthur, Grant A.; Tothill, Richard W.

    2014-01-01

    Targeted resequencing by massively parallel sequencing has become an effective and affordable way to survey small to large portions of the genome for genetic variation. Despite the rapid development in open source software for analysis of such data, the practical implementation of these tools through construction of sequencing analysis pipelines still remains a challenging and laborious activity, and a major hurdle for many small research and clinical laboratories. We developed TREVA (Targeted REsequencing Virtual Appliance), making pre-built pipelines immediately available as a virtual appliance. Based on virtual machine technologies, TREVA is a solution for rapid and efficient deployment of complex bioinformatics pipelines to laboratories of all sizes, enabling reproducible results. The analyses that are supported in TREVA include: somatic and germline single-nucleotide and insertion/deletion variant calling, copy number analysis, and cohort-based analyses such as pathway and significantly mutated genes analyses. TREVA is flexible and easy to use, and can be customised by Linux-based extensions if required. TREVA can also be deployed on the cloud (cloud computing), enabling instant access without investment overheads for additional hardware. TREVA is available at http://bioinformatics.petermac.org/treva/. PMID:24752294

  17. Spectrum of mutations in leiomyosarcomas identified by clinical targeted next-generation sequencing.

    PubMed

    Lee, Paul J; Yoo, Naomi S; Hagemann, Ian S; Pfeifer, John D; Cottrell, Catherine E; Abel, Haley J; Duncavage, Eric J

    2017-02-01

    Recurrent genomic mutations in uterine and non-uterine leiomyosarcomas have not been well established. Using a next generation sequencing (NGS) panel of common cancer-associated genes, 25 leiomyosarcomas arising from multiple sites were examined to explore genetic alterations, including single nucleotide variants (SNV), small insertions/deletions (indels), and copy number alterations (CNA). Sequencing showed 86 non-synonymous, coding region somatic variants within 151 gene targets in 21 cases, with a mean of 4.1 variants per case; 4 cases had no putative mutations in the panel of genes assayed. The most frequently altered genes were TP53 (36%), ATM and ATRX (16%), and EGFR and RB1 (12%). CNA were identified in 85% of cases, with the most frequent copy number losses observed in chromosomes 10 and 13 including PTEN and RB1; the most frequent gains were seen in chromosomes 7 and 17. Our data show that deletions in canonical cancer-related genes are common in leiomyosarcomas. Further, the spectrum of gene mutations observed shows that defects in DNA repair and chromosomal maintenance are central to the biology of leiomyosarcomas, and that activating mutations observed in other common cancer types are rare in leiomyosarcomas. Copyright © 2017 Elsevier Inc. All rights reserved.

  18. Less is more in mammalian phylogenomics: AT-rich genes minimize tree conflicts and unravel the root of placental mammals.

    PubMed

    Romiguier, Jonathan; Ranwez, Vincent; Delsuc, Frédéric; Galtier, Nicolas; Douzery, Emmanuel J P

    2013-09-01

    Despite the rapid increase of size in phylogenomic data sets, a number of important nodes on animal phylogeny are still unresolved. Among these, the rooting of the placental mammal tree is still a controversial issue. One difficulty lies in the pervasive phylogenetic conflicts among genes, with each one telling its own story, which may be reliable or not. Here, we identified a simple criterion, that is, the GC content, which substantially helps in determining which gene trees best reflect the species tree. We assessed the ability of 13,111 coding sequence alignments to correctly reconstruct the placental phylogeny. We found that GC-rich genes induced a higher amount of conflict among gene trees and performed worse than AT-rich genes in retrieving well-supported, consensual nodes on the placental tree. We interpret this GC effect mainly as a consequence of genome-wide variations in recombination rate. Indeed, recombination is known to drive GC-content evolution through GC-biased gene conversion and might be problematic for phylogenetic reconstruction, for instance, in an incomplete lineage sorting context. When we focused on the AT-richest fraction of the data set, the resolution level of the placental phylogeny was greatly increased, and a strong support was obtained in favor of an Afrotheria rooting, that is, Afrotheria as the sister group of all other placentals. We show that in mammals most conflicts among gene trees, which have so far hampered the resolution of the placental tree, are concentrated in the GC-rich regions of the genome. We argue that the GC content-because it is a reliable indicator of the long-term recombination rate-is an informative criterion that could help in identifying the most reliable molecular markers for species tree inference.

  19. Age effects on discrimination of timing in auditory sequences

    NASA Astrophysics Data System (ADS)

    Fitzgibbons, Peter J.; Gordon-Salant, Sandra

    2004-08-01

    The experiments examined age-related changes in temporal sensitivity to increments in the interonset intervals (IOI) of components in tonal sequences. Discrimination was examined using reference sequences consisting of five 50-ms tones separated by silent intervals; tone frequencies were either fixed at 4 kHz or varied within a 2-4-kHz range to produce spectrally complex patterns. The tonal IOIs within the reference sequences were either equal (200 or 600 ms) or varied individually with an average value of 200 or 600 ms to produce temporally complex patterns. The difference limen (DL) for increments of IOI was measured. Comparison sequences featured either equal increments in all tonal IOIs or increments in a single target IOI, with the sequential location of the target changing randomly across trials. Four groups of younger and older adults with and without sensorineural hearing loss participated. Results indicated that DLs for uniform changes of sequence rate were smaller than DLs for single target intervals, with the largest DLs observed for single targets embedded within temporally complex sequences. Older listeners performed more poorly than younger listeners in all conditions, but the largest age-related differences were observed for temporally complex stimulus conditions. No systematic effects of hearing loss were observed.

  20. Resistance gene enrichment sequencing (RenSeq) enables reannotation of the NB-LRR gene family from sequenced plant genomes and rapid mapping of resistance loci in segregating populations

    PubMed Central

    Jupe, Florian; Witek, Kamil; Verweij, Walter; Śliwka, Jadwiga; Pritchard, Leighton; Etherington, Graham J; Maclean, Dan; Cock, Peter J; Leggett, Richard M; Bryan, Glenn J; Cardle, Linda; Hein, Ingo; Jones, Jonathan DG

    2013-01-01

    Summary RenSeq is a NB-LRR (nucleotide binding-site leucine-rich repeat) gene-targeted, Resistance gene enrichment and sequencing method that enables discovery and annotation of pathogen resistance gene family members in plant genome sequences. We successfully applied RenSeq to the sequenced potato Solanum tuberosum clone DM, and increased the number of identified NB-LRRs from 438 to 755. The majority of these identified R gene loci reside in poorly or previously unannotated regions of the genome. Sequence and positional details on the 12 chromosomes have been established for 704 NB-LRRs and can be accessed through a genome browser that we provide. We compared these NB-LRR genes and the corresponding oligonucleotide baits with the highest sequence similarity and demonstrated that ∼80% sequence identity is sufficient for enrichment. Analysis of the sequenced tomato S. lycopersicum ‘Heinz 1706’ extended the NB-LRR complement to 394 loci. We further describe a methodology that applies RenSeq to rapidly identify molecular markers that co-segregate with a pathogen resistance trait of interest. In two independent segregating populations involving the wild Solanum species S. berthaultii (Rpi-ber2) and S. ruiz-ceballosii (Rpi-rzc1), we were able to apply RenSeq successfully to identify markers that co-segregate with resistance towards the late blight pathogen Phytophthora infestans. These SNP identification workflows were designed as easy-to-adapt Galaxy pipelines. PMID:23937694

  1. Efficient strategy for the molecular diagnosis of intellectual disability using targeted high-throughput sequencing

    PubMed Central

    Redin, Claire; Gérard, Bénédicte; Lauer, Julia; Herenger, Yvan; Muller, Jean; Quartier, Angélique; Masurel-Paulet, Alice; Willems, Marjolaine; Lesca, Gaétan; El-Chehadeh, Salima; Le Gras, Stéphanie; Vicaire, Serge; Philipps, Muriel; Dumas, Michaël; Geoffroy, Véronique; Feger, Claire; Haumesser, Nicolas; Alembik, Yves; Barth, Magalie; Bonneau, Dominique; Colin, Estelle; Dollfus, Hélène; Doray, Bérénice; Delrue, Marie-Ange; Drouin-Garraud, Valérie; Flori, Elisabeth; Fradin, Mélanie; Francannet, Christine; Goldenberg, Alice; Lumbroso, Serge; Mathieu-Dramard, Michèle; Martin-Coignard, Dominique; Lacombe, Didier; Morin, Gilles; Polge, Anne; Sukno, Sylvie; Thauvin-Robinet, Christel; Thevenon, Julien; Doco-Fenzy, Martine; Genevieve, David; Sarda, Pierre; Edery, Patrick; Isidor, Bertrand; Jost, Bernard; Olivier-Faivre, Laurence; Mandel, Jean-Louis; Piton, Amélie

    2014-01-01

    Background Intellectual disability (ID) is characterised by an extreme genetic heterogeneity. Several hundred genes have been associated to monogenic forms of ID, considerably complicating molecular diagnostics. Trio-exome sequencing was recently proposed as a diagnostic approach, yet remains costly for a general implementation. Methods We report the alternative strategy of targeted high-throughput sequencing of 217 genes in which mutations had been reported in patients with ID or autism as the major clinical concern. We analysed 106 patients with ID of unknown aetiology following array-CGH analysis and other genetic investigations. Ninety per cent of these patients were males, and 75% sporadic cases. Results We identified 26 causative mutations: 16 in X-linked genes (ATRX, CUL4B, DMD, FMR1, HCFC1, IL1RAPL1, IQSEC2, KDM5C, MAOA, MECP2, SLC9A6, SLC16A2, PHF8) and 10 de novo in autosomal-dominant genes (DYRK1A, GRIN1, MED13L, TCF4, RAI1, SHANK3, SLC2A1, SYNGAP1). We also detected four possibly causative mutations (eg, in NLGN3) requiring further investigations. We present detailed reasoning for assigning causality for each mutation, and associated patients’ clinical information. Some genes were hit more than once in our cohort, suggesting they correspond to more frequent ID-associated conditions (KDM5C, MECP2, DYRK1A, TCF4). We highlight some unexpected genotype to phenotype correlations, with causative mutations being identified in genes associated to defined syndromes in patients deviating from the classic phenotype (DMD, TCF4, MECP2). We also bring additional supportive (HCFC1, MED13L) or unsupportive (SHROOM4, SRPX2) evidences for the implication of previous candidate genes or mutations in cognitive disorders. Conclusions With a diagnostic yield of 25% targeted sequencing appears relevant as a first intention test for the diagnosis of ID, but importantly will also contribute to a better understanding regarding the specific contribution of the many genes

  2. Analysis of Genes Involved in Body Weight Regulation by Targeted Re-Sequencing.

    PubMed

    Volckmar, Anna-Lena; Han, Chung Ting; Pütter, Carolin; Haas, Stefan; Vogel, Carla I G; Knoll, Nadja; Struve, Christoph; Göbel, Maria; Haas, Katharina; Herrfurth, Nikolas; Jarick, Ivonne; Grallert, Harald; Schürmann, Annette; Al-Hasani, Hadi; Hebebrand, Johannes; Sauer, Sascha; Hinney, Anke

    2016-01-01

    Genes involved in body weight regulation that were previously investigated in genome-wide association studies (GWAS) and in animal models were target-enriched followed by massive parallel next generation sequencing. We enriched and re-sequenced continuous genomic regions comprising FTO, MC4R, TMEM18, SDCCAG8, TKNS, MSRA and TBC1D1 in a screening sample of 196 extremely obese children and adolescents with age and sex specific body mass index (BMI) ≥ 99th percentile and 176 lean adults (BMI ≤ 15th percentile). 22 variants were confirmed by Sanger sequencing. Genotyping was performed in up to 705 independent obesity trios (extremely obese child and both parents), 243 extremely obese cases and 261 lean adults. We detected 20 different non-synonymous variants, one frame shift and one nonsense mutation in the 7 continuous genomic regions in study groups of different weight extremes. For SNP Arg695Cys (rs58983546) in TBC1D1 we detected nominal association with obesity (pTDT = 0.03 in 705 trios). Eleven of the variants were rare, thus were only detected heterozygously in up to ten individual(s) of the complete screening sample of 372 individuals. Two of them (in FTO and MSRA) were found in lean individuals, nine in extremely obese. In silico analyses of the 11 variants did not reveal functional implications for the mutations. Concordant with our hypothesis we detected a rare variant that potentially leads to loss of FTO function in a lean individual. For TBC1D1, in contrary to our hypothesis, the loss of function variant (Arg443Stop) was found in an obese individual. Functional in vitro studies are warranted.

  3. Direct detection of RNA in vitro and in situ by target-primed RCA: The impact of E. coli RNase III on the detection efficiency of RNA sequences distanced far from the 3'-end.

    PubMed

    Merkiene, Egle; Gaidamaviciute, Edita; Riauba, Laurynas; Janulaitis, Arvydas; Lagunavicius, Arunas

    2010-08-01

    We improved the target RNA-primed RCA technique for direct detection and analysis of RNA in vitro and in situ. Previously we showed that the 3' --> 5' single-stranded RNA exonucleolytic activity of Phi29 DNA polymerase converts the target RNA into a primer and uses it for RCA initiation. However, in some cases, the single-stranded RNA exoribonucleolytic activity of the polymerase is hindered by strong double-stranded structures at the 3'-end of target RNAs. We demonstrate that in such hampered cases, the double-stranded RNA-specific Escherichia coli RNase III efficiently assists Phi29 DNA polymerase in converting the target RNA into a primer. These observations extend the target RNA-primed RCA possibilities to test RNA sequences distanced far from the 3'-end and customize this technique for the inner RNA sequence analysis.

  4. Fluorescence self-quenching assay for the detection of target collagen sequences using a short probe peptide.

    PubMed

    Nian, Linge; Hu, Yue; Fu, Caihong; Song, Chen; Wang, Jie; Xiao, Jianxi

    2018-01-01

    The development of novel assays to detect collagen fragments is of utmost importance for diagnostic, prognostic and therapeutic decisions in various collagen-related diseases, and one essential question is to discover probe peptides that can specifically recognize target collagen sequences. Herein we have developed the fluorescence self-quenching assay as a convenient tool to screen the capability of a series of fluorescent probe peptides of variable lengths to bind with target collagen peptides. We have revealed that the targeting ability of probe peptides is length-dependent, and have discovered a relatively short probe peptide FAM-G(POG) 8 capable to identify the target peptide. We have further demonstrated that fluorescence self-quenching assay together with this short probe peptide can be applied to specifically detect the desired collagen fragment in complex biological media. Fluorescence self-quenching assay provides a powerful new tool to discover effective peptides for the recognition of collagen biomarkers, and it may have great potential to identify probe peptides for various protein biomarkers involved in pathological conditions. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Targeted next generation sequencing of well-differentiated/dedifferentiated liposarcoma reveals novel gene amplifications and mutations.

    PubMed

    Somaiah, Neeta; Beird, Hannah C; Barbo, Andrea; Song, Juhee; Mills Shaw, Kenna R; Wang, Wei-Lien; Eterovic, Karina; Chen, Ken; Lazar, Alexander; Conley, Anthony P; Ravi, Vinod; Hwu, Patrick; Futreal, Andrew; Simon, George; Meric-Bernstam, Funda; Hong, David

    2018-04-13

    Well-differentiated/dedifferentiated liposarcoma is a common soft tissue sarcoma with approximately 1500 new cases per year. Surgery is the mainstay of treatment but recurrences are frequent and systemic options are limited. 'Tumor genotyping' is becoming more common in clinical practice as it offers the hope of personalized targeted therapy. We wanted to evaluate the results and the clinical utility of available next-generation sequencing panels in WD/DD liposarcoma. Patients who had their tumor sequenced by either FoundationOne ( n = 13) or the institutional T200/T200.1 panels ( n = 7) were included in this study. Significant copy number alterations were identified, but mutations were infrequent. Out of the 27 mutations detected in 7 samples, 8 ( CTNNB1, MECOM, ZNF536, EGFR, EML4, CSMD3, PBRM1, PPP1R3A ) were identified as deleterious (on Condel, PolyPhen and SIFT) and a truncating mutation was found in NF2 . Of these, EGFR and NF2 are potential driver mutations and have not been reported previously in liposarcoma. MDM2 and CDK4 amplification was universally present in all the tested samples and multiple other recurrent genes with high amplification or high deletion were detected. Many of these targets are potentially actionable. Eight patients went on to receive an MDM2 inhibitor with a median time to progression of 23 months (95% CI: 10-83 months).

  6. Exome Sequencing Provides Evidence of Polygenic Adaptation to a Fat-Rich Animal Diet in Indigenous Siberian Populations.

    PubMed

    Hsieh, PingHsun; Hallmark, Brian; Watkins, Joseph; Karafet, Tatiana M; Osipova, Ludmila P; Gutenkunst, Ryan N; Hammer, Michael F

    2017-11-01

    Siberia is one of the coldest environments on Earth and has great seasonal temperature variation. Long-term settlement in northern Siberia undoubtedly required biological adaptation to severe cold stress, dramatic variation in photoperiod, and limited food resources. In addition, recent archeological studies show that humans first occupied Siberia at least 45,000 years ago; yet our understanding of the demographic history of modern indigenous Siberians remains incomplete. In this study, we use whole-exome sequencing data from the Nganasans and Yakuts to infer the evolutionary history of these two indigenous Siberian populations. Recognizing the complexity of the adaptive process, we designed a model-based test to systematically search for signatures of polygenic selection. Our approach accounts for stochasticity in the demographic process and the hitchhiking effect of classic selective sweeps, as well as potential biases resulting from recombination rate and mutation rate heterogeneity. Our demographic inference shows that the Nganasans and Yakuts diverged ∼12,000-13,000 years ago from East-Asian ancestors in a process involving continuous gene flow. Our polygenic selection scan identifies seven candidate gene sets with Siberian-specific signals. Three of these gene sets are related to diet, especially to fat metabolism, consistent with the hypothesis of adaptation to a fat-rich animal diet. Additional testing rejects the effect of hitchhiking and favors a model in which selection yields small allele frequency changes at multiple unlinked genes. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Modified Cross-Linking, Ligation, and Sequencing of Hybrids (qCLASH) Identifies Kaposi's Sarcoma-Associated Herpesvirus MicroRNA Targets in Endothelial Cells.

    PubMed

    Gay, Lauren A; Sethuraman, Sunantha; Thomas, Merin; Turner, Peter C; Renne, Rolf

    2018-04-15

    Kaposi's sarcoma (KS) tumors are derived from endothelial cells and express Kaposi's sarcoma-associated herpesvirus (KSHV) microRNAs (miRNAs). Although miRNA targets have been identified in B cell lymphoma-derived cells and epithelial cells, little has been done to characterize the KSHV miRNA targetome in endothelial cells. A recent innovation in the identification of miRNA targetomes, cross-linking, ligation, and sequencing of hybrids (CLASH), unambiguously identifies miRNAs and their targets by ligating the two species while both species are still bound within the RNA-induced silencing complex (RISC). We developed a streamlined quick CLASH (qCLASH) protocol that requires a lower cell input than the original method and therefore has the potential to be used on patient biopsy samples. Additionally, we developed a fast-growing, KSHV-negative endothelial cell line derived from telomerase-immortalized vein endothelial long-term culture (TIVE-LTC) cells. qCLASH was performed on uninfected cells and cells infected with either wild-type KSHV or a mutant virus lacking miR-K12-11/11*. More than 1,400 cellular targets of KSHV miRNAs were identified. Many of the targets identified by qCLASH lacked a canonical seed sequence match. Additionally, most target regions in mRNAs originated from the coding DNA sequence (CDS) rather than the 3' untranslated region (UTR). This set of genes includes some that were previously identified in B cells and some new genes that warrant further study. Pathway analysis of endothelial cell targets showed enrichment in cell cycle control, apoptosis, and glycolysis pathways, among others. Characterization of these new targets and the functional consequences of their repression will be important in furthering our understanding of the role of KSHV miRNAs in oncogenesis. IMPORTANCE KS lesions consist of endothelial cells latently infected with KSHV. Cells that make up these lesions express KSHV miRNAs. Identification of the targets of KSHV miRNAs will

  8. Extrinsic Sources of Scatter in the Richness-mass Relation of Galaxy Clusters

    NASA Astrophysics Data System (ADS)

    Rozo, Eduardo; Rykoff, Eli; Koester, Benjamin; Nord, Brian; Wu, Hao-Yi; Evrard, August; Wechsler, Risa

    2011-10-01

    Maximizing the utility of upcoming photometric cluster surveys requires a thorough understanding of the richness-mass relation of galaxy clusters. We use Monte Carlo simulations to study the impact of various sources of observational scatter on this relation. Cluster ellipticity, photometric errors, photometric redshift errors, and cluster-to-cluster variations in the properties of red-sequence galaxies contribute negligible noise. Miscentering, however, can be important, and likely contributes to the scatter in the richness-mass relation of galaxy maxBCG clusters at the low-mass end, where centering is more difficult. We also investigate the impact of projection effects under several empirically motivated assumptions about cluster environments. Using Sloan Digital Sky Survey data and the maxBCG cluster catalog, we demonstrate that variations in cluster environments can rarely (≈1%-5% of the time) result in significant richness boosts. Due to the steepness of the mass/richness function, the corresponding fraction of optically selected clusters that suffer from these projection effects is ≈5%-15%. We expect these numbers to be generic in magnitude, but a precise determination requires detailed, survey-specific modeling.

  9. Sequence investigation of 34 forensic autosomal STRs with massively parallel sequencing.

    PubMed

    Zhang, Suhua; Niu, Yong; Bian, Yingnan; Dong, Rixia; Liu, Xiling; Bao, Yun; Jin, Chao; Zheng, Hancheng; Li, Chengtao

    2018-05-01

    STRs vary not only in the length of the repeat units and the number of repeats but also in the region with which they conform to an incremental repeat pattern. Massively parallel sequencing (MPS) offers new possibilities in the analysis of STRs since they can simultaneously sequence multiple targets in a single reaction and capture potential internal sequence variations. Here, we sequenced 34 STRs applied in the forensic community of China with a custom-designed panel. MPS performance were evaluated from sequencing reads analysis, concordance study and sensitivity testing. High coverage sequencing data were obtained to determine the constitute ratios and heterozygous balance. No actual inconsistent genotypes were observed between capillary electrophoresis (CE) and MPS, demonstrating the reliability of the panel and the MPS technology. With the sequencing data from the 200 investigated individuals, 346 and 418 alleles were obtained via CE and MPS technologies at the 34 STRs, indicating MPS technology provides higher discrimination than CE detection. The whole study demonstrated that STR genotyping with the custom panel and MPS technology has the potential not only to reveal length and sequence variations but also to satisfy the demands of high throughput and high multiplexing with acceptable sensitivity.

  10. Targeted Next Generation Sequencing in Patients with Inborn Errors of Metabolism

    PubMed Central

    Yubero, Dèlia; Brandi, Núria; Ormazabal, Aida; Garcia-Cazorla, Àngels; Pérez-Dueñas, Belén; Campistol, Jaime; Ribes, Antonia; Palau, Francesc

    2016-01-01

    Background Next-generation sequencing (NGS) technology has allowed the promotion of genetic diagnosis and are becoming increasingly inexpensive and faster. To evaluate the utility of NGS in the clinical field, a targeted genetic panel approach was designed for the diagnosis of a set of inborn errors of metabolism (IEM). The final aim of the study was to compare the findings for the diagnostic yield of NGS in patients who presented with consistent clinical and biochemical suspicion of IEM with those obtained for patients who did not have specific biomarkers. Methods The subjects studied (n = 146) were classified into two categories: Group 1 (n = 81), which consisted of patients with clinical and biochemical suspicion of IEM, and Group 2 (n = 65), which consisted of IEM cases with clinical suspicion and unspecific biomarkers. A total of 171 genes were analyzed using a custom targeted panel of genes followed by Sanger validation. Results Genetic diagnosis was achieved in 50% of patients (73/146). In addition, the diagnostic yield obtained for Group 1 was 78% (63/81), and this rate decreased to 15.4% (10/65) in Group 2 (X2 = 76.171; p < 0.0001). Conclusions A rapid and effective genetic diagnosis was achieved in our cohort, particularly the group that had both clinical and biochemical indications for the diagnosis. PMID:27243974

  11. Molecular Diagnosis of Infantile Mitochondrial Disease with Targeted Next-Generation Sequencing

    PubMed Central

    Calvo, Sarah E.; Compton, Alison G.; Hershman, Steven G.; Lim, Sze Chern; Lieber, Daniel S.; Tucker, Elena J.; Laskowski, Adrienne; Garone, Caterina; Liu, Shangtao; Jaffe, David B.; Christodoulou, John; Fletcher, Janice M.; Bruno, Damien L; Goldblatt, Jack; DiMauro, Salvatore; Thorburn, David R.; Mootha, Vamsi K.

    2012-01-01

    Advances in next-generation sequencing (NGS) promise to facilitate diagnosis of inherited disorders. While in research settings NGS has pinpointed causal alleles using segregation in large families, the key challenge for clinical diagnosis is application to single individuals. To explore its diagnostic utility, we performed targeted NGS in 42 unrelated infants with clinical and biochemical evidence of mitochondrial oxidative phosphorylation disease, who were refractory to traditional molecular diagnosis. These devastating mitochondrial disorders are characterized by phenotypic and genetic heterogeneity, with over 100 causal genes identified to date. We performed “MitoExome” sequencing of the mitochondrial DNA (mtDNA) and exons of ~1000 nuclear genes encoding mitochondrial proteins and prioritized rare mutations predicted to disrupt function. Since patients and controls harbored a comparable number of such heterozygous alleles, we could not prioritize dominant acting genes. However, patients showed a five-fold enrichment of genes with two such mutations that could underlie recessive disease. In total, 23/42 (55%) patients harbored such recessive genes or pathogenic mtDNA variants. Firm diagnoses were enabled in 10 patients (24%) who had mutations in genes previously linked to disease. 13 patients (31%) had mutations in nuclear genes never linked to disease. The pathogenicity of two such genes, NDUFB3 and AGK, was supported by cDNA complementation and evidence from multiple patients, respectively. The results underscore the immediate potential and challenges of deploying NGS in clinical settings. PMID:22277967

  12. PhytoCRISP-Ex: a web-based and stand-alone application to find specific target sequences for CRISPR/CAS editing.

    PubMed

    Rastogi, Achal; Murik, Omer; Bowler, Chris; Tirichine, Leila

    2016-07-01

    With the emerging interest in phytoplankton research, the need to establish genetic tools for the functional characterization of genes is indispensable. The CRISPR/Cas9 system is now well recognized as an efficient and accurate reverse genetic tool for genome editing. Several computational tools have been published allowing researchers to find candidate target sequences for the engineering of the CRISPR vectors, while searching possible off-targets for the predicted candidates. These tools provide built-in genome databases of common model organisms that are used for CRISPR target prediction. Although their predictions are highly sensitive, the applicability to non-model genomes, most notably protists, makes their design inadequate. This motivated us to design a new CRISPR target finding tool, PhytoCRISP-Ex. Our software offers CRIPSR target predictions using an extended list of phytoplankton genomes and also delivers a user-friendly standalone application that can be used for any genome. The software attempts to integrate, for the first time, most available phytoplankton genomes information and provide a web-based platform for Cas9 target prediction within them with high sensitivity. By offering a standalone version, PhytoCRISP-Ex maintains an independence to be used with any organism and widens its applicability in high throughput pipelines. PhytoCRISP-Ex out pars all the existing tools by computing the availability of restriction sites over the most probable Cas9 cleavage sites, which can be ideal for mutant screens. PhytoCRISP-Ex is a simple, fast and accurate web interface with 13 pre-indexed and presently updating phytoplankton genomes. The software was also designed as a UNIX-based standalone application that allows the user to search for target sequences in the genomes of a variety of other species.

  13. Investigating the Feasibility of Targeted Next-Generation Sequencing to Guide the Treatment of Head and Neck Squamous Cell Carcinoma.

    PubMed

    Lim, Sun Min; Cho, Sang Hee; Hwang, In Gyu; Choi, Jae Woo; Chang, Hyun; Ahn, Myung-Ju; Park, Keon Uk; Kim, Ji-Won; Ko, Yoon Ho; Ahn, Hee Kyung; Cho, Byoung Chul; Nam, Byung-Ho; Chun, Sang Hoon; Hong, Ji Hyung; Kwon, Jung Hye; Choi, Jong Gwon; Kang, Eun Joo; Yun, Tak; Lee, Keun-Wook; Kim, Joo-Hang; Kim, Jin Soo; Lee, Hyun Woo; Kim, Min Kyoung; Jung, Dongmin; Kim, Ji Eun; Keam, Bhumsuk; Yun, Hwan Jung; Kim, Sangwoo; Kim, Hye Ryun

    2018-05-09

    Head and neck squamous cell carcinoma (HNSCC) is a deadly disease in which precision medicine needs to be incorporated. We aimed to implement next-generation sequencing (NGS) in determining actionable targets to guide appropriate molecular targeted therapy in HNSCC patients. Ninety-three tumors and matched blood samples underwent targeted sequencing of 244 genes using the Illumina HiSeq 2500 platform with an average depth of coverage of greater than 1,000×. Clinicopathological data from patients were obtained from 17 centers in Korea, and were analyzed in correlation with NGS data. Ninety-two of the 93 tumors were amenable to data analysis. TP53 was the most common mutation, occurring in 47 (51%) patients, followed by CDKN2A (n=23, 25%), CCND1 (n=22, 24%) and PIK3CA (n=19, 21%). The total mutational burden was similar between human papillomavirus (HPV)-negative vs. positive tumors, although TP53, CDKN2A and CCND1 gene alterations occurred more frequently in HPV-negative tumors. HPV-positive tumors were significantly associated with immune signature-related genes compared to HPV-negative tumors. Mutations of NOTCH1 (p=0.027), CDKN2A (p<0.001) and TP53 (p=0.038) were significantly associated with poorer overall survival. FAT1 mutations were highly enriched in cisplatin responders, and potentially targetable alterations such as PIK3CA E545K and CDKN2A R58X were noted in 14 (15%) patients. We found several targetable genetic alterations, and our findings suggest that implementation of precision medicine in HNSCC is feasible. The predictive value of each targetable alteration should be assessed in a future umbrella trial using matched molecular targeted agents.

  14. Diversity Measures in Environmental Sequences Are Highly Dependent on Alignment Quality—Data from ITS and New LSU Primers Targeting Basidiomycetes

    PubMed Central

    Fischer, Christiane; Daniel, Rolf; Wubet, Tesfaye

    2012-01-01

    The ribosomal DNA comprised of the ITS1-5.8S-ITS2 regions is widely used as a fungal marker in molecular ecology and systematics but cannot be aligned with confidence across genetically distant taxa. In order to study the diversity of Agaricomycotina in forest soils, we designed primers targeting the more alignable 28S (LSU) gene, which should be more useful for phylogenetic analyses of the detected taxa. This paper compares the performance of the established ITS1F/4B primer pair, which targets basidiomycetes, to that of two new pairs. Key factors in the comparison were the diversity covered, off-target amplification, rarefaction at different Operational Taxonomic Unit (OTU) cutoff levels, sensitivity of the method used to process the alignment to missing data and insecure positional homology, and the congruence of monophyletic clades with OTU assignments and BLAST-derived OTU names. The ITS primer pair yielded no off-target amplification but also exhibited the least fidelity to the expected phylogenetic groups. The LSU primers give complementary pictures of diversity, but were more sensitive to modifications of the alignment such as the removal of difficult-to align stretches. The LSU primers also yielded greater numbers of singletons but also had a greater tendency to produce OTUs containing sequences from a wider variety of species as judged by BLAST similarity. We introduced some new parameters to describe alignment heterogeneity based on Shannon entropy and the extent and contents of the OTUs in a phylogenetic tree space. Our results suggest that ITS should not be used when calculating phylogenetic trees from genetically distant sequences obtained from environmental DNA extractions and that it is inadvisable to define OTUs on the basis of very heterogeneous alignments. PMID:22363808

  15. Short communication: Validation of 4 candidate causative trait variants in 2 cattle breeds using targeted sequence imputation.

    PubMed

    Pausch, Hubert; Wurmser, Christine; Reinhardt, Friedrich; Emmerling, Reiner; Fries, Ruedi

    2015-06-01

    Most association studies for pinpointing trait-associated variants are performed within breed. The availability of sequence data from key ancestors of several cattle breeds now enables immediate assessment of the frequency of trait-associated variants in populations different from the mapping population and their imputation into large validation populations. The objective of this study was to validate the effects of 4 putatively causative variants on milk production traits, male fertility, and stature in German Fleckvieh and Holstein-Friesian animals using targeted sequence imputation. We used whole-genome sequence data of 456 animals to impute 4 missense mutations in DGAT1, GHR, PRLR, and PROP1 into 10,363 Fleckvieh and 8,812 Holstein animals. The accuracy of the imputed genotypes exceeded 95% for all variants. Association testing with imputed variants revealed consistent antagonistic effects of the DGAT1 p.A232K and GHR p.F279Y variants on milk yield and protein and fat contents, respectively, in both breeds. The allele frequency of both polymorphisms has changed considerably in the past 20 yr, indicating that they were targets of recent selection for milk production traits. The PRLR p.S18N variant was associated with yield traits in Fleckvieh but not in Holstein, suggesting that it may be in linkage disequilibrium with a mutation affecting yield traits rather than being causal. The reported effects of the PROP1 p.H173R variant on milk production, male fertility, and stature could not be confirmed. Our results demonstrate that population-wide imputation of candidate causal variants from sequence data is feasible, enabling their rapid validation in large independent populations. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  16. Isolation of a new class of cysteine-glycine-proline-rich beta-proteins (beta-keratins) and their expression in snake epidermis.

    PubMed

    Dalla Valle, Luisa; Nardi, Alessia; Alibardi, Lorenzo

    2010-03-01

    Scales of snakes contain hard proteins (beta-keratins), now referred to as keratin-associated beta-proteins. In the present study we report the isolation, sequencing, and expression of a new group of these proteins from snake epidermis, designated cysteine-glycine-proline-rich proteins. One deduced protein from expressed mRNAs contains 128 amino acids (12.5 kDa) with a theoretical pI at 7.95, containing 10.2% cysteine and 15.6% glycine. The sequences of two more snake cysteine-proline-rich proteins have been identified from genomic DNA. In situ hybridization shows that the messengers for these proteins are present in the suprabasal and early differentiating beta-cells of the renewing scale epidermis. The present study shows that snake scales, as previously seen in scales of lizards, contain cysteine-rich beta-proteins in addition to glycine-rich beta-proteins. These keratin-associated beta-proteins mix with intermediate filament keratins (alpha-keratins) to produce the resistant corneous layer of snake scales. The specific proportion of these two subfamilies of proteins in different scales can determine various degrees of hardness in scales.

  17. Signal sequence-independent targeting of MID2 mRNA to the endoplasmic reticulum by the yeast RNA-binding protein Khd1p.

    PubMed

    Syed, Muhammad Ibrahim; Moorthy, Balaji T; Jenner, Andreas; Fetka, Ingrid; Jansen, Ralf-Peter

    2018-05-17

    Localization of mRNAs depends on specific RNA-binding proteins (RBPs) and critically contributes not only to cell polarization but also to basal cell function. The yeast RBP Khd1p binds to several hundred mRNAs, the majority of which encodes secreted or membrane proteins. We demonstrate that a subfraction of Khd1p associates with artificial liposomes and endoplasmic reticulum (ER), and that Khd1p endomembrane association is partially dependent on its binding to RNA. ER targeting of at least two mRNAs, MID2 and SLG1/WSC1, requires KHD1 but is independent of their translation. Together, our results suggest interdependence of Khd1p and mRNA for their targeting to the ER and presents additional evidence for signal sequence-independent, RBP-mediated mRNA targeting. © 2018 Federation of European Biochemical Societies.

  18. Galectin-8 regulates targeting of Gp135/podocalyxin and lumen formation at the apical surface of renal epithelial cells.

    PubMed

    Lim, HooiCheng; Yu, Chun-Ying; Jou, Tzuu-Shuh

    2017-11-01

    Establishment of apical-basal polarity, through correct targeting of polarity determinants to distinct domains of the plasma membrane, is a fundamental process for the development of functioning epithelial tubules. Here we report that galectin (Gal)-8 regulates apical-basal polarity of Madin-Darby canine kidney (MDCK) cells via apical targeting of 135-kDa glycoprotein (Gp135). Gal-8 interacts with newly synthesized Gp135 in a glycan-dependent manner. Gal-8 knockdown induces aberrant lumens at the lateral domain and mistargeting of Gp135 to this structure, thus disrupting the kidney epithelial polarity of MDCK cells, which organize lumens at the apical surface. The O -glycosylation deletion mutant of Gp135 phenocopies the effect of Gal-8 knockdown, which suggests that Gal-8 is the decoding machinery for the apical sorting signals of Gp135 residing at its O -glycosylation-rich region. Collectively, our results reveal a new role of Gal-8 in the development of luminal organs by regulating targeting of apical polarity protein Gp135.-Lim, H., Yu, C.-Y., Jou, T.-S. Galectin-8 regulates targeting of Gp135/podocalyxin and lumen formation at the apical surface of renal epithelial cells. © FASEB.

  19. Novel myosin mutations for hereditary hearing loss revealed by targeted genomic capture and massively parallel sequencing

    PubMed Central

    Brownstein, Zippora; Abu-Rayyan, Amal; Karfunkel-Doron, Daphne; Sirigu, Serena; Davidov, Bella; Shohat, Mordechai; Frydman, Moshe; Houdusse, Anne; Kanaan, Moien; Avraham, Karen B

    2014-01-01

    Hereditary hearing loss is genetically heterogeneous, with a large number of genes and mutations contributing to this sensory, often monogenic, disease. This number, as well as large size, precludes comprehensive genetic diagnosis of all known deafness genes. A combination of targeted genomic capture and massively parallel sequencing (MPS), also referred to as next-generation sequencing, was applied to determine the deafness-causing genes in hearing-impaired individuals from Israeli Jewish and Palestinian Arab families. Among the mutations detected, we identified nine novel mutations in the genes encoding myosin VI, myosin VIIA and myosin XVA, doubling the number of myosin mutations in the Middle East. Myosin VI mutations were identified in this population for the first time. Modeling of the mutations provided predicted mechanisms for the damage they inflict in the molecular motors, leading to impaired function and thus deafness. The myosin mutations span all regions of these molecular motors, leading to a wide range of hearing phenotypes, reinforcing the key role of this family of proteins in auditory function. This study demonstrates that multiple mutations responsible for hearing loss can be identified in a relatively straightforward manner by targeted-gene MPS technology and concludes that this is the optimal genetic diagnostic approach for identification of mutations responsible for hearing loss. PMID:24105371

  20. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  1. The GC-rich mitochondrial and plastid genomes of the green alga Coccomyxa give insight into the evolution of organelle DNA nucleotide landscape

    DOE PAGES

    Smith, David Roy; Burki, Fabien; Yamada, Takashi; ...

    2011-08-26

    Here, most of the available mitochondrial and plastid genome sequences are biased towards adenine and thymine (AT) over guanine and cytosine (GC). Examples of GC-rich organelle DNAs are limited to a small but eclectic list of species, including certain green algae. Here, to gain insight in the evolution of organelle nucleotide landscape, we present the GC-rich mitochondrial and plastid DNAs from the trebouxiophyte green alga Coccomyxa sp. C-169. We compare these sequences with other GC-rich organelle DNAs and argue that the forces biasing them towards G and C are nonadaptive and linked to the metabolic and/or life history features ofmore » this species. The Coccomyxa organelle genomes are also used for phylogenetic analyses, which highlight the complexities in trying to resolve the interrelationships among the core chlorophyte green algae, but ultimately favour a sister relationship between the Ulvophyceae and Chlorophyceae, with the Trebouxiophyceae branching at the base of the chlorophyte crown.« less

  2. The GC-Rich Mitochondrial and Plastid Genomes of the Green Alga Coccomyxa Give Insight into the Evolution of Organelle DNA Nucleotide Landscape

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smith, David Roy; Burki, Fabien; Yamada, Takashi

    2011-05-13

    Most of the available mitochondrial and plastid genome sequences are biased towards adenine and thymine (AT) over guanine and cytosine (GC). Examples of GC-rich organelle DNAs are limited to a small but eclectic list of species, including certain green algae. Here, to gain insight in the evolution of organelle nucleotide landscape, we present the GC-rich mitochondrial and plastid DNAs from the trebouxiophyte green alga Coccomyxa sp. C-169. We compare these sequences with other GC-rich organelle DNAs and argue that the forces biasing them towards G and C are nonadaptive and linked to the metabolic and/or life history features of thismore » species. The Coccomyxa organelle genomes are also used for phylogenetic analyses, which highlight the complexities in trying to resolve the interrelationships among the core chlorophyte green algae, but ultimately favour a sister relationship between the Ulvophyceae and Chlorophyceae, with the Trebouxiophyceae branching at the base of the chlorophyte crown.« less

  3. Calcium Sensing Receptor Mutations Implicated in Pancreatitis and Idiopathic Epilepsy Syndrome Disrupt an Arginine-rich Retention Motif

    PubMed Central

    Stepanchick, Ann; McKenna, Jennifer; McGovern, Olivia; Huang, Ying; Breitwieser, Gerda E.

    2010-01-01

    Calcium sensing receptor (CaSR) mutations implicated in familial hypocalciuric hypercalcemia, pancreatitis and idiopathic epilepsy syndrome map to an extended arginine-rich region in the proximal carboxyl terminus. Arginine-rich motifs mediate endoplasmic reticulum retention and/or retrieval of multisubunit proteins so we asked whether these mutations, R886P, R896H or R898Q, altered CaSR targeting to the plasma membrane. Targeting was enhanced by all three mutations, and Ca2+-stimulated ERK1/2 phosphorylation was increased for R896H and R898Q. To define the role of the extended arginine-rich region in CaSR trafficking, we independently determined the contributions of R890/R891 and/or R896/K897/R898 motifs by mutation to alanine. Disruption of the motif(s) significantly increased surface expression and function relative to wt CaSR. The arginine-rich region is flanked by phosphorylation sites at S892 (protein kinase C) and S899 (protein kinase A). The phosphorylation state of S899 regulated recognition of the arginine-rich region; S899D showed increased surface localization. CaSR assembles in the endoplasmic reticulum as a covalent disulfide-linked dimer and we determined whether retention requires the presence of arginine-rich regions in both subunits. A single arginine-rich region within the dimer was sufficient to confer intracellular retention comparable to wt CaSR. We have identified an extended arginine-rich region in the proximal carboxyl terminus of CaSR (residues R890 - R898) which fosters intracellular retention of CaSR and is regulated by phosphorylation. Mutation(s) identified in chronic pancreatitis and idiopathic epilepsy syndrome therefore increase plasma membrane targeting of CaSR, likely contributing to the altered Ca2+ signaling characteristic of these diseases. PMID:20798521

  4. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

    PubMed

    Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin

    2013-01-01

    Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.

  5. Improving mapping and SNP-calling performance in multiplexed targeted next-generation sequencing

    PubMed Central

    2012-01-01

    Background Compared to classical genotyping, targeted next-generation sequencing (tNGS) can be custom-designed to interrogate entire genomic regions of interest, in order to detect novel as well as known variants. To bring down the per-sample cost, one approach is to pool barcoded NGS libraries before sample enrichment. Still, we lack a complete understanding of how this multiplexed tNGS approach and the varying performance of the ever-evolving analytical tools can affect the quality of variant discovery. Therefore, we evaluated the impact of different software tools and analytical approaches on the discovery of single nucleotide polymorphisms (SNPs) in multiplexed tNGS data. To generate our own test model, we combined a sequence capture method with NGS in three experimental stages of increasing complexity (E. coli genes, multiplexed E. coli, and multiplexed HapMap BRCA1/2 regions). Results We successfully enriched barcoded NGS libraries instead of genomic DNA, achieving reproducible coverage profiles (Pearson correlation coefficients of up to 0.99) across multiplexed samples, with <10% strand bias. However, the SNP calling quality was substantially affected by the choice of tools and mapping strategy. With the aim of reducing computational requirements, we compared conventional whole-genome mapping and SNP-calling with a new faster approach: target-region mapping with subsequent ‘read-backmapping’ to the whole genome to reduce the false detection rate. Consequently, we developed a combined mapping pipeline, which includes standard tools (BWA, SAMtools, etc.), and tested it on public HiSeq2000 exome data from the 1000 Genomes Project. Our pipeline saved 12 hours of run time per Hiseq2000 exome sample and detected ~5% more SNPs than the conventional whole genome approach. This suggests that more potential novel SNPs may be discovered using both approaches than with just the conventional approach. Conclusions We recommend applying our general

  6. Identification of microRNAs and their targets in Finger millet by high throughput sequencing.

    PubMed

    Usha, S; Jyothi, M N; Sharadamma, N; Dixit, Rekha; Devaraj, V R; Nagesh Babu, R

    2015-12-15

    MicroRNAs are short non-coding RNAs which play an important role in regulating gene expression by mRNA cleavage or by translational repression. The majority of identified miRNAs were evolutionarily conserved; however, others expressed in a species-specific manner. Finger millet is an important cereal crop; nonetheless, no practical information is available on microRNAs to date. In this study, we have identified 95 conserved microRNAs belonging to 39 families and 3 novel microRNAs by high throughput sequencing. For the identified conserved and novel miRNAs a total of 507 targets were predicted. 11 miRNAs were validated and tissue specificity was determined by stem loop RT-qPCR, Northern blot. GO analyses revealed targets of miRNA were involved in wide range of regulatory functions. This study implies large number of known and novel miRNAs found in Finger millet which may play important role in growth and development. Copyright © 2015 Elsevier B.V. All rights reserved.

  7. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A

  8. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M

  9. Species identification in mixed tuna samples with next-generation sequencing targeting two short cytochrome b gene fragments.

    PubMed

    Kappel, Kristina; Haase, Ilka; Käppel, Christine; Sotelo, Carmen G; Schröder, Ute

    2017-11-01

    Conventional Sanger sequencing of PCR products is the gold standard for species authentication of seafood products. However, this method is inappropriate for the analysis of products that might contain mixtures of species, such as tinned tuna. The purpose of this study was to test whether next-generation sequencing (NGS) can be a solution for the authentication of mixed products. Nine tuna samples containing mixtures of up to four species were prepared and subjected to an NGS approach targeting two short cytochrome b gene (cytb) fragments on the Illumina MiSeq platform. Sequence recovery was precise and admixtures of as low as 1% could be identified, depending on the species composition of the mixtures. Duplicate samples as well as two individual NGS runs produced very similar results. A first test of three commercial tinned tuna samples indicated the presence of different species in the same tin, although this is forbidden by EU law. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. Insertion Sequences

    PubMed Central

    Mahillon, Jacques; Chandler, Michael

    1998-01-01

    Insertion sequences (ISs) constitute an important component of most bacterial genomes. Over 500 individual ISs have been described in the literature to date, and many more are being discovered in the ongoing prokaryotic and eukaryotic genome-sequencing projects. The last 10 years have also seen some striking advances in our understanding of the transposition process itself. Not least of these has been the development of various in vitro transposition systems for both prokaryotic and eukaryotic elements and, for several of these, a detailed understanding of the transposition process at the chemical level. This review presents a general overview of the organization and function of insertion sequences of eubacterial, archaebacterial, and eukaryotic origins with particular emphasis on bacterial elements and on different aspects of the transposition mechanism. It also attempts to provide a framework for classification of these elements by assigning them to various families or groups. A total of 443 members of the collection have been grouped in 17 families based on combinations of the following criteria: (i) similarities in genetic organization (arrangement of open reading frames); (ii) marked identities or similarities in the enzymes which mediate the transposition reactions, the recombinases/transposases (Tpases); (iii) similar features of their ends (terminal IRs); and (iv) fate of the nucleotide sequence of their target sites (generation of a direct target duplication of determined length). A brief description of the mechanism(s) involved in the mobility of individual ISs in each family and of the structure-function relationships of the individual Tpases is included where available. PMID:9729608

  11. DrugBank: a knowledgebase for drugs, drug actions and drug targets

    PubMed Central

    Wishart, David S.; Knox, Craig; Guo, An Chi; Cheng, Dean; Shrivastava, Savita; Tzur, Dan; Gautam, Bijaya; Hassanali, Murtaza

    2008-01-01

    DrugBank is a richly annotated resource that combines detailed drug data with comprehensive drug target and drug action information. Since its first release in 2006, DrugBank has been widely used to facilitate in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction, drug interaction prediction and general pharmaceutical education. The latest version of DrugBank (release 2.0) has been expanded significantly over the previous release. With ∼4900 drug entries, it now contains 60% more FDA-approved small molecule and biotech drugs including 10% more ‘experimental’ drugs. Significantly, more protein target data has also been added to the database, with the latest version of DrugBank containing three times as many non-redundant protein or drug target sequences as before (1565 versus 524). Each DrugCard entry now contains more than 100 data fields with half of the information being devoted to drug/chemical data and the other half devoted to pharmacological, pharmacogenomic and molecular biological data. A number of new data fields, including food–drug interactions, drug–drug interactions and experimental ADME data have been added in response to numerous user requests. DrugBank has also significantly improved the power and simplicity of its structure query and text query searches. DrugBank is available at http://www.drugbank.ca PMID:18048412

  12. [Application of targeted capture technology and next generation sequencing in molecular diagnosis of inherited myopathy].

    PubMed

    Fu, Xiaona; Liu, Aijie; Yang, Haipo; Wei, Cuijie; Ding, Juan; Wang, Shuang; Wang, Jingmin; Yuan, Yun; Jiang, Yuwu; Xiong, Hui

    2015-10-01

    To elucidate the usefulness of next generation sequencing for diagnosis of inherited myopathy, and to analyze the relevance between clinical phenotype and genotype in inherited myopathy. Related genes were selected for SureSelect target enrichment system kit (Panel Version 1 and Panel Version 2). A total of 134 patients who were diagnosed as inherited myopathy clinically underwent next generation sequencing in Department of Pediatrics, Peking University First Hospital from January 2013 to June 2014. Clinical information and gene detection result of the patients were collected and analyzed. Seventy-seven of 134 patients (89 males and 45 females, visiting ages from 6-month-old to 26-year-old, average visiting age was 6 years and 1 month) underwent next generation sequencing by Panel Version 1 in 2013, and 57 patients underwent next generation sequencing by Panel Version 2 in 2014. The gene detection revealed that 74 patients had pathogenic gene mutations, and the positive rate of genetic diagnosis was 55.22%. One patient was diagnosed as metabolic myopathy. Five patients were diagnosed as congenital myopathy; 68 were diagnosed as muscular dystrophy, including 22 with congenital muscular dystrophy 1A (MDC1A), 11 with Ullrich congenital muscular dystrophy (UCMD), 6 with Bethlem myopathy (BM), 12 with Duchenne muscular dystrophy (DMD) caused by point mutations in DMD gene, 5 with LMNA-related congenital muscular dystrophy (L-CMD), 1 with Emery-Dreifuss muscular dystrophy (EDMD), 7 with alpha-dystroglycanopathy (α-DG) patients, and 4 with limb-girdle muscular dystrophy (LGMD) patients. Next generation sequencing plays an important role in diagnosis of inherited myopathy. Clinical and biological information analysis was essential for screening pathogenic gene of inherited myopathy.

  13. Low-Energy Electron-Induced Strand Breaks in Telomere-Derived DNA Sequences-Influence of DNA Sequence and Topology.

    PubMed

    Rackwitz, Jenny; Bald, Ilko

    2018-03-26

    During cancer radiation therapy high-energy radiation is used to reduce tumour tissue. The irradiation produces a shower of secondary low-energy (<20 eV) electrons, which are able to damage DNA very efficiently by dissociative electron attachment. Recently, it was suggested that low-energy electron-induced DNA strand breaks strongly depend on the specific DNA sequence with a high sensitivity of G-rich sequences. Here, we use DNA origami platforms to expose G-rich telomere sequences to low-energy (8.8 eV) electrons to determine absolute cross sections for strand breakage and to study the influence of sequence modifications and topology of telomeric DNA on the strand breakage. We find that the telomeric DNA 5'-(TTA GGG) 2 is more sensitive to low-energy electrons than an intermixed sequence 5'-(TGT GTG A) 2 confirming the unique electronic properties resulting from G-stacking. With increasing length of the oligonucleotide (i.e., going from 5'-(GGG ATT) 2 to 5'-(GGG ATT) 4 ), both the variety of topology and the electron-induced strand break cross sections increase. Addition of K + ions decreases the strand break cross section for all sequences that are able to fold G-quadruplexes or G-intermediates, whereas the strand break cross section for the intermixed sequence remains unchanged. These results indicate that telomeric DNA is rather sensitive towards low-energy electron-induced strand breakage suggesting significant telomere shortening that can also occur during cancer radiation therapy. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Geologic map showing springs rich in carbon dioxide or or chloride in California

    USGS Publications Warehouse

    Barnes, Ivan; Irwin, William P.; Gibson, H.A.

    1975-01-01

    Carbon dioxide- and chloride-rich springs occur in all geologic provinces in California, but are most abundant in the Coast Ranges and the Great Valley. The carbon-dioxide-rich springs issue mainly from Franciscan terrane; they also are rich in boron and are of the metamorphic type (White, 1957). Based on isotopic data, either the carbon dioxide or the water, or both, may be of metamorphic origin. Because of high magnesium values, the water of many of the carbon-dioxide-rich springs is thought to have passed through serpentinite. The chloride-rich waters are most common in rocks of the Great Valley sequence. Nearly all are more dilute than present-day sea water. The similarity in isotopic compositions of the metamorphic carbon-dioxide-rich water and the chloride-rich water may indicate a similar extent of water-rock interaction.

  15. Two DNA-binding factors recognize specific sequences at silencers, upstream activating sequences, autonomously replicating sequences, and telomeres in Saccharomyces cerevisiae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Buchman, A.R.; Kimmerly, W.J.; Rine, J.

    1988-01-01

    Two DNA-binding factors from Saccharomyces cerevisiae have been characterized, GRFI (general regulatory factor I) and ABFI (ARS-binding factor I), that recognize specific sequences within diverse genetic elements. GRFI bound to sequences at the negative regulatory elements (silencers) of the silent mating type loci HML E and HMR E and to the upstream activating sequence (UAS) required for transcription of the MAT ..cap alpha.. genes. A putative conserved UAS located at genes involved in translation (RPG box) was also recognized by GRFI. In addition, GRFI bound with high affinity to sequences within the (C/sub 1-3/A)-repeat region at yeast telomeres. Binding sitesmore » for GRFI with the highest affinity appeared to be of the form 5'-(A/G)(A/C)ACCCAN NCA(T/C)(T/C)-3', where N is any nucleotide. ABFI-binding sites were located next to autonomously replicating sequences (ARSs) at controlling elements of the silent mating type loci HMR E, HMR I, and HML I and were associated with ARS1, ARS2, and the 2..mu..m plasmid ARS. Two tandem ABFI binding sites were found between the HIS3 and DED1 genes, several kilobase pairs from any ARS, indicating that ABFI-binding sites are not restricted to ARSs. The sequences recognized by AFBI showed partial dyad-symmetry and appeared to be variations of the consensus 5'-TATCATTNNNNACGA-3'. GRFI and ABFI were both abundant DNA-binding factors and did not appear to be encoded by the SIR genes, whose product are required for repression of the silent mating type loci. Together, these results indicate that both GRFI and ABFI play multiple roles within the cell.« less

  16. Exome sequencing of a multigenerational human pedigree.

    PubMed

    Hedges, Dale J; Hedges, Dale; Burges, Dan; Powell, Eric; Almonte, Cherylyn; Huang, Jia; Young, Stuart; Boese, Benjamin; Schmidt, Mike; Pericak-Vance, Margaret A; Martin, Eden; Zhang, Xinmin; Harkins, Timothy T; Züchner, Stephan

    2009-12-14

    Over the next few years, the efficient use of next-generation sequencing (NGS) in human genetics research will depend heavily upon the effective mechanisms for the selective enrichment of genomic regions of interest. Recently, comprehensive exome capture arrays have become available for targeting approximately 33 Mb or approximately 180,000 coding exons across the human genome. Selective genomic enrichment of the human exome offers an attractive option for new experimental designs aiming to quickly identify potential disease-associated genetic variants, especially in family-based studies. We have evaluated a 2.1 M feature human exome capture array on eight individuals from a three-generation family pedigree. We were able to cover up to 98% of the targeted bases at a long-read sequence read depth of > or = 3, 86% at a read depth of > or = 10, and over 50% of all targets were covered with > or = 20 reads. We identified up to 14,284 SNPs and small indels per individual exome, with up to 1,679 of these representing putative novel polymorphisms. Applying the conservative genotype calling approach HCDiff, the average rate of detection of a variant allele based on Illumina 1 M BeadChips genotypes was 95.2% at > or = 10x sequence. Further, we propose an advantageous genotype calling strategy for low covered targets that empirically determines cut-off thresholds at a given coverage depth based on existing genotype data. Application of this method was able to detect >99% of SNPs covered > or = 8x. Our results offer guidance for "real-world" applications in human genetics and provide further evidence that microarray-based exome capture is an efficient and reliable method to enrich for chromosomal regions of interest in next-generation sequencing experiments.

  17. A Selective Surface-Enhanced Raman Scattering Sensor for Mercury(II) Based on a Porous Polymer Material and the Target-Mediated Displacement of a T-Rich Strand

    NASA Astrophysics Data System (ADS)

    Kang, Y.; Zhang, L.; Zhang, H.; Wu, T.; Du, Y.

    2017-05-01

    A sensitive and selective surface-enhanced Raman scattering (SERS) sensor for mercury(II) was fabricated based on the target-mediated displacement of a T-rich oligonucleotide strand. A DNA/aptamer duplex was prepared by the hybridization between a tetramethylrhodamine(TMR)-labeled thymine(T)-rich Hg2+-specific aptamer (denoted as TMR-aptamer) and a thiolated adenine-rich capturing DNA. The duplex can be immobilized onto the SERS substrate of the Ag-moiety modified glycidyl methacrylate-ethylene dimethacrylate (denoted as Ag-GMA-EDMA) via self-assembly by the thiol anchor, in which the TMR-aptamer exists in a double-stranded chain. In this case, the label of the TMR moiety approaches the substrate surface and produces a strong SERS signal. Upon the addition of the target, a pair of TMR-aptamers could cooperatively coordinate with Hg2+ to form a stable duplex-like structure mediated by the T-Hg2+-T complex between two adjacent strands, which triggers the release of the TMR-aptamer from the SERS substrate surface, thus drawing the TMR tags away from the substrate with a significant decrease in the SERS signal. This optical sensor shows a sensitive response to Hg2+ in a concentration from 5 nM to 2.0 μM with a detection limit of 2.5 nM. The prepared sensor is negligibly responsive to other metal ions, can be easily regenerated, and shows good performance in real sample analysis.

  18. Identification of Direct Target Genes Using Joint Sequence and Expression Likelihood with Application to DAF-16

    PubMed Central

    Yu, Ron X.; Liu, Jie; True, Nick; Wang, Wei

    2008-01-01

    A major challenge in the post-genome era is to reconstruct regulatory networks from the biological knowledge accumulated up to date. The development of tools for identifying direct target genes of transcription factors (TFs) is critical to this endeavor. Given a set of microarray experiments, a probabilistic model called TRANSMODIS has been developed which can infer the direct targets of a TF by integrating sequence motif, gene expression and ChIP-chip data. The performance of TRANSMODIS was first validated on a set of transcription factor perturbation experiments (TFPEs) involving Pho4p, a well studied TF in Saccharomyces cerevisiae. TRANSMODIS removed elements of arbitrariness in manual target gene selection process and produced results that concur with one's intuition. TRANSMODIS was further validated on a genome-wide scale by comparing it with two other methods in Saccharomyces cerevisiae. The usefulness of TRANSMODIS was then demonstrated by applying it to the identification of direct targets of DAF-16, a critical TF regulating ageing in Caenorhabditis elegans. We found that 189 genes were tightly regulated by DAF-16. In addition, DAF-16 has differential preference for motifs when acting as an activator or repressor, which awaits experimental verification. TRANSMODIS is computationally efficient and robust, making it a useful probabilistic framework for finding immediate targets. PMID:18350157

  19. Fine tuning cellular recognition: The function of the leucine rich repeat (LRR) trans-membrane protein, LRT, in muscle targeting to tendon cells.

    PubMed

    Gilsohn, Eli; Volk, Talila

    2010-01-01

    The formation of complex tissues during embryonic development is often accompanied by directed cellular migration towards a target tissue. Specific mutual recognition between the migrating cell and its target tissue leads to the arrest of the cell migratory behavior and subsequent contact formation between the two interacting cell types. Recent studies implicated a novel family of surface proteins containing a trans-membrane domain and single leucine-rich repeat (LRR) domain in inter-cellular recognition and the arrest of cell migration. Here, we describe the involvement of a novel LRR surface protein, LRT, in targeting migrating muscles towards their corresponding tendon cells in the Drosophila embryo. LRT is specifically expressed by the target tendon cells and is essential for arresting the migratory behavior of the muscle cells. Additional studies in Drosophila S2 cultured cells suggest that LRT forms a protein complex with the Roundabout (Robo) receptor, essential for guiding muscles towards their tendon partners. Genetic analysis supports a model in which LRT performs its activity non-autonomously through its interaction with the Robo receptors expressed on the muscle surfaces. These results suggest a novel mechanism of intercellular recognition through interactions between LRR family members and Robo receptors.

  20. RNA sequencing and pathway analysis identify tumor necrosis factor alpha driven small proline-rich protein dysregulation in chronic rhinosinusitis.

    PubMed

    Ramakrishnan, Vijay R; Gonzalez, Joseph R; Cooper, Sarah E; Barham, Henry P; Anderson, Catherine B; Larson, Eric D; Cool, Carlyne D; Diller, John D; Jones, Kenneth; Kinnamon, Sue C

    2017-09-01

    Chronic rhinosinusitis (CRS) is a heterogeneous inflammatory disorder in which many pathways contribute to end-organ disease. Small proline-rich proteins (SPRR) are polypeptides that have recently been shown to contribute to epithelial biomechanical properties relevant in T-helper type 2 inflammation. There is evidence that genetic polymorphism in SPRR genes may predict the development of asthma in children with atopy and, correlatively, that expression of SPRRs is increased under allergic conditions, which leads to epithelial barrier dysfunction in atopic disease. RNAs from uncinate tissue specimens from patients with CRS and control subjects were compared by RNA sequencing by using Ingenuity Pathway Analysis (n = 4 each), and quantitative polymerase chain reaction (PCR) (n = 15). A separate cohort of archived sinus tissue was examined by immunohistochemistry (n = 19). A statistically significant increase of SPRR expression in CRS sinus tissue was identified that was not a result of atopic presence. SPRR1 and SPRR2A expressions were markedly increased in patients with CRS (p < 0.01) on RNA sequencing, with confirmation by using real-time PCR. Immunohistochemistry of archived surgical samples demonstrated staining of SPRR proteins within squamous epithelium of both groups. Pathway analysis indicated tumor necrosis factor (TNF) alpha as a master regulator of the SPRR gene products. Expression of SPRR1 and of SPRR2A is increased in mucosal samples from patients with CRS and appeared as a downstream result of TNF alpha modulation, which possibly resulted in epithelial barrier dysfunction.

  1. Mesoarchean Banded Iron Formation sequences in Dixon Island-Cleaverville Formation, Pilbara Australia: Oxygenic signal from DXCL project

    NASA Astrophysics Data System (ADS)

    Kiyokawa, S.; Ito, T.; Ikehara, M.; Yamaguchi, K. E.; Naraoka, H.; Onoue, T.; Horie, K.; Sakamoto, R.; Aihara, Y.; Miki, T.

    2013-12-01

    The 3.2-3.1 Ga Dixon island-Cleaverville formations are well-preserved Banded Iron Formation (BIF) within hydrothermal oceanic sequence at oceanic island arc setting (Kiyokawa et al., 2002, 2006, 2012). The stratigraphy of the Dixon Island (3195+15Ma) -Cleaverville (3108+13Ma) formations shows the well preserved environmental condition at the Mesoarchean ocean floor. The stratigraphy of these formations are formed about volcano-sedimentary sequences with hydrothermal chert, black shale and banded iron formation to the top. Based on the scientific drilling of DXCL project at 2007 and 2011, detail lithology between BIF sequence was clearly understood. Four drilling holes had been done at coastal sites; the Dixon Island Formation is DX site (100m) and the Cleaverville Formation is CL2 (40m), CL1 (60m) and CL3 (200m) sites and from stratigraphic bottom to top. Coarsening and thickening upward black shale-BIF sequences are well preserved of the stratigraphy form the core samples. The Dixon Island Formation consists komatiite-rhyolite sequences with many hydrothermal veins and very fine laminated cherty rocks above them. The Cleaverville Formation contains black shale, fragments-bearing pyroclastic beds, white chert, greenish shale and BIF. The CL3 core, which drilled through BIF, shows siderite-chert beds above black shale identified before magnetite lamination bed. U-Pb SHRIMP data of the tuff in lower Dixon Island Formation is 3195+15 Ma and the pyroclastic sequence below the Cleaverville BIF is 3108+13 Ma. Sedimentation rate of these sequence is 2-8 cm/ 1000year. The hole section of the organic carbon rich black shales below BIF are similar amount of organic content and 13C isotope (around -30per mill). There are very weak sulfur MIF signal (less 0.2%) in these black shale sequence. Our result show that thick organic rich sediments may be triggered to form iron rich siderite and magnetite iron beds. The stratigraphy in this sequence quite resemble to other Iron

  2. Side chain-side chain interactions of arginine with tyrosine and aspartic acid in Arg/Gly/Tyr-rich domains within plant glycine-rich RNA binding proteins.

    PubMed

    Kumaki, Yasuhiro; Nitta, Katsutoshi; Hikichi, Kunio; Matsumoto, Takeshi; Matsushima, Norio

    2004-07-01

    Plant glycine-rich RNA-binding proteins (GRRBPs) contain a glycine-rich region at the C-terminus whose structure is quite unknown. The C-terminal glycine-rich part is interposed with arginine and tyrosine (arginine/glycine/tyrosine (RGY)-rich domain). Comparative sequence analysis of forty-one GRRBPs revealed that the RGY-rich domain contains multiple repeats of Tyr-(Xaa)h-(Arg)k-(Xaa)l, where Xaa is mainly Gly, "k" is 1 or 2, and "h" and "l" range from 0 to 10. Two peptides, 1 (G1G2Y3G4G5G6R7R8D9G10) and 2 (G1G2R3R4D5G6G7Y8G9G10), corresponding to sections of the RGY-rich domain in Zea mays RAB15, were selected for CD and NMR experiments. The CD spectra indicate a unique, positive band near 228 nm in both peptides that has been ascribed to tyrosine residues in ordered structures. The pH titration by NMR revealed that a side chain-side chain interaction, presumably an H-Nepsilon...O=Cgamma hydrogen bonding interaction in the salt bridge, occurs between Arg (i) and Asp (i + 2). 1D GOESY experiments indicated the presence of NOE between the aromatic side chain proton and the arginine side chain proton in the two peptides suggesting strongly that the Arg (i) aromatic side chain interacts directly with the Tyr (i +/- 4 or i +/- 5) side chain.

  3. SSMART: Sequence-structure motif identification for RNA-binding proteins.

    PubMed

    Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe

    2018-06-11

    RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.

  4. Comparison of simple sequence repeats in 19 Archaea.

    PubMed

    Trivedi, S

    2006-12-05

    All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.

  5. Targeted next generation sequencing of well-differentiated/dedifferentiated liposarcoma reveals novel gene amplifications and mutations

    PubMed Central

    Somaiah, Neeta; Beird, Hannah C; Barbo, Andrea; Song, Juhee; Mills Shaw, Kenna R.; Wang, Wei-Lien; Eterovic, Karina; Chen, Ken; Lazar, Alexander; Conley, Anthony P.; Ravi, Vinod; Hwu, Patrick; Futreal, Andrew; Simon, George; Meric-Bernstam, Funda; Hong, David

    2018-01-01

    Well-differentiated/dedifferentiated liposarcoma is a common soft tissue sarcoma with approximately 1500 new cases per year. Surgery is the mainstay of treatment but recurrences are frequent and systemic options are limited. ‘Tumor genotyping’ is becoming more common in clinical practice as it offers the hope of personalized targeted therapy. We wanted to evaluate the results and the clinical utility of available next-generation sequencing panels in WD/DD liposarcoma. Patients who had their tumor sequenced by either FoundationOne (n = 13) or the institutional T200/T200.1 panels (n = 7) were included in this study. Significant copy number alterations were identified, but mutations were infrequent. Out of the 27 mutations detected in 7 samples, 8 (CTNNB1, MECOM, ZNF536, EGFR, EML4, CSMD3, PBRM1, PPP1R3A) were identified as deleterious (on Condel, PolyPhen and SIFT) and a truncating mutation was found in NF2. Of these, EGFR and NF2 are potential driver mutations and have not been reported previously in liposarcoma. MDM2 and CDK4 amplification was universally present in all the tested samples and multiple other recurrent genes with high amplification or high deletion were detected. Many of these targets are potentially actionable. Eight patients went on to receive an MDM2 inhibitor with a median time to progression of 23 months (95% CI: 10-83 months). PMID:29731991

  6. Targeted exon sequencing in Usher syndrome type I.

    PubMed

    Bujakowska, Kinga M; Consugar, Mark; Place, Emily; Harper, Shyana; Lena, Jaclyn; Taub, Daniel G; White, Joseph; Navarro-Gomez, Daniel; Weigel DiFranco, Carol; Farkas, Michael H; Gai, Xiaowu; Berson, Eliot L; Pierce, Eric A

    2014-12-02

    Patients with Usher syndrome type I (USH1) have retinitis pigmentosa, profound congenital hearing loss, and vestibular ataxia. This syndrome is currently thought to be associated with at least six genes, which are encoded by over 180 exons. Here, we present the use of state-of-the-art techniques in the molecular diagnosis of a cohort of 47 USH1 probands. The cohort was studied with selective exon capture and next-generation sequencing of currently known inherited retinal degeneration genes, comparative genomic hybridization, and Sanger sequencing of new USH1 exons identified by human retinal transcriptome analysis. With this approach, we were able to genetically solve 14 of the 47 probands by confirming the biallelic inheritance of mutations. We detected two likely pathogenic variants in an additional 19 patients, for whom family members were not available for cosegregation analysis to confirm biallelic inheritance. Ten patients, in addition to primary disease-causing mutations, carried rare likely pathogenic USH1 alleles or variants in other genes associated with deaf-blindness, which may influence disease phenotype. Twenty-one of the identified mutations were novel among the 33 definite or likely solved patients. Here, we also present a clinical description of the studied cohort at their initial visits. We found a remarkable genetic heterogeneity in the studied USH1 cohort with multiplicity of mutations, of which many were novel. No obvious influence of genotype on phenotype was found, possibly due to small sample sizes of the genotypes under study. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.

  7. Targeted Exon Sequencing in Usher Syndrome Type I

    PubMed Central

    Bujakowska, Kinga M.; Consugar, Mark; Place, Emily; Harper, Shyana; Lena, Jaclyn; Taub, Daniel G.; White, Joseph; Navarro-Gomez, Daniel; Weigel DiFranco, Carol; Farkas, Michael H.; Gai, Xiaowu; Berson, Eliot L.; Pierce, Eric A.

    2014-01-01

    Purpose. Patients with Usher syndrome type I (USH1) have retinitis pigmentosa, profound congenital hearing loss, and vestibular ataxia. This syndrome is currently thought to be associated with at least six genes, which are encoded by over 180 exons. Here, we present the use of state-of-the-art techniques in the molecular diagnosis of a cohort of 47 USH1 probands. Methods. The cohort was studied with selective exon capture and next-generation sequencing of currently known inherited retinal degeneration genes, comparative genomic hybridization, and Sanger sequencing of new USH1 exons identified by human retinal transcriptome analysis. Results. With this approach, we were able to genetically solve 14 of the 47 probands by confirming the biallelic inheritance of mutations. We detected two likely pathogenic variants in an additional 19 patients, for whom family members were not available for cosegregation analysis to confirm biallelic inheritance. Ten patients, in addition to primary disease–causing mutations, carried rare likely pathogenic USH1 alleles or variants in other genes associated with deaf-blindness, which may influence disease phenotype. Twenty-one of the identified mutations were novel among the 33 definite or likely solved patients. Here, we also present a clinical description of the studied cohort at their initial visits. Conclusions. We found a remarkable genetic heterogeneity in the studied USH1 cohort with multiplicity of mutations, of which many were novel. No obvious influence of genotype on phenotype was found, possibly due to small sample sizes of the genotypes under study. PMID:25468891

  8. Isolation of nucleotide binding site-leucine rich repeat and kinase resistance gene analogues from sugarcane (Saccharum spp.).

    PubMed

    Glynn, Neil C; Comstock, Jack C; Sood, Sushma G; Dang, Phat M; Chaparro, Jose X

    2008-01-01

    Resistance gene analogues (RGAs) have been isolated from many crops and offer potential in breeding for disease resistance through marker-assisted selection, either as closely linked or as perfect markers. Many R-gene sequences contain kinase domains, and indeed kinase genes have been reported as being proximal to R-genes, making kinase analogues an additionally promising target. The first step towards utilizing RGAs as markers for disease resistance is isolation and characterization of the sequences. Sugarcane clone US01-1158 was identified as resistant to yellow leaf caused by the sugarcane yellow leaf virus (SCYLV) and moderately resistant to rust caused by Puccinia melanocephala Sydow & Sydow. Degenerate primers that had previously proved useful for isolating RGAs and kinase analogues in wheat and soybean were used to amplify DNA from sugarcane (Saccharum spp.) clone US-01-1158. Sequences generated from 1512 positive clones were assembled into 134 contigs of between two and 105 sequences. Comparison of the contig consensuses with the NCBI sequence database using BLASTx showed that 20 had sequence homology to nuclear binding site and leucine rich repeat (NBS-LRR) RGAs, and eight to kinase genes. Alignment of the deduced amino acid sequences with similar sequences from the NCBI database allowed the identification of several conserved domains. The alignment and resulting phenetic tree showed that many of the sequences had greater similarity to sequences from other species than to one another. The use of degenerate primers is a useful method for isolating novel sugarcane RGA and kinase gene analogues. Further studies are needed to evaluate the role of these genes in disease resistance.

  9. New insights into the paleolake sequence of Baumkirchen (Austria): multiple lake phases and a minor ice advance during MIS 4?

    NASA Astrophysics Data System (ADS)

    Barrett, Samuel; Starnberger, Reinhard; Spötl, Christoph; Brauer, Achim; Tjallingii, Rik; Dulski, Peter; Abfalterer, Christof

    2015-04-01

    The sequence of pre-LGM lacustrine sediments at Baumkirchen (Austria) provides a key record in Alpine Quaternary stratigraphy. These sediments from within the boundary of the Alps potentially provide unique insights into the regional paleoclimate. Recent drilling revealed at least ~250m (the base was not reached) of almost entirely mm- to cm-scale lacustrine sediments. The laminated sediments are comprised of alternations between clayey silt and event layers of medium silt to fine sand. The sequence is interrupted only by a short section of gravel supported in an unlaminated clay-rich matrix. Optically stimulated luminescence dating identifies two distinct sequences: the upper sequence spanning mid-late Marine Isotope Stage (MIS) 3 (~33 to ~45 ka BP), agreeing with existing calibrated radiocarbon ages, and the lower section dating to MIS 4 (~59 to ~73 ka BP). Whether the hiatus is an erosional unconformity, or if the sequences represent two separate lake phases is unclear. Although the precise location of the hiatus is hard to identify, the gravel-rich section lies at the very top of the lower sequence. Pebbles in these gravels are largely angular and contain a significant proportion of non-local, regional lithologies. Such gravels are absent in the remainder of the entire 250 m-thick sequence and hence suggest a unique event rather than e.g. an interfingering local delta gravel foresets with the basin sediments. The gravels are therefore likely to be ice-rafted debris from icebergs from nearby glaciers calving into the lake. This therefore represents the first sedimentological evidence of a MIS 4 ice advance in the Eastern Alps. X-ray fluorescence analysis (ITRAX core scanning) of event layers indicates a strong change in the geochemical composition from generally K, Zr and Ti-rich layers in the upper sequence to mainly Ca and/or Si-rich layers in the lower sequence. X-ray diffraction analysis shows the Ca and Si signals to be controlled by carbonate (both calcite

  10. Chemical profiles along olivine crystallographic axes: a record of the melt-rock interaction sequence forming Hole U1309D Olivine-rich troctolites (Atlantis Massif, MAR, 30°N)

    NASA Astrophysics Data System (ADS)

    Ferrando, Carlotta; Godard, Marguerite; Ildefonse, Benoit; Rampone, Elisabetta

    2017-04-01

    The gabbroic section drilled at IODP Hole U1309D (Mid-Atlantic Ridge, IODP Expeditions 304, 305) comprises a whole range of modes from primitive olivine-rich troctolites to evolved gabbros. These series occur as discrete alternating intervals of variable composition and thickness at different depths. High MgO contents and a relatively large proportion of olivine-rich lithologies (up to 90% modal olivine) characterize this gabbroic section. Contacts between olivine-rich troctolites and neighboring coarse grained olivine gabbros are sharp, with the exception of the contacts between olivine-rich intervals and cross-cutting gabbroic veins, which are diffuse and characterized by progressive variations in plagioclase content. Olivine-rich troctolites are heterogeneously distributed along the borehole and show variable modal composition: centimeter to decimeter scale dunitic (90% olivine), troctolitic (enriched in plagioclase) and wehrlitic (enriched in clinopyroxene) domains were identified. Previous in-situ trace element geochemistry and crystallographic preferred orientation measurements of olivine-rich troctolites indicated that they record extensive melt impregnation of pre-existing olivine-rich material, either mantle rocks or dunitic cumulate. We performed a detailed multi-scale petro-structural and geochemical study on selected samples of well-preserved olivine-rich troctolites with the aim to unravel the sequence of re-equilibration processes and better constrain the local conditions driving the formation of these rocks. Processed EBSD maps show variable textures at single sample scale. All identified domains are characterized by coarse grained and deformed olivines, and small rounded undeformed olivines. Coarse grained and small rounded olivines have the same major and trace element compositions. Small olivines are interpreted as relicts after dissolution of coarse grained olivines. Clinopyroxene, plagioclase, and minor orthopyroxene are present as interstitial

  11. Targeted next generation sequencing of parotid gland cancer uncovers genetic heterogeneity.

    PubMed

    Grünewald, Inga; Vollbrecht, Claudia; Meinrath, Jeannine; Meyer, Moritz F; Heukamp, Lukas C; Drebber, Uta; Quaas, Alexander; Beutner, Dirk; Hüttenbrink, Karl-Bernd; Wardelmann, Eva; Hartmann, Wolfgang; Büttner, Reinhard; Odenthal, Margarete; Stenner, Markus

    2015-07-20

    Salivary gland cancer represents a heterogeneous group of malignant tumors. Due to their low incidence and the existence of multiple morphologically defined subtypes, these tumors are still poorly understood with regard to their molecular pathogenesis and therapeutically relevant genetic alterations.Performing a systematic and comprehensive study covering 13 subtypes of salivary gland cancer, next generation sequencing was done on 84 tissue samples of parotid gland cancer using multiplex PCR for enrichment of cancer related gene loci covering hotspots of 46 cancer genes.Mutations were identified in 22 different genes. The most frequent alterations affected TP53, followed by RAS genes, PIK3CA, SMAD4 and members of the ERB family. HRAS mutations accounted for more than 90% of RAS mutations, occurring especially in epithelial-myoepithelial carcinomas and salivary duct carcinomas. Additional mutations in PIK3CA also affected particularly epithelial-myoepithelial carcinomas and salivary duct carcinomas, occurring simultaneously with HRAS mutations in almost all cases, pointing to an unknown and therapeutically relevant molecular constellation. Interestingly, 14% of tumors revealed mutations in surface growth factor receptor genes including ALK, HER2, ERBB4, FGFR, cMET and RET, which might prove to be targetable by new therapeutic agents. 6% of tumors revealed mutations in SMAD4.In summary, our data provide novel insight into the fundamental molecular heterogeneity of salivary gland cancer, relevant in terms of tumor classification and the establishment of targeted therapeutic concepts.

  12. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  13. Combined Targeted DNA Sequencing in Non-Small Cell Lung Cancer (NSCLC) Using UNCseq and NGScopy, and RNA Sequencing Using UNCqeR for the Detection of Genetic Aberrations in NSCLC

    PubMed Central

    Walter, Vonn; Patel, Nirali M.; Eberhard, David A.; Hayward, Michele C.; Salazar, Ashley H.; Jo, Heejoon; Soloway, Matthew G.; Wilkerson, Matthew D.; Parker, Joel S.; Yin, Xiaoying; Zhang, Guosheng; Siegel, Marni B.; Rosson, Gary B.; Earp, H. Shelton; Sharpless, Norman E.; Gulley, Margaret L.; Weck, Karen E.

    2015-01-01

    The recent FDA approval of the MiSeqDx platform provides a unique opportunity to develop targeted next generation sequencing (NGS) panels for human disease, including cancer. We have developed a scalable, targeted panel-based assay termed UNCseq, which involves a NGS panel of over 200 cancer-associated genes and a standardized downstream bioinformatics pipeline for detection of single nucleotide variations (SNV) as well as small insertions and deletions (indel). In addition, we developed a novel algorithm, NGScopy, designed for samples with sparse sequencing coverage to detect large-scale copy number variations (CNV), similar to human SNP Array 6.0 as well as small-scale intragenic CNV. Overall, we applied this assay to 100 snap-frozen lung cancer specimens lacking same-patient germline DNA (07–0120 tissue cohort) and validated our results against Sanger sequencing, SNP Array, and our recently published integrated DNA-seq/RNA-seq assay, UNCqeR, where RNA-seq of same-patient tumor specimens confirmed SNV detected by DNA-seq, if RNA-seq coverage depth was adequate. In addition, we applied the UNCseq assay on an independent lung cancer tumor tissue collection with available same-patient germline DNA (11–1115 tissue cohort) and confirmed mutations using assays performed in a CLIA-certified laboratory. We conclude that UNCseq can identify SNV, indel, and CNV in tumor specimens lacking germline DNA in a cost-efficient fashion. PMID:26076459

  14. A carrot leucine-rich-repeat protein that inhibits ice recrystallization.

    PubMed

    Worrall, D; Elias, L; Ashford, D; Smallwood, M; Sidebottom, C; Lillford, P; Telford, J; Holt, C; Bowles, D

    1998-10-02

    Many organisms adapted to live at subzero temperatures express antifreeze proteins that improve their tolerance to freezing. Although structurally diverse, all antifreeze proteins interact with ice surfaces, depress the freezing temperature of aqueous solutions, and inhibit ice crystal growth. A protein purified from carrot shares these functional features with antifreeze proteins of fish. Expression of the carrot complementary DNA in tobacco resulted in the accumulation of antifreeze activity in the apoplast of plants grown at greenhouse temperatures. The sequence of carrot antifreeze protein is similar to that of polygalacturonase inhibitor proteins and contains leucine-rich repeats.

  15. Targeted exome sequencing for the identification of a protective variant against Internet gaming disorder at rs2229910 of neurotrophic tyrosine kinase receptor, type 3 (NTRK3): A pilot study

    PubMed Central

    Kim, Jeong-Yu; Jeong, Jo-Eun; Rhee, Je-Keun; Cho, Hyun; Chun, Ji-Won; Kim, Tae-Min; Choi, Sam-Wook; Choi, Jung-Seok; Kim, Dai-Jin

    2016-01-01

    Background and aims Internet gaming disorder (IGD) has gained recognition as a potential new diagnosis in the fifth revision of the Diagnostic and Statistical Manual of Mental Disorders, but genetic evidence supporting this disorder remains scarce. Methods In this study, targeted exome sequencing was conducted in 30 IGD patients and 30 control subjects with a focus on genes linked to various neurotransmitters associated with substance and non-substance addictions, depression, and attention deficit hyperactivity disorder. Results rs2229910 of neurotrophic tyrosine kinase receptor, type 3 (NTRK3) was the only single nucleotide polymorphism (SNP) that exhibited a significantly different minor allele frequency in IGD subjects compared to controls (p = .01932), suggesting that this SNP has a protective effect against IGD (odds ratio = 0.1541). The presence of this potentially protective allele was also associated with less time spent on Internet gaming and lower scores on the Young’s Internet Addiction Test and Korean Internet Addiction Proneness Scale for Adults. Conclusions The results of this first targeted exome sequencing study of IGD subjects indicate that rs2229910 of NTRK3 is a genetic variant that is significantly related to IGD. These findings may have significant implications for future research investigating the genetics of IGD and other behavioral addictions. PMID:27826991

  16. Pure Perceptual-Based Sequence Learning: A Role for Visuospatial Attention

    ERIC Educational Resources Information Center

    Remillard, Gilbert

    2009-01-01

    Learning the structure of a sequence of target locations when target location is not the response dimension and the sequence of target locations is uncorrelated with the sequence of responses is called pure perceptual-based sequence learning. The paradigm introduced by G. Remillard (2003) was used to determine whether orienting of visuospatial…

  17. Culture-Independent Identification of Periodontitis-Associated Porphyromonas and Tannerella Populations by Targeted Molecular Analysis

    PubMed Central

    de Lillo, A.; Booth, V.; Kyriacou, L.; Weightman, A. J.; Wade, W. G.

    2004-01-01

    Periodontitis is the commonest bacterial disease of humans and is the major cause of adult tooth loss. About half of the oral microflora is unculturable; and 16S rRNA PCR, cloning, and sequencing techniques have demonstrated the high level of species richness of the oral microflora. In the present study, a PCR primer set specific for the genera Porphyromonas and Tannerella was designed and used to analyze the bacterial populations in subgingival plaque samples from inflamed shallow and deep sites in subjects with periodontitis and shallow sites in age- and sex-matched controls. A total of 308 clones were sequenced and found to belong to one of six Porphyromonas or Tannerella species or phylotypes, one of which, Porphyromonas P3, was novel. Tannerella forsythensis was found in significantly higher proportions in patients than in controls. Porphyromonas catoniae and Tannerella phylotype BU063 appeared to be associated with shallow sites. Targeted culture-independent molecular ecology studies have a valuable role to play in the identification of bacterial targets for further investigations of the pathogenesis of bacterial infections. PMID:15583276

  18. Seasonal diversity and dynamics of haptophytes in the Skagerrak, Norway, explored by high-throughput sequencing

    PubMed Central

    Egge, Elianne Sirnæs; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente

    2015-01-01

    Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September–October (autumn) and lowest in April–May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3–5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. PMID:25893259

  19. Targeted sequencing for high-resolution evolutionary analyses following genome duplication in salmonid fish: Proof of concept for key components of the insulin-like growth factor axis.

    PubMed

    Lappin, Fiona M; Shaw, Rebecca L; Macqueen, Daniel J

    2016-12-01

    High-throughput sequencing has revolutionised comparative and evolutionary genome biology. It has now become relatively commonplace to generate multiple genomes and/or transcriptomes to characterize the evolution of large taxonomic groups of interest. Nevertheless, such efforts may be unsuited to some research questions or remain beyond the scope of some research groups. Here we show that targeted high-throughput sequencing offers a viable alternative to study genome evolution across a vertebrate family of great scientific interest. Specifically, we exploited sequence capture and Illumina sequencing to characterize the evolution of key components from the insulin-like growth (IGF) signalling axis of salmonid fish at unprecedented phylogenetic resolution. The IGF axis represents a central governor of vertebrate growth and its core components were expanded by whole genome duplication in the salmonid ancestor ~95Ma. Using RNA baits synthesised to genes encoding the complete family of IGF binding proteins (IGFBP) and an IGF hormone (IGF2), we captured, sequenced and assembled orthologous and paralogous exons from species representing all ten salmonid genera. This approach generated 299 novel sequences, most as complete or near-complete protein-coding sequences. Phylogenetic analyses confirmed congruent evolutionary histories for all nineteen recognized salmonid IGFBP family members and identified novel salmonid-specific IGF2 paralogues. Moreover, we reconstructed the evolution of duplicated IGF axis paralogues across a replete salmonid phylogeny, revealing complex historic selection regimes - both ancestral to salmonids and lineage-restricted - that frequently involved asymmetric paralogue divergence under positive and/or relaxed purifying selection. Our findings add to an emerging literature highlighting diverse applications for targeted sequencing in comparative-evolutionary genomics. We also set out a viable approach to obtain large sets of nuclear genes for any

  20. Chromosomal localization and sequence analysis of a human episomal sequence with in vitro differentiating activity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Boccaccio, C.; Deshatrette, J.; Meunier-Rotival, M.

    1994-05-01

    The genomic fragment carrying the human activator of liver function, previously described as an episome capable of inducing differentiation upon transfection into a dedifferentiated rat hepatoma cell line, was mapped on human chromosome 12q24.2-12q24.3. This chromosomal location was indistinguishable by in situ hybridization from that of the gene coding for the hepatic transcription factor HNF1. The sequence of the integrated form of the episome as well as its flanking sequences show that it is rich in retroposons. It contains a human ribosomal protein L21 processed pseudogene, one truncated L1Hs sequence, and 10 Alu repeats, which belong to different subfamilies.

  1. Targeted next generation sequencing of the entire vitamin D receptor gene reveals polymorphisms correlated with vitamin D deficiency among older Filipino women with and without fragility fracture.

    PubMed

    Zumaraga, Mark Pretzel; Medina, Paul Julius; Recto, Juan Miguel; Abrahan, Lauro; Azurin, Edelyn; Tanchoco, Celeste C; Jimeno, Cecilia A; Palmes-Saloma, Cynthia

    2017-03-01

    This study aimed to discover genetic variants in the entire 101 kB vitamin D receptor (VDR) gene for vitamin D deficiency in a group of postmenopausal Filipino women using targeted next generation sequencing (TNGS) approach in a case-control study design. A total of 50 women with and without osteoporotic fracture seen at the Philippine Orthopedic Center were included. Blood samples were collected for determination of serum vitamin D, calcium, phosphorus, glucose, blood urea nitrogen, creatinine, aspartate aminotransferase, alanine aminotransferase and as primary source for targeted VDR gene sequencing using the Ion Torrent Personal Genome Machine. The variant calling was based on the GATK best practice workflow and annotated using Annovar tool. A total of 1496 unique variants in the whole 101-kb VDR gene were identified. Novel sequence variations not registered in the dbSNP database were found among cases and controls at a rate of 23.1% and 16.6% of total discovered variants, respectively. One disease-associated enhancer showed statistically significant association to low serum 25-hydroxy vitamin D levels (Pearson chi-square P-value=0.009). The transcription factor binding site prediction program PROMO predicted the disruption of three transcription factor binding sites in this enhancer region. These findings show the power of TNGS in identifying sequence variations in a very large gene and the surprising results obtained in this study greatly expand the catalog of known VDR sequence variants that may represent an important clue in the emergence of vitamin D deficiency. Such information will also provide the additional guidance necessary toward a personalized nutritional advice to reach sufficient vitamin D status. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Richness in Functional Connectivity Depends on the Neuronal Integrity within the Posterior Cingulate Cortex

    PubMed Central

    Lord, Anton R.; Li, Meng; Demenescu, Liliana R.; van den Meer, Johan; Borchardt, Viola; Krause, Anna Linda; Heinze, Hans-Jochen; Breakspear, Michael; Walter, Martin

    2017-01-01

    The brain's connectivity skeleton—a rich club of strongly interconnected members—was initially shown to exist in human structural networks, but recent evidence suggests a functional counterpart. This rich club typically includes key regions (or hubs) from multiple canonical networks, reducing the cost of inter-network communication. The posterior cingulate cortex (PCC), a hub node embedded within the default mode network, is known to facilitate communication between brain networks and is a key member of the “rich club.” Here, we assessed how metabolic signatures of neuronal integrity and cortical thickness influence the global extent of a functional rich club as measured using the functional rich club coefficient (fRCC). Rich club estimation was performed on functional connectivity of resting state brain signals acquired at 3T in 48 healthy adult subjects. Magnetic resonance spectroscopy was measured in the same session using a point resolved spectroscopy sequence. We confirmed convergence of functional rich club with a previously established structural rich club. N-acetyl aspartate (NAA) in the PCC is significantly correlated with age (p = 0.001), while the rich club coefficient showed no effect of age (p = 0.106). In addition, we found a significant quadratic relationship between fRCC and NAA concentration in PCC (p = 0.009). Furthermore, cortical thinning in the PCC was correlated with a reduced rich club coefficient after accounting for age and NAA. In conclusion, we found that the fRCC is related to a marker of neuronal integrity in a key region of the cingulate cortex. Furthermore, cortical thinning in the same area was observed, suggesting that both cortical thinning and neuronal integrity in the hub regions influence functional integration of at a whole brain level. PMID:28439224

  3. Richness in Functional Connectivity Depends on the Neuronal Integrity within the Posterior Cingulate Cortex.

    PubMed

    Lord, Anton R; Li, Meng; Demenescu, Liliana R; van den Meer, Johan; Borchardt, Viola; Krause, Anna Linda; Heinze, Hans-Jochen; Breakspear, Michael; Walter, Martin

    2017-01-01

    The brain's connectivity skeleton-a rich club of strongly interconnected members-was initially shown to exist in human structural networks, but recent evidence suggests a functional counterpart. This rich club typically includes key regions (or hubs) from multiple canonical networks, reducing the cost of inter-network communication. The posterior cingulate cortex (PCC), a hub node embedded within the default mode network, is known to facilitate communication between brain networks and is a key member of the "rich club." Here, we assessed how metabolic signatures of neuronal integrity and cortical thickness influence the global extent of a functional rich club as measured using the functional rich club coefficient (fRCC). Rich club estimation was performed on functional connectivity of resting state brain signals acquired at 3T in 48 healthy adult subjects. Magnetic resonance spectroscopy was measured in the same session using a point resolved spectroscopy sequence. We confirmed convergence of functional rich club with a previously established structural rich club. N-acetyl aspartate (NAA) in the PCC is significantly correlated with age ( p = 0.001), while the rich club coefficient showed no effect of age (p = 0.106). In addition, we found a significant quadratic relationship between fRCC and NAA concentration in PCC ( p = 0.009). Furthermore, cortical thinning in the PCC was correlated with a reduced rich club coefficient after accounting for age and NAA. In conclusion, we found that the fRCC is related to a marker of neuronal integrity in a key region of the cingulate cortex. Furthermore, cortical thinning in the same area was observed, suggesting that both cortical thinning and neuronal integrity in the hub regions influence functional integration of at a whole brain level.

  4. Drug-administration sequence of target-controlled propofol and remifentanil influences the onset of rocuronium. A double-blind, randomized trial.

    PubMed

    Na, H S; Hwang, J W; Park, S H; Oh, A Y; Park, H P; Jeon, Y T; Do, S H

    2012-05-01

    Remifentanil is known to cause bradycardia and hypotension, as well as the decreases of cardiac output (CO). We hypothesized that hemodynamic suppression by remifentanil would affect the onset time of rocuronium. This study investigated whether the onset of rocuronium was influenced by the drug-administration sequence during induction of anesthesia with target-controlled infusion of propofol and remifentanil. Healthy adult patients (n = 126) undergoing elective surgery under general anesthesia were randomized into two groups according to drug-administration sequence. In Remi-Pro-Rocu group (n = 62), remifentanil was infused first, followed by propofol. Then, rocuronium was administered lastly. In Pro-Rocu-Remi group (n = 64), propofol, rocuronium, and remifentanil were given in that order. As a primary outcome, the onset time of rocuronium was measured. Mean arterial pressure (MAP), heart rate (HR), CO, and stroke volume were recorded before anesthesia (T1), at injection of rocuronium (T2), immediately before and after intubation (T3 and T4). In Remi-Pro-Roc group, the onset of rocuronium was delayed significantly compared with Pro-Rocu-Remi group [median (interquartile range); 130 (105-150) vs. 90 (71-100) s, P < 0.001]. At the time of rocuronium injection (T2), MAP, HR, and CO were significantly lower in Remi-Pro-Rocu group than Pro-Rocu-Remi group (P < 0.001). The onset time of rocuronium is prolonged significantly by early administration of remifentanil during target-controlled infusion of propofol and remifentanil, and it may be due to the decreased CO caused by remifentanil. © 2012 The Authors. Acta Anaesthesiologica Scandinavica © 2012 The Acta Anaesthesiologica Scandinavica Foundation.

  5. Solid-Phase Synthesis of Difficult Purine-Rich PNAs through Selective Hmb Incorporation: Application to the Total Synthesis of Cell Penetrating Peptide-PNAs

    PubMed Central

    Tailhades, Julien; Takizawa, Hotake; Gait, Michael J.; Wellings, Don A.; Wade, John D.; Aoki, Yoshitsugu; Shabanpoor, Fazel

    2017-01-01

    Antisense oligonucleotide (ASO)-based drug development is gaining significant momentum following the recent FDA approval of Eteplirsen (an ASO based on phosphorodiamidate morpholino) and Spinraza (2′-O-methoxyethyl-phosphorothioate) in late 2016. Their attractiveness is mainly due to the backbone modifications which have improved the in vivo characteristics of oligonucleotide drugs. Another class of ASO, based on peptide nucleic acid (PNA) chemistry, is also gaining popularity as a platform for development of gene-specific therapy for various disorders. However, the chemical synthesis of long PNAs, which are more target-specific, remains an ongoing challenge. Most of the reported methodology for the solid-phase synthesis of PNA suffer from poor coupling efficiency which limits production to short PNA sequences of less than 15 residues. Here, we have studied the effect of backbone modifications with Hmb (2-hydroxy-4-methoxybenzyl) and Dmb (2,4-dimethoxybenzyl) to ameliorate difficult couplings and reduce “on-resin” aggregation. We firstly synthesized a library of PNA dimers incorporating either Hmb or Dmb and identified that Hmb is superior to Dmb in terms of its ease of removal. Subsequently, we used Hmb backbone modification to synthesize a 22-mer purine-rich PNA, targeting dystrophin RNA splicing, which could not be synthesized by standard coupling methodology. Hmb backbone modification allowed this difficult PNA to be synthesized as well as to be continued to include a cell-penetrating peptide on the same solid support. This approach provides a novel and straightforward strategy for facile solid-phase synthesis of difficult purine-rich PNA sequences. PMID:29094037

  6. Solid-Phase Synthesis of Difficult Purine-Rich PNAs through Selective Hmb Incorporation: Application to the Total Synthesis of Cell Penetrating Peptide-PNAs.

    PubMed

    Tailhades, Julien; Takizawa, Hotake; Gait, Michael J; Wellings, Don A; Wade, John D; Aoki, Yoshitsugu; Shabanpoor, Fazel

    2017-01-01

    Antisense oligonucleotide (ASO)-based drug development is gaining significant momentum following the recent FDA approval of Eteplirsen (an ASO based on phosphorodiamidate morpholino) and Spinraza (2'- O -methoxyethyl-phosphorothioate) in late 2016. Their attractiveness is mainly due to the backbone modifications which have improved the in vivo characteristics of oligonucleotide drugs. Another class of ASO, based on peptide nucleic acid (PNA) chemistry, is also gaining popularity as a platform for development of gene-specific therapy for various disorders. However, the chemical synthesis of long PNAs, which are more target-specific, remains an ongoing challenge. Most of the reported methodology for the solid-phase synthesis of PNA suffer from poor coupling efficiency which limits production to short PNA sequences of less than 15 residues. Here, we have studied the effect of backbone modifications with Hmb (2-hydroxy-4-methoxybenzyl) and Dmb (2,4-dimethoxybenzyl) to ameliorate difficult couplings and reduce "on-resin" aggregation. We firstly synthesized a library of PNA dimers incorporating either Hmb or Dmb and identified that Hmb is superior to Dmb in terms of its ease of removal. Subsequently, we used Hmb backbone modification to synthesize a 22-mer purine-rich PNA, targeting dystrophin RNA splicing, which could not be synthesized by standard coupling methodology. Hmb backbone modification allowed this difficult PNA to be synthesized as well as to be continued to include a cell-penetrating peptide on the same solid support. This approach provides a novel and straightforward strategy for facile solid-phase synthesis of difficult purine-rich PNA sequences.

  7. Solid-phase synthesis of difficult purine-rich PNAs through selective Hmb incorporation: Application to the total synthesis of cell penetrating peptide-PNAs

    NASA Astrophysics Data System (ADS)

    Tailhades, Julien; Takizawa, Hotake; Gait, Michael J.; Wellings, Don A.; Wade, John D.; Aoki, Yoshitsugu; Shabanpoor, Fazel

    2017-10-01

    Antisense oligonucleotide (ASO)-based drug development is gaining significant momentum following the recent FDA approval of Eteplirsen (an ASO based on phosphorodiamidate morpholino) and Spinraza (2’-O-methoxyethyl-phosphorothioate) in late 2016. Their attractiveness is mainly due to the backbone modifications which have improved the in vivo characteristics of oligonucleotide drugs. Another class of ASO, based on peptide nucleic acid (PNA) chemistry, is also gaining popularity as a platform for development of gene-specific therapy for various disorders. However, the chemical synthesis of long PNAs, which are more target-specific, remains an ongoing challenge. Most of the reported methodology for the solid-phase synthesis of PNA suffer from poor coupling efficiency which limits production to short PNA sequences of less than 15 residues. Here we have studied the effect of backbone modifications with Hmb (2-hydroxy-4-methoxybenzyl) and Dmb (2,4-dimethoxybenzyl) to ameliorate difficult couplings and reduce “on-resin” aggregation. We firstly synthesized a library of PNA dimers incorporating either Hmb or Dmb and identified that Hmb is superior to Dmb in terms of its ease of removal. Subsequently, we used Hmb backbone modification to synthesize a 22-mer purine-rich PNA, targeting dystrophin RNA splicing, which could not be synthesized by standard coupling methodology. Hmb backbone modification allowed this difficult PNA to be synthesized as well as to be continued to include a cell-penetrating peptide on the same solid support. This approach provides a novel and straightforward strategy for facile solid-phase synthesis of difficult purine-rich PNA sequences.

  8. Phylogenomics from Whole Genome Sequences Using aTRAM.

    PubMed

    Allen, Julie M; Boyd, Bret; Nguyen, Nam-Phuong; Vachaspati, Pranjal; Warnow, Tandy; Huang, Daisie I; Grady, Patrick G S; Bell, Kayce C; Cronk, Quentin C B; Mugisha, Lawrence; Pittendrigh, Barry R; Leonardi, M Soledad; Reed, David L; Johnson, Kevin P

    2017-09-01

    Novel sequencing technologies are rapidly expanding the size of data sets that can be applied to phylogenetic studies. Currently the most commonly used phylogenomic approaches involve some form of genome reduction. While these approaches make assembling phylogenomic data sets more economical for organisms with large genomes, they reduce the genomic coverage and thereby the long-term utility of the data. Currently, for organisms with moderate to small genomes ($<$1000 Mbp) it is feasible to sequence the entire genome at modest coverage ($10-30\\times$). Computational challenges for handling these large data sets can be alleviated by assembling targeted reads, rather than assembling the entire genome, to produce a phylogenomic data matrix. Here we demonstrate the use of automated Target Restricted Assembly Method (aTRAM) to assemble 1107 single-copy ortholog genes from whole genome sequencing of sucking lice (Anoplura) and out-groups. We developed a pipeline to extract exon sequences from the aTRAM assemblies by annotating them with respect to the original target protein. We aligned these protein sequences with the inferred amino acids and then performed phylogenetic analyses on both the concatenated matrix of genes and on each gene separately in a coalescent analysis. Finally, we tested the limits of successful assembly in aTRAM by assembling 100 genes from close- to distantly related taxa at high to low levels of coverage.Both the concatenated analysis and the coalescent-based analysis produced the same tree topology, which was consistent with previously published results and resolved weakly supported nodes. These results demonstrate that this approach is successful at developing phylogenomic data sets from raw genome sequencing reads. Further, we found that with coverages above $5-10\\times$, aTRAM was successful at assembling 80-90% of the contigs for both close and distantly related taxa. As sequencing costs continue to decline, we expect full genome sequencing

  9. Efficient strategy for the molecular diagnosis of intellectual disability using targeted high-throughput sequencing.

    PubMed

    Redin, Claire; Gérard, Bénédicte; Lauer, Julia; Herenger, Yvan; Muller, Jean; Quartier, Angélique; Masurel-Paulet, Alice; Willems, Marjolaine; Lesca, Gaétan; El-Chehadeh, Salima; Le Gras, Stéphanie; Vicaire, Serge; Philipps, Muriel; Dumas, Michaël; Geoffroy, Véronique; Feger, Claire; Haumesser, Nicolas; Alembik, Yves; Barth, Magalie; Bonneau, Dominique; Colin, Estelle; Dollfus, Hélène; Doray, Bérénice; Delrue, Marie-Ange; Drouin-Garraud, Valérie; Flori, Elisabeth; Fradin, Mélanie; Francannet, Christine; Goldenberg, Alice; Lumbroso, Serge; Mathieu-Dramard, Michèle; Martin-Coignard, Dominique; Lacombe, Didier; Morin, Gilles; Polge, Anne; Sukno, Sylvie; Thauvin-Robinet, Christel; Thevenon, Julien; Doco-Fenzy, Martine; Genevieve, David; Sarda, Pierre; Edery, Patrick; Isidor, Bertrand; Jost, Bernard; Olivier-Faivre, Laurence; Mandel, Jean-Louis; Piton, Amélie

    2014-11-01

    Intellectual disability (ID) is characterised by an extreme genetic heterogeneity. Several hundred genes have been associated to monogenic forms of ID, considerably complicating molecular diagnostics. Trio-exome sequencing was recently proposed as a diagnostic approach, yet remains costly for a general implementation. We report the alternative strategy of targeted high-throughput sequencing of 217 genes in which mutations had been reported in patients with ID or autism as the major clinical concern. We analysed 106 patients with ID of unknown aetiology following array-CGH analysis and other genetic investigations. Ninety per cent of these patients were males, and 75% sporadic cases. We identified 26 causative mutations: 16 in X-linked genes (ATRX, CUL4B, DMD, FMR1, HCFC1, IL1RAPL1, IQSEC2, KDM5C, MAOA, MECP2, SLC9A6, SLC16A2, PHF8) and 10 de novo in autosomal-dominant genes (DYRK1A, GRIN1, MED13L, TCF4, RAI1, SHANK3, SLC2A1, SYNGAP1). We also detected four possibly causative mutations (eg, in NLGN3) requiring further investigations. We present detailed reasoning for assigning causality for each mutation, and associated patients' clinical information. Some genes were hit more than once in our cohort, suggesting they correspond to more frequent ID-associated conditions (KDM5C, MECP2, DYRK1A, TCF4). We highlight some unexpected genotype to phenotype correlations, with causative mutations being identified in genes associated to defined syndromes in patients deviating from the classic phenotype (DMD, TCF4, MECP2). We also bring additional supportive (HCFC1, MED13L) or unsupportive (SHROOM4, SRPX2) evidences for the implication of previous candidate genes or mutations in cognitive disorders. With a diagnostic yield of 25% targeted sequencing appears relevant as a first intention test for the diagnosis of ID, but importantly will also contribute to a better understanding regarding the specific contribution of the many genes implicated in ID and autism. Published by the

  10. Conserved sequences in the current strains of HIV-1 subtype A in Russia are effectively targeted by artificial RNAi in vitro.

    PubMed

    Tchurikov, Nickolai A; Fedoseeva, Daria M; Gashnikova, Natalya M; Sosin, Dmitri V; Gorbacheva, Maria A; Alembekov, Ildar R; Chechetkin, Vladimir R; Kravatsky, Yuri V; Kretova, Olga V

    2016-05-25

    Highly active antiretroviral therapy has greatly reduced the morbidity and mortality of AIDS. However, many of the antiretroviral drugs are toxic with long-term use, and all currently used anti-HIV agents generate drug-resistant mutants. Therefore, there is a great need for new approaches to AIDS therapy. RNAi is a powerful means of inhibiting HIV-1 production in human cells. We propose to use RNAi for gene therapy of HIV/AIDS. Previously we identified a number of new biologically active siRNAs targeting several moderately conserved regions in HIV-1 transcripts. Here we analyze the heterogeneity of nucleotide sequences in three RNAi targets in sequences encoding the reverse transcriptase and integrase domains of current isolates of HIV-1 subtype A in Russia. These data were used to generate genetic constructs expressing short hairpin RNAs 28-30-bp in length that could be processed in cells into siRNAs. After transfection of the constructs we observed siRNAs that efficiently attacked the selected targets. We expect that targeting several viral genes important for HIV-1 reproduction will help overcome the problem of viral adaptation and will prevent the appearance of RNAi escape mutants in current virus strains, an important feature of gene therapy of HIV/AIDS. Copyright © 2016 Elsevier B.V. All rights reserved.

  11. Targeted genotyping-by-sequencing permits cost-effective identification and discrimination of pasture grass species and cultivars.

    PubMed

    Pembleton, Luke W; Drayton, Michelle C; Bain, Melissa; Baillie, Rebecca C; Inch, Courtney; Spangenberg, German C; Wang, Junping; Forster, John W; Cogan, Noel O I

    2016-05-01

    A targeted amplicon-based genotyping-by-sequencing approach has permitted cost-effective and accurate discrimination between ryegrass species (perennial, Italian and inter-species hybrid), and identification of cultivars based on bulked samples. Perennial ryegrass and Italian ryegrass are the most important temperate forage species for global agriculture, and are represented in the commercial pasture seed market by numerous cultivars each composed of multiple highly heterozygous individuals. Previous studies have identified difficulties in the use of morphophysiological criteria to discriminate between these two closely related taxa. Recently, a highly multiplexed single nucleotide polymorphism (SNP)-based genotyping assay has been developed that permits accurate differentiation between both species and cultivars of ryegrasses at the genetic level. This assay has since been further developed into an amplicon-based genotyping-by-sequencing (GBS) approach implemented on a second-generation sequencing platform, allowing accelerated throughput and ca. sixfold reduction in cost. Using the GBS approach, 63 cultivars of perennial, Italian and interspecific hybrid ryegrasses, as well as intergeneric Festulolium hybrids, were genotyped. The genetic relationships between cultivars were interpreted in terms of known breeding histories and indistinct species boundaries within the Lolium genus, as well as suitability of current cultivar registration methodologies. An example of applicability to quality assurance and control (QA/QC) of seed purity is also described. Rapid, low-cost genotypic assays provide new opportunities for breeders to more fully explore genetic diversity within breeding programs, allowing the combination of novel unique genetic backgrounds. Such tools also offer the potential to more accurately define cultivar identities, allowing protection of varieties in the commercial market and supporting processes of cultivar accreditation and quality assurance.

  12. IMM estimator with out-of-sequence measurements

    NASA Astrophysics Data System (ADS)

    Bar-Shalom, Yaakov; Chen, Huimin

    2004-08-01

    In multisensor tracking systems that operate in a centralized information processing architecture, measurements from the same target obtained by different sensors can arrive at the processing center out of sequence. In order to avoid either a delay in the output or the need for reordering and reprocessing an entire sequence of measurements, such measurements have to be processed as out-of-sequence measurements (OOSM). Recent work developed procedures for incorporating OOSMs into a Kalman filter (KF). Since the state of the art tracker for real (maneuvering) targets is the Interacting Multiple Model (IMM) estimator, this paper presents the algorithm for incorporating OOSMs into an IMM estimator. Both data association and estimation are considered. Simulation results are presented for two realistic problems using measurements from two airborne GMTI sensors. It is shown that the proposed algorithm for incorporating OOSMs into an IMM estimator yields practically the same performance as the reordering and in-sequence reprocessing of the measurements.

  13. RFDT: A Rotation Forest-based Predictor for Predicting Drug-Target Interactions Using Drug Structure and Protein Sequence Information.

    PubMed

    Wang, Lei; You, Zhu-Hong; Chen, Xing; Yan, Xin; Liu, Gang; Zhang, Wei

    2018-01-01

    Identification of interaction between drugs and target proteins plays an important role in discovering new drug candidates. However, through the experimental method to identify the drug-target interactions remain to be extremely time-consuming, expensive and challenging even nowadays. Therefore, it is urgent to develop new computational methods to predict potential drugtarget interactions (DTI). In this article, a novel computational model is developed for predicting potential drug-target interactions under the theory that each drug-target interaction pair can be represented by the structural properties from drugs and evolutionary information derived from proteins. Specifically, the protein sequences are encoded as Position-Specific Scoring Matrix (PSSM) descriptor which contains information of biological evolutionary and the drug molecules are encoded as fingerprint feature vector which represents the existence of certain functional groups or fragments. Four benchmark datasets involving enzymes, ion channels, GPCRs and nuclear receptors, are independently used for establishing predictive models with Rotation Forest (RF) model. The proposed method achieved the prediction accuracy of 91.3%, 89.1%, 84.1% and 71.1% for four datasets respectively. In order to make our method more persuasive, we compared our classifier with the state-of-theart Support Vector Machine (SVM) classifier. We also compared the proposed method with other excellent methods. Experimental results demonstrate that the proposed method is effective in the prediction of DTI, and can provide assistance for new drug research and development. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  14. Neutron-rich isotope production using a uranium carbide - carbon nanotubes SPES target prototype

    NASA Astrophysics Data System (ADS)

    Corradetti, S.; Biasetto, L.; Manzolaro, M.; Scarpa, D.; Carturan, S.; Andrighetto, A.; Prete, G.; Vasquez, J.; Zanonato, P.; Colombo, P.; Jost, C. U.; Stracener, D. W.

    2013-05-01

    The SPES (Selective Production of Exotic Species) project, under development at the Istituto Nazionale di Fisica Nucleare - Laboratori Nazionali di Legnaro (INFN-LNL), is a new-generation Isotope Separation On-Line (ISOL) facility for the production of radioactive ion beams by means of the proton-induced fission of uranium. In the framework of the research on the SPES target, seven uranium carbide discs, obtained by reacting uranium oxide with graphite and carbon nanotubes, were irradiated with protons at the Holifield Radioactive Ion Beam Facility (HRIBF) of Oak Ridge National Laboratory (ORNL). In the following, the yields of several fission products obtained during the experiment are presented and discussed. The experimental results are then compared to those obtained using a standard uranium carbide target. The reported data highlights the capability of the new type of SPES target to produce and release isotopes of interest for the nuclear physics community.

  15. Studies of neutron-rich nuclei far from stability at TRISTAN

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gill, R.L.

    The ISOL facility, TRISTAN, is a user facility located at Brookhaven National Laboratory's High Flux Beam Reactor. Short-lived, neutron-rich nuclei, far from stability, are produced by thermal neutron fission of /sup 235/U. An extensive array of experimental end stations are available for nuclear structure studies. These studies are augmented by a variety of long-lived ion sources suitable for use at a reactor facility. Some recent results at TRISTAN are presented as examples of using an ISOL facility to study series of nuclei, whereby an effective means of conducting nuclear structure investigations is available.

  16. Coulomb Excitation of Neutron-Rich Cd Isotopes at REX-ISOLDE

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kroell, Th.; Behrens, T.; Kruecken, R.

    2005-11-21

    We report on the 'safe' Coulomb excitation of neutron-rich Cd isotopes in the vicinity of the doubly magic nucleus 132Sn. The radioactive nuclei have been produced by ISOLDE at CERN and postaccelerated by the REX-ISOLDE facility. The {gamma}-decay of excited states has been detected by the MINIBALL array. Preliminary results for the B(E2) values of 122,124Cd are consistent with expectations from phenomenological systematics.

  17. Coulomb excitation of neutron-rich Cd isotopes at REX-ISOLDE

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kroell, Th.; Behrens, T.; Kruecken, R.

    2006-04-26

    We report on the 'safe' Coulomb excitation of neutron-rich Cd isotopes in the vicinity of the doubly magic nucleus 132Sn. The radioactive nuclei have been produced by ISOLDE at CERN and postaccelerated by the REX-ISOLDE facility. The {gamma}-decay of excited states has been detected by the MINIBALL array. Preliminary results for the B(E2) values of 122,124Cd are consistent with expectations from phenomenological systematics.

  18. Isolation of a new class of cysteine–glycine–proline-rich beta-proteins (beta-keratins) and their expression in snake epidermis

    PubMed Central

    Dalla Valle, Luisa; Nardi, Alessia; Alibardi, Lorenzo

    2010-01-01

    Scales of snakes contain hard proteins (beta-keratins), now referred to as keratin-associated beta-proteins. In the present study we report the isolation, sequencing, and expression of a new group of these proteins from snake epidermis, designated cysteine–glycine–proline-rich proteins. One deduced protein from expressed mRNAs contains 128 amino acids (12.5 kDa) with a theoretical pI at 7.95, containing 10.2% cysteine and 15.6% glycine. The sequences of two more snake cysteine–proline-rich proteins have been identified from genomic DNA. In situ hybridization shows that the messengers for these proteins are present in the suprabasal and early differentiating beta-cells of the renewing scale epidermis. The present study shows that snake scales, as previously seen in scales of lizards, contain cysteine-rich beta-proteins in addition to glycine-rich beta-proteins. These keratin-associated beta-proteins mix with intermediate filament keratins (alpha-keratins) to produce the resistant corneous layer of snake scales. The specific proportion of these two subfamilies of proteins in different scales can determine various degrees of hardness in scales. PMID:20070430

  19. Exploring N-Rich Phases in Li(x)N(y) Clusters for Hydrogen Storage at Nanoscale.

    PubMed

    Bhattacharya, Amrita; Bhattacharya, Saswata

    2015-09-17

    We have performed cascade genetic algorithm and ab initio atomistic thermodynamics under the framework of first-principles-based hybrid density functional theory to study the (meta-)stability of a wide range of Li(x)N(y) clusters. We found that hybrid xc-functional is essential to address this problem as a local/semilocal functional simply fails even to predict a qualitative prediction. Most importantly, we find that though in bulk lithium nitride, the Li-rich phase, that is, Li3N, is the stable stoichiometry; in small Li(x)N(y) clusters, N-rich phases are more stable at thermodynamic equilibrium. We further show that these N-rich clusters are promising hydrogen storage material because of their easy adsorption and desorption ability at respectively low (≤300 K) and moderately high temperature (≥600 K).

  20. Sequence-Specific Targeting of Bacterial Resistance Genes Increases Antibiotic Efficacy

    PubMed Central

    Wong, Michael; Daly, Seth M.; Greenberg, David E.; Toprak, Erdal

    2016-01-01

    The lack of effective and well-tolerated therapies against antibiotic-resistant bacteria is a global public health problem leading to prolonged treatment and increased mortality. To improve the efficacy of existing antibiotic compounds, we introduce a new method for strategically inducing antibiotic hypersensitivity in pathogenic bacteria. Following the systematic verification that the AcrAB-TolC efflux system is one of the major determinants of the intrinsic antibiotic resistance levels in Escherichia coli, we have developed a short antisense oligomer designed to inhibit the expression of acrA and increase antibiotic susceptibility in E. coli. By employing this strategy, we can inhibit E. coli growth using 2- to 40-fold lower antibiotic doses, depending on the antibiotic compound utilized. The sensitizing effect of the antisense oligomer is highly specific to the targeted gene’s sequence, which is conserved in several bacterial genera, and the oligomer does not have any detectable toxicity against human cells. Finally, we demonstrate that antisense oligomers improve the efficacy of antibiotic combinations, allowing the combined use of even antagonistic antibiotic pairs that are typically not favored due to their reduced activities. PMID:27631336