Sample records for synthase seq id

  1. Monoterpene synthases from common sage (Salvia officinalis)

    DOEpatents

    Croteau, Rodney Bruce; Wise, Mitchell Lynn; Katahira, Eva Joy; Savage, Thomas Jonathan

    1999-01-01

    cDNAs encoding (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase from common sage (Salvia officinalis) have been isolated and sequenced, and the corresponding amino acid sequences has been determined. Accordingly, isolated DNA sequences (SEQ ID No:1; SEQ ID No:3 and SEQ ID No:5) are provided which code for the expression of (+)-bornyl diphosphate synthase (SEQ ID No:2), 1,8-cineole synthase (SEQ ID No:4) and (+)-sabinene synthase SEQ ID No:6), respectively, from sage (Salvia officinalis). In other aspects, replicable recombinant cloning vehicles are provided which code for (+)-bornyl diphosphate synthase, 1,8-cineole synthase or (+)-sabinene synthase, or for a base sequence sufficiently complementary to at least a portion of (+)-bornyl diphosphate synthase, 1,8-cineole synthase or (+)-sabinene synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding (+)-bornyl diphosphate synthase, 1,8-cineole synthase or (+)-sabinene synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant monoterpene synthases that may be used to facilitate their production, isolation and purification in significant amounts. Recombinant (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase may be used to obtain expression or enhanced expression of (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase in plants in order to enhance the production of monoterpenoids, or may be otherwise employed for the regulation or expression of (+)-bornyl diphosphate synthase, 1,8-cineole synthase and (+)-sabinene synthase, or the production of their products.

  2. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  3. Isolation and bacterial expression of a sesquiterpene synthase CDNA clone from peppermint(mentha .chi. piperita, L.) that produces the aphid alarm pheromone (E)-.beta.-farnesene

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Crock, John E.

    1999-01-01

    A cDNA encoding (E)-.beta.-farnesene synthase from peppermint (Mentha piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of (E)-.beta.-farnesene synthase (SEQ ID NO:2), from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for (E)-.beta.-farnesene synthase, or for a base sequence sufficiently complementary to at least a portion of (E)-.beta.-farnesene synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding (E)-.beta.-farnesene synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant (E)-.beta.-farnesene synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant (E)-.beta.-farnesene synthase may be used to obtain expression or enhanced expression of (E)-.beta.-farnesene synthase in plants in order to enhance the production of (E)-.beta.-farnesene, or may be otherwise employed for the regulation or expression of (E)-.beta.-farnesene synthase, or the production of its product.

  4. Isolation and bacterial expression of a sesquiterpene synthase cDNA clone from peppermint (Mentha x piperita, L.) that produces the aphid alarm pheromone (E)-.beta.-farnesene

    DOEpatents

    Croteau, Rodney Bruce; Crock, John E.

    2005-01-25

    A cDNA encoding (E)-.beta.-farnesene synthase from peppermint (Mentha piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of (E)-.beta.-farnesene synthase (SEQ ID NO:2), from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for (E)-.beta.-farnesene synthase, or for a base sequence sufficiently complementary to at least a portion of (E)-.beta.-farnesene synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding (E)-.beta.-farnesene synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant (E)-.beta.-famesene synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant (E)-.beta.-farnesene synthase may be used to obtain expression or enhanced expression of (E)-.beta.-famesene synthase in plants in order to enhance the production of (E)-.beta.-farnesene, or may be otherwise employed for the regulation or expression of (E)-.beta.-farnesene synthase, or the production of its product.

  5. Geranyl diphosphate synthase from mint

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Burke, Charles Cullen; Gershenzon, Jonathan

    1999-01-01

    A cDNA encoding geranyl diphosphate synthase from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID No:1) is provided which codes for the expression of geranyl diphosphate synthase (SEQ ID No:2) from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase or for a base sequence sufficiently complementary to at least a portion of the geranyl diphosphate synthase DNA or RNA to enable hybridization therewith (e.g., antisense geranyl diphosphate synthase RNA or fragments of complementary geranyl diphosphate synthase DNA which are useful as polymerase chain reaction primers or as probes for geranyl diphosphate synthase or related genes). In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase that may be used to facilitate the production, isolation and purification of significant quantities of recombinant geranyl diphosphate synthase for subsequent use, to obtain expression or enhanced expression of geranyl diphosphate synthase in plants in order to enhance the production of monoterpenoids, to produce geranyl diphosphate in cancerous cells as a precursor to monoterpenoids having anti-cancer properties or may be otherwise employed for the regulation or expression of geranyl diphosphate synthase or the production of geranyl diphosphate.

  6. Geranyl diphosphate synthase from mint

    DOEpatents

    Croteau, R.B.; Wildung, M.R.; Burke, C.C.; Gershenzon, J.

    1999-03-02

    A cDNA encoding geranyl diphosphate synthase from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID No:1) is provided which codes for the expression of geranyl diphosphate synthase (SEQ ID No:2) from peppermint (Mentha piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase or for a base sequence sufficiently complementary to at least a portion of the geranyl diphosphate synthase DNA or RNA to enable hybridization therewith (e.g., antisense geranyl diphosphate synthase RNA or fragments of complementary geranyl diphosphate synthase DNA which are useful as polymerase chain reaction primers or as probes for geranyl diphosphate synthase or related genes). In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase that may be used to facilitate the production, isolation and purification of significant quantities of recombinant geranyl diphosphate synthase for subsequent use, to obtain expression or enhanced expression of geranyl diphosphate synthase in plants in order to enhance the production of monoterpenoids, to produce geranyl diphosphate in cancerous cells as a precursor to monoterpenoids having anti-cancer properties or may be otherwise employed for the regulation or expression of geranyl diphosphate synthase or the production of geranyl diphosphate. 5 figs.

  7. Cell culture compositions

    DOEpatents

    Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yiao, Jian

    2014-03-18

    The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6 (SEQ ID NO:1 encodes the full length endoglucanase; SEQ ID NO:4 encodes the mature form), and the corresponding endoglucanase VI amino acid sequence ("EGVI"; SEQ ID NO:3 is the signal sequence; SEQ ID NO:2 is the mature sequence). The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.

  8. Oncoprotein protein kinase

    DOEpatents

    Karin, Michael; Hibi, Masahiko; Lin, Anning

    2002-01-29

    The present invention provides an isolated polynucleotide encoding a c-Jun peptide consisting of about amino acid residues 33 to 79 as set fort in SEQ ID NO: 10 or conservative variations thereof. The invention also provides a method for producing a peptide of SEQ ID NO:1 comprising (a) culturing a host cell containing a polynucleotide encoding a c-Jun peptide consisting of about amino acid residues 33 to 79 as set forth in SEQ ID NO: 10 under conditions which allow expression of the polynucleotide; and (b) obtaining the peptide of SEQ ID NO:1.

  9. Carbohydrate degrading polypeptide and uses thereof

    DOEpatents

    Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

    2015-10-20

    The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  10. Novel family of terpene synthases evolved from trans-isoprenyl diphosphate synthases in a flea beetle

    PubMed Central

    Beran, Franziska; Rahfeld, Peter; Luck, Katrin; Nagel, Raimund; Vogel, Heiko; Wielsch, Natalie; Irmisch, Sandra; Ramasamy, Srinivasan; Gershenzon, Jonathan; Heckel, David G.; Köllner, Tobias G.

    2016-01-01

    Sesquiterpenes play important roles in insect communication, for example as pheromones. However, no sesquiterpene synthases, the enzymes involved in construction of the basic carbon skeleton, have been identified in insects to date. We investigated the biosynthesis of the sesquiterpene (6R,7S)-himachala-9,11-diene in the crucifer flea beetle Phyllotreta striolata, a compound previously identified as a male-produced aggregation pheromone in several Phyllotreta species. A (6R,7S)-himachala-9,11-diene–producing sesquiterpene synthase activity was detected in crude beetle protein extracts, but only when (Z,E)-farnesyl diphosphate [(Z,E)-FPP] was offered as a substrate. No sequences resembling sesquiterpene synthases from plants, fungi, or bacteria were found in the P. striolata transcriptome, but we identified nine divergent putative trans-isoprenyl diphosphate synthase (trans-IDS) transcripts. Four of these putative trans-IDSs exhibited terpene synthase (TPS) activity when heterologously expressed. Recombinant PsTPS1 converted (Z,E)-FPP to (6R,7S)-himachala-9,11-diene and other sesquiterpenes observed in beetle extracts. RNAi-mediated knockdown of PsTPS1 mRNA in P. striolata males led to reduced emission of aggregation pheromone, confirming a significant role of PsTPS1 in pheromone biosynthesis. Two expressed enzymes showed genuine IDS activity, with PsIDS1 synthesizing (E,E)-FPP, whereas PsIDS3 produced neryl diphosphate, (Z,Z)-FPP, and (Z,E)-FPP. In a phylogenetic analysis, the PsTPS enzymes and PsIDS3 were clearly separated from a clade of known coleopteran trans-IDS enzymes including PsIDS1 and PsIDS2. However, the exon–intron structures of IDS and TPS genes in P. striolata are conserved, suggesting that this TPS gene family evolved from trans-IDS ancestors. PMID:26936952

  11. Human jagged polypeptide, encoding nucleic acids and methods of use

    DOEpatents

    Li, Linheng; Hood, Leroy

    2000-01-01

    The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.

  12. Methods of diagnosing alagille syndrome

    DOEpatents

    Li, Linheng; Hood, Leroy; Krantz, Ian D.; Spinner, Nancy B.

    2004-03-09

    The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.

  13. Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

    DOEpatents

    Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

    2016-02-16

    The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  14. Polypeptide having beta-glucosidase activity and uses thereof

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less

  15. Polypeptide having swollenin activity and uses thereof

    DOEpatents

    Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

    2015-11-04

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  16. Polypeptide having beta-glucosidase activity and uses thereof

    DOEpatents

    Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

    2015-09-01

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  17. Polypeptide having cellobiohydrolase activity and uses thereof

    DOEpatents

    Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

    2015-09-15

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  18. Polypeptide having acetyl xylan esterase activity and uses thereof

    DOEpatents

    Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

    2015-10-20

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  19. Polypeptide having carbohydrate degrading activity and uses thereof

    DOEpatents

    Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

    2015-08-18

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  20. A non-parametric peak calling algorithm for DamID-Seq.

    PubMed

    Li, Renhua; Hempel, Leonie U; Jiang, Tingbo

    2015-01-01

    Protein-DNA interactions play a significant role in gene regulation and expression. In order to identify transcription factor binding sites (TFBS) of double sex (DSX)-an important transcription factor in sex determination, we applied the DNA adenine methylation identification (DamID) technology to the fat body tissue of Drosophila, followed by deep sequencing (DamID-Seq). One feature of DamID-Seq data is that induced adenine methylation signals are not assured to be symmetrically distributed at TFBS, which renders the existing peak calling algorithms for ChIP-Seq, including SPP and MACS, inappropriate for DamID-Seq data. This challenged us to develop a new algorithm for peak calling. A challenge in peaking calling based on sequence data is estimating the averaged behavior of background signals. We applied a bootstrap resampling method to short sequence reads in the control (Dam only). After data quality check and mapping reads to a reference genome, the peaking calling procedure compromises the following steps: 1) reads resampling; 2) reads scaling (normalization) and computing signal-to-noise fold changes; 3) filtering; 4) Calling peaks based on a statistically significant threshold. This is a non-parametric method for peak calling (NPPC). We also used irreproducible discovery rate (IDR) analysis, as well as ChIP-Seq data to compare the peaks called by the NPPC. We identified approximately 6,000 peaks for DSX, which point to 1,225 genes related to the fat body tissue difference between female and male Drosophila. Statistical evidence from IDR analysis indicated that these peaks are reproducible across biological replicates. In addition, these peaks are comparable to those identified by use of ChIP-Seq on S2 cells, in terms of peak number, location, and peaks width.

  1. Cysteine-containing peptide tag for site-specific conjugation of proteins

    DOEpatents

    Backer, Marina V.; Backer, Joseph M.

    2008-04-08

    The present invention is directed to a biological conjugate, comprising: (a) a targeting moiety comprising a polypeptide having an amino acid sequence comprising the polypeptide sequence of SEQ ID NO:2 and the polypeptide sequence of a selected targeting protein; and (b) a binding moiety bound to the targeting moiety; the biological conjugate having a covalent bond between the thiol group of SEQ ID NO:2 and a functional group in the binding moiety. The present invention is directed to a biological conjugate, comprising: (a) a targeting moiety comprising a polypeptide having an amino acid sequence comprising the polypeptide sequence of SEQ ID NO:2 and the polypeptide sequence of a selected targeting protein; and (b) a binding moiety that comprises an adapter protein, the adapter protein having a thiol group; the biological conjugate having a disulfide bond between the thiol group of SEQ ID NO:2 and the thiol group of the adapter protein. The present invention is also directed to biological sequences employed in the above biological conjugates, as well as pharmaceutical preparations and methods using the above biological conjugates.

  2. Cysteine-containing peptide tag for site-specific conjugation of proteins

    DOEpatents

    Backer, Marina V.; Backer, Joseph M.

    2010-10-05

    The present invention is directed to a biological conjugate, comprising: (a) a targeting moiety comprising a polypeptide having an amino acid sequence comprising the polypeptide sequence of SEQ ID NO:2 and the polypeptide sequence of a selected targeting protein; and (b) a binding moiety bound to the targeting moiety; the biological conjugate having a covalent bond between the thiol group of SEQ ID NO:2 and a functional group in the binding moiety. The present invention is directed to a biological conjugate, comprising: (a) a targeting moiety comprising a polypeptide having an amino acid sequence comprising the polypeptide sequence of SEQ ID NO:2 and the polypeptide sequence of a selected targeting protein; and (b) a binding moiety that comprises an adapter protein, the adapter protein having a thiol group; the biological conjugate having a disulfide bond between the thiol group of SEQ ID NO:2 and the thiol group of the adapter protein. The present invention is also directed to biological sequences employed in the above biological conjugates, as well as pharmaceutical preparations and methods using the above biological conjugates.

  3. The Sucrose Synthase Gene Family in Chinese Pear (Pyrus bretschneideri Rehd.): Structure, Expression, and Evolution.

    PubMed

    Abdullah, Muhammad; Cao, Yungpeng; Cheng, Xi; Meng, Dandan; Chen, Yu; Shakoor, Awais; Gao, Junshan; Cai, Yongping

    2018-05-11

    Sucrose synthase (SS) is a key enzyme involved in sucrose metabolism that is critical in plant growth and development, and particularly quality of the fruit. Sucrose synthase gene families have been identified and characterized in plants various plants such as tobacco, grape, rice, and Arabidopsis . However, there is still lack of detailed information about sucrose synthase gene in pear. In the present study, we performed a systematic analysis of the pear ( Pyrus bretschneideri Rehd.) genome and reported 30 sucrose synthase genes. Subsequently, gene structure, phylogenetic relationship, chromosomal localization, gene duplications, promoter regions, collinearity, RNA-Seq data and qRT-PCR were conducted on these sucrose synthase genes. The transcript analysis revealed that 10 PbSSs genes (30%) were especially expressed in pear fruit development. Additionally, qRT-PCR analysis verified the RNA-seq data and shown that PbSS30 , PbSS24 , and PbSS15 have a potential role in the pear fruit development stages. This study provides important insights into the evolution of sucrose synthase gene family in pear and will provide assistance for further investigation of sucrose synthase genes functions in the process of fruit development, fruit quality and resistance to environmental stresses.

  4. RNA-Seq in the discovery of a sparsely expressed scent-determining monoterpene synthase in lavender (Lavandula).

    PubMed

    Adal, Ayelign M; Sarker, Lukman S; Malli, Radesh P N; Liang, Ping; Mahmoud, Soheil S

    2018-06-09

    Using RNA-Seq, we cloned and characterized a unique monoterpene synthase responsible for the formation of a scent-determining S-linalool constituent of lavender oils from Lavandula × intermedia. Several species of Lavandula produce essential oils (EOs) consisting mainly of monoterpenes including linalool, one of the most abundant and scent-determining oil constituents. Although R-linalool dominates the EOs of lavenders, varying amounts (depending on the species) of the S-linalool enantiomer can also be found in these plants. Despite its relatively low abundance, S-linalool contributes a sweet, pleasant scent and is an important constituent of lavender EOs. While several terpene synthase genes including R-linalool synthase have been cloned from lavenders many important terpene synthases including S-linalool synthase have not been described from these plants. In this study, we employed RNA-Seq and other complementary sequencing data to clone and functionally characterize the sparsely expressed S-linalool synthase cDNA (LiS-LINS) from Lavandula × intermedia. Recombinant LiS-LINS catalyzed the conversion of the universal monoterpene precursor geranyl diphosphate to S-linalool as the sole product. Intriguingly, LiS-LINS exhibited very low (~ 30%) sequence similarity to other Lavandula terpene synthases, including R-linalool synthase. However, the predicted 3D structure of this protein, including the composition and arrangement of amino acids at the active site, is highly homologous to known terpene synthase proteins. LiS-LINS transcripts were detected in flowers, but were much less abundant than those corresponding to LiR-LINS, paralleling enantiomeric composition of linalool in L. × intermedia oils. These data indicate that production of S-linalool is at least partially controlled at the level of transcription from LiS-LINS. The cloned LiS-LINS cDNA may be used to enhance oil composition in lavenders and other plants through metabolic engineering.

  5. Gene encoding herbicide safener binding protein

    DOEpatents

    Walton, Jonathan D.; Scott-Craig, John S.

    1999-01-01

    The cDNA encoding safener binding protein (SafBP), also referred to as SBP1, is set forth in FIG. 5 and SEQ ID No. 1. The deduced amino acid sequence is provided in FIG. 5 and SEQ ID No. 2. Methods of making and using SBP1 and SafBP to alter a plant's sensitivity to certain herbicides or a plant's responsiveness to certain safeners are also provided, as well as expression vectors, transgenic plants or other organisms transfected with said vectors and seeds from said plants.

  6. Variants of beta-glucosidase

    DOEpatents

    Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

    2015-07-14

    The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  7. Switchgrass ubiquitin promoter (PVUBI2) and uses thereof

    DOEpatents

    Stewart, C. Neal; Mann, David George James

    2013-12-10

    The subject application provides polynucleotides, compositions thereof and methods for regulating gene expression in a plant. Polynucleotides disclosed herein comprise novel sequences for a promoter isolated from Panicum virgatum (switchgrass) that initiates transcription of an operably linked nucleotide sequence. Thus, various embodiments of the invention comprise the nucleotide sequence of SEQ ID NO: 2 or fragments thereof comprising nucleotides 1 to 692 of SEQ ID NO: 2 that are capable of driving the expression of an operably linked nucleic acid sequence.

  8. Variants of beta-glucosidases

    DOEpatents

    Fidantsef, Ana; Lamsa, Michael; Gorre-Clancy, Brian

    2014-10-07

    The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  9. Variants of beta-glucosidase

    DOEpatents

    Fidantsef, Ana [Davis, CA; Lamsa, Michael [Davis, CA; Gorre-Clancy, Brian [Elk Grove, CA

    2009-12-29

    The present invention relates to variants of a parent beta-glucosidase, comprising a substitution at one or more positions corresponding to positions 142, 183, 266, and 703 of amino acids 1 to 842 of SEQ ID NO: 2 or corresponding to positions 142, 183, 266, and 705 of amino acids 1 to 844 of SEQ ID NO: 70, wherein the variant has beta-glucosidase activity. The present invention also relates to nucleotide sequences encoding the variant beta-glucosidases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  10. Variants of glycoside hydrolases

    DOEpatents

    Teter, Sarah; Ward, Connie; Cherry, Joel; Jones, Aubrey; Harris, Paul; Yi, Jung

    2013-02-26

    The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  11. Variants of glycoside hydrolases

    DOEpatents

    Teter, Sarah [Davis, CA; Ward, Connie [Hamilton, MT; Cherry, Joel [Davis, CA; Jones, Aubrey [Davis, CA; Harris, Paul [Carnation, WA; Yi, Jung [Sacramento, CA

    2011-04-26

    The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  12. Variants of glycoside hydrolases

    DOEpatents

    Teter, Sarah; Ward, Connie; Cherry, Joel; Jones, Aubrey; Harris, Paul; Yi, Jung

    2017-07-11

    The present invention relates to variants of a parent glycoside hydrolase, comprising a substitution at one or more positions corresponding to positions 21, 94, 157, 205, 206, 247, 337, 350, 373, 383, 438, 455, 467, and 486 of amino acids 1 to 513 of SEQ ID NO: 2, and optionally further comprising a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2 a substitution at one or more positions corresponding to positions 8, 22, 41, 49, 57, 113, 193, 196, 226, 227, 246, 251, 255, 259, 301, 356, 371, 411, and 462 of amino acids 1 to 513 of SEQ ID NO: 2, wherein the variants have glycoside hydrolase activity. The present invention also relates to nucleotide sequences encoding the variant glycoside hydrolases and to nucleic acid constructs, vectors, and host cells comprising the nucleotide sequences.

  13. Overexpression of an Isoprenyl Diphosphate Synthase in Spruce Leads to Unexpected Terpene Diversion Products That Function in Plant Defense1[W][OPEN

    PubMed Central

    Nagel, Raimund; Berasategui, Aileen; Paetz, Christian; Gershenzon, Jonathan; Schmidt, Axel

    2014-01-01

    Spruce (Picea spp.) and other conifers employ terpenoid-based oleoresin as part of their defense against herbivores and pathogens. The short-chain isoprenyl diphosphate synthases (IDS) are situated at critical branch points in terpene biosynthesis, producing the precursors of the different terpenoid classes. To determine the role of IDS and to create altered terpene phenotypes for assessing the defensive role of terpenoids, we overexpressed a bifunctional spruce IDS, a geranyl diphosphate and geranylgeranyl diphosphate synthase in white spruce (Picea glauca) saplings. While transcript level (350-fold), enzyme activity level (7-fold), and in planta geranyl diphosphate and geranylgeranyl diphosphate levels (4- to 8-fold) were significantly increased in the needles of transgenic plants, there was no increase in the major monoterpenes and diterpene acids of the resin and no change in primary isoprenoids, such as sterols, chlorophylls, and carotenoids. Instead, large amounts of geranylgeranyl fatty acid esters, known from various gymnosperm and angiosperm plant species, accumulated in needles and were shown to act defensively in reducing the performance of larvae of the nun moth (Lymantria monacha), a conifer pest in Eurasia. These results show the impact of overexpression of an IDS and the defensive role of an unexpected accumulation product of terpenoid biosynthesis with the potential for a broader function in plant protection. PMID:24346420

  14. Methods and kits for predicting a response to an erythropoietic agent

    DOEpatents

    Merchant, Michael L.; Klein, Jon B.; Brier, Michael E.; Gaweda, Adam E.

    2015-06-16

    Methods for predicting a response to an erythropoietic agent in a subject include providing a biological sample from the subject, and determining an amount in the sample of at least one peptide selected from the group consisting of SEQ ID NOS: 1-17. If there is a measurable difference in the amount of the at least one peptide in the sample, when compared to a control level of the same peptide, the subject is then predicted to have a good response or a poor response to the erythropoietic agent. Kits for predicting a response to an erythropoietic agent are further provided and include one or more antibodies, or fragments thereof, that specifically recognize a peptide of SEQ ID NOS: 1-17.

  15. Elucidation of terpenoid metabolism in Scoparia dulcis by RNA-seq analysis.

    PubMed

    Yamamura, Yoshimi; Kurosaki, Fumiya; Lee, Jung-Bum

    2017-03-07

    Scoparia dulcis biosynthesize bioactive diterpenes, such as scopadulcic acid B (SDB), which are known for their unique molecular skeleton. Although the biosynthesis of bioactive diterpenes is catalyzed by a sequence of class II and class I diterpene synthases (diTPSs), the mechanisms underlying this process are yet to be fully identified. To elucidate these biosynthetic machinery, we performed a high-throughput RNA-seq analysis, and de novo assembly of clean reads revealed 46,332 unique transcripts and 40,503 two unigenes. We found diTPSs genes including a putative syn-copalyl diphosphate synthase (SdCPS2) and two kaurene synthase-like (SdKSLs) genes. Besides them, total 79 full-length of cytochrome P450 (CYP450) genes were also discovered. The expression analyses showed selected CYP450s associated with their expression pattern of SdCPS2 and SdKSL1, suggesting that CYP450 candidates involved diterpene modification. SdCPS2 represents the first predicted gene to produce syn-copalyl diphosphate in dicots. In addition, SdKSL1 potentially contributes to the SDB biosynthetic pathway. Therefore, these identified genes associated with diterpene biosynthesis lead to the development of genetic engineering focus on diterpene metabolism in S. dulcis.

  16. Elucidation of terpenoid metabolism in Scoparia dulcis by RNA-seq analysis

    PubMed Central

    Yamamura, Yoshimi; Kurosaki, Fumiya; Lee, Jung-Bum

    2017-01-01

    Scoparia dulcis biosynthesize bioactive diterpenes, such as scopadulcic acid B (SDB), which are known for their unique molecular skeleton. Although the biosynthesis of bioactive diterpenes is catalyzed by a sequence of class II and class I diterpene synthases (diTPSs), the mechanisms underlying this process are yet to be fully identified. To elucidate these biosynthetic machinery, we performed a high-throughput RNA-seq analysis, and de novo assembly of clean reads revealed 46,332 unique transcripts and 40,503 two unigenes. We found diTPSs genes including a putative syn-copalyl diphosphate synthase (SdCPS2) and two kaurene synthase-like (SdKSLs) genes. Besides them, total 79 full-length of cytochrome P450 (CYP450) genes were also discovered. The expression analyses showed selected CYP450s associated with their expression pattern of SdCPS2 and SdKSL1, suggesting that CYP450 candidates involved diterpene modification. SdCPS2 represents the first predicted gene to produce syn-copalyl diphosphate in dicots. In addition, SdKSL1 potentially contributes to the SDB biosynthetic pathway. Therefore, these identified genes associated with diterpene biosynthesis lead to the development of genetic engineering focus on diterpene metabolism in S. dulcis. PMID:28266568

  17. Optimization of thermophilic trans-isoprenyl diphosphate synthase expression in Escherichia coli by response surface methodology.

    PubMed

    Piccolomini, Angelica A; Fiabon, Alex; Borrotti, Matteo; De Lucrezia, Davide

    2017-01-01

    We optimized the heterologous expression of trans-isoprenyl diphosphate synthase (IDS), the key enzyme involved in the biosynthesis of trans-polyisoprene. trans-Polyisoprene is a particularly valuable compound due to its superior stiffness, excellent insulation, and low thermal expansion coefficient. Currently, trans-polyisoprene is mainly produced through chemical synthesis and no biotechnological processes have been established so far for its large-scale production. In this work, we employed D-optimal design and response surface methodology to optimize the expression of thermophilic enzymes IDS from Thermococcus kodakaraensis. The design of experiment took into account of six factors (preinduction cell density, inducer concentration, postinduction temperature, salt concentration, alternative carbon source, and protein inhibitor) and seven culture media (LB, NZCYM, TB, M9, Ec, Ac, and EDAVIS) at five different pH points. By screening only 109 experimental points, we were able to improve IDS production by 48% in close-batch fermentation. © 2015 International Union of Biochemistry and Molecular Biology, Inc.

  18. Investigating sesquiterpene biosynthesis in Ginkgo biloba: molecular cloning and functional characterization of (E,E)-farnesol and α-bisabolene synthases.

    PubMed

    Parveen, Iffat; Wang, Mei; Zhao, Jianping; Chittiboyina, Amar G; Tabanca, Nurhayat; Ali, Abbas; Baerson, Scott R; Techen, Natascha; Chappell, Joe; Khan, Ikhlas A; Pan, Zhiqiang

    2015-11-01

    Ginkgo biloba is one of the oldest living tree species and has been extensively investigated as a source of bioactive natural compounds, including bioactive flavonoids, diterpene lactones, terpenoids and polysaccharides which accumulate in foliar tissues. Despite this chemical diversity, relatively few enzymes associated with any biosynthetic pathway from ginkgo have been characterized to date. In the present work, predicted transcripts potentially encoding enzymes associated with the biosynthesis of diterpenoid and terpenoid compounds, including putative terpene synthases, were first identified by mining publicly-available G. biloba RNA-seq data sets. Recombinant enzyme studies with two of the TPS-like sequences led to the identification of GbTPS1 and GbTPS2, encoding farnesol and bisabolene synthases, respectively. Additionally, the phylogenetic analysis revealed the two terpene synthase genes as primitive genes that might have evolved from an ancestral diterpene synthase.

  19. Nucleic Acid Encoding A Lectin-Derived Progenitor Cell Preservation Factor

    DOEpatents

    Colucci, M. Gabriella; Chrispeels, Maarten J.; Moore, Jeffrey G.

    2001-10-30

    The invention relates to an isolated nucleic acid molecule that encodes a protein that is effective to preserve progenitor cells, such as hematopoietic progenitor cells. The nucleic acid comprises a sequence defined by SEQ ID NO:1, a homolog thereof, or a fragment thereof. The encoded protein has an amino acid sequence that comprises a sequence defined by SEQ ID NO:2, a homolog thereof, or a fragment thereof that contains an amino acid sequence TNNVLQVT. Methods of using the encoded protein for preserving progenitor cells in vitro, ex vivo, and in vivo are also described. The invention, therefore, include methods such as myeloablation therapies for cancer treatment wherein myeloid reconstitution is facilitated by means of the specified protein. Other therapeutic utilities are also enabled through the invention, for example, expanding progenitor cell populations ex vivo to increase chances of engraftation, improving conditions for transporting and storing progenitor cells, and facilitating gene therapy to treat and cure a broad range of life-threatening hematologic diseases.

  20. Exome Pool-Seq in neurodevelopmental disorders.

    PubMed

    Popp, Bernt; Ekici, Arif B; Thiel, Christian T; Hoyer, Juliane; Wiesener, Antje; Kraus, Cornelia; Reis, André; Zweier, Christiane

    2017-12-01

    High throughput sequencing has greatly advanced disease gene identification, especially in heterogeneous entities. Despite falling costs this is still an expensive and laborious technique, particularly when studying large cohorts. To address this problem we applied Exome Pool-Seq as an economic and fast screening technology in neurodevelopmental disorders (NDDs). Sequencing of 96 individuals can be performed in eight pools of 12 samples on less than one Illumina sequencer lane. In a pilot study with 96 cases we identified 27 variants, likely or possibly affecting function. Twenty five of these were identified in 923 established NDD genes (based on SysID database, status November 2016) (ACTB, AHDC1, ANKRD11, ATP6V1B2, ATRX, CASK, CHD8, GNAS, IFIH1, KCNQ2, KMT2A, KRAS, MAOA, MED12, MED13L, RIT1, SETD5, SIN3A, TCF4, TRAPPC11, TUBA1A, WAC, ZBTB18, ZMYND11), two in 543 (SysID) candidate genes (ZNF292, BPTF), and additionally a de novo loss-of-function variant in LRRC7, not previously implicated in NDDs. Most of them were confirmed to be de novo, but we also identified X-linked or autosomal-dominantly or autosomal-recessively inherited variants. With a detection rate of 28%, Exome Pool-Seq achieves comparable results to individual exome analyses but reduces costs by >85%. Compared with other large scale approaches using Molecular Inversion Probes (MIP) or gene panels, it allows flexible re-analysis of data. Exome Pool-Seq is thus well suited for large-scale, cost-efficient and flexible screening in characterized but heterogeneous entities like NDDs.

  1. Targeted Integration of RNA-Seq and Metabolite Data to Elucidate Curcuminoid Biosynthesis in Four Curcuma Species.

    PubMed

    Li, Donghan; Ono, Naoaki; Sato, Tetsuo; Sugiura, Tadao; Altaf-Ul-Amin, Md; Ohta, Daisaku; Suzuki, Hideyuki; Arita, Masanori; Tanaka, Ken; Ma, Zhiqiang; Kanaya, Shigehiko

    2015-05-01

    Curcuminoids, namely curcumin and its analogs, are secondary metabolites that act as the primary active constituents of turmeric (Curcuma longa). The contents of these curcuminoids vary among species in the genus Curcuma. For this reason, we compared two wild strains and two cultivars to understand the differences in the synthesis of curcuminoids. Because the fluxes of metabolic reactions depend on the amounts of their substrate and the activity of the catalysts, we analyzed the metabolite concentrations and gene expression of related enzymes. We developed a method based on RNA sequencing (RNA-Seq) analysis that focuses on a specific set of genes to detect expression differences between species in detail. We developed a 'selection-first' method for RNA-Seq analysis in which short reads are mapped to selected enzymes in the target biosynthetic pathways in order to reduce the effect of mapping errors. Using this method, we found that the difference in the contents of curcuminoids among the species, as measured by gas chromatography-mass spectrometry, could be explained by the changes in the expression of genes encoding diketide-CoA synthase, and curcumin synthase at the branching point of the curcuminoid biosynthesis pathway. © The Author 2015. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  2. Statistical models for RNA-seq data derived from a two-condition 48-replicate experiment.

    PubMed

    Gierliński, Marek; Cole, Christian; Schofield, Pietà; Schurch, Nicholas J; Sherstnev, Alexander; Singh, Vijender; Wrobel, Nicola; Gharbi, Karim; Simpson, Gordon; Owen-Hughes, Tom; Blaxter, Mark; Barton, Geoffrey J

    2015-11-15

    High-throughput RNA sequencing (RNA-seq) is now the standard method to determine differential gene expression. Identifying differentially expressed genes crucially depends on estimates of read-count variability. These estimates are typically based on statistical models such as the negative binomial distribution, which is employed by the tools edgeR, DESeq and cuffdiff. Until now, the validity of these models has usually been tested on either low-replicate RNA-seq data or simulations. A 48-replicate RNA-seq experiment in yeast was performed and data tested against theoretical models. The observed gene read counts were consistent with both log-normal and negative binomial distributions, while the mean-variance relation followed the line of constant dispersion parameter of ∼0.01. The high-replicate data also allowed for strict quality control and screening of 'bad' replicates, which can drastically affect the gene read-count distribution. RNA-seq data have been submitted to ENA archive with project ID PRJEB5348. g.j.barton@dundee.ac.uk. © The Author 2015. Published by Oxford University Press.

  3. Statistical models for RNA-seq data derived from a two-condition 48-replicate experiment

    PubMed Central

    Cole, Christian; Schofield, Pietà; Schurch, Nicholas J.; Sherstnev, Alexander; Singh, Vijender; Wrobel, Nicola; Gharbi, Karim; Simpson, Gordon; Owen-Hughes, Tom; Blaxter, Mark; Barton, Geoffrey J.

    2015-01-01

    Motivation: High-throughput RNA sequencing (RNA-seq) is now the standard method to determine differential gene expression. Identifying differentially expressed genes crucially depends on estimates of read-count variability. These estimates are typically based on statistical models such as the negative binomial distribution, which is employed by the tools edgeR, DESeq and cuffdiff. Until now, the validity of these models has usually been tested on either low-replicate RNA-seq data or simulations. Results: A 48-replicate RNA-seq experiment in yeast was performed and data tested against theoretical models. The observed gene read counts were consistent with both log-normal and negative binomial distributions, while the mean-variance relation followed the line of constant dispersion parameter of ∼0.01. The high-replicate data also allowed for strict quality control and screening of ‘bad’ replicates, which can drastically affect the gene read-count distribution. Availability and implementation: RNA-seq data have been submitted to ENA archive with project ID PRJEB5348. Contact: g.j.barton@dundee.ac.uk PMID:26206307

  4. Key gene regulating cell wall biosynthesis and recalcitrance in Populus, gene Y

    DOEpatents

    Chen, Jay; Engle, Nancy; Gunter, Lee E.; Jawdy, Sara; Tschaplinski, Timothy J.; Tuskan, Gerald A.

    2015-12-08

    This disclosure provides methods and transgenic plants for improved production of renewable biofuels and other plant-derived biomaterials by altering the expression and/or activity of Gene Y, an O-acetyltransferase. This disclosure also provides expression vectors containing a nucleic acid (Gene Y) which encodes the polypeptide of SEQ ID NO: 1 and is operably linked to a heterologous promoter.

  5. Towards the integration, annotation and association of historical microarray experiments with RNA-seq.

    PubMed

    Chavan, Shweta S; Bauer, Michael A; Peterson, Erich A; Heuck, Christoph J; Johann, Donald J

    2013-01-01

    Transcriptome analysis by microarrays has produced important advances in biomedicine. For instance in multiple myeloma (MM), microarray approaches led to the development of an effective disease subtyping via cluster assignment, and a 70 gene risk score. Both enabled an improved molecular understanding of MM, and have provided prognostic information for the purposes of clinical management. Many researchers are now transitioning to Next Generation Sequencing (NGS) approaches and RNA-seq in particular, due to its discovery-based nature, improved sensitivity, and dynamic range. Additionally, RNA-seq allows for the analysis of gene isoforms, splice variants, and novel gene fusions. Given the voluminous amounts of historical microarray data, there is now a need to associate and integrate microarray and RNA-seq data via advanced bioinformatic approaches. Custom software was developed following a model-view-controller (MVC) approach to integrate Affymetrix probe set-IDs, and gene annotation information from a variety of sources. The tool/approach employs an assortment of strategies to integrate, cross reference, and associate microarray and RNA-seq datasets. Output from a variety of transcriptome reconstruction and quantitation tools (e.g., Cufflinks) can be directly integrated, and/or associated with Affymetrix probe set data, as well as necessary gene identifiers and/or symbols from a diversity of sources. Strategies are employed to maximize the annotation and cross referencing process. Custom gene sets (e.g., MM 70 risk score (GEP-70)) can be specified, and the tool can be directly assimilated into an RNA-seq pipeline. A novel bioinformatic approach to aid in the facilitation of both annotation and association of historic microarray data, in conjunction with richer RNA-seq data, is now assisting with the study of MM cancer biology.

  6. Nucleic acid molecules encoding isopentenyl monophosphate kinase, and methods of use

    DOEpatents

    Croteau, Rodney B.; Lange, Bernd M.

    2001-01-01

    A cDNA encoding isopentenyl monophosphate kinase (IPK) from peppermint (Mentha x piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of isopentenyl monophosphate kinase (SEQ ID NO:2), from peppermint (Mentha x piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for isopentenyl monophosphate kinase, or for a base sequence sufficiently complementary to at least a portion of isopentenyl monophosphate kinase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding isopentenyl monophosphate kinase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant isopentenyl monophosphate kinase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant isopentenyl monophosphate kinase may be used to obtain expression or enhanced expression of isopentenyl monophosphate kinase in plants in order to enhance the production of isopentenyl monophosphate kinase, or isoprenoids derived therefrom, or may be otherwise employed for the regulation or expression of isopentenyl monophosphate kinase, or the production of its products.

  7. Transcriptome profiling of the Australian arid-land plant Eremophila serrulata (A.DC.) Druce (Scrophulariaceae) for the identification of monoterpene synthases.

    PubMed

    Kracht, Octavia Natascha; Ammann, Ann-Christin; Stockmann, Julia; Wibberg, Daniel; Kalinowski, Jörn; Piotrowski, Markus; Kerr, Russell; Brück, Thomas; Kourist, Robert

    2017-04-01

    Plant terpenoids are a large and highly diverse class of metabolites with an important role in the immune defense. They find wide industrial application as active pharmaceutical ingredients, aroma and fragrance compounds. Several Eremophila sp. derived terpenoids have been documented. To elucidate the terpenoid metabolism, the transcriptome of juvenile and mature Eremophila serrulata (A.DC.) Druce (Scrophulariaceae) leaves was sequenced and a transcript library was generated. We report on the first transcriptomic dataset of an Eremophila plant. IlluminaMiSeq sequencing (2 × 300 bp) revealed 7,093,266 paired reads, which could be assembled to 34,505 isogroups. To enable detection of terpene biosynthetic genes, leaves were separately treated with methyl jasmonate, a well-documented inducer of plant secondary metabolites. In total, 21 putative terpene synthase genes were detected in the transcriptome data. Two terpene synthase isoenzymatic genes, termed ES01 and ES02, were successfully expressed in E. coli. The resulting proteins catalyzed the conversion of geranyl pyrophosphate, the universal substrate of monoterpene synthases to myrcene and Z-(b)-ocimene, respectively. The transcriptomic data and the discovery of the first terpene synthases from Eremophila serrulata are the initial step for the understanding of the terpene metabolism in this medicinally important plant genus. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. Protein modelling of triterpene synthase genes from mangrove plants using Phyre2 and Swiss-model

    NASA Astrophysics Data System (ADS)

    Basyuni, M.; Wati, R.; Sulistiyono, N.; Hayati, R.; Sumardi; Oku, H.; Baba, S.; Sagami, H.

    2018-03-01

    Molecular cloning of five oxidosqualene cyclases (OSC) genes from Bruguiera gymnorrhiza, Kandelia candel, and Rhizophora stylosa had previously been cloned, characterized, and encoded mono and -multi triterpene synthases. The present study analyzed protein modelling of triterpene synthase genes from mangrove using Phyre2 and Swiss-model. The diversity was noted within protein modelling of triterpene synthases using Phyre2 from sequence identity (38-43%) and residue (696-703). RsM2 was distinguishable from others for template structure; it used lanosterol synthase as a template (PDB ID: w6j.1.A). By contrast, other genes used human lanosterol synthase (1w6k.1.A). The predicted bind sites were correlated with the product of triterpene synthase, the product of BgbAS was β-amyrin, while RsM1 contained a significant amount of β-amyrin. Similarly BgLUS and KcMS, both main products was lupeol, on the other hand, RsM2 with the outcome of taraxerol. Homology modelling revealed that 696 residues of BgbAS, BgLUS, RsM1, and RsM2 (91-92% of the amino acid sequence) had been modelled with 100% confidence by the single highest scoring template using Phyre2. This coverage was higher than Swiss-model (85-90%). The present study suggested that molecular cloning of triterpene genes provides useful tools for studying the protein modelling related regulation of isoprenoids biosynthesis in mangrove forests.

  9. Effect of biliary drainage on inducible nitric oxide synthase, CD14 and TGR5 expression in obstructive jaundice rats

    PubMed Central

    Wang, Zi-Kai; Xiao, Jian-Guo; Huang, Xue-Fei; Gong, Yi-Chun; Li, Wen

    2013-01-01

    AIM: To investigate the effect of biliary drainage on inducible nitric oxide synthase (iNOS), CD14 and TGR5 expression in rats with obstructive jaundice (OJ). METHODS: Male adult Sprague-Dawley rats were randomly assigned to four groups: OJ, sham operation (SH), internal biliary drainage (ID) and external biliary drainage (ED). Rat models were successfully established by two operations and succumbed for extraction of Kupffer cells (KCs) and liver tissue collection on the 8th and 15th day. KCs were isolated by in situ hepatic perfusion and digested with collagen IV, density gradient centrifuged by percoll reagent and purified by cell culture attachment. The isolated KCs were cultured with the endotoxin lipopolysaccharide (LPS) with and without the addition of ursodeoxycholic acid (UDCA). The expression of iNOS, CD14 and bile acid receptor-TGR5 protein in rat liver tissues was determined by immunohistochemistry. The expression of iNOS and CD14 messenger RNA (mRNA) on the isolated KCs was detected by reverse transcription polymerase chain reaction (PCR) and the TGR5 mRNA level in KCs was measured by real-time quantitative PCR. RESULTS: The iNOS protein was markedly expressed in the liver of OJ rats, but rare expressed in SH rats. After relief of OJ, the iNOS expression was decidedly suppressed in the ID group (ID vs OJ, P < 0.01), but obviously increased in rats of ED (ED vs OJ, P = 0.004). When interfered only with LPS, the expression of iNOS mRNA by KCs was increased in the OJ group compared with the SH group (P = 0.004). After relief of biliary obstruction, the iNOS mRNA expression showed slight changes in the ED group (ED vs OJ, P = 0.71), but dropped in the ID group (ID vs OJ, P = 0.001). Compared with the simple intervention with LPS, the expressions of iNOS mRNA were significantly inhibited in all four groups after interfered with both LPS and UDCA (P < 0.01, respectively). After bile duct ligation, the CD14 protein expression in rat liver was significantly strengthened (OJ vs SH, P < 0.01), but the CD14 mRNA level by KCs was not up-regulated (OJ vs SH, P = 0.822). After relieving the OJ, the expression of CD14 protein was reduced in the ID group (ID vs OJ, P < 0.01), but not reduced in ED group (ED vs OJ, P = 0.591). And then the CD14 mRNA expression was aggravated by ED (ED vs OJ, P < 0.01), but was not significantly different between the ID group and the SH and OJ groups (ID vs SH, P = 0.944; ID vs OJ, P = 0.513, respectively). The expression of TGR5 protein and mRNA increased significantly in OJ rats (OJ vs SH, P = 0.001, respectively). After relief of OJ, ID could reduce the expression of TGR5 protein and mRNA to the levels of SH group (ID vs SH, P = 0.22 and P = 0.354, respectively), but ED could not (ED vs SH, P = 0.001, respectively). CONCLUSION: ID could be attributed to the regulatory function of activation of KCs and release of inflammatory mediators. PMID:23613625

  10. Comparative de novo transcriptome analysis of male and female Sea buckthorn.

    PubMed

    Bansal, Ankush; Salaria, Mehul; Sharma, Tashil; Stobdan, Tsering; Kant, Anil

    2018-02-01

    Sea buckthorn is a dioecious medicinal plant found at high altitude. The plant has both male and female reproductive organs in separate individuals. In this article, whole transcriptome de novo assemblies of male and female flower bud samples were carried out using Illumina NextSeq 500 platform to determine the role of the genes involved in sex determination. Moreover, genes with differential expression in male and female transcriptomes were identified to understand the underlying sex determination mechanism. The current study showed 63,904 and 62,272 coding sequences (CDS) in female and male transcriptome data sets, respectively. 16,831 common CDS were screened out from both transcriptomes, out of which 625 were upregulated and 491 were found to be downregulated. To understand the potential regulatory roles of differentially expressed genes in metabolic networks and biosynthetic pathways: KEGG mapping, gene ontology, and co-expression network analysis were performed. Comparison with Flowering Interactive Database (FLOR-ID) resulted in eight differentially expressed genes viz. CHD3-type chromatin-remodeling factor PICKLE ( PKL ), phytochrome-associated serine/threonine-protein phosphatase ( FYPP ), protein TOPLESS ( TPL ), sensitive to freezing 6 ( SFR6 ), lysine-specific histone demethylase 1 homolog 1 ( LDL1 ), pre-mRNA-processing-splicing factor 8A ( PRP8A ), sucrose synthase 4 ( SUS4 ), ubiquitin carboxyl-terminal hydrolase 12 ( UBP12 ), known to be broadly involved in flowering, photoperiodism, embryo development, and cold response pathways. Male and female flower bud transcriptome data of Sea buckthorn may provide comprehensive information at genomic level for the identification of genetic regulation involved in sex determination.

  11. Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells

    PubMed Central

    Carlile, Thomas M.; Rojas-Duran, Maria F.; Zinshteyn, Boris; Shin, Hakyung; Bartoli, Kristen M.; Gilbert, Wendy V.

    2014-01-01

    Post-transcriptional modification of RNA nucleosides occurs in all living organisms. Pseudouridine, the most abundant modified nucleoside in non-coding RNAs1, enhances the function of transfer RNA and ribosomal RNA by stabilizing RNA structure2–8. mRNAs were not known to contain pseudouridine, but artificial pseudouridylation dramatically affects mRNA function – it changes the genetic code by facilitating non-canonical base pairing in the ribosome decoding center9,10. However, without evidence of naturally occurring mRNA pseudouridylation, its physiological was unclear. Here we present a comprehensive analysis of pseudouridylation in yeast and human RNAs using Pseudo-seq, a genome-wide, single-nucleotide-resolution method for pseudouridine identification. Pseudo-seq accurately identifies known modification sites as well as 100 novel sites in non-coding RNAs, and reveals hundreds of pseudouridylated sites in mRNAs. Genetic analysis allowed us to assign most of the new modification sites to one of seven conserved pseudouridine synthases, Pus1–4, 6, 7 and 9. Notably, the majority of pseudouridines in mRNA are regulated in response to environmental signals, such as nutrient deprivation in yeast and serum starvation in human cells. These results suggest a mechanism for the rapid and regulated rewiring of the genetic code through inducible mRNA modifications. Our findings reveal unanticipated roles for pseudouridylation and provide a resource for identifying the targets of pseudouridine synthases implicated in human disease11–13. PMID:25192136

  12. Id-1 activation of PI3K/Akt/NFkappaB signaling pathway and its significance in promoting survival of esophageal cancer cells.

    PubMed

    Li, Bin; Cheung, Pak Yan; Wang, Xianghong; Tsao, Sai Wah; Ling, Ming Tat; Wong, Yong Chuan; Cheung, Annie L M

    2007-11-01

    Inhibitor of differentiation or DNA binding (Id-1) is a helix-loop-helix protein that is over-expressed in many types of cancer including esophageal cancer. This study aims to investigate its effects on the phosphatidylinositol-3-kinase (PI3K)/Akt/ nuclear factor kappa B (NFkappaB) signaling pathway and the significance in protecting esophageal cancer cells against apoptosis. We found elevated expression of phosphorylated forms of Akt, glycogen synthase kinase 3beta and inhibitor of kappa B, as well as increased nuclear translocation of NFkappaB subunit p65 and NFkappaB DNA-binding activity, in esophageal cancer cells with stable ectopic Id-1 expression. Transient transfection of Id-1 into HEK293 cells confirmed activation of PI3K/Akt/NFkappaB signaling and the effects were counteracted by the PI3K inhibitor LY294002. Treatment with tumor necrosis factor-alpha (TNF-alpha) elicited a significantly weaker apoptotic response, following a marked and sustained activation of Akt and NFkappaB in the Id-1-over-expressing cells, compared with the vector control. The effects of Id-1 on the PI3K/Akt/NFkappaB signaling pathway and apoptosis were reversed in esophageal cancer cells transfected with siRNA against Id-1. In addition, inhibition of PI3K or NFkappaB signaling using the PI3K inhibitor LY294002 or the NFkappaB inhibitor Bay11-7082 increased the sensitivity of Id-1-over-expressing esophageal cancer cells to TNF-alpha-induced apoptosis. Our results provide the first evidence that Id-1 induces the activation of PI3K/Akt/NFkappaB signaling pathway, and protects esophageal cancer cells from TNF-alpha-induced apoptosis in vitro. Inactivation of Id-1 may provide us with a novel strategy to improve the treatment and survival of patients with esophageal cancer.

  13. Influence of coronary artery diameter on eNOS protein content

    NASA Technical Reports Server (NTRS)

    Laughlin, M. H.; Turk, J. R.; Schrage, W. G.; Woodman, C. R.; Price, E. M.

    2003-01-01

    The purpose of this study was to test the hypothesis that the content of endothelial nitric oxide synthase (eNOS) protein (eNOS protein/g total artery protein) increases with decreasing artery diameter in the coronary arterial tree. Content of eNOS protein was determined in porcine coronary arteries with immunoblot analysis. Arteries were isolated in six size categories from each heart: large arteries [301- to 2,500-microm internal diameter (ID)], small arteries (201- to 300-microm ID), resistance arteries (151- to 200-microm ID), large arterioles (101- to 150-microm ID), intermediate arterioles (51- to 100-microm ID), and small arterioles(<50-microm ID). To obtain sufficient protein for analysis from small- and intermediate-sized arterioles, five to seven arterioles 1-2 mm in length were pooled into one sample for each animal. Results establish that the number of smooth muscle cells per endothelial cell decreases from a number of 10 to 15 in large coronary arteries to 1 in the smallest arterioles. Immunohistochemistry revealed that eNOS is located only in endothelial cells in all sizes of coronary artery and in coronary capillaries. Contrary to our hypothesis, eNOS protein content did not increase with decreasing size of coronary artery. Indeed, the smallest coronary arterioles had less eNOS protein per gram of total protein than the large coronary arteries. These results indicate that eNOS protein content is greater in the endothelial cells of conduit arteries, resistance arteries, and large arterioles than in small coronary arterioles.

  14. Modeling, molecular docking, probing catalytic binding mode of acetyl-CoA malate synthase G in Brucella melitensis 16M.

    PubMed

    Adi, Pradeepkiran Jangampalli; Yellapu, Nanda Kumar; Matcha, Bhaskar

    2016-12-01

    There are enormous evidences and previous reports standpoint that the enzyme of glyoxylate pathway malate synthase G (MSG) is a potential virulence factor in several pathogenic organisms, including Brucella melitensis 16M. Where the lack of crystal structures for best candidate proteins like MSG of B. melitensis 16M creates big lacuna to understand the molecular pathogenesis of brucellosis. In the present study, we have constructed a 3-D structure of MSG of Brucella melitensis 16M in MODELLER with the help of crystal structure of Mycobacterium tuberculosis malate synthase (PDB ID: 2GQ3) as template. The stereo chemical quality of the restrained model was evaluated by SAVES server; remarkably we identified the catalytic functional core domain located at 4 th cleft with conserved catalytic amino acids, start at ILE 59 to VAL 586 manifest the function of the protein. Furthermore, virtual screening and docking results reveals that best leadmolecules binds at the core domain pocket of MSG catalytic residues and these ligand leads could be the best prospective inhibitors to treat brucellosis.

  15. Contribution of copy number variants involving nonsense-mediated mRNA decay pathway genes to neuro-developmental disorders.

    PubMed

    Nguyen, Lam S; Kim, Hyung-Goo; Rosenfeld, Jill A; Shen, Yiping; Gusella, James F; Lacassie, Yves; Layman, Lawrence C; Shaffer, Lisa G; Gécz, Jozef

    2013-05-01

    The nonsense-mediated mRNA decay (NMD) pathway functions not only to degrade transcripts containing premature termination codons (PTC), but also to regulate the transcriptome. UPF3B and RBM8A, important components of NMD, have been implicated in various forms of intellectual disability (ID) and Thrombocytopenia with Absent Radius (TAR) syndrome, which is also associated with ID. To gauge the contribution of other NMD factors to ID, we performed a comprehensive search for copy number variants (CNVs) of 18 NMD genes among individuals with ID and/or congenital anomalies. We identified 11 cases with heterozygous deletions of the genomic region encompassing UPF2, which encodes for a direct interacting protein of UPF3B. Using RNA-Seq, we showed that the genome-wide consequence of reduced expression of UPF2 is similar to that seen in patients with UPF3B mutations. Out of the 1009 genes found deregulated in patients with UPF2 deletions by at least 2-fold, majority (95%) were deregulated similarly in patients with UPF3B mutations. This supports the major role of deletion of UPF2 in ID. Furthermore, we found that four other NMD genes, UPF3A, SMG6, EIF4A3 and RNPS1 are frequently deleted and/or duplicated in the patients. We postulate that dosage imbalances of these NMD genes are likely to be the causes or act as predisposing factors for neuro-developmental disorders. Our findings further emphasize the importance of NMD pathway(s) in learning and memory.

  16. Understanding the molecular mechanisms underlying the effects of light intensity on flavonoid production by RNA-seq analysis in Epimedium pseudowushanense B.L.Guo

    PubMed Central

    Chen, Haimei; Guo, Baolin; Liu, Chang

    2017-01-01

    Epimedium pseudowushanense B.L.Guo, a light-demanding shade herb, is used in traditional medicine to increase libido and strengthen muscles and bones. The recognition of the health benefits of Epimedium has increased its market demand. However, its resource recycling rate is low and environmentally dependent. Furthermore, its natural sources are endangered, further increasing prices. Commercial culture can address resource constraints of it.Understanding the effects of environmental factors on the production of its active components would improve the technology for cultivation and germplasm conservation. Here, we studied the effects of light intensities on the flavonoid production and revealed the molecular mechanism using RNA-seq analysis. Plants were exposed to five levels of light intensity through the periods of germination to flowering, the flavonoid contents were measured using HPLC. Quantification of epimedin A, epimedin B, epimedin C, and icariin showed that the flavonoid contents varied with different light intensity levels. And the largest amount of epimedin C was produced at light intensity level 4 (I4). Next, the leaves under the treatment of three light intensity levels (“L”, “M” and “H”) with the largest differences in the flavonoid content, were subjected to RNA-seq analysis. Transcriptome reconstruction identified 43,657 unigenes. All unigene sequences were annotated by searching against the Nr, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. In total, 4008, 5260, and 3591 significant differentially expressed genes (DEGs) were identified between the groups L vs. M, M vs. H and L vs. H. Particularly, twenty-one full-length genes involved in flavonoid biosynthesis were identified. The expression levels of the flavonol synthase, chalcone synthase genes were strongly associated with light-induced flavonoid abundance with the highest expression levels found in the H group. Furthermore, 65 transcription factors, including 31 FAR1, 17 MYB-related, 12 bHLH, and 5 WRKY, were differentially expressed after light induction. Finally, a model was proposed to explain the light-induced flavonoid production. This study provided valuable information to improve cultivation practices and produced the first comprehensive resource for E. pseudowushanense transcriptomes. PMID:28786984

  17. Understanding the molecular mechanisms underlying the effects of light intensity on flavonoid production by RNA-seq analysis in Epimedium pseudowushanense B.L.Guo.

    PubMed

    Pan, Junqian; Chen, Haimei; Guo, Baolin; Liu, Chang

    2017-01-01

    Epimedium pseudowushanense B.L.Guo, a light-demanding shade herb, is used in traditional medicine to increase libido and strengthen muscles and bones. The recognition of the health benefits of Epimedium has increased its market demand. However, its resource recycling rate is low and environmentally dependent. Furthermore, its natural sources are endangered, further increasing prices. Commercial culture can address resource constraints of it.Understanding the effects of environmental factors on the production of its active components would improve the technology for cultivation and germplasm conservation. Here, we studied the effects of light intensities on the flavonoid production and revealed the molecular mechanism using RNA-seq analysis. Plants were exposed to five levels of light intensity through the periods of germination to flowering, the flavonoid contents were measured using HPLC. Quantification of epimedin A, epimedin B, epimedin C, and icariin showed that the flavonoid contents varied with different light intensity levels. And the largest amount of epimedin C was produced at light intensity level 4 (I4). Next, the leaves under the treatment of three light intensity levels ("L", "M" and "H") with the largest differences in the flavonoid content, were subjected to RNA-seq analysis. Transcriptome reconstruction identified 43,657 unigenes. All unigene sequences were annotated by searching against the Nr, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. In total, 4008, 5260, and 3591 significant differentially expressed genes (DEGs) were identified between the groups L vs. M, M vs. H and L vs. H. Particularly, twenty-one full-length genes involved in flavonoid biosynthesis were identified. The expression levels of the flavonol synthase, chalcone synthase genes were strongly associated with light-induced flavonoid abundance with the highest expression levels found in the H group. Furthermore, 65 transcription factors, including 31 FAR1, 17 MYB-related, 12 bHLH, and 5 WRKY, were differentially expressed after light induction. Finally, a model was proposed to explain the light-induced flavonoid production. This study provided valuable information to improve cultivation practices and produced the first comprehensive resource for E. pseudowushanense transcriptomes.

  18. RNA-Seq mediated root transcriptome analysis of Chlorophytum borivilianum for identification of genes involved in saponin biosynthesis.

    PubMed

    Kumar, Sunil; Kalra, Shikha; Singh, Baljinder; Kumar, Avneesh; Kaur, Jagdeep; Singh, Kashmir

    2016-01-01

    Chlorophytum borivilianum is an important species of liliaceae family, owing to its vital medicinal properties. Plant roots are used for aphrodisiac, adaptogen, anti-aging, health-restorative and health-promoting purposes. Saponins, are considered to be the principal bioactive components responsible for the wide variety of pharmacological properties of this plant. In the present study, we have performed de novo root transcriptome sequencing of C. borivilianum using Illumina Hiseq 2000 platform, to gain molecular insight into saponins biosynthesis. A total of 33,963,356 high-quality reads were obtained after quality filtration. Sequences were assembled using various programs which generated 97,344 transcripts with a size range of 100-5,216 bp and N50 value of 342. Data was analyzed against non-redundant proteins, gene ontology (GO), and enzyme commission (EC) databases. All the genes involved in saponins biosynthesis along with five full-length genes namely farnesyl pyrophosphate synthase, cycloartenol synthase, β-amyrin synthase, cytochrome p450, and sterol-3-glucosyltransferase were identified. Read per exon kilobase per million (RPKM)-based comparative expression profiling was done to study the differential regulation of the genes. In silico expression analysis of seven selected genes of saponin biosynthetic pathway was validated by qRT-PCR.

  19. MITIE: Simultaneous RNA-Seq-based transcript identification and quantification in multiple samples.

    PubMed

    Behr, Jonas; Kahles, André; Zhong, Yi; Sreedharan, Vipin T; Drewe, Philipp; Rätsch, Gunnar

    2013-10-15

    High-throughput sequencing of mRNA (RNA-Seq) has led to tremendous improvements in the detection of expressed genes and reconstruction of RNA transcripts. However, the extensive dynamic range of gene expression, technical limitations and biases, as well as the observed complexity of the transcriptional landscape, pose profound computational challenges for transcriptome reconstruction. We present the novel framework MITIE (Mixed Integer Transcript IdEntification) for simultaneous transcript reconstruction and quantification. We define a likelihood function based on the negative binomial distribution, use a regularization approach to select a few transcripts collectively explaining the observed read data and show how to find the optimal solution using Mixed Integer Programming. MITIE can (i) take advantage of known transcripts, (ii) reconstruct and quantify transcripts simultaneously in multiple samples, and (iii) resolve the location of multi-mapping reads. It is designed for genome- and assembly-based transcriptome reconstruction. We present an extensive study based on realistic simulated RNA-Seq data. When compared with state-of-the-art approaches, MITIE proves to be significantly more sensitive and overall more accurate. Moreover, MITIE yields substantial performance gains when used with multiple samples. We applied our system to 38 Drosophila melanogaster modENCODE RNA-Seq libraries and estimated the sensitivity of reconstructing omitted transcript annotations and the specificity with respect to annotated transcripts. Our results corroborate that a well-motivated objective paired with appropriate optimization techniques lead to significant improvements over the state-of-the-art in transcriptome reconstruction. MITIE is implemented in C++ and is available from http://bioweb.me/mitie under the GPL license.

  20. Impact of genomic polymorphism on arterial hypertension after aortic coarctation repair.

    PubMed

    Hager, Alfred; Bildau, Judith; Kreuder, Joachim; Kaemmerer, Harald; Hess, John

    2011-08-18

    Even after repair of aortic coarctation without restenosis there is a high incidence of arterial hypertension. This study was performed to assess the contribution of several inherited gene polymorphisms, which are known to be related to essential hypertension. 122 patients aged 17-72 years, 46 women, and 2-27 years after repair of isolated aortic coarctation without restenosis were investigated. Genomic polymorphism of angiotensin converting enzyme (ACE I/D), angiotensinogen (AGT, c.704C>T), angiotensin II receptor type 1 (AGTR1, c.1166A>C), aldosterone synthase (CYP11B2, c.-344C>T), endothelin 1 (EDN1, EDN1/ex5-c.5665G>T), G protein (GNB3, c.825C>T), G protein-coupled receptor kinase 4 (GRK4, c.679C>T), fibrillin 1 (FBN1, VNTR(TAAA)) and two polymorphisms each of the ß1 adrenoreceptor (ADRB1, c.145G>A and c.1165C>G), ß2 adrenoreceptor (ADRB2, c.46A>G and c.79C>G), and endothelial NO synthase (NOS3, intron 4 I/D and NOS3, c.894G>T) were determined by PCR amplification and fragment length analysis. Patients were classified "normotensive", if they were not on antihypertensive drugs and showed normal blood pressure both on ambulatory measurement and exercise test. None of the investigated genomic polymorphism could be related to hypertension. Only patients with the ACE I/I genotype had a less pronounced nocturnal dipping and patients with a ADRB1 c.1165 C/C genotype had a higher systolic and mean blood pressure at night. Development of late hypertension after aortic coarctation repair could not be related to the investigated genomic polymorphism. The correlation of the ACE I/D and the ADRB1 c.1165C>G polymorphism to nocturnal dipping and blood pressure at nighttime needs further confirmation. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  1. Compositions and methods for improved protein production

    DOEpatents

    Bodie, Elizabeth A [San Carlos, CA; Kim, Steve [San Francisco, CA

    2012-07-10

    The present invention relates to the identification of novel nucleic acid sequences, designated herein as 7p, 8k, 7E, 9G, 8Q and 203, in a host cell which effect protein production. The present invention also provides host cells having a mutation or deletion of part or all of the gene encoding 7p, 8k, 7E, 9G, 8Q and 203, which are presented in FIG. 1, and are SEQ ID NOS.: 1-6, respectively. The present invention also provides host cells further comprising a nucleic acid encoding a desired heterologous protein such as an enzyme.

  2. Compositions and methods for improved protein production

    DOEpatents

    Bodie, Elizabeth A.; Kim, Steve Sungjin

    2014-06-03

    The present invention relates to the identification of novel nucleic acid sequences, designated herein as 7p, 8k, 7E, 9G, 8Q and 203, in a host cell which effect protein production. The present invention also provides host cells having a mutation or deletion of part or all of the gene encoding 7p, 8k, 7E, 9G, 8Q and 203, which are presented in FIG. 1, and are SEQ ID NOS.: 1-6, respectively. The present invention also provides host cells further comprising a nucleic acid encoding a desired heterologous protein such as an enzyme.

  3. In silico prediction of inhibitory effects of pyrazol-5-one and indazole derivatives on GSK3β kinase enzyme

    NASA Astrophysics Data System (ADS)

    Wang, Fangfang; Liu, Mengmeng; Liu, Jianling

    2012-09-01

    Glycogen synthase kinase-3 beta (GSK3β) plays an important role in a diverse number of regulatory pathways by phosphorylation of several different cellular targets and its inhibitors have been evaluated as promising drug candidates. In this work, 192 3-aryl-4-(arylhydrazono)-1H-pyrazol-5-one analogs (AHP) and indazoles (ID) derivatives possessing selective binding affinity for GSK3β kinase were studied using the 3D-QSAR/CoMFA/CoMSIA methodologies. The obtained CoMFA/CoMSIA models exhibit both good internal and external predictive abilities, i.e., Rcv2=0.551,Rpred2=0.698 for AHP derivatives and Rcv2=0.511,Rpred2=0.791 for ID analogs. Of paramount interest is the observation derived from the combination of molecular dynamics and molecular docking studies that Val135 and Asp133 are responsible for the binding recognition for AHP molecules, while residues Val135 and Pro136 are mainly involved in the specific ligand-kinase interactions for ID analogs. The developed models are seeking to be helpful for the rational design of novel potent GSK3β inhibitors.

  4. Increased chalcone synthase (CHS) expression is associated with dicamba resistance in Kochia scoparia.

    PubMed

    Pettinga, Dean J; Ou, Junjun; Patterson, Eric L; Jugulam, Mithila; Westra, Philip; Gaines, Todd A

    2017-10-30

    Resistance to the synthetic auxin herbicide dicamba is increasingly problematic in Kochia scoparia. The resistance mechanism in an inbred dicamba-resistant K. scoparia line (9425R) was investigated using physiological and transcriptomics (RNA-Seq) approaches. No differences were found in dicamba absorption or metabolism between 9425R and a dicamba-susceptible line, but 9425R was found to have significantly reduced dicamba translocation. Known auxin-responsive genes ACC synthase (ACS) and indole-3-acetic acid amino synthetase (GH3) were transcriptionally induced following dicamba treatment in dicamba-susceptible K. scoparia but not in 9425R. Chalcone synthase (CHS), the gene regulating synthesis of the flavonols quertecin and kaemperfol, was found to have twofold higher transcription in 9425R both without and 12 h after dicamba treatment. Increased CHS transcription co-segregated with dicamba resistance in a forward genetics screen using an F 2 population. Prior work has shown that the flavonols quertecin and kaemperfol compete with auxin for intercellular movement and vascular loading via ATP-binding cassette subfamily B (ABCB) membrane transporters. The results of this study support a model in which constitutively increased CHS expression in the meristem produces more flavonols that would compete with dicamba for intercellular transport by ABCB transporters, resulting in reduced dicamba translocation. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.

  5. Potential antimicrobial agents from triazole-functionalized 2H-benzo[b][1,4]oxazin-3(4H)-ones.

    PubMed

    Bollu, Rajitha; Banu, Saleha; Bantu, Rajashaker; Reddy, A Gopi; Nagarapu, Lingaiah; Sirisha, K; Kumar, C Ganesh; Gunda, Shravan Kumar; Shaik, Kamal

    2017-12-01

    A series of substituted triazole functionalized 2H-benzo[b][1,4]oxazin-3(4H)-ones were synthesized by employing click chemistry and further characterized based on 1 H NMR, 13 C NMR, IR and mass spectral studies. All the synthesized derivatives were screened for their in vitro antimicrobial activities. Further, molecular docking studies were accomplished to explore the binding interactions between 1,2,3-triazol-4-yl-2H-benzo[b][1,4]oxazin-3(4H)-one and the active site of Staphylococcus aureus (CrtM) dehydrosqualene synthase (PDB ID: 2ZCS). These docking studies revealed that the synthesized derivatives showed high binding energies and strong H-bond interactions with the dehydrosqualene synthase validating the observed antimicrobial activity data. Based on antimicrobial activity and docking studies, the compounds 9c, 9d and 9e were identified as promising antimicrobial leads. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. TAF1 Variants Are Associated with Dysmorphic Features, Intellectual Disability, and Neurological Manifestations.

    PubMed

    O'Rawe, Jason A; Wu, Yiyang; Dörfel, Max J; Rope, Alan F; Au, P Y Billie; Parboosingh, Jillian S; Moon, Sungjin; Kousi, Maria; Kosma, Konstantina; Smith, Christopher S; Tzetis, Maria; Schuette, Jane L; Hufnagel, Robert B; Prada, Carlos E; Martinez, Francisco; Orellana, Carmen; Crain, Jonathan; Caro-Llopis, Alfonso; Oltra, Silvestre; Monfort, Sandra; Jiménez-Barrón, Laura T; Swensen, Jeffrey; Ellingwood, Sara; Smith, Rosemarie; Fang, Han; Ospina, Sandra; Stegmann, Sander; Den Hollander, Nicolette; Mittelman, David; Highnam, Gareth; Robison, Reid; Yang, Edward; Faivre, Laurence; Roubertie, Agathe; Rivière, Jean-Baptiste; Monaghan, Kristin G; Wang, Kai; Davis, Erica E; Katsanis, Nicholas; Kalscheuer, Vera M; Wang, Edith H; Metcalfe, Kay; Kleefstra, Tjitske; Innes, A Micheil; Kitsiou-Tzeli, Sophia; Rosello, Monica; Keegan, Catherine E; Lyon, Gholson J

    2015-12-03

    We describe an X-linked genetic syndrome associated with mutations in TAF1 and manifesting with global developmental delay, intellectual disability (ID), characteristic facial dysmorphology, generalized hypotonia, and variable neurologic features, all in male individuals. Simultaneous studies using diverse strategies led to the identification of nine families with overlapping clinical presentations and affected by de novo or maternally inherited single-nucleotide changes. Two additional families harboring large duplications involving TAF1 were also found to share phenotypic overlap with the probands harboring single-nucleotide changes, but they also demonstrated a severe neurodegeneration phenotype. Functional analysis with RNA-seq for one of the families suggested that the phenotype is associated with downregulation of a set of genes notably enriched with genes regulated by E-box proteins. In addition, knockdown and mutant studies of this gene in zebrafish have shown a quantifiable, albeit small, effect on a neuronal phenotype. Our results suggest that mutations in TAF1 play a critical role in the development of this X-linked ID syndrome. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  7. Genotype-phenotype characterization in 13 individuals with chromosome Xp11.22 duplications.

    PubMed

    Grams, Sarah E; Argiropoulos, Bob; Lines, Matthew; Chakraborty, Pranesh; Mcgowan-Jordan, Jean; Geraghty, Michael T; Tsang, Marilyn; Eswara, Marthand; Tezcan, Kamer; Adams, Kelly L; Linck, Leesa; Himes, Patricia; Kostiner, Dana; Zand, Dina J; Stalker, Heather; Driscoll, Daniel J; Huang, Taosheng; Rosenfeld, Jill A; Li, Xu; Chen, Emily

    2016-04-01

    We report 13 new individuals with duplications in Xp11.22-p11.23. The index family has one male and two female members in three generations with mild-severe intellectual disability (ID), speech delay, dysmorphic features, early puberty, constipation, and/or hand and foot abnormalities. Affected individuals were found to have two small duplications in Xp11.22 at nucleotide position (hg19) 50,112,063-50,456,458 bp (distal) and 53,160,114-53,713,154 bp (proximal). Collectively, these two regions include 14 RefSeq genes, prompting collection of a larger cohort of patients, in an attempt to delineate critical genes associated with the observed phenotype. In total, we have collected data on nine individuals with duplications overlapping the distal duplication region containing SHROOM4 and DGKK and eight individuals overlapping the proximal region including HUWE1. Duplications of HUWE1 have been previously associated with non-syndromic ID. Our data, with previously published reports, suggest that duplications involving SHROOM4 and DGKK may represent a new syndromic X-linked ID critical region associated with mild to severe ID, speech delay +/- dysarthria, attention deficit disorder, precocious puberty, constipation, and motor delay. We frequently observed foot abnormalities, 5th finger clinodactyly, tapering fingers, constipation, and exercise intolerance in patients with duplications of these two genes. Regarding duplications including the proximal region, our observations agree with previous studies, which have found associations with intellectual disability. In addition, expressive language delay, failure to thrive, motor delay, and 5th finger clinodactyly were also frequently observed in patients with the proximal duplication. © 2015 Wiley Periodicals, Inc.

  8. Biallelic missense variants in ZBTB11 can cause intellectual disability in human.

    PubMed

    Fattahi, Zohreh; Sheikh, Taimoor I; Musante, Luciana; Rasheed, Memoona; Taskiran, Ibrahim Ihsan; Harripaul, Ricardo; Hu, Hao; Kazeminasab, Somayeh; Alam, Muhammad Rizwan; Hosseini, Masoumeh; Larti, Farzaneh; Ghaderi, Zhila; Celik, Arzu; Ayub, Muhammad; Ansar, Muhammad; Haddadi, Mohammad; Wienker, Thomas F; Ropers, Hans Hilger; Kahrizi, Kimia; Vincent, John B; Najmabadi, H

    2018-06-08

    Exploring genes and pathways underlying Intellectual Disability (ID) provides insight into brain development and function, clarifying the complex puzzle of how cognition develops. As part of ongoing systematic studies to identify candidate ID genes, linkage analysis and next generation sequencing revealed ZBTB11, as a novel candidate ID gene. ZBTB11 encodes a less-studied transcription regulator and the two identified missense variants in this study may disrupt canonical Zn2+-binding residues of its C2H2 zinc finger domain, leading to possible altered DNA binding. Using HEK293T cells transfected with wild type and mutant GFP-ZBTB11 constructs, we found the ZBTB11 mutants being excluded from the nucleolus, where the wild-type recombinant protein is predominantly localized. Pathway analysis applied to ChIP-seq data deposited in the ENCODE database supports the localization of ZBTB11 in nucleoli, highlighting associated pathways such as rRNA synthesis, ribosomal assembly, RNA modification, stress sensing and provides a direct link between subcellular ZBTB11 location and its function. Furthermore, considering the report of prominent brain and spinal cord degeneration in a zebrafish Zbtb11 mutant, we investigated ZBTB11-ortholog knockdown in Drosophila melanogaster brain by targeting RNAi using the UAS/Gal4 system. The observed approximate reduction to a third of the mushroom body size - possibly through neuronal reduction or degeneration - may affect neuronal circuits in the brain that are required for adaptive behavior, specifying the role of this gene in nervous system. In conclusion, we report two ID families segregating ZBTB11 biallelic mutations disrupting Zn2+-binding motifs, and provide functional evidence linking ZBTB11 dysfunction to this phenotype.

  9. TnSeq of Mycobacterium tuberculosis clinical isolates reveals strain-specific antibiotic liabilities

    PubMed Central

    Carey, Allison F.; Rock, Jeremy M.; Krieger, Inna V.; Gagneux, Sebastien; Sacchettini, James C.; Fortune, Sarah M.

    2018-01-01

    Once considered a phenotypically monomorphic bacterium, there is a growing body of work demonstrating heterogeneity among Mycobacterium tuberculosis (Mtb) strains in clinically relevant characteristics, including virulence and response to antibiotics. However, the genetic and molecular basis for most phenotypic differences among Mtb strains remains unknown. To investigate the basis of strain variation in Mtb, we performed genome-wide transposon mutagenesis coupled with next-generation sequencing (TnSeq) for a panel of Mtb clinical isolates and the reference strain H37Rv to compare genetic requirements for in vitro growth across these strains. We developed an analytic approach to identify quantitative differences in genetic requirements between these genetically diverse strains, which vary in genomic structure and gene content. Using this methodology, we found differences between strains in their requirements for genes involved in fundamental cellular processes, including redox homeostasis and central carbon metabolism. Among the genes with differential requirements were katG, which encodes the activator of the first-line antitubercular agent isoniazid, and glcB, which encodes malate synthase, the target of a novel small-molecule inhibitor. Differences among strains in their requirement for katG and glcB predicted differences in their response to these antimicrobial agents. Importantly, these strain-specific differences in antibiotic response could not be predicted by genetic variants identified through whole genome sequencing or by gene expression analysis. Our results provide novel insight into the basis of variation among Mtb strains and demonstrate that TnSeq is a scalable method to predict clinically important phenotypic differences among Mtb strains. PMID:29505613

  10. RNA-Seq analysis and transcriptome assembly for blackberry (Rubus sp. Var. Lochness) fruit.

    PubMed

    Garcia-Seco, Daniel; Zhang, Yang; Gutierrez-Mañero, Francisco J; Martin, Cathie; Ramos-Solano, Beatriz

    2015-01-22

    There is an increasing interest in berries, especially blackberries in the diet, because of recent reports of their health benefits due to their high content of flavonoids. A broad range of genomic tools are available for other Rosaceae species but these tools are still lacking in the Rubus genus, thus limiting gene discovery and the breeding of improved varieties. De novo RNA-seq of ripe blackberries grown under field conditions was performed using Illumina Hiseq 2000. Almost 9 billion nucleotide bases were sequenced in total. Following assembly, 42,062 consensus sequences were detected. For functional annotation, 33,040 (NR), 32,762 (NT), 21,932 (Swiss-Prot), 20,134 (KEGG), 13,676 (COG), 24,168 (GO) consensus sequences were annotated using different databases; in total 34,552 annotated sequences were identified. For protein prediction analysis, the number of coding DNA sequences (CDS) that mapped to the protein database was 32,540. Non redundant (NR), annotation showed that 25,418 genes (73.5%) has the highest similarity with Fragaria vesca subspecies vesca. Reanalysis was undertaken by aligning the reads with this reference genome for a deeper analysis of the transcriptome. We demonstrated that de novo assembly, using Trinity and later annotation with Blast using different databases, were complementary to alignment to the reference sequence using SOAPaligner/SOAP2. The Fragaria reference genome belongs to a species in the same family as blackberry (Rosaceae) but to a different genus. Since blackberries are tetraploids, the possibility of artefactual gene chimeras resulting from mis-assembly was tested with one of the genes sequenced by RNAseq, Chalcone Synthase (CHS). cDNAs encoding this protein were cloned and sequenced. Primers designed to the assembled sequences accurately distinguished different contigs, at least for chalcone synthase genes. We prepared and analysed transcriptome data from ripe blackberries, for which prior genomic information was limited. This new sequence information will improve the knowledge of this important and healthy fruit, providing an invaluable new tool for biological research.

  11. The mecillinam resistome reveals a role for peptidoglycan endopeptidases in stimulating cell wall synthesis in Escherichia coli.

    PubMed

    Lai, Ghee Chuan; Cho, Hongbaek; Bernhardt, Thomas G

    2017-07-01

    Bacterial cells are typically surrounded by an net-like macromolecule called the cell wall constructed from the heteropolymer peptidoglycan (PG). Biogenesis of this matrix is the target of penicillin and related beta-lactams. These drugs inhibit the transpeptidase activity of PG synthases called penicillin-binding proteins (PBPs), preventing the crosslinking of nascent wall material into the existing network. The beta-lactam mecillinam specifically targets the PBP2 enzyme in the cell elongation machinery of Escherichia coli. Low-throughput selections for mecillinam resistance have historically been useful in defining mechanisms involved in cell wall biogenesis and the killing activity of beta-lactam antibiotics. Here, we used transposon-sequencing (Tn-Seq) as a high-throughput method to identify nearly all mecillinam resistance loci in the E. coli genome, providing a comprehensive resource for uncovering new mechanisms underlying PG assembly and drug resistance. Induction of the stringent response or the Rcs envelope stress response has been previously implicated in mecillinam resistance. We therefore also performed the Tn-Seq analysis in mutants defective for these responses in addition to wild-type cells. Thus, the utility of the dataset was greatly enhanced by determining the stress response dependence of each resistance locus in the resistome. Reasoning that stress response-independent resistance loci are those most likely to identify direct modulators of cell wall biogenesis, we focused our downstream analysis on this subset of the resistome. Characterization of one of these alleles led to the surprising discovery that the overproduction of endopeptidase enzymes that cleave crosslinks in the cell wall promotes mecillinam resistance by stimulating PG synthesis by a subset of PBPs. Our analysis of this activation mechanism suggests that, contrary to the prevailing view in the field, PG synthases and PG cleaving enzymes need not function in multi-enzyme complexes to expand the cell wall matrix.

  12. Multi-step splicing of sphingomyelin synthase linear and circular RNAs.

    PubMed

    Filippenkov, Ivan B; Sudarkina, Olga Yu; Limborska, Svetlana A; Dergunova, Lyudmila V

    2018-05-15

    The SGMS1 gene encodes the enzyme sphingomyelin synthase 1 (SMS1), which is involved in the regulation of lipid metabolism, apoptosis, intracellular vesicular transport and other significant processes. The SGMS1 gene is located on chromosome 10 and has a size of 320 kb. Previously, we showed that dozens of alternative transcripts of the SGMS1 gene are present in various human tissues. In addition to mRNAs that provide synthesis of the SMS1 protein, this gene participates in the synthesis of non-coding transcripts, including circular RNAs (circRNAs), which include exons of the 5'-untranslated region (5'-UTR) and are highly represented in the brain. In this study, using the high-throughput technology RNA-CaptureSeq, many new SGMS1 transcripts were identified, including both intronic unspliced RNAs (premature RNAs) and RNAs formed via alternative splicing. Recursive exons (RS-exons) that can participate in the multi-step splicing of long introns of the gene were also identified. These exons participate in the formation of circRNAs. Thus, multi-step splicing may provide a variety of linear and circular RNAs of eukaryotic genes in tissues. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. Contemporary microbiology and identification of Corynebacteria spp. causing infections in human.

    PubMed

    Zasada, A A; Mosiej, E

    2018-06-01

    The Corynebacterium is a genus of bacteria of growing clinical importance. Progress in medicine results in growing population of immunocompromised patients and growing number of infections caused by opportunistic pathogens. A new infections caused by new Corynebacterium species and species previously regarded as commensal micro-organisms have been described. Parallel with changes in Corynebacteria infections, the microbiological laboratory diagnostic possibilities are changing. But identification of this group of bacteria to the species level remains difficult. In the paper, we present various manual, semi-automated and automated assays used in clinical laboratories for Corynebacterium identification, such as API Coryne, RapID CB Plus, BBL Crystal Gram Positive ID System, MICRONAUT-RPO, VITEK 2, BD Phoenix System, Sherlock Microbial ID System, MicroSeq Microbial Identification System, Biolog Microbial Identification Systems, MALDI-TOF MS systems, polymerase chain reaction (PCR)-based and sequencing-based assays. The presented assays are based on various properties, like biochemical tests, specific DNA sequences, composition of cellular fatty acids, protein profiles and have specific limitations. The number of opportunistic infections caused by Corynebacteria is increasing due to increase in number of immunocompromised patients. New Corynebacterium species and new human infections, caused by this group of bacteria, has been described recently. However, identification of Corynebacteria is still a challenge despite application of sophisticated laboratory methods. In the study we present possibilities and limitations of various commercial systems for identification of Corynebacteria. © 2018 The Society for Applied Microbiology.

  14. Massively Parallel Sequencing of Patients with Intellectual Disability, Congenital Anomalies and/or Autism Spectrum Disorders with a Targeted Gene Panel

    PubMed Central

    Brett, Maggie; McPherson, John; Zang, Zhi Jiang; Lai, Angeline; Tan, Ee-Shien; Ng, Ivy; Ong, Lai-Choo; Cham, Breana; Tan, Patrick; Rozen, Steve; Tan, Ene-Choo

    2014-01-01

    Developmental delay and/or intellectual disability (DD/ID) affects 1–3% of all children. At least half of these are thought to have a genetic etiology. Recent studies have shown that massively parallel sequencing (MPS) using a targeted gene panel is particularly suited for diagnostic testing for genetically heterogeneous conditions. We report on our experiences with using massively parallel sequencing of a targeted gene panel of 355 genes for investigating the genetic etiology of eight patients with a wide range of phenotypes including DD/ID, congenital anomalies and/or autism spectrum disorder. Targeted sequence enrichment was performed using the Agilent SureSelect Target Enrichment Kit and sequenced on the Illumina HiSeq2000 using paired-end reads. For all eight patients, 81–84% of the targeted regions achieved read depths of at least 20×, with average read depths overlapping targets ranging from 322× to 798×. Causative variants were successfully identified in two of the eight patients: a nonsense mutation in the ATRX gene and a canonical splice site mutation in the L1CAM gene. In a third patient, a canonical splice site variant in the USP9X gene could likely explain all or some of her clinical phenotypes. These results confirm the value of targeted MPS for investigating DD/ID in children for diagnostic purposes. However, targeted gene MPS was less likely to provide a genetic diagnosis for children whose phenotype includes autism. PMID:24690944

  15. Alteration of development and gene expression induced by in ovo-nanoinjection of 3-hydroxybenzo[c]phenanthrene into Japanese medaka (Oryzias latipes) embryos.

    PubMed

    Chen, Kun; Tsutsumi, Yuki; Yoshitake, Shuhei; Qiu, Xuchun; Xu, Hai; Hashiguchi, Yasuyuki; Honda, Masato; Tashiro, Kosuke; Nakayama, Kei; Hano, Takeshi; Suzuki, Nobuo; Hayakawa, Kazuichi; Shimasaki, Yohei; Oshima, Yuji

    2017-01-01

    Benzo[c]phenanthrene (BcP) is a highly toxic polycyclic aromatic hydrocarbon (PAHs) found throughout the environment. In fish, it is metabolized to 3-hydroxybenzo[c]phenanthrene (3-OHBcP). In the present study, we observed the effects of 1nM 3-OHBcP on the development and gene expression of Japanese medaka (Oryzias latipes) embryos. Embryos were nanoinjected with the chemical after fertilization. Survival, developmental stage, and heart rate of the embryos were observed, and gene expression differences were quantified by messenger RNA sequencing (mRNA-Seq). The exposure to 1nM 3-OHBcP accelerated the development of medaka embryos on the 1st, 4th, and 6th days post fertilization (dpf), and increased heart rates significantly on the 5th dpf. Physical development differences of exposed medaka embryos were consistent with the gene expression profiles of the mRNA-Seq results for the 3rd dpf, which show that the expression of 780 genes differed significantly between the solvent control and 1nM 3-OHBcP exposure groups. The obvious expression changes in the exposure group were found for genes involved in organ formation (eye, muscle, heart), energy supply (ATPase and ATP synthase), and stress-response (heat shock protein genes). The acceleration of development and increased heart rate, which were consistent with the changes in mRNA expression, suggested that 3-OHBcP affects the development of medaka embryos. The observation on the developmental stages and heart beat, in ovo-nanoinjection and mRNA-Seq may be efficient tools to evaluate the effects of chemicals on embryos. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. 1-deoxy-d-xylulose-5-phosphate reductoisomerases and method of use

    DOEpatents

    Croteau, Rodney B.; Lange, Bernd M.

    2001-01-01

    The present invention relates to isolated DNA sequences which code for the expression of plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein, such as the sequence presented in SEQ ID NO:1 which encodes a 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein from peppermint (Mentha x piperita). Additionally, the present invention relates to isolated plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein. In other aspects, the present invention is directed to replicable recombinant cloning vehicles comprising a nucleic acid sequence which codes for a plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase, to modified host cells transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence of the invention.

  17. 1-deoxy-D-xylulose-5-phosphate reductoisomerases, and methods of use

    DOEpatents

    Croteau, Rodney B.; Lange, Bernd M.

    2002-07-16

    The present invention relates to isolated DNA sequences which code for the expression of plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein, such as the sequence presented in SEQ ID NO:1 which encodes a 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein from peppermint (Mentha x piperita). Additionally, the present invention relates to isolated plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase protein. In other aspects, the present invention is directed to replicable recombinant cloning vehicles comprising a nucleic acid sequence which codes for a plant 1-deoxy-D-xylulose-5-phosphate reductoisomerase, to modified host cells transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence of the invention.

  18. Virtual Screening of Novel Glucosamine-6-Phosphate Synthase Inhibitors.

    PubMed

    Lather, Amit; Sharma, Sunil; Khatkar, Anurag

    2018-01-01

    Infections caused by microorganisms are the major cause of death today. The tremendous and improper use of antimicrobial agents leads to antimicrobial resistance. Various currently available antimicrobial drugs are inadequate to control the infections and lead to various adverse drug reactions. Efforts based on computer-aided drug design (CADD) can excavate a large number of databases to generate new, potent hits and minimize the requirement of time as well as money for the discovery of newer antimicrobials. Pharmaceutical sciences also have made development with advances in drug designing concepts. The current research article focuses on the study of various G-6-P synthase inhibitors from literature cited molecular database. Docking analysis was conducted and ADMET data of various molecules was evaluated by Schrodinger Glide and PreADMET software, respectively. Here, the results presented efficacy of various inhibitors towards enzyme G-6-P synthase. Docking scores, binding energy and ADMET data of various molecules showed good inhibitory potential toward G-6-P synthase as compared to standard antibiotics. This novel antimicrobial drug target G-6-P synthase has not so extensively been explored for its application in antimicrobial therapy, so the work done so far proved highly essential. This article has helped the drug researchers and scientists to intensively explore about this wonderful antimicrobial drug target. The Schrodinger, Inc. (New York, USA) software was utilized to carry out the computational calculations and docking studies. The hardware configuration was Intel® core (TM) i5-4210U CPU @ 2.40GHz, RAM memory 4.0 GB under 64-bit window operating system. The ADMET data was calculated by using the PreADMET tool (PreADMET ver. 2.0). All the computational work was completed in the Laboratory for Enzyme Inhibition Studies, Department of Pharmaceutical Sciences, M.D. University, Rohtak, INDIA. Molecular docking studies were carried out to identify the binding affinities and interaction between the inhibitors and the target proteins (G-6-P synthase) by using Glide software (Schrodinger Inc. U.S.A.-Maestro version 10.2). Grid-based Ligand Docking with Energetic (Glide) is one of the most accurate docking softwares available for ligand-protein, protein-protein binding studies. A library of hundreds of available ligands was docked against targeted proteins G-6-P synthase having PDB ID 1moq. Results of docking are shown in Table 1 and Table 2. Results of G-6-P synthase docking showed that some compounds were found to have comparable docking score and binding energy (kj/mol) as compared to standard antibiotics. Many of the ligands showed hydrogen bond interaction, hydrophobic interactions, electrostatic interactions, ionic interactions and π- π stacking with the various amino acid residues in the binding pockets of G-6-P synthase. The docking study estimated free energy of binding, binding pose andglide score and all these parameters provide a promising tool for the discovery of new potent natural inhibitors of G-6-P synthase. These G-6-P synthase inhibitors could further be used as antimicrobials. Here, a detailed binding analysis and new insights of inhibitors from various classes of molecules were docked in binding cavity of G-6-P synthase. ADME and toxicity prediction of these compounds will further accentuate us to study these compounds in vivo. This information will possibly present further expansion of effective antimicrobials against several microbial infections. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  19. Transcriptome Analysis Revealed Highly Expressed Genes Encoding Secondary Metabolite Pathways and Small Cysteine-Rich Proteins in the Sclerotium of Lignosus rhinocerotis

    PubMed Central

    Yap, Hui-Yeng Y.; Chooi, Yit-Heng; Fung, Shin-Yee; Ng, Szu-Ting; Tan, Chon-Seng; Tan, Nget-Hong

    2015-01-01

    Lignosus rhinocerotis (Cooke) Ryvarden (tiger milk mushroom) has long been known for its nutritional and medicinal benefits among the local communities in Southeast Asia. However, the molecular and genetic basis of its medicinal and nutraceutical properties at transcriptional level have not been investigated. In this study, the transcriptome of L. rhinocerotis sclerotium, the part with medicinal value, was analyzed using high-throughput Illumina HiSeqTM platform with good sequencing quality and alignment results. A total of 3,673, 117, and 59,649 events of alternative splicing, novel transcripts, and SNP variation were found to enrich its current genome database. A large number of transcripts were expressed and involved in the processing of gene information and carbohydrate metabolism. A few highly expressed genes encoding the cysteine-rich cerato-platanin, hydrophobins, and sugar-binding lectins were identified and their possible roles in L. rhinocerotis were discussed. Genes encoding enzymes involved in the biosynthesis of glucans, six gene clusters encoding four terpene synthases and one each of non-ribosomal peptide synthetase and polyketide synthase, and 109 transcribed cytochrome P450 sequences were also identified in the transcriptome. The data from this study forms a valuable foundation for future research in the exploitation of this mushroom in pharmacological and industrial applications. PMID:26606395

  20. CozE is a member of the MreCD complex that directs cell elongation in Streptococcus pneumoniae.

    PubMed

    Fenton, Andrew K; El Mortaji, Lamya; Lau, Derek T C; Rudner, David Z; Bernhardt, Thomas G

    2016-12-12

    Most bacterial cells are surrounded by a peptidoglycan cell wall that is essential for their integrity. The major synthases of this exoskeleton are called penicillin-binding proteins (PBPs) 1,2 . Surprisingly little is known about how cells control these enzymes, given their importance as drug targets. In the model Gram-negative bacterium Escherichia coli, outer membrane lipoproteins are critical activators of the class A PBPs (aPBPs) 3,4 , bifunctional synthases capable of polymerizing and crosslinking peptidoglycan to build the exoskeletal matrix 1 . Regulators of PBP activity in Gram-positive bacteria have yet to be discovered but are likely to be distinct due to the absence of an outer membrane. To uncover Gram-positive PBP regulatory factors, we used transposon-sequencing (Tn-Seq) 5 to screen for mutations affecting the growth of Streptococcus pneumoniae cells when the aPBP synthase PBP1a was inactivated. Our analysis revealed a set of genes that were essential for growth in wild-type cells yet dispensable when pbp1a was deleted. The proteins encoded by these genes include the conserved cell wall elongation factors MreC and MreD 2,6,7 , as well as a membrane protein of unknown function (SPD_0768) that we have named CozE (coordinator of zonal elongation). Our results indicate that CozE is a member of the MreCD complex of S. pneumoniae that directs the activity of PBP1a to the midcell plane where it promotes zonal cell elongation and normal morphology. CozE homologues are broadly distributed among bacteria, suggesting that they represent a widespread family of morphogenic proteins controlling cell wall biogenesis by the PBPs.

  1. Anthranilate synthase subunit organization in Chromobacterium violaceum.

    PubMed

    Carminatti, C A; Oliveira, I L; Recouvreux, D O S; Antônio, R V; Porto, L M

    2008-09-16

    Tryptophan is an aromatic amino acid used for protein synthesis and cellular growth. Chromobacterium violaceum ATCC 12472 uses two tryptophan molecules to synthesize violacein, a secondary metabolite of pharmacological interest. The genome analysis of this bacterium revealed that the genes trpA-F and pabA-B encode the enzymes of the tryptophan pathway in which the first reaction is the conversion of chorismate to anthranilate by anthranilate synthase (AS), an enzyme complex. In the present study, the organization and structure of AS protein subunits from C. violaceum were analyzed using bioinformatics tools available on the Web. We showed by calculating molecular masses that AS in C. violaceum is composed of alpha (TrpE) and beta (PabA) subunits. This is in agreement with values determined experimentally. Catalytic and regulatory sites of the AS subunits were identified. The TrpE and PabA subunits contribute to the catalytic site while the TrpE subunit is involved in the allosteric site. Protein models for the TrpE and PabA subunits were built by restraint-based homology modeling using AS enzyme, chains A and B, from Salmonella typhimurium (PDB ID 1I1Q).

  2. Crystal structure of plant acetohydroxyacid synthase, the target for several commercial herbicides.

    PubMed

    Garcia, Mario Daniel; Wang, Jian-Guo; Lonhienne, Thierry; Guddat, Luke William

    2017-07-01

    Acetohydroxyacid synthase (AHAS, EC 2.2.1.6) is the first enzyme in the branched-chain amino acid biosynthesis pathway. Five of the most widely used commercial herbicides (i.e. sulfonylureas, imidazolinones, triazolopyrimidines, pyrimidinyl-benzoates and sulfonylamino-cabonyl-triazolinones) target this enzyme. Here we have determined the first crystal structure of a plant AHAS in the absence of any inhibitor (2.9 Å resolution) and it shows that the herbicide-binding site adopts a folded state even in the absence of an inhibitor. This is unexpected because the equivalent regions for herbicide binding in uninhibited Saccharomyces cerevisiae AHAS crystal structures are either disordered, or adopt a different fold when the herbicide is not present. In addition, the structure provides an explanation as to why some herbicides are more potent inhibitors of Arabidopsis thaliana AHAS compared to AHASs from other species (e.g. S. cerevisiae). The elucidation of the native structure of plant AHAS provides a new platform for future rational structure-based herbicide design efforts. The coordinates and structure factors for uninhibited AtAHAS have been deposited in the Protein Data Bank (www.pdb.org) with the PDB ID code 5K6Q. © 2017 Federation of European Biochemical Societies.

  3. Inhibition of glycogen synthase kinase-3beta downregulates total tau proteins in cultured neurons and its reversal by the blockade of protein phosphatase-2A.

    PubMed

    Martin, Ludovic; Magnaudeix, Amandine; Esclaire, Françoise; Yardin, Catherine; Terro, Faraj

    2009-02-03

    In tauopathies such as Alzheimer's disease (AD), the molecular mechanisms of tau protein aggregation into neurofibrillary tangles (NFTs) and their contribution to neurodegeneration remain not understood. It was recently demonstrated that tau, regardless of its aggregation, might represent a key mediator of neurodegeneration. Therefore, reduction of tau levels might represent a mechanism of neuroprotection. Glycogen synthase kinase-3beta (GSK3beta) and protein phosphatase-2A (PP2A) are key enzymes involved in the regulation of tau phosphorylation, and have been suggested to be involved in the abnormal tau phosphorylation and aggregation in AD. Connections between PP2A and GSK3beta signaling have been reported. We have previously demonstrated that exposure of cultured cortical neurons to lithium decreased tau protein expression and provided neuroprotection against Abeta. Since lithium is not a specific inhibitor of GSK3beta (ID50=2.0 mM), whether or not the lithium-induced tau decrease involves GSK3beta remained to be determined. For that purpose, cultured cortical neurons were exposed to 6-bromo-indirubin-3'-oxime (6-BIO), a more selective and potent GSK3beta inhibitor (ID50=1.5 microM) or to lithium. Analysis of tau levels and phosphorylation by western-blot assays showed that lithium and 6-BIO dose-dependently decreased both tau protein levels and tau phosphorylation. Conversely, inhibition of cyclin-dependent kinase-5 (CDK5) by roscovitine decreased phosphorylated tau but failed to alter tau protein levels. These data indicate that GSK3beta might be selectively involved in the regulation of tau protein levels. Moreover, inhibition of PP2A by okadaic acid, but not that of PP2B (protein phosphatase-2B)/calcineurin by FK506, dose-dependently reversed lithium-induced tau decrease. These data indicate that GSK3beta regulates both tau phosphorylation and total tau levels through PP2A.

  4. RNA-Seq Analysis Provides Insights for Understanding Photoautotrophic Polyhydroxyalkanoate Production in Recombinant Synechocystis Sp.

    PubMed Central

    Lau, Nyok-Sean; Foong, Choon Pin; Kurihara, Yukio; Sudesh, Kumar; Matsui, Minami

    2014-01-01

    The photosynthetic cyanobacterium, Synechocystis sp. strain 6803, is a potential platform for the production of various chemicals and biofuels. In this study, direct photosynthetic production of a biopolymer, polyhydroxyalkanoate (PHA), in genetically engineered Synechocystis sp. achieved as high as 14 wt%. This is the highest production reported in Synechocystis sp. under photoautotrophic cultivation conditions without the addition of a carbon source. The addition of acetate increased PHA accumulation to 41 wt%, and this value is comparable to the highest production obtained with cyanobacteria. Transcriptome analysis by RNA-seq coupled with real-time PCR was performed to understand the global changes in transcript levels of cells subjected to conditions suitable for photoautotrophic PHA biosynthesis. There was lower expression of most PHA synthesis-related genes in recombinant Synechocystis sp. with higher PHA accumulation suggesting that the concentration of these enzymes is not the limiting factor to achieving high PHA accumulation. In order to cope with the higher PHA production, cells may utilize enhanced photosynthesis to drive the product formation. Results from this study suggest that the total flux of carbon is the possible driving force for the biosynthesis of PHA and the polymerizing enzyme, PHA synthase, is not the only critical factor affecting PHA-synthesis. Knowledge of the regulation or control points of the biopolymer production pathways will facilitate the further use of cyanobacteria for biotechnological applications. PMID:24466058

  5. A novel plasma circular RNA circFARSA is a potential biomarker for non-small cell lung cancer.

    PubMed

    Hang, Dong; Zhou, Jing; Qin, Na; Zhou, Wen; Ma, Hongxia; Jin, Guangfu; Hu, Zhibin; Dai, Juncheng; Shen, Hongbing

    2018-06-01

    Emerging evidence indicates that circular RNAs (circRNAs) are implicated in cancer development. This study aimed to evaluate whether circulating circRNAs may serve as novel biomarkers for non-small cell lung cancer (NSCLC). We used RNA sequencing (RNA-seq) and quantitative real-time PCR to explore cancer-related circRNAs. Bioinformatics and functional analyses were performed to reveal biological effects of circRNAs on lung cancer cells. A total of 5471 distinct circRNAs were identified by total RNA-seq, in which 185 were differentially expressed between cancerous and adjacent normal tissues. A circRNA derived from exon 5-7 of the FARSA gene, termed circFARSA, was observed to increase in cancerous tissues (P = 0.016), and was more abundant in patients' plasma than controls (P < 0.001). Overexpression of circFARSA in A549 cell line significantly promoted cell migration and invasion. In silico analysis suggested that circFARSA might sponge miR-330-5p and miR-326, thereby relieving their inhibitory effects on oncogene fatty acid synthase. Summarily, this study revealed circRNA profile of NSCLC for the first time and provided the evidence of plasma circFARSA as a potential noninvasive biomarker for this malignancy. © 2018 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.

  6. The PhoP-Dependent ncRNA Mcr7 Modulates the TAT Secretion System in Mycobacterium tuberculosis

    PubMed Central

    Benjak, Andrej; Uplekar, Swapna; Rougemont, Jacques; Guilhot, Christophe; Malaga, Wladimir; Martín, Carlos; Cole, Stewart T.

    2014-01-01

    The PhoPR two-component system is essential for virulence in Mycobacterium tuberculosis where it controls expression of approximately 2% of the genes, including those for the ESX-1 secretion apparatus, a major virulence determinant. Mutations in phoP lead to compromised production of pathogen-specific cell wall components and attenuation both ex vivo and in vivo. Using antibodies against the native protein in ChIP-seq experiments (chromatin immunoprecipitation followed by high-throughput sequencing) we demonstrated that PhoP binds to at least 35 loci on the M. tuberculosis genome. The PhoP regulon comprises several transcriptional regulators as well as genes for polyketide synthases and PE/PPE proteins. Integration of ChIP-seq results with high-resolution transcriptomic analysis (RNA-seq) revealed that PhoP controls 30 genes directly, whilst regulatory cascades are responsible for signal amplification and downstream effects through proteins like EspR, which controls Esx1 function, via regulation of the espACD operon. The most prominent site of PhoP regulation was located in the intergenic region between rv2395 and PE_PGRS41, where the mcr7 gene codes for a small non-coding RNA (ncRNA). Northern blot experiments confirmed the absence of Mcr7 in an M. tuberculosis phoP mutant as well as low-level expression of the ncRNA in M. tuberculosis complex members other than M. tuberculosis. By means of genetic and proteomic analyses we demonstrated that Mcr7 modulates translation of the tatC mRNA thereby impacting the activity of the Twin Arginine Translocation (Tat) protein secretion apparatus. As a result, secretion of the immunodominant Ag85 complex and the beta-lactamase BlaC is affected, among others. Mcr7, the first ncRNA of M. tuberculosis whose function has been established, therefore represents a missing link between the PhoPR two-component system and the downstream functions necessary for successful infection of the host. PMID:24874799

  7. The Chlamydomonas genome project: a decade on

    PubMed Central

    Blaby, Ian K.; Blaby-Haas, Crysten; Tourasse, Nicolas; Hom, Erik F. Y.; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George; Stanke, Mario; Harris, Elizabeth H.; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S.; Prochnik, Simon

    2014-01-01

    The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis and micronutrient homeostasis. Ten years since its genome project was initiated, an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the “omics” era. Housed at Phytozome, the Joint Genome Institute’s (JGI) plant genomics portal, the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of RNA-Seq data. Here, we present the past, present and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. PMID:24950814

  8. Three Mutations in the Bilateral Frontoparietal Polymicrogyria Gene GPR56 in Pakistani Intellectual Disability Families.

    PubMed

    Sawal, Humaira Aziz; Harripaul, Ricardo; Mikhailov, Anna; Vleuten, Kayla; Naeem, Farooq; Nasr, Tanveer; Hassan, Muhammad Jawad; Vincent, John B; Ayub, Muhammad; Rafiq, Muhammad Arshad

    2018-06-01

    Bilateral frontoparietal polymicrogyria (BFPP, MIM 606854) is a heterogeneous autosomal recessive disorder of abnormal cortical lamination, leading to moderate-to-severe intellectual disability (ID), seizure disorder, and motor difficulties, and caused by mutations in the G protein-coupled receptor 56 ( GPR56 ) gene. Twenty-eight mutations in 40 different families have been reported in the literature. The clinical and neuroimaging phenotype is consistent in these cases. The BFPP cortex consists of numerous small gyral cells, with scalloping of the cortical-white matter junction. There are also associated white matter, brain stem, and cerebellar changes. GPR56 is a member of an adhesion G protein-coupled receptor family with a very long N-terminal stalk and seven transmembrane domains. In this study, we identified three families from Pakistan, ascertained primarily for ID, with overlapping approximately 1 Mb region (chr16:56,973,335-57,942,866) of homozygosity by descent, including 24 RefSeq genes. We found three GPR56 homozygous mutations, using next-generation sequencing. These mutations include a substitutional variant, c.1460T > C; p.L487P, (chr16:57693480 T > C), a 13-bp insertion causing the frameshift and truncating mutation, p.Leu269Hisfs*21 (NM_005682.6:c.803_804insCCATGGAGGTGCT; Chr16: 57689345_57689346insCCATGGAGGTGCT), and a truncating mutation c.1426C > T; p.Arg476* (Chr16:57693446C > T). These mutations fully segregated with ID in these families and were absent in the Exome Aggregation Consortium database that has approximately 8,000 control samples of South Asian origin. Two of these mutations have been reported in ClinVar database, and the third one has not been reported before. Three families from Pakistan with GPR56 mutations have been reported before. With the addition of our findings, the total number of mutations reported in Pakistani patients now is six. These results increase our knowledge regarding the mutational spectrum of the GPR56 gene causing BFPP/ID.

  9. Structural, kinetic and computational investigation of Vitis vinifera DHDPS reveals new insight into the mechanism of lysine-mediated allosteric inhibition.

    PubMed

    Atkinson, Sarah C; Dogovski, Con; Downton, Matthew T; Czabotar, Peter E; Dobson, Renwick C J; Gerrard, Juliet A; Wagner, John; Perugini, Matthew A

    2013-03-01

    Lysine is one of the most limiting amino acids in plants and its biosynthesis is carefully regulated through inhibition of the first committed step in the pathway catalyzed by dihydrodipicolinate synthase (DHDPS). This is mediated via a feedback mechanism involving the binding of lysine to the allosteric cleft of DHDPS. However, the precise allosteric mechanism is yet to be defined. We present a thorough enzyme kinetic and thermodynamic analysis of lysine inhibition of DHDPS from the common grapevine, Vitis vinifera (Vv). Our studies demonstrate that lysine binding is both tight (relative to bacterial DHDPS orthologs) and cooperative. The crystal structure of the enzyme bound to lysine (2.4 Å) identifies the allosteric binding site and clearly shows a conformational change of several residues within the allosteric and active sites. Molecular dynamics simulations comparing the lysine-bound (PDB ID 4HNN) and lysine free (PDB ID 3TUU) structures show that Tyr132, a key catalytic site residue, undergoes significant rotational motion upon lysine binding. This suggests proton relay through the catalytic triad is attenuated in the presence of lysine. Our study reveals for the first time the structural mechanism for allosteric inhibition of DHDPS from the common grapevine.

  10. ARCPHdb: A comprehensive protein database for SF1 and SF2 helicase from archaea.

    PubMed

    Moukhtar, Mirna; Chaar, Wafi; Abdel-Razzak, Ziad; Khalil, Mohamad; Taha, Samir; Chamieh, Hala

    2017-01-01

    Superfamily 1 and Superfamily 2 helicases, two of the largest helicase protein families, play vital roles in many biological processes including replication, transcription and translation. Study of helicase proteins in the model microorganisms of archaea have largely contributed to the understanding of their function, architecture and assembly. Based on a large phylogenomics approach, we have identified and classified all SF1 and SF2 protein families in ninety five sequenced archaea genomes. Here we developed an online webserver linked to a specialized protein database named ARCPHdb to provide access for SF1 and SF2 helicase families from archaea. ARCPHdb was implemented using MySQL relational database. Web interfaces were developed using Netbeans. Data were stored according to UniProt accession numbers, NCBI Ref Seq ID, PDB IDs and Entrez Databases. A user-friendly interactive web interface has been developed to browse, search and download archaeal helicase protein sequences, their available 3D structure models, and related documentation available in the literature provided by ARCPHdb. The database provides direct links to matching external databases. The ARCPHdb is the first online database to compile all protein information on SF1 and SF2 helicase from archaea in one platform. This database provides essential resource information for all researchers interested in the field. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. RNA-seq Transcriptome Response of Flax (Linum usitatissimum L.) to the Pathogenic Fungus Fusarium oxysporum f. sp. lini

    PubMed Central

    Galindo-González, Leonardo; Deyholos, Michael K.

    2016-01-01

    Fusarium oxysporum f. sp. lini is a hemibiotrophic fungus that causes wilt in flax. Along with rust, fusarium wilt has become an important factor in flax production worldwide. Resistant flax cultivars have been used to manage the disease, but the resistance varies, depending on the interactions between specific cultivars and isolates of the pathogen. This interaction has a strong molecular basis, but no genomic information is available on how the plant responds to attempted infection, to inform breeding programs on potential candidate genes to evaluate or improve resistance across cultivars. In the current study, disease progression in two flax cultivars [Crop Development Center (CDC) Bethune and Lutea], showed earlier disease symptoms and higher susceptibility in the later cultivar. Chitinase gene expression was also divergent and demonstrated and earlier molecular response in Lutea. The most resistant cultivar (CDC Bethune) was used for a full RNA-seq transcriptome study through a time course at 2, 4, 8, and 18 days post-inoculation (DPI). While over 100 genes were significantly differentially expressed at both 4 and 8 DPI, the broadest deployment of plant defense responses was evident at 18 DPI with transcripts of more than 1,000 genes responding to the treatment. These genes evidenced a reception and transduction of pathogen signals, a large transcriptional reprogramming, induction of hormone signaling, activation of pathogenesis-related genes, and changes in secondary metabolism. Among these, several key genes that consistently appear in studies of plant-pathogen interactions, had increased transcript abundance in our study, and constitute suitable candidates for resistance breeding programs. These included: an induced RPMI-induced protein kinase; transcription factors WRKY3, WRKY70, WRKY75, MYB113, and MYB108; the ethylene response factors ERF1 and ERF14; two genes involved in auxin/glucosinolate precursor synthesis (CYP79B2 and CYP79B3); the flavonoid-related enzymes chalcone synthase, dihydroflavonol reductase and multiple anthocyanidin synthases; and a peroxidase implicated in lignin formation (PRX52). Additionally, regulation of some genes indicated potential pathogen manipulation to facilitate infection; these included four disease resistance proteins that were repressed, indole acetic acid amido/amino hydrolases which were upregulated, activated expansins and glucanases, amino acid transporters and aquaporins, and finally, repression of major latex proteins. PMID:27933082

  12. RNA-seq Transcriptome Response of Flax (Linum usitatissimum L.) to the Pathogenic Fungus Fusarium oxysporum f. sp. lini.

    PubMed

    Galindo-González, Leonardo; Deyholos, Michael K

    2016-01-01

    Fusarium oxysporum f. sp. lini is a hemibiotrophic fungus that causes wilt in flax. Along with rust, fusarium wilt has become an important factor in flax production worldwide. Resistant flax cultivars have been used to manage the disease, but the resistance varies, depending on the interactions between specific cultivars and isolates of the pathogen. This interaction has a strong molecular basis, but no genomic information is available on how the plant responds to attempted infection, to inform breeding programs on potential candidate genes to evaluate or improve resistance across cultivars. In the current study, disease progression in two flax cultivars [Crop Development Center (CDC) Bethune and Lutea], showed earlier disease symptoms and higher susceptibility in the later cultivar. Chitinase gene expression was also divergent and demonstrated and earlier molecular response in Lutea. The most resistant cultivar (CDC Bethune) was used for a full RNA-seq transcriptome study through a time course at 2, 4, 8, and 18 days post-inoculation (DPI). While over 100 genes were significantly differentially expressed at both 4 and 8 DPI, the broadest deployment of plant defense responses was evident at 18 DPI with transcripts of more than 1,000 genes responding to the treatment. These genes evidenced a reception and transduction of pathogen signals, a large transcriptional reprogramming, induction of hormone signaling, activation of pathogenesis-related genes, and changes in secondary metabolism. Among these, several key genes that consistently appear in studies of plant-pathogen interactions, had increased transcript abundance in our study, and constitute suitable candidates for resistance breeding programs. These included: an induced R PMI-induced protein kinase; transcription factors WRKY3, WRKY70, WRKY75, MYB113 , and MYB108 ; the ethylene response factors ERF1 and ERF14 ; two genes involved in auxin/glucosinolate precursor synthesis ( CYP79B2 and CYP79B3 ); the flavonoid-related enzymes chalcone synthase, dihydroflavonol reductase and multiple anthocyanidin synthases; and a peroxidase implicated in lignin formation ( PRX52 ). Additionally, regulation of some genes indicated potential pathogen manipulation to facilitate infection; these included four disease resistance proteins that were repressed, indole acetic acid amido/amino hydrolases which were upregulated, activated expansins and glucanases, amino acid transporters and aquaporins, and finally, repression of major latex proteins.

  13. Tetra primer ARMS-PCR relates folate/homocysteine pathway genes and ACE gene polymorphism with coronary artery disease.

    PubMed

    Masud, Rizwan; Qureshi, Irfan Zia

    2011-09-01

    Cardiovascular disorders and coronary artery disease (CAD) are significant contributors to morbidity and mortality in heart patients. As genes of the folate/homocysteine pathway have been linked with the vascular disease, we investigated association of these gene polymorphisms with CAD/myocardial infarction (MI) using the novel approach of tetraprimer ARMS-PCR. A total of 230 participants (129 MI cases, 101 normal subjects) were recruited. We genotyped rs1801133 and rs1801131 SNPs in 5'10' methylenetetrahydrofolate reductase (MTHFR), rs1805087 SNP in 5' methyltetrahydrofolate homocysteine methyltransferase (MTR), rs662 SNP in paroxanse1 (PON1), and rs5742905 polymorphism in cystathionine beta synthase (CBS). Angiotensin converting enzyme (ACE) insertion/deletion polymorphism was detected through conventional PCR. Covariates included blood pressure, fasting blood sugar, serum cholesterol, and creatinine concentrations. Our results showed allele frequencies at rs1801133, rs1801131, rs1805087 and the ACE insertion/deletion (I/D) polymorphism varied between cases and controls. Logistic regression, after adjusting for covariates, demonstrated significant associations of rs1801133 and rs1805087 with CAD in the additive, dominant, and genotype model. In contrast, ACE I/D polymorphism was significantly related with CAD where recessive model was applied. Gene-gene interaction against the disease status revealed two polymorphism groups: rs1801133, rs662, and rs1805087; and rs1801131, rs662, and ACE I/D. Only the latter interaction maintained significance after adjusted for covariates. Our study concludes that folate pathway variants exert contributory influence on susceptibility to CAD. We further suggest that tetraprimer ARMS-PCR successfully resolves the genotypes in selected samples and might prove to be a superior technique compared to the conventional approach.

  14. A De novo Transcriptomic Approach to Identify Flavonoids and Anthocyanins “Switch-Off” in Olive (Olea europaea L.) Drupes at Different Stages of Maturation

    PubMed Central

    Iaria, Domenico L.; Chiappetta, Adriana; Muzzalupo, Innocenzo

    2016-01-01

    Highlights A de novo transcriptome reconstruction of olive drupes was performed in two genotypesGene expression was monitored during drupe development in two olive cultivarsTranscripts involved in flavonoid and anthocyanin pathways were analyzed in Cassanese and Leucocarpa cultivarsBoth cultivar and developmental stage impact gene expression in Olea europaea fruits. During ripening, the fruits of the olive tree (Olea europaea L.) undergo a progressive chromatic change characterized by the formation of a red-brown “spot” which gradually extends on the epidermis and in the innermost part of the mesocarp. This event finds an exception in the Leucocarpa cultivar, in which we observe a destabilized equilibrium between the metabolisms of chlorophyll and other pigments, particularly the anthocyanins whose switch-off during maturation promotes the white coloration of fruits. Despite its importance, genomic information on the olive tree is still lacking. Different RNA-seq libraries were generated from drupes of “Leucocarpa” and “Cassanese” olive genotypes, sampled at 100 and 130 days after flowering (DAF), and were used in order to identify transcripts involved in the main phenotypic changes of fruits during maturation and their corresponding expression patterns. A total of 103,359 transcripts were obtained and 3792 and 3064 were differentially expressed in “Leucocarpa” and “Cassanese” genotypes, respectively, during 100–130 DAF transition. Among them flavonoid and anthocyanin related transcripts such as phenylalanine ammonia lyase (PAL), cinnamate 4-hydroxylase (C4H), 4-coumarate-CoA ligase (4CL), chalcone synthase (CHS), chalcone isomerase (CHI), flavanone 3-hydroxylase (F3H), flavonol 3′-hydrogenase (F3′H), flavonol 3′5 ′-hydrogenase (F3′5′H), flavonol synthase (FLS), dihydroflavonol 4-reductase (DFR), anthocyanidin synthase (ANS), UDP-glucose:anthocianidin: flavonoid glucosyltransferase (UFGT) were identified. These results contribute to reducing the current gap in information regarding metabolic processes, including those linked to fruit pigmentation in the olive. PMID:26834761

  15. The Chlamydomonas genome project: a decade on.

    PubMed

    Blaby, Ian K; Blaby-Haas, Crysten E; Tourasse, Nicolas; Hom, Erik F Y; Lopez, David; Aksoy, Munevver; Grossman, Arthur; Umen, James; Dutcher, Susan; Porter, Mary; King, Stephen; Witman, George B; Stanke, Mario; Harris, Elizabeth H; Goodstein, David; Grimwood, Jane; Schmutz, Jeremy; Vallon, Olivier; Merchant, Sabeeha S; Prochnik, Simon

    2014-10-01

    The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis, and micronutrient homeostasis. Ten years since its genome project was initiated an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the omics era. Housed at Phytozome, the plant genomics portal of the Joint Genome Institute (JGI), the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of whole transcriptome sequencing (RNA-Seq) data. We present here the past, present, and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions, and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. Single nucleotide resolution RNA-seq uncovers new regulatory mechanisms in the opportunistic pathogen Streptococcus agalactiae.

    PubMed

    Rosinski-Chupin, Isabelle; Sauvage, Elisabeth; Sismeiro, Odile; Villain, Adrien; Da Cunha, Violette; Caliot, Marie-Elise; Dillies, Marie-Agnès; Trieu-Cuot, Patrick; Bouloc, Philippe; Lartigue, Marie-Frédérique; Glaser, Philippe

    2015-05-30

    Streptococcus agalactiae, or Group B Streptococcus, is a leading cause of neonatal infections and an increasing cause of infections in adults with underlying diseases. In an effort to reconstruct the transcriptional networks involved in S. agalactiae physiology and pathogenesis, we performed an extensive and robust characterization of its transcriptome through a combination of differential RNA-sequencing in eight different growth conditions or genetic backgrounds and strand-specific RNA-sequencing. Our study identified 1,210 transcription start sites (TSSs) and 655 transcript ends as well as 39 riboswitches and cis-regulatory regions, 39 cis-antisense non-coding RNAs and 47 small RNAs potentially acting in trans. Among these putative regulatory RNAs, ten were differentially expressed in response to an acid stress and two riboswitches sensed directly or indirectly the pH modification. Strikingly, 15% of the TSSs identified were associated with the incorporation of pseudo-templated nucleotides, showing that reiterative transcription is a pervasive process in S. agalactiae. In particular, 40% of the TSSs upstream genes involved in nucleotide metabolism show reiterative transcription potentially regulating gene expression, as exemplified for pyrG and thyA encoding the CTP synthase and the thymidylate synthase respectively. This comprehensive map of the transcriptome at the single nucleotide resolution led to the discovery of new regulatory mechanisms in S. agalactiae. It also provides the basis for in depth analyses of transcriptional networks in S. agalactiae and of the regulatory role of reiterative transcription following variations of intra-cellular nucleotide pools.

  17. RNA-Seq identification of candidate defense genes targeted by endophytic Bacillus cereus-mediated induced systemic resistance against Meloidogyne incognita in tomato.

    PubMed

    Hu, Haijing; Wang, Cong; Li, Xia; Tang, Yunyun; Wang, Yufang; Chen, Shuanglin; Yan, Shuzhen

    2018-05-08

    The endophytic bacteria Bacillus cereus BCM2 has shown great potential as a defense against the parasitic nematode Meloidogyne incognita. Here, we studied the endophytic bacteria-mediated plant defense against M. incognita and searched for defense-related candidate genes using RNA-Seq. The induced systemic resistance of BCM2 against M. incognita was tested using the split-root method. Pre-inoculated BCM2 on the inducer side was associated with a dramatic reduction in galls and egg masses at the responder side, but inoculated BCM2 alone did not produce the same effect. In order to investigate which plant defense-related genes are specifically activated by BCM2, four RNA samples from tomato roots were sequenced, and four high quality total clean bases were obtained, ranging from 6.64 to 6.75 Gb, with an average of 21558 total genes. The 34 candidate defense-related genes were identified by pair-wise comparison among libraries, representing the targets for BCM2 priming resistance against M. incognita. Functional characterization revealed that the plant-pathogen interaction pathway (ID: ko04626) was significantly enriched for BCM2-mediated M. incognita resistance. This study demonstrates that B. cereus BCM2 maintains a harmonious host-microbe relationship with tomato, but appeared to prime the plant, resulting in more vigorous defense response toward the infection nematode. This article is protected by copyright. All rights reserved.

  18. Comparative Analysis of Single-Cell RNA Sequencing Methods.

    PubMed

    Ziegenhain, Christoph; Vieth, Beate; Parekh, Swati; Reinius, Björn; Guillaumet-Adkins, Amy; Smets, Martha; Leonhardt, Heinrich; Heyn, Holger; Hellmann, Ines; Enard, Wolfgang

    2017-02-16

    Single-cell RNA sequencing (scRNA-seq) offers new possibilities to address biological and medical questions. However, systematic comparisons of the performance of diverse scRNA-seq protocols are lacking. We generated data from 583 mouse embryonic stem cells to evaluate six prominent scRNA-seq methods: CEL-seq2, Drop-seq, MARS-seq, SCRB-seq, Smart-seq, and Smart-seq2. While Smart-seq2 detected the most genes per cell and across cells, CEL-seq2, Drop-seq, MARS-seq, and SCRB-seq quantified mRNA levels with less amplification noise due to the use of unique molecular identifiers (UMIs). Power simulations at different sequencing depths showed that Drop-seq is more cost-efficient for transcriptome quantification of large numbers of cells, while MARS-seq, SCRB-seq, and Smart-seq2 are more efficient when analyzing fewer cells. Our quantitative comparison offers the basis for an informed choice among six prominent scRNA-seq methods, and it provides a framework for benchmarking further improvements of scRNA-seq protocols. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Homology modeling of Homo sapiens lipoic acid synthase: Substrate docking and insights on its binding mode.

    PubMed

    Krishnamoorthy, Ezhilarasi; Hassan, Sameer; Hanna, Luke Elizabeth; Padmalayam, Indira; Rajaram, Rama; Viswanathan, Vijay

    2017-05-07

    Lipoic acid synthase (LIAS) is an iron-sulfur cluster mitochondrial enzyme which catalyzes the final step in the de novo pathway for the biosynthesis of lipoic acid, a potent antioxidant. Recently there has been significant interest in its role in metabolic diseases and its deficiency in LIAS expression has been linked to conditions such as diabetes, atherosclerosis and neonatal-onset epilepsy, suggesting a strong inverse correlation between LIAS reduction and disease status. In this study we use a bioinformatics approach to predict its structure, which would be helpful to understanding its role. A homology model for LIAS protein was generated using X-ray crystallographic structure of Thermosynechococcus elongatus BP-1 (PDB ID: 4U0P). The predicted structure has 93% of the residues in the most favour region of Ramachandran plot. The active site of LIAS protein was mapped and docked with S-Adenosyl Methionine (SAM) using GOLD software. The LIAS-SAM complex was further refined using molecular dynamics simulation within the subsite 1 and subsite 3 of the active site. To the best of our knowledge, this is the first study to report a reliable homology model of LIAS protein. This study will facilitate a better understanding mode of action of the enzyme-substrate complex for future studies in designing drugs that can target LIAS protein. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. DNA Microarray for Rapid Detection and Identification of Food and Water Borne Bacteria: From Dry to Wet Lab.

    PubMed

    Ranjbar, Reza; Behzadi, Payam; Najafi, Ali; Roudi, Raheleh

    2017-01-01

    A rapid, accurate, flexible and reliable diagnostic method may significantly decrease the costs of diagnosis and treatment. Designing an appropriate microarray chip reduces noises and probable biases in the final result. The aim of this study was to design and construct a DNA Microarray Chip for a rapid detection and identification of 10 important bacterial agents. In the present survey, 10 unique genomic regions relating to 10 pathogenic bacterial agents including Escherichia coli (E.coli), Shigella boydii, Sh.dysenteriae, Sh.flexneri, Sh.sonnei, Salmonella typhi, S.typhimurium, Brucella sp., Legionella pneumophila, and Vibrio cholera were selected for designing specific long oligo microarray probes. For this reason, the in-silico operations including utilization of the NCBI RefSeq database, Servers of PanSeq and Gview, AlleleID 7.7 and Oligo Analyzer 3.1 was done. On the other hand, the in-vitro part of the study comprised stages of robotic microarray chip probe spotting, bacterial DNAs extraction and DNA labeling, hybridization and microarray chip scanning. In wet lab section, different tools and apparatus such as Nexterion® Slide E, Qarray mini spotter, NimbleGen kit, TrayMix TM S4, and Innoscan 710 were used. A DNA microarray chip including 10 long oligo microarray probes was designed and constructed for detection and identification of 10 pathogenic bacteria. The DNA microarray chip was capable to identify all 10 bacterial agents tested simultaneously. The presence of a professional bioinformatician as a probe designer is needed to design appropriate multifunctional microarray probes to increase the accuracy of the outcomes.

  1. Comparative transcriptome investigation of global gene expression changes caused by miR156 overexpression in Medicago sativa.

    PubMed

    Gao, Ruimin; Austin, Ryan S; Amyot, Lisa; Hannoufa, Abdelali

    2016-08-19

    Medicago sativa (alfalfa) is a low-input forage and potential bioenergy crop, and improving its yield and quality has always been a focus of the alfalfa breeding industry. Transgenic alfalfa plants overexpressing a precursor of alfalfa microRNA156 (MsmiR156) were recently generated by our group. These plants (miR156OE) showed enhanced biomass yield, reduced internodal length, increased shoot branching and trichome density, and a delay in flowering time. Transcripts of three SQUAMOSA-PROMOTER BINDING PROTEIN-LIKE (SPL) genes (MsSPL6, MsSPL12, and MsSPL13) were found to be targeted for cleavage by MsmiR156 in alfalfa. To further illustrate the molecular mechanisms underlying the effects of miR156 in alfalfa, two miR156OE genotypes (A11a and A17) were subjected to Next Generation RNA Sequencing with Illumina HiSeq. More than 1.11 billion clean reads were obtained from our available sequenced samples. A total of 160,472 transcripts were generated using Trinity de novo assembly and 4,985 significantly differentially expressed genes were detected in miR156OE plants A11a and A17 using the Medicago truncatula genome as reference. A total of 17 genes (including upregulated, downregulated, and unchanged) were selected for quantitative real-time PCR (qRT-PCR) validation, which showed that gene expression levels were largely consistent between qRT-PCR and RNA-Seq data. In addition to the established SPL genes MsSPL6, MsSPL12 and MsSPL13, four new SPLs; MsSPL2, MsSPL3, MsSPL4 and MsSPL9 were also down-regulated significantly in both miR156OE plants. These seven SPL genes belong to genes phylogeny clades VI, IV, VIII, V and VII, which have been reported to be targeted by miR156 in Arabidopsis thaliana. The gene ontology terms characterized electron transporter, starch synthase activity, sucrose transport, sucrose-phosphate synthase activity, chitin binding, sexual reproduction, flavonoid biosynthesis and lignin catabolism correlate well to the phenotypes of miR156OE alfalfa plants. This is the first report of changes in global gene expression in response to miR156 overexpression in alfalfa. The discovered miR156-targeted SPL genes belonging to different clades indicate miR156 plays fundamental and multifunctional roles in regulating alfalfa plant development.

  2. Comprehensive evaluation of AmpliSeq transcriptome, a novel targeted whole transcriptome RNA sequencing methodology for global gene expression analysis.

    PubMed

    Li, Wenli; Turner, Amy; Aggarwal, Praful; Matter, Andrea; Storvick, Erin; Arnett, Donna K; Broeckel, Ulrich

    2015-12-16

    Whole transcriptome sequencing (RNA-seq) represents a powerful approach for whole transcriptome gene expression analysis. However, RNA-seq carries a few limitations, e.g., the requirement of a significant amount of input RNA and complications led by non-specific mapping of short reads. The Ion AmpliSeq Transcriptome Human Gene Expression Kit (AmpliSeq) was recently introduced by Life Technologies as a whole-transcriptome, targeted gene quantification kit to overcome these limitations of RNA-seq. To assess the performance of this new methodology, we performed a comprehensive comparison of AmpliSeq with RNA-seq using two well-established next-generation sequencing platforms (Illumina HiSeq and Ion Torrent Proton). We analyzed standard reference RNA samples and RNA samples obtained from human induced pluripotent stem cell derived cardiomyocytes (hiPSC-CMs). Using published data from two standard RNA reference samples, we observed a strong concordance of log2 fold change for all genes when comparing AmpliSeq to Illumina HiSeq (Pearson's r = 0.92) and Ion Torrent Proton (Pearson's r = 0.92). We used ROC, Matthew's correlation coefficient and RMSD to determine the overall performance characteristics. All three statistical methods demonstrate AmpliSeq as a highly accurate method for differential gene expression analysis. Additionally, for genes with high abundance, AmpliSeq outperforms the two RNA-seq methods. When analyzing four closely related hiPSC-CM lines, we show that both AmpliSeq and RNA-seq capture similar global gene expression patterns consistent with known sources of variations. Our study indicates that AmpliSeq excels in the limiting areas of RNA-seq for gene expression quantification analysis. Thus, AmpliSeq stands as a very sensitive and cost-effective approach for very large scale gene expression analysis and mRNA marker screening with high accuracy.

  3. Gene Expression Analysis of Plum pox virus (Sharka) Susceptibility/Resistance in Apricot (Prunus armeniaca L.).

    PubMed

    Rubio, Manuel; Ballester, Ana Rosa; Olivares, Pedro Manuel; Castro de Moura, Manuel; Dicenta, Federico; Martínez-Gómez, Pedro

    2015-01-01

    RNA-Seq has proven to be a very powerful tool in the analysis of the Plum pox virus (PPV, sharka disease)/Prunus interaction. This technique is an important complementary tool to other means of studying genomics. In this work an analysis of gene expression of resistance/susceptibility to PPV in apricot is performed. RNA-Seq has been applied to analyse the gene expression changes induced by PPV infection in leaves from two full-sib apricot genotypes, "Rojo Pasión" and "Z506-7", resistant and susceptible to PPV, respectively. Transcriptomic analyses revealed the existence of more than 2,000 genes related to the pathogen response and resistance to PPV in apricot. These results showed that the response to infection by the virus in the susceptible genotype is associated with an induction of genes involved in pathogen resistance such as the allene oxide synthase, S-adenosylmethionine synthetase 2 and the major MLP-like protein 423. Over-expression of the Dicer protein 2a may indicate the suppression of a gene silencing mechanism of the plant by PPV HCPro and P1 PPV proteins. On the other hand, there were 164 genes involved in resistance mechanisms that have been identified in apricot, 49 of which are located in the PPVres region (scaffold 1 positions from 8,050,804 to 8,244,925), which is responsible for PPV resistance in apricot. Among these genes in apricot there are several MATH domain-containing genes, although other genes inside (Pleiotropic drug resistance 9 gene) or outside (CAP, Cysteine-rich secretory proteins, Antigen 5 and Pathogenesis-related 1 protein; and LEA, Late embryogenesis abundant protein) PPVres region could also be involved in the resistance.

  4. Gene Expression Analysis of Plum pox virus (Sharka) Susceptibility/Resistance in Apricot (Prunus armeniaca L.)

    PubMed Central

    Rubio, Manuel; Ballester, Ana Rosa; Olivares, Pedro Manuel; Castro de Moura, Manuel; Dicenta, Federico; Martínez-Gómez, Pedro

    2015-01-01

    RNA-Seq has proven to be a very powerful tool in the analysis of the Plum pox virus (PPV, sharka disease)/Prunus interaction. This technique is an important complementary tool to other means of studying genomics. In this work an analysis of gene expression of resistance/susceptibility to PPV in apricot is performed. RNA-Seq has been applied to analyse the gene expression changes induced by PPV infection in leaves from two full-sib apricot genotypes, “Rojo Pasión” and “Z506-7”, resistant and susceptible to PPV, respectively. Transcriptomic analyses revealed the existence of more than 2,000 genes related to the pathogen response and resistance to PPV in apricot. These results showed that the response to infection by the virus in the susceptible genotype is associated with an induction of genes involved in pathogen resistance such as the allene oxide synthase, S-adenosylmethionine synthetase 2 and the major MLP-like protein 423. Over-expression of the Dicer protein 2a may indicate the suppression of a gene silencing mechanism of the plant by PPV HCPro and P1 PPV proteins. On the other hand, there were 164 genes involved in resistance mechanisms that have been identified in apricot, 49 of which are located in the PPVres region (scaffold 1 positions from 8,050,804 to 8,244,925), which is responsible for PPV resistance in apricot. Among these genes in apricot there are several MATH domain-containing genes, although other genes inside (Pleiotropic drug resistance 9 gene) or outside (CAP, Cysteine-rich secretory proteins, Antigen 5 and Pathogenesis-related 1 protein; and LEA, Late embryogenesis abundant protein) PPVres region could also be involved in the resistance. PMID:26658051

  5. Next-Generation Transcriptome Profiling of the Salmon Louse Caligus rogercresseyi Exposed to Deltamethrin (AlphaMax™): Discovery of Relevant Genes and Sex-Related Differences.

    PubMed

    Chávez-Mardones, Jacqueline; Gallardo-Escárate, Cristian

    2015-12-01

    Sea lice are one of the main parasites affecting the salmon aquaculture industry, causing significant economic losses worldwide. Increased resistance to traditional chemical treatments has created the need to find alternative control methods. Therefore, the objective of this study was to identify the transcriptome response of the salmon louse Caligus rogercresseyi to the delousing drug deltamethrin (AlphaMax™). Through bioassays with different concentrations of deltamethrin, adult salmon lice transcriptomes were sequenced from cDNA libraries in the MiSeq Illumina platform. A total of 78 million reads for females and males were assembled in 30,212 and 38,536 contigs, respectively. De novo assembly yielded 86,878 high-quality contigs and, based on published data, it was possible to annotate and identify relevant genes involved in several biological processes. RNA-seq analysis in conjunction with heatmap hierarchical clustering evidenced that pyrethroids modify the ectoparasitic transcriptome in adults, affecting molecular processes associated with the nervous system, cuticle formation, oxidative stress, reproduction, and metabolism, among others. Furthermore, sex-related transcriptome differences were evidenced. Specifically, 534 and 1033 exclusive transcripts were identified for males and females, respectively, and 154 were shared between sexes. For males, estradiol 17-beta-dehydrogenase, sphingolipid delta4-desaturase DES1, ketosamine-3-kinase, and arylsulfatase A, among others, were discovered, while for females, vitellogenin 1, glycoprotein G, transaldolase, and nitric oxide synthase were among those identified. The shared transcripts included annotations for tropomyosin, γ-crystallin A, glutamate receptor-metabotropic, glutathione S-transferase, and carboxipeptidase B. The present study reveals that deltamethrin generates a complex transcriptome response in C. rogercresseyi, thus providing valuable genomic information for developing new delousing drugs.

  6. ChiLin: a comprehensive ChIP-seq and DNase-seq quality control and analysis pipeline.

    PubMed

    Qin, Qian; Mei, Shenglin; Wu, Qiu; Sun, Hanfei; Li, Lewyn; Taing, Len; Chen, Sujun; Li, Fugen; Liu, Tao; Zang, Chongzhi; Xu, Han; Chen, Yiwen; Meyer, Clifford A; Zhang, Yong; Brown, Myles; Long, Henry W; Liu, X Shirley

    2016-10-03

    Transcription factor binding, histone modification, and chromatin accessibility studies are important approaches to understanding the biology of gene regulation. ChIP-seq and DNase-seq have become the standard techniques for studying protein-DNA interactions and chromatin accessibility respectively, and comprehensive quality control (QC) and analysis tools are critical to extracting the most value from these assay types. Although many analysis and QC tools have been reported, few combine ChIP-seq and DNase-seq data analysis and quality control in a unified framework with a comprehensive and unbiased reference of data quality metrics. ChiLin is a computational pipeline that automates the quality control and data analyses of ChIP-seq and DNase-seq data. It is developed using a flexible and modular software framework that can be easily extended and modified. ChiLin is ideal for batch processing of many datasets and is well suited for large collaborative projects involving ChIP-seq and DNase-seq from different designs. ChiLin generates comprehensive quality control reports that include comparisons with historical data derived from over 23,677 public ChIP-seq and DNase-seq samples (11,265 datasets) from eight literature-based classified categories. To the best of our knowledge, this atlas represents the most comprehensive ChIP-seq and DNase-seq related quality metric resource currently available. These historical metrics provide useful heuristic quality references for experiment across all commonly used assay types. Using representative datasets, we demonstrate the versatility of the pipeline by applying it to different assay types of ChIP-seq data. The pipeline software is available open source at https://github.com/cfce/chilin . ChiLin is a scalable and powerful tool to process large batches of ChIP-seq and DNase-seq datasets. The analysis output and quality metrics have been structured into user-friendly directories and reports. We have successfully compiled 23,677 profiles into a comprehensive quality atlas with fine classification for users.

  7. Genome-wide identification and characterisation of human DNA replication origins by initiation site sequencing (ini-seq)

    PubMed Central

    Langley, Alexander R.; Gräf, Stefan; Smith, James C.; Krude, Torsten

    2016-01-01

    Next-generation sequencing has enabled the genome-wide identification of human DNA replication origins. However, different approaches to mapping replication origins, namely (i) sequencing isolated small nascent DNA strands (SNS-seq); (ii) sequencing replication bubbles (bubble-seq) and (iii) sequencing Okazaki fragments (OK-seq), show only limited concordance. To address this controversy, we describe here an independent high-resolution origin mapping technique that we call initiation site sequencing (ini-seq). In this approach, newly replicated DNA is directly labelled with digoxigenin-dUTP near the sites of its initiation in a cell-free system. The labelled DNA is then immunoprecipitated and genomic locations are determined by DNA sequencing. Using this technique we identify >25,000 discrete origin sites at sub-kilobase resolution on the human genome, with high concordance between biological replicates. Most activated origins identified by ini-seq are found at transcriptional start sites and contain G-quadruplex (G4) motifs. They tend to cluster in early-replicating domains, providing a correlation between early replication timing and local density of activated origins. Origins identified by ini-seq show highest concordance with sites identified by SNS-seq, followed by OK-seq and bubble-seq. Furthermore, germline origins identified by positive nucleotide distribution skew jumps overlap with origins identified by ini-seq and OK-seq more frequently and more specifically than do sites identified by either SNS-seq or bubble-seq. PMID:27587586

  8. Genome-wide identification and characterisation of human DNA replication origins by initiation site sequencing (ini-seq).

    PubMed

    Langley, Alexander R; Gräf, Stefan; Smith, James C; Krude, Torsten

    2016-12-01

    Next-generation sequencing has enabled the genome-wide identification of human DNA replication origins. However, different approaches to mapping replication origins, namely (i) sequencing isolated small nascent DNA strands (SNS-seq); (ii) sequencing replication bubbles (bubble-seq) and (iii) sequencing Okazaki fragments (OK-seq), show only limited concordance. To address this controversy, we describe here an independent high-resolution origin mapping technique that we call initiation site sequencing (ini-seq). In this approach, newly replicated DNA is directly labelled with digoxigenin-dUTP near the sites of its initiation in a cell-free system. The labelled DNA is then immunoprecipitated and genomic locations are determined by DNA sequencing. Using this technique we identify >25,000 discrete origin sites at sub-kilobase resolution on the human genome, with high concordance between biological replicates. Most activated origins identified by ini-seq are found at transcriptional start sites and contain G-quadruplex (G4) motifs. They tend to cluster in early-replicating domains, providing a correlation between early replication timing and local density of activated origins. Origins identified by ini-seq show highest concordance with sites identified by SNS-seq, followed by OK-seq and bubble-seq. Furthermore, germline origins identified by positive nucleotide distribution skew jumps overlap with origins identified by ini-seq and OK-seq more frequently and more specifically than do sites identified by either SNS-seq or bubble-seq. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Diffusible gas transmitter signaling in the copepod crustacean Calanus finmarchicus: identification of the biosynthetic enzymes of nitric oxide (NO), carbon monoxide (CO) and hydrogen sulfide (H2S) using a de novo assembled transcriptome

    PubMed Central

    Christie, Andrew E.; Fontanilla, Tiana M.; Roncalli, Vittoria; Cieslak, Matthew C.; Lenz, Petra H.

    2014-01-01

    Neurochemical signaling is a major component of physiological/behavioral control throughout the animal kingdom. Gas transmitters are perhaps the most ancient class of molecules used by nervous systems for chemical communication. Three gases are generally recognized as being produced by neurons: nitric oxide (NO), carbon monoxide (CO) and hydrogen sulfide (H2S). As part of an ongoing effort to identify and characterize the neurochemical signaling systems of the copepod Calanus finmarchicus, the biomass dominant zooplankton in much of the North Atlantic Ocean, we have mined a de novo assembled transcriptome for sequences encoding the neuronal biosynthetic enzymes of these gases, i.e. nitric oxide synthase (NOS), heme oxygenase (HO) and cystathionine β-synthase (CBS), respectively. Using Drosophila proteins as queries, two NOS-, one HO-, and one CBS-encoding transcripts were identified. Reverse BLAST and structural analyses of the deduced proteins suggest that each is a true member of its respective enzyme family. RNA-Seq data collected from embryos, early nauplii, late nauplii, early copepodites, late copepodites and adults revealed the expression of each transcript to be stage specific: one NOS restricted primarily to the embryo and the other was absent in the embryo but expressed in all other stages, no CBS expression in the embryo, but present in all other stages, and HO expressed across all developmental stages. Given the importance of gas transmitters in the regulatory control of a number of physiological processes, these data open opportunities for investigating the roles these proteins play under different life-stage and environmental conditions in this ecologically important species. PMID:24747481

  10. The Terpene Synthase Gene Family of Carrot (Daucus carota L.): Identification of QTLs and Candidate Genes Associated with Terpenoid Volatile Compounds

    PubMed Central

    Keilwagen, Jens; Lehnert, Heike; Berner, Thomas; Budahn, Holger; Nothnagel, Thomas; Ulrich, Detlef; Dunemann, Frank

    2017-01-01

    Terpenes are an important group of secondary metabolites in carrots influencing taste and flavor, and some of them might also play a role as bioactive substances with an impact on human physiology and health. Understanding the genetic and molecular basis of terpene synthases (TPS) involved in the biosynthesis of volatile terpenoids will provide insights for improving breeding strategies aimed at quality traits and for developing specific carrot chemotypes possibly useful for pharmaceutical applications. Hence, a combination of terpene metabolite profiling, genotyping-by-sequencing (GBS), and genome-wide association study (GWAS) was used in this work to get insights into the genetic control of terpene biosynthesis in carrots and to identify several TPS candidate genes that might be involved in the production of specific monoterpenes. In a panel of 85 carrot cultivars and accessions, metabolite profiling was used to identify 31 terpenoid volatile organic compounds (VOCs) in carrot leaves and roots, and a GBS approach was used to provide dense genome-wide marker coverage (>168,000 SNPs). Based on this data, a total of 30 quantitative trait loci (QTLs) was identified for 15 terpenoid volatiles. Most QTLs were detected for the monoterpene compounds ocimene, sabinene, β-pinene, borneol and bornyl acetate. We identified four genomic regions on three different carrot chromosomes by GWAS which are both associated with high significance (LOD ≥ 5.91) to distinct monoterpenes and to TPS candidate genes, which have been identified by homology-based gene prediction utilizing RNA-seq data. In total, 65 TPS candidate gene models in carrot were identified and assigned to known plant TPS subfamilies with the exception of TPS-d and TPS-h. TPS-b was identified as largest subfamily with 32 TPS candidate genes. PMID:29170675

  11. Chromosome doubling to overcome the chrysanthemum cross barrier based on insight from transcriptomic and proteomic analyses.

    PubMed

    Zhang, Fengjiao; Hua, Lichun; Fei, Jiangsong; Wang, Fan; Liao, Yuan; Fang, Weimin; Chen, Fadi; Teng, Nianjun

    2016-08-09

    Cross breeding is the most commonly used method in chrysanthemum (Chrysanthemum morifolium) breeding; however, cross barriers always exist in these combinations. Many studies have shown that paternal chromosome doubling can often overcome hybridization barriers during cross breeding, although the underlying mechanism has seldom been investigated. In this study, we performed two crosses: C. morifolium (pollen receptor) × diploid C. nankingense (pollen donor) and C. morifolium × tetraploid C. nankingense. Seeds were obtained only from the latter cross. RNA-Seq and isobaric tags for relative and absolute quantitation (iTRAQ) were used to investigate differentially expressed genes and proteins during key embryo development stages in the latter cross. A previously performed cross, C. morifolium × diploid C. nankingense, was compared to our results and revealed that transcription factors (i.e., the agamous-like MADS-box protein AGL80 and the leucine-rich repeat receptor protein kinase EXS), hormone-responsive genes (auxin-binding protein 1), genes and proteins related to metabolism (ATP-citrate synthase, citrate synthase and malate dehydrogenase) and other genes reported to contribute to embryo development (i.e., LEA, elongation factor and tubulin) had higher expression levels in the C. morifolium × tetraploid C. nankingense cross. In contrast, genes related to senescence and cell death were down-regulated in the C. morifolium × tetraploid C. nankingense cross. The data resources helped elucidate the gene and protein expression profiles and identify functional genes during different development stages. When the chromosomes from the male parent are doubled, the genes contributing to normal embryo developmentare more abundant. However, genes with negative functions were suppressed, suggesting that chromosome doubling may epigenetically inhibit the expression of these genes and allow the embryo to develop normally.

  12. ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data.

    PubMed

    Promworn, Yuttachon; Kaewprommal, Pavita; Shaw, Philip J; Intarapanich, Apichart; Tongsima, Sissades; Piriyapongsa, Jittima

    2017-01-01

    Biochemical methods are available for enriching 5' ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5' ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5' ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5' ends than TSSAR. In general, the transcript 5' ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5'ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER).

  13. ATACseqQC: a Bioconductor package for post-alignment quality assessment of ATAC-seq data.

    PubMed

    Ou, Jianhong; Liu, Haibo; Yu, Jun; Kelliher, Michelle A; Castilla, Lucio H; Lawson, Nathan D; Zhu, Lihua Julie

    2018-03-01

    ATAC-seq (Assays for Transposase-Accessible Chromatin using sequencing) is a recently developed technique for genome-wide analysis of chromatin accessibility. Compared to earlier methods for assaying chromatin accessibility, ATAC-seq is faster and easier to perform, does not require cross-linking, has higher signal to noise ratio, and can be performed on small cell numbers. However, to ensure a successful ATAC-seq experiment, step-by-step quality assurance processes, including both wet lab quality control and in silico quality assessment, are essential. While several tools have been developed or adopted for assessing read quality, identifying nucleosome occupancy and accessible regions from ATAC-seq data, none of the tools provide a comprehensive set of functionalities for preprocessing and quality assessment of aligned ATAC-seq datasets. We have developed a Bioconductor package, ATACseqQC, for easily generating various diagnostic plots to help researchers quickly assess the quality of their ATAC-seq data. In addition, this package contains functions to preprocess aligned ATAC-seq data for subsequent peak calling. Here we demonstrate the utilities of our package using 25 publicly available ATAC-seq datasets from four studies. We also provide guidelines on what the diagnostic plots should look like for an ideal ATAC-seq dataset. This software package has been used successfully for preprocessing and assessing several in-house and public ATAC-seq datasets. Diagnostic plots generated by this package will facilitate the quality assessment of ATAC-seq data, and help researchers to evaluate their own ATAC-seq experiments as well as select high-quality ATAC-seq datasets from public repositories such as GEO to avoid generating hypotheses or drawing conclusions from low-quality ATAC-seq experiments. The software, source code, and documentation are freely available as a Bioconductor package at https://bioconductor.org/packages/release/bioc/html/ATACseqQC.html .

  14. iSeq: Web-Based RNA-seq Data Analysis and Visualization.

    PubMed

    Zhang, Chao; Fan, Caoqi; Gan, Jingbo; Zhu, Ping; Kong, Lei; Li, Cheng

    2018-01-01

    Transcriptome sequencing (RNA-seq) is becoming a standard experimental methodology for genome-wide characterization and quantification of transcripts at single base-pair resolution. However, downstream analysis of massive amount of sequencing data can be prohibitively technical for wet-lab researchers. A functionally integrated and user-friendly platform is required to meet this demand. Here, we present iSeq, an R-based Web server, for RNA-seq data analysis and visualization. iSeq is a streamlined Web-based R application under the Shiny framework, featuring a simple user interface and multiple data analysis modules. Users without programming and statistical skills can analyze their RNA-seq data and construct publication-level graphs through a standardized yet customizable analytical pipeline. iSeq is accessible via Web browsers on any operating system at http://iseq.cbi.pku.edu.cn .

  15. Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.

    PubMed

    Van den Berge, Koen; Perraudeau, Fanny; Soneson, Charlotte; Love, Michael I; Risso, Davide; Vert, Jean-Philippe; Robinson, Mark D; Dudoit, Sandrine; Clement, Lieven

    2018-02-26

    Dropout events in single-cell RNA sequencing (scRNA-seq) cause many transcripts to go undetected and induce an excess of zero read counts, leading to power issues in differential expression (DE) analysis. This has triggered the development of bespoke scRNA-seq DE methods to cope with zero inflation. Recent evaluations, however, have shown that dedicated scRNA-seq tools provide no advantage compared to traditional bulk RNA-seq tools. We introduce a weighting strategy, based on a zero-inflated negative binomial model, that identifies excess zero counts and generates gene- and cell-specific weights to unlock bulk RNA-seq DE pipelines for zero-inflated data, boosting performance for scRNA-seq.

  16. rSeqNP: a non-parametric approach for detecting differential expression and splicing from RNA-Seq data.

    PubMed

    Shi, Yang; Chinnaiyan, Arul M; Jiang, Hui

    2015-07-01

    High-throughput sequencing of transcriptomes (RNA-Seq) has become a powerful tool to study gene expression. Here we present an R package, rSeqNP, which implements a non-parametric approach to test for differential expression and splicing from RNA-Seq data. rSeqNP uses permutation tests to access statistical significance and can be applied to a variety of experimental designs. By combining information across isoforms, rSeqNP is able to detect more differentially expressed or spliced genes from RNA-Seq data. The R package with its source code and documentation are freely available at http://www-personal.umich.edu/∼jianghui/rseqnp/. jianghui@umich.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. RNA-Seq-Based Transcript Structure Analysis with TrBorderExt.

    PubMed

    Wang, Yejun; Sun, Ming-An; White, Aaron P

    2018-01-01

    RNA-Seq has become a routine strategy for genome-wide gene expression comparisons in bacteria. Despite lower resolution in transcript border parsing compared with dRNA-Seq, TSS-EMOTE, Cappable-seq, Term-seq, and others, directional RNA-Seq still illustrates its advantages: low cost, quantification and transcript border analysis with a medium resolution (±10-20 nt). To facilitate mining of directional RNA-Seq datasets especially with respect to transcript structure analysis, we developed a tool, TrBorderExt, which can parse transcript start sites and termination sites accurately in bacteria. A detailed protocol is described in this chapter for how to use the software package step by step to identify bacterial transcript borders from raw RNA-Seq data. The package was developed with Perl and R programming languages, and is accessible freely through the website: http://www.szu-bioinf.org/TrBorderExt .

  18. 50 CFR 12.12 - Appraisement.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... Protection Act, 16 U.S.C. 1361 et seq., and the value of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1, et seq.; or the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq. If the seized property may...

  19. 50 CFR 12.12 - Appraisement.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... Protection Act, 16 U.S.C. 1361 et seq., and the value of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1, et seq.; or the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq. If the seized property may...

  20. 50 CFR 12.12 - Appraisement.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... Protection Act, 16 U.S.C. 1361 et seq., and the value of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1, et seq.; or the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq. If the seized property may...

  1. 50 CFR 12.12 - Appraisement.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... Protection Act, 16 U.S.C. 1361 et seq., and the value of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1, et seq.; or the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq. If the seized property may...

  2. 50 CFR 12.12 - Appraisement.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... Protection Act, 16 U.S.C. 1361 et seq., and the value of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1, et seq.; or the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq. If the seized property may...

  3. ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data

    PubMed Central

    Promworn, Yuttachon; Kaewprommal, Pavita; Shaw, Philip J.; Intarapanich, Apichart; Tongsima, Sissades

    2017-01-01

    Background Biochemical methods are available for enriching 5′ ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5′ ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. Results We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5′ ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5′ ends than TSSAR. In general, the transcript 5′ ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. Conclusion ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5′ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER). PMID:28542466

  4. Comprehensive Assessments of RNA-seq by the SEQC Consortium: FDA-Led Efforts Advance Precision Medicine.

    PubMed

    Xu, Joshua; Gong, Binsheng; Wu, Leihong; Thakkar, Shraddha; Hong, Huixiao; Tong, Weida

    2016-03-15

    Studies on gene expression in response to therapy have led to the discovery of pharmacogenomics biomarkers and advances in precision medicine. Whole transcriptome sequencing (RNA-seq) is an emerging tool for profiling gene expression and has received wide adoption in the biomedical research community. However, its value in regulatory decision making requires rigorous assessment and consensus between various stakeholders, including the research community, regulatory agencies, and industry. The FDA-led SEquencing Quality Control (SEQC) consortium has made considerable progress in this direction, and is the subject of this review. Specifically, three RNA-seq platforms (Illumina HiSeq, Life Technologies SOLiD, and Roche 454) were extensively evaluated at multiple sites to assess cross-site and cross-platform reproducibility. The results demonstrated that relative gene expression measurements were consistently comparable across labs and platforms, but not so for the measurement of absolute expression levels. As part of the quality evaluation several studies were included to evaluate the utility of RNA-seq in clinical settings and safety assessment. The neuroblastoma study profiled tumor samples from 498 pediatric neuroblastoma patients by both microarray and RNA-seq. RNA-seq offers more utilities than microarray in determining the transcriptomic characteristics of cancer. However, RNA-seq and microarray-based models were comparable in clinical endpoint prediction, even when including additional features unique to RNA-seq beyond gene expression. The toxicogenomics study compared microarray and RNA-seq profiles of the liver samples from rats exposed to 27 different chemicals representing multiple toxicity modes of action. Cross-platform concordance was dependent on chemical treatment and transcript abundance. Though both RNA-seq and microarray are suitable for developing gene expression based predictive models with comparable prediction performance, RNA-seq offers advantages over microarray in profiling genes with low expression. The rat BodyMap study provided a comprehensive rat transcriptomic body map by performing RNA-Seq on 320 samples from 11 organs in either sex of juvenile, adolescent, adult and aged Fischer 344 rats. Lastly, the transferability study demonstrated that signature genes of predictive models are reciprocally transferable between microarray and RNA-seq data for model development using a comprehensive approach with two large clinical data sets. This result suggests continued usefulness of legacy microarray data in the coming RNA-seq era. In conclusion, the SEQC project enhances our understanding of RNA-seq and provides valuable guidelines for RNA-seq based clinical application and safety evaluation to advance precision medicine.

  5. GGRNA: an ultrafast, transcript-oriented search engine for genes and transcripts

    PubMed Central

    Naito, Yuki; Bono, Hidemasa

    2012-01-01

    GGRNA (http://GGRNA.dbcls.jp/) is a Google-like, ultrafast search engine for genes and transcripts. The web server accepts arbitrary words and phrases, such as gene names, IDs, gene descriptions, annotations of gene and even nucleotide/amino acid sequences through one simple search box, and quickly returns relevant RefSeq transcripts. A typical search takes just a few seconds, which dramatically enhances the usability of routine searching. In particular, GGRNA can search sequences as short as 10 nt or 4 amino acids, which cannot be handled easily by popular sequence analysis tools. Nucleotide sequences can be searched allowing up to three mismatches, or the query sequences may contain degenerate nucleotide codes (e.g. N, R, Y, S). Furthermore, Gene Ontology annotations, Enzyme Commission numbers and probe sequences of catalog microarrays are also incorporated into GGRNA, which may help users to conduct searches by various types of keywords. GGRNA web server will provide a simple and powerful interface for finding genes and transcripts for a wide range of users. All services at GGRNA are provided free of charge to all users. PMID:22641850

  6. GGRNA: an ultrafast, transcript-oriented search engine for genes and transcripts.

    PubMed

    Naito, Yuki; Bono, Hidemasa

    2012-07-01

    GGRNA (http://GGRNA.dbcls.jp/) is a Google-like, ultrafast search engine for genes and transcripts. The web server accepts arbitrary words and phrases, such as gene names, IDs, gene descriptions, annotations of gene and even nucleotide/amino acid sequences through one simple search box, and quickly returns relevant RefSeq transcripts. A typical search takes just a few seconds, which dramatically enhances the usability of routine searching. In particular, GGRNA can search sequences as short as 10 nt or 4 amino acids, which cannot be handled easily by popular sequence analysis tools. Nucleotide sequences can be searched allowing up to three mismatches, or the query sequences may contain degenerate nucleotide codes (e.g. N, R, Y, S). Furthermore, Gene Ontology annotations, Enzyme Commission numbers and probe sequences of catalog microarrays are also incorporated into GGRNA, which may help users to conduct searches by various types of keywords. GGRNA web server will provide a simple and powerful interface for finding genes and transcripts for a wide range of users. All services at GGRNA are provided free of charge to all users.

  7. Regulation of CDP-diacylglycerol synthesis and utilization by inositol and choline in Schizosaccharomyces pombe.

    PubMed Central

    Gaynor, P M; Greenberg, M L

    1992-01-01

    CDP-diacylglycerol (CDP-DG) is an important branchpoint intermediate in eucaryotic phospholipid biosynthesis and could be a key regulatory site in phospholipid metabolism. Therefore, we examined the effects of growth phase, phospholipid precursors, and the disruption of phosphatidylcholine (PC) synthesis on the membrane-associated phospholipid biosynthetic enzymes CDP-DG synthase, phosphatidylglycerolphosphate (PGP) synthase, phosphatidylinositol (PI) synthase, and phosphatidylserine (PS) synthase in cell extracts of the fission yeast Schizosaccharomyces pombe. In complete synthetic medium containing inositol, maximal expression of CDP-DG synthase, PGP synthase, PI synthase, and PS synthase in wild-type cells occurred in the exponential phase of growth and decreased two- to fourfold in the stationary phase of growth. In cells starved for inositol, this decrease in PGP synthase, PI synthase, and PS synthase expression was not observed. Starvation for inositol resulted in a twofold derepression of PGP synthase and PS synthase expression, while PI synthase expression decreased initially and then remained constant. Upon the addition of inositol to inositol-starved cells, there was a rapid and continued increase in PI synthase expression. We examined expression of these enzymes in cho2 and cho1 mutants, which are blocked in the methylation pathway for synthesis of PC. Choline starvation resulted in a decrease in PS synthase and CDP-DG synthase expression in cho1 but not cho2 cells. Expression of PGP synthase and PI synthase was not affected by choline starvation. Inositol starvation resulted in a 1.7-fold derepression of PGP synthase expression in cho2 but not cho1 cells when PC was synthesized. PS synthase expression was not depressed, while CDP-DG synthase and PI synthase expression decreased in cho2 and cho1 cells in the absence of inositol. These results demonstrate that (i) CDP-DG synthase, PGP synthase, PI synthase, and PS synthase are similarly regulated by growth phase; (ii) inositol affects the expression of PGP synthase, PI synthase, and PS synthase; (iii) disruption of the methylation pathway results in aberrant patterns of regulation of growth phase and phospholipid precursors. Important differences between S. pombe and Saccharomyces cerevisiae with regard to regulation of these enzymes are discussed. PMID:1324908

  8. CLIP-seq analysis of multi-mapped reads discovers novel functional RNA regulatory sites in the human transcriptome.

    PubMed

    Zhang, Zijun; Xing, Yi

    2017-09-19

    Crosslinking or RNA immunoprecipitation followed by sequencing (CLIP-seq or RIP-seq) allows transcriptome-wide discovery of RNA regulatory sites. As CLIP-seq/RIP-seq reads are short, existing computational tools focus on uniquely mapped reads, while reads mapped to multiple loci are discarded. We present CLAM (CLIP-seq Analysis of Multi-mapped reads). CLAM uses an expectation-maximization algorithm to assign multi-mapped reads and calls peaks combining uniquely and multi-mapped reads. To demonstrate the utility of CLAM, we applied it to a wide range of public CLIP-seq/RIP-seq datasets involving numerous splicing factors, microRNAs and m6A RNA methylation. CLAM recovered a large number of novel RNA regulatory sites inaccessible by uniquely mapped reads. The functional significance of these sites was demonstrated by consensus motif patterns and association with alternative splicing (splicing factors), transcript abundance (AGO2) and mRNA half-life (m6A). CLAM provides a useful tool to discover novel protein-RNA interactions and RNA modification sites from CLIP-seq and RIP-seq data, and reveals the significant contribution of repetitive elements to the RNA regulatory landscape of the human transcriptome. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. ALOMYbase, a resource to investigate non-target-site-based resistance to herbicides inhibiting acetolactate-synthase (ALS) in the major grass weed Alopecurus myosuroides (black-grass).

    PubMed

    Gardin, Jeanne Aude Christiane; Gouzy, Jérôme; Carrère, Sébastien; Délye, Christophe

    2015-08-12

    Herbicide resistance in agrestal weeds is a global problem threatening food security. Non-target-site resistance (NTSR) endowed by mechanisms neutralising the herbicide or compensating for its action is considered the most agronomically noxious type of resistance. Contrary to target-site resistance, NTSR mechanisms are far from being fully elucidated. A part of weed response to herbicide stress, NTSR is considered to be largely driven by gene regulation. Our purpose was to establish a transcriptome resource allowing investigation of the transcriptomic bases of NTSR in the major grass weed Alopecurus myosuroides L. (Poaceae) for which almost no genomic or transcriptomic data was available. RNA-Seq was performed from plants in one F2 population that were sensitive or expressing NTSR to herbicides inhibiting acetolactate-synthase. Cloned plants were sampled over seven time-points ranging from before until 73 h after herbicide application. Assembly of over 159M high-quality Illumina reads generated a transcriptomic resource (ALOMYbase) containing 65,558 potentially active contigs (N50 = 1240 nucleotides) predicted to encode 32,138 peptides with 74% GO annotation, of which 2017 were assigned to protein families presumably involved in NTSR. Comparison with the fully sequenced grass genomes indicated good coverage and correct representation of A. myosuroides transcriptome in ALOMYbase. The part of the herbicide transcriptomic response common to the resistant and the sensitive plants was consistent with the expected effects of acetolactate-synthase inhibition, with striking similarities observed with published Arabidopsis thaliana data. A. myosuroides plants with NTSR were first affected by herbicide action like sensitive plants, but ultimately overcame it. Analysis of differences in transcriptomic herbicide response between resistant and sensitive plants did not allow identification of processes directly explaining NTSR. Five contigs associated to NTSR in the F2 population studied were tentatively identified. They were predicted to encode three cytochromes P450 (CYP71A, CYP71B and CYP81D), one peroxidase and one disease resistance protein. Our data confirmed that gene regulation is at the root of herbicide response and of NTSR. ALOMYbase proved to be a relevant resource to support NTSR transcriptomic studies, and constitutes a valuable tool for future research aiming at elucidating gene regulations involved in NTSR in A. myosuroides.

  10. SpliceSeq: a resource for analysis and visualization of RNA-Seq data on alternative splicing and its functional impacts.

    PubMed

    Ryan, Michael C; Cleland, James; Kim, RyangGuk; Wong, Wing Chung; Weinstein, John N

    2012-09-15

    SpliceSeq is a resource for RNA-Seq data that provides a clear view of alternative splicing and identifies potential functional changes that result from splice variation. It displays intuitive visualizations and prioritized lists of results that highlight splicing events and their biological consequences. SpliceSeq unambiguously aligns reads to gene splice graphs, facilitating accurate analysis of large, complex transcript variants that cannot be adequately represented in other formats. SpliceSeq is freely available at http://bioinformatics.mdanderson.org/main/SpliceSeq:Overview. The application is a Java program that can be launched via a browser or installed locally. Local installation requires MySQL and Bowtie. mryan@insilico.us.com Supplementary data are available at Bioinformatics online.

  11. PRAPI: post-transcriptional regulation analysis pipeline for Iso-Seq.

    PubMed

    Gao, Yubang; Wang, Huiyuan; Zhang, Hangxiao; Wang, Yongsheng; Chen, Jinfeng; Gu, Lianfeng

    2018-05-01

    The single-molecule real-time (SMRT) isoform sequencing (Iso-Seq) based on Pacific Bioscience (PacBio) platform has received increasing attention for its ability to explore full-length isoforms. Thus, comprehensive tools for Iso-Seq bioinformatics analysis are extremely useful. Here, we present a one-stop solution for Iso-Seq analysis, called PRAPI to analyze alternative transcription initiation (ATI), alternative splicing (AS), alternative cleavage and polyadenylation (APA), natural antisense transcripts (NAT), and circular RNAs (circRNAs) comprehensively. PRAPI is capable of combining Iso-Seq full-length isoforms with short read data, such as RNA-Seq or polyadenylation site sequencing (PAS-seq) for differential expression analysis of NAT, AS, APA and circRNAs. Furthermore, PRAPI can annotate new genes and correct mis-annotated genes when gene annotation is available. Finally, PRAPI generates high-quality vector graphics to visualize and highlight the Iso-Seq results. The Dockerfile of PRAPI is available at http://www.bioinfor.org/tool/PRAPI. lfgu@fafu.edu.cn.

  12. SeqLib: a C ++ API for rapid BAM manipulation, sequence alignment and sequence assembly

    PubMed Central

    Wala, Jeremiah; Beroukhim, Rameen

    2017-01-01

    Abstract We present SeqLib, a C ++ API and command line tool that provides a rapid and user-friendly interface to BAM/SAM/CRAM files, global sequence alignment operations and sequence assembly. Four C libraries perform core operations in SeqLib: HTSlib for BAM access, BWA-MEM and BLAT for sequence alignment and Fermi for error correction and sequence assembly. Benchmarking indicates that SeqLib has lower CPU and memory requirements than leading C ++ sequence analysis APIs. We demonstrate an example of how minimal SeqLib code can extract, error-correct and assemble reads from a CRAM file and then align with BWA-MEM. SeqLib also provides additional capabilities, including chromosome-aware interval queries and read plotting. Command line tools are available for performing integrated error correction, micro-assemblies and alignment. Availability and Implementation: SeqLib is available on Linux and OSX for the C ++98 standard and later at github.com/walaj/SeqLib. SeqLib is released under the Apache2 license. Additional capabilities for BLAT alignment are available under the BLAT license. Contact: jwala@broadinstitue.org; rameen@broadinstitute.org PMID:28011768

  13. Stormbow: A Cloud-Based Tool for Reads Mapping and Expression Quantification in Large-Scale RNA-Seq Studies

    PubMed Central

    Zhao, Shanrong; Prenger, Kurt; Smith, Lance

    2013-01-01

    RNA-Seq is becoming a promising replacement to microarrays in transcriptome profiling and differential gene expression study. Technical improvements have decreased sequencing costs and, as a result, the size and number of RNA-Seq datasets have increased rapidly. However, the increasing volume of data from large-scale RNA-Seq studies poses a practical challenge for data analysis in a local environment. To meet this challenge, we developed Stormbow, a cloud-based software package, to process large volumes of RNA-Seq data in parallel. The performance of Stormbow has been tested by practically applying it to analyse 178 RNA-Seq samples in the cloud. In our test, it took 6 to 8 hours to process an RNA-Seq sample with 100 million reads, and the average cost was $3.50 per sample. Utilizing Amazon Web Services as the infrastructure for Stormbow allows us to easily scale up to handle large datasets with on-demand computational resources. Stormbow is a scalable, cost effective, and open-source based tool for large-scale RNA-Seq data analysis. Stormbow can be freely downloaded and can be used out of box to process Illumina RNA-Seq datasets. PMID:25937948

  14. SeqLib: a C ++ API for rapid BAM manipulation, sequence alignment and sequence assembly.

    PubMed

    Wala, Jeremiah; Beroukhim, Rameen

    2017-03-01

    We present SeqLib, a C ++ API and command line tool that provides a rapid and user-friendly interface to BAM/SAM/CRAM files, global sequence alignment operations and sequence assembly. Four C libraries perform core operations in SeqLib: HTSlib for BAM access, BWA-MEM and BLAT for sequence alignment and Fermi for error correction and sequence assembly. Benchmarking indicates that SeqLib has lower CPU and memory requirements than leading C ++ sequence analysis APIs. We demonstrate an example of how minimal SeqLib code can extract, error-correct and assemble reads from a CRAM file and then align with BWA-MEM. SeqLib also provides additional capabilities, including chromosome-aware interval queries and read plotting. Command line tools are available for performing integrated error correction, micro-assemblies and alignment. SeqLib is available on Linux and OSX for the C ++98 standard and later at github.com/walaj/SeqLib. SeqLib is released under the Apache2 license. Additional capabilities for BLAT alignment are available under the BLAT license. jwala@broadinstitue.org ; rameen@broadinstitute.org. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  15. Stormbow: A Cloud-Based Tool for Reads Mapping and Expression Quantification in Large-Scale RNA-Seq Studies.

    PubMed

    Zhao, Shanrong; Prenger, Kurt; Smith, Lance

    2013-01-01

    RNA-Seq is becoming a promising replacement to microarrays in transcriptome profiling and differential gene expression study. Technical improvements have decreased sequencing costs and, as a result, the size and number of RNA-Seq datasets have increased rapidly. However, the increasing volume of data from large-scale RNA-Seq studies poses a practical challenge for data analysis in a local environment. To meet this challenge, we developed Stormbow, a cloud-based software package, to process large volumes of RNA-Seq data in parallel. The performance of Stormbow has been tested by practically applying it to analyse 178 RNA-Seq samples in the cloud. In our test, it took 6 to 8 hours to process an RNA-Seq sample with 100 million reads, and the average cost was $3.50 per sample. Utilizing Amazon Web Services as the infrastructure for Stormbow allows us to easily scale up to handle large datasets with on-demand computational resources. Stormbow is a scalable, cost effective, and open-source based tool for large-scale RNA-Seq data analysis. Stormbow can be freely downloaded and can be used out of box to process Illumina RNA-Seq datasets.

  16. Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposons

    DOE PAGES

    Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; ...

    2015-05-12

    Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with anymore » transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.« less

  17. Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposons

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.

    Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with anymore » transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are too laborious to be applied to hundreds of experimental conditions across multiple bacteria. Here, we describe an approach, random bar code transposon-site sequencing (RB-TnSeq), which greatly simplifies the measurement of gene fitness by using bar code sequencing (BarSeq) to monitor the abundance of mutants. We performed 387 genome-wide fitness assays across five bacteria and identified phenotypes for over 5,000 genes. RB-TnSeq can be applied to diverse bacteria and is a powerful tool to annotate uncharacterized genes using phenotype data.« less

  18. SpliceSeq: a resource for analysis and visualization of RNA-Seq data on alternative splicing and its functional impacts

    PubMed Central

    Ryan, Michael C.; Cleland, James; Kim, RyangGuk; Wong, Wing Chung; Weinstein, John N.

    2012-01-01

    Summary: SpliceSeq is a resource for RNA-Seq data that provides a clear view of alternative splicing and identifies potential functional changes that result from splice variation. It displays intuitive visualizations and prioritized lists of results that highlight splicing events and their biological consequences. SpliceSeq unambiguously aligns reads to gene splice graphs, facilitating accurate analysis of large, complex transcript variants that cannot be adequately represented in other formats. Availability and implementation: SpliceSeq is freely available at http://bioinformatics.mdanderson.org/main/SpliceSeq:Overview. The application is a Java program that can be launched via a browser or installed locally. Local installation requires MySQL and Bowtie. Contact: mryan@insilico.us.com Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:22820202

  19. Rapid Quantification of Mutant Fitness in Diverse Bacteria by Sequencing Randomly Bar-Coded Transposons

    PubMed Central

    Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; Lamson, Jacob S.; He, Jennifer; Hoover, Cindi A.; Blow, Matthew J.; Bristow, James; Butland, Gareth

    2015-01-01

    ABSTRACT Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with any transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative d-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. PMID:25968644

  20. Indel detection from DNA and RNA sequencing data with transIndel.

    PubMed

    Yang, Rendong; Van Etten, Jamie L; Dehm, Scott M

    2018-04-19

    Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data. Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. TransIndel exhibits competitive or superior performance over eight state-of-the-art indel detection tools on benchmarks using both synthetic and real DNA-seq data. Additionally, we applied transIndel to DNA-seq and RNA-seq datasets from 333 primary prostate cancer patients from The Cancer Genome Atlas (TCGA) and 59 metastatic prostate cancer patients from AACR-PCF Stand-Up- To-Cancer (SU2C) studies. TransIndel enhanced the taxonomy of DNA- and RNA-level alterations in prostate cancer by identifying recurrent FOXA1 indels as well as exitron splicing in genes implicated in disease progression. Our study demonstrates that transIndel is a robust tool for elucidation of medium- and large-sized indels from DNA-seq and RNA-seq data. Including RNA-seq in indel discovery efforts leads to significant improvements in sensitivity for identification of med-sized and large indels missed by DNA-seq, and reveals non-canonical RNA-splicing events in genes associated with disease pathology.

  1. Seq-Well: portable, low-cost RNA sequencing of single cells at high throughput.

    PubMed

    Gierahn, Todd M; Wadsworth, Marc H; Hughes, Travis K; Bryson, Bryan D; Butler, Andrew; Satija, Rahul; Fortune, Sarah; Love, J Christopher; Shalek, Alex K

    2017-04-01

    Single-cell RNA-seq can precisely resolve cellular states, but applying this method to low-input samples is challenging. Here, we present Seq-Well, a portable, low-cost platform for massively parallel single-cell RNA-seq. Barcoded mRNA capture beads and single cells are sealed in an array of subnanoliter wells using a semipermeable membrane, enabling efficient cell lysis and transcript capture. We use Seq-Well to profile thousands of primary human macrophages exposed to Mycobacterium tuberculosis.

  2. ChIP-seq and RNA-seq methods to study circadian control of transcription in mammals

    PubMed Central

    Takahashi, Joseph S.; Kumar, Vivek; Nakashe, Prachi; Koike, Nobuya; Huang, Hung-Chung; Green, Carla B.; Kim, Tae-Kyung

    2015-01-01

    Genome-wide analyses have revolutionized our ability to study the transcriptional regulation of circadian rhythms. The advent of next-generation sequencing methods has facilitated the use of two such technologies, ChIP-seq and RNA-seq. In this chapter, we describe detailed methods and protocols for these two techniques, with emphasis on their usage in circadian rhythm experiments in the mouse liver, a major target organ of the circadian clock system. Critical factors for these methods are highlighted and issues arising with time series samples for ChIP-seq and RNA-seq are discussed. Finally detailed protocols for library preparation suitable for Illumina sequencing platforms are presented. PMID:25662462

  3. MetaRNA-Seq: An Interactive Tool to Browse and Annotate Metadata from RNA-Seq Studies.

    PubMed

    Kumar, Pankaj; Halama, Anna; Hayat, Shahina; Billing, Anja M; Gupta, Manish; Yousri, Noha A; Smith, Gregory M; Suhre, Karsten

    2015-01-01

    The number of RNA-Seq studies has grown in recent years. The design of RNA-Seq studies varies from very simple (e.g., two-condition case-control) to very complicated (e.g., time series involving multiple samples at each time point with separate drug treatments). Most of these publically available RNA-Seq studies are deposited in NCBI databases, but their metadata are scattered throughout four different databases: Sequence Read Archive (SRA), Biosample, Bioprojects, and Gene Expression Omnibus (GEO). Although the NCBI web interface is able to provide all of the metadata information, it often requires significant effort to retrieve study- or project-level information by traversing through multiple hyperlinks and going to another page. Moreover, project- and study-level metadata lack manual or automatic curation by categories, such as disease type, time series, case-control, or replicate type, which are vital to comprehending any RNA-Seq study. Here we describe "MetaRNA-Seq," a new tool for interactively browsing, searching, and annotating RNA-Seq metadata with the capability of semiautomatic curation at the study level.

  4. GWIPS-viz: development of a ribo-seq genome browser

    PubMed Central

    Michel, Audrey M.; Fox, Gearoid; M. Kiran, Anmol; De Bo, Christof; O’Connor, Patrick B. F.; Heaphy, Stephen M.; Mullan, James P. A.; Donohue, Claire A.; Higgins, Desmond G.; Baranov, Pavel V.

    2014-01-01

    We describe the development of GWIPS-viz (http://gwips.ucc.ie), an online genome browser for viewing ribosome profiling data. Ribosome profiling (ribo-seq) is a recently developed technique that provides genome-wide information on protein synthesis (GWIPS) in vivo. It is based on the deep sequencing of ribosome-protected messenger RNA (mRNA) fragments, which allows the ribosome density along all mRNA transcripts present in the cell to be quantified. Since its inception, ribo-seq has been carried out in a number of eukaryotic and prokaryotic organisms. Owing to the increasing interest in ribo-seq, there is a pertinent demand for a dedicated ribo-seq genome browser. GWIPS-viz is based on The University of California Santa Cruz (UCSC) Genome Browser. Ribo-seq tracks, coupled with mRNA-seq tracks, are currently available for several genomes: human, mouse, zebrafish, nematode, yeast, bacteria (Escherichia coli K12, Bacillus subtilis), human cytomegalovirus and bacteriophage lambda. Our objective is to continue incorporating published ribo-seq data sets so that the wider community can readily view ribosome profiling information from multiple studies without the need to carry out computational processing. PMID:24185699

  5. Impact of artifact removal on ChIP quality metrics in ChIP-seq and ChIP-exo data

    PubMed Central

    Carroll, Thomas S.; Liang, Ziwei; Salama, Rafik; Stark, Rory; de Santiago, Ines

    2014-01-01

    With the advent of ChIP-seq multiplexing technologies and the subsequent increase in ChIP-seq throughput, the development of working standards for the quality assessment of ChIP-seq studies has received significant attention. The ENCODE consortium's large scale analysis of transcription factor binding and epigenetic marks as well as concordant work on ChIP-seq by other laboratories has established a new generation of ChIP-seq quality control measures. The use of these metrics alongside common processing steps has however not been evaluated. In this study, we investigate the effects of blacklisting and removal of duplicated reads on established metrics of ChIP-seq quality and show that the interpretation of these metrics is highly dependent on the ChIP-seq preprocessing steps applied. Further to this we perform the first investigation of the use of these metrics for ChIP-exo data and make recommendations for the adaptation of the NSC statistic to allow for the assessment of ChIP-exo efficiency. PMID:24782889

  6. ChloroSeq, an optimized chloroplast RNA-Seq bioinformatic pipeline, reveals remodeling of the organellar transcriptome under heat stress

    DOE PAGES

    Castandet, Benoît; Hotto, Amber M.; Strickler, Susan R.; ...

    2016-07-06

    Although RNA-Seq has revolutionized transcript analysis, organellar transcriptomes are rarely assessed even when present in published datasets. Here, we describe the development and application of a rapid and convenient method, ChloroSeq, to delineate qualitative and quantitative features of chloroplast RNA metabolism from strand-specific RNA-Seq datasets, including processing, editing, splicing, and relative transcript abundance. The use of a single experiment to analyze systematically chloroplast transcript maturation and abundance is of particular interest due to frequent pleiotropic effects observed in mutants that affect chloroplast gene expression and/or photosynthesis. To illustrate its utility, ChloroSeq was applied to published RNA-Seq datasets derived from Arabidopsismore » thaliana grown under control and abiotic stress conditions, where the organellar transcriptome had not been examined. The most appreciable effects were found for heat stress, which induces a global reduction in splicing and editing efficiency, and leads to increased abundance of chloroplast transcripts, including genic, intergenic, and antisense transcripts. Moreover, by concomitantly analyzing nuclear transcripts that encode chloroplast gene expression regulators from the same libraries, we demonstrate the possibility of achieving a holistic understanding of the nucleus-organelle system. In conclusion, ChloroSeq thus represents a unique method for streamlining RNA-Seq data interpretation of the chloroplast transcriptome and its regulators.« less

  7. ChloroSeq, an optimized chloroplast RNA-Seq bioinformatic pipeline, reveals remodeling of the organellar transcriptome under heat stress

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Castandet, Benoît; Hotto, Amber M.; Strickler, Susan R.

    Although RNA-Seq has revolutionized transcript analysis, organellar transcriptomes are rarely assessed even when present in published datasets. Here, we describe the development and application of a rapid and convenient method, ChloroSeq, to delineate qualitative and quantitative features of chloroplast RNA metabolism from strand-specific RNA-Seq datasets, including processing, editing, splicing, and relative transcript abundance. The use of a single experiment to analyze systematically chloroplast transcript maturation and abundance is of particular interest due to frequent pleiotropic effects observed in mutants that affect chloroplast gene expression and/or photosynthesis. To illustrate its utility, ChloroSeq was applied to published RNA-Seq datasets derived from Arabidopsismore » thaliana grown under control and abiotic stress conditions, where the organellar transcriptome had not been examined. The most appreciable effects were found for heat stress, which induces a global reduction in splicing and editing efficiency, and leads to increased abundance of chloroplast transcripts, including genic, intergenic, and antisense transcripts. Moreover, by concomitantly analyzing nuclear transcripts that encode chloroplast gene expression regulators from the same libraries, we demonstrate the possibility of achieving a holistic understanding of the nucleus-organelle system. In conclusion, ChloroSeq thus represents a unique method for streamlining RNA-Seq data interpretation of the chloroplast transcriptome and its regulators.« less

  8. rMATS: robust and flexible detection of differential alternative splicing from replicate RNA-Seq data.

    PubMed

    Shen, Shihao; Park, Juw Won; Lu, Zhi-xiang; Lin, Lan; Henry, Michael D; Wu, Ying Nian; Zhou, Qing; Xing, Yi

    2014-12-23

    Ultra-deep RNA sequencing (RNA-Seq) has become a powerful approach for genome-wide analysis of pre-mRNA alternative splicing. We previously developed multivariate analysis of transcript splicing (MATS), a statistical method for detecting differential alternative splicing between two RNA-Seq samples. Here we describe a new statistical model and computer program, replicate MATS (rMATS), designed for detection of differential alternative splicing from replicate RNA-Seq data. rMATS uses a hierarchical model to simultaneously account for sampling uncertainty in individual replicates and variability among replicates. In addition to the analysis of unpaired replicates, rMATS also includes a model specifically designed for paired replicates between sample groups. The hypothesis-testing framework of rMATS is flexible and can assess the statistical significance over any user-defined magnitude of splicing change. The performance of rMATS is evaluated by the analysis of simulated and real RNA-Seq data. rMATS outperformed two existing methods for replicate RNA-Seq data in all simulation settings, and RT-PCR yielded a high validation rate (94%) in an RNA-Seq dataset of prostate cancer cell lines. Our data also provide guiding principles for designing RNA-Seq studies of alternative splicing. We demonstrate that it is essential to incorporate biological replicates in the study design. Of note, pooling RNAs or merging RNA-Seq data from multiple replicates is not an effective approach to account for variability, and the result is particularly sensitive to outliers. The rMATS source code is freely available at rnaseq-mats.sourceforge.net/. As the popularity of RNA-Seq continues to grow, we expect rMATS will be useful for studies of alternative splicing in diverse RNA-Seq projects.

  9. Comparison of RNA-seq and microarray-based models for clinical endpoint prediction.

    PubMed

    Zhang, Wenqian; Yu, Ying; Hertwig, Falk; Thierry-Mieg, Jean; Zhang, Wenwei; Thierry-Mieg, Danielle; Wang, Jian; Furlanello, Cesare; Devanarayan, Viswanath; Cheng, Jie; Deng, Youping; Hero, Barbara; Hong, Huixiao; Jia, Meiwen; Li, Li; Lin, Simon M; Nikolsky, Yuri; Oberthuer, André; Qing, Tao; Su, Zhenqiang; Volland, Ruth; Wang, Charles; Wang, May D; Ai, Junmei; Albanese, Davide; Asgharzadeh, Shahab; Avigad, Smadar; Bao, Wenjun; Bessarabova, Marina; Brilliant, Murray H; Brors, Benedikt; Chierici, Marco; Chu, Tzu-Ming; Zhang, Jibin; Grundy, Richard G; He, Min Max; Hebbring, Scott; Kaufman, Howard L; Lababidi, Samir; Lancashire, Lee J; Li, Yan; Lu, Xin X; Luo, Heng; Ma, Xiwen; Ning, Baitang; Noguera, Rosa; Peifer, Martin; Phan, John H; Roels, Frederik; Rosswog, Carolina; Shao, Susan; Shen, Jie; Theissen, Jessica; Tonini, Gian Paolo; Vandesompele, Jo; Wu, Po-Yen; Xiao, Wenzhong; Xu, Joshua; Xu, Weihong; Xuan, Jiekun; Yang, Yong; Ye, Zhan; Dong, Zirui; Zhang, Ke K; Yin, Ye; Zhao, Chen; Zheng, Yuanting; Wolfinger, Russell D; Shi, Tieliu; Malkas, Linda H; Berthold, Frank; Wang, Jun; Tong, Weida; Shi, Leming; Peng, Zhiyu; Fischer, Matthias

    2015-06-25

    Gene expression profiling is being widely applied in cancer research to identify biomarkers for clinical endpoint prediction. Since RNA-seq provides a powerful tool for transcriptome-based applications beyond the limitations of microarrays, we sought to systematically evaluate the performance of RNA-seq-based and microarray-based classifiers in this MAQC-III/SEQC study for clinical endpoint prediction using neuroblastoma as a model. We generate gene expression profiles from 498 primary neuroblastomas using both RNA-seq and 44 k microarrays. Characterization of the neuroblastoma transcriptome by RNA-seq reveals that more than 48,000 genes and 200,000 transcripts are being expressed in this malignancy. We also find that RNA-seq provides much more detailed information on specific transcript expression patterns in clinico-genetic neuroblastoma subgroups than microarrays. To systematically compare the power of RNA-seq and microarray-based models in predicting clinical endpoints, we divide the cohort randomly into training and validation sets and develop 360 predictive models on six clinical endpoints of varying predictability. Evaluation of factors potentially affecting model performances reveals that prediction accuracies are most strongly influenced by the nature of the clinical endpoint, whereas technological platforms (RNA-seq vs. microarrays), RNA-seq data analysis pipelines, and feature levels (gene vs. transcript vs. exon-junction level) do not significantly affect performances of the models. We demonstrate that RNA-seq outperforms microarrays in determining the transcriptomic characteristics of cancer, while RNA-seq and microarray-based models perform similarly in clinical endpoint prediction. Our findings may be valuable to guide future studies on the development of gene expression-based predictive models and their implementation in clinical practice.

  10. A MBD-seq protocol for large-scale methylome-wide studies with (very) low amounts of DNA.

    PubMed

    Aberg, Karolina A; Chan, Robin F; Shabalin, Andrey A; Zhao, Min; Turecki, Gustavo; Staunstrup, Nicklas Heine; Starnawska, Anna; Mors, Ole; Xie, Lin Y; van den Oord, Edwin Jcg

    2017-09-01

    We recently showed that, after optimization, our methyl-CpG binding domain sequencing (MBD-seq) application approximates the methylome-wide coverage obtained with whole-genome bisulfite sequencing (WGB-seq), but at a cost that enables adequately powered large-scale association studies. A prior drawback of MBD-seq is the relatively large amount of genomic DNA (ideally >1 µg) required to obtain high-quality data. Biomaterials are typically expensive to collect, provide a finite amount of DNA, and may simply not yield sufficient starting material. The ability to use low amounts of DNA will increase the breadth and number of studies that can be conducted. Therefore, we further optimized the enrichment step. With this low starting material protocol, MBD-seq performed equally well, or better, than the protocol requiring ample starting material (>1 µg). Using only 15 ng of DNA as input, there is minimal loss in data quality, achieving 93% of the coverage of WGB-seq (with standard amounts of input DNA) at similar false/positive rates. Furthermore, across a large number of genomic features, the MBD-seq methylation profiles closely tracked those observed for WGB-seq with even slightly larger effect sizes. This suggests that MBD-seq provides similar information about the methylome and classifies methylation status somewhat more accurately. Performance decreases with <15 ng DNA as starting material but, even with as little as 5 ng, MBD-seq still achieves 90% of the coverage of WGB-seq with comparable genome-wide methylation profiles. Thus, the proposed protocol is an attractive option for adequately powered and cost-effective methylome-wide investigations using (very) low amounts of DNA.

  11. Discovering functional modules by topic modeling RNA-Seq based toxicogenomic data.

    PubMed

    Yu, Ke; Gong, Binsheng; Lee, Mikyung; Liu, Zhichao; Xu, Joshua; Perkins, Roger; Tong, Weida

    2014-09-15

    Toxicogenomics (TGx) endeavors to elucidate the underlying molecular mechanisms through exploring gene expression profiles in response to toxic substances. Recently, RNA-Seq is increasingly regarded as a more powerful alternative to microarrays in TGx studies. However, realizing RNA-Seq's full potential requires novel approaches to extracting information from the complex TGx data. Considering read counts as the number of times a word occurs in a document, gene expression profiles from RNA-Seq are analogous to a word by document matrix used in text mining. Topic modeling aiming at to discover the latent structures in text corpora would be helpful to explore RNA-Seq based TGx data. In this study, topic modeling was applied on a typical RNA-Seq based TGx data set to discover hidden functional modules. The RNA-Seq based gene expression profiles were transformed into "documents", on which latent Dirichlet allocation (LDA) was used to build a topic model. We found samples treated by the compounds with the same modes of actions (MoAs) could be clustered based on topic similarities. The topic most relevant to each cluster was identified as a "marker" topic, which was interpreted by gene enrichment analysis with MoAs then confirmed by compound and pathways associations mined from literature. To further validate the "marker" topics, we tested topic transferability from RNA-Seq to microarrays. The RNA-Seq based gene expression profile of a topic specifically associated with peroxisome proliferator-activated receptors (PPAR) signaling pathway was used to query samples with similar expression profiles in two different microarray data sets, yielding accuracy of about 85%. This proof-of-concept study demonstrates the applicability of topic modeling to discover functional modules in RNA-Seq data and suggests a valuable computational tool for leveraging information within TGx data in RNA-Seq era.

  12. Advantages of RNA-seq compared to RNA microarrays for transcriptome profiling of anterior cruciate ligament tears.

    PubMed

    Rai, Muhammad Farooq; Tycksen, Eric D; Sandell, Linda J; Brophy, Robert H

    2018-01-01

    Microarrays and RNA-seq are at the forefront of high throughput transcriptome analyses. Since these methodologies are based on different principles, there are concerns about the concordance of data between the two techniques. The concordance of RNA-seq and microarrays for genome-wide analysis of differential gene expression has not been rigorously assessed in clinically derived ligament tissues. To demonstrate the concordance between RNA-seq and microarrays and to assess potential benefits of RNA-seq over microarrays, we assessed differences in transcript expression in anterior cruciate ligament (ACL) tissues based on time-from-injury. ACL remnants were collected from patients with an ACL tear at the time of ACL reconstruction. RNA prepared from torn ACL remnants was subjected to Agilent microarrays (N = 24) and RNA-seq (N = 8). The correlation of biological replicates in RNA-seq and microarrays data was similar (0.98 vs. 0.97), demonstrating that each platform has high internal reproducibility. Correlations between the RNA-seq data and the individual microarrays were low, but correlations between the RNA-seq values and the geometric mean of the microarrays values were moderate. The cross-platform concordance for differentially expressed transcripts or enriched pathways was linearly correlated (r = 0.64). RNA-Seq was superior in detecting low abundance transcripts and differentiating biologically critical isoforms. Additional independent validation of transcript expression was undertaken using microfluidic PCR for selected genes. PCR data showed 100% concordance (in expression pattern) with RNA-seq and microarrays data. These findings demonstrate that RNA-seq has advantages over microarrays for transcriptome profiling of ligament tissues when available and affordable. Furthermore, these findings are likely transferable to other musculoskeletal tissues where tissue collection is challenging and cells are in low abundance. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc. J Orthop Res 36:484-497, 2018. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc.

  13. Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing.

    PubMed

    Liu, Yu; Koyutürk, Mehmet; Maxwell, Sean; Xiang, Min; Veigl, Martina; Cooper, Richard S; Tayo, Bamidele O; Li, Li; LaFramboise, Thomas; Wang, Zhenghe; Zhu, Xiaofeng; Chance, Mark R

    2014-08-16

    Sequences up to several megabases in length have been found to be present in individual genomes but absent in the human reference genome. These sequences may be common in populations, and their absence in the reference genome may indicate rare variants in the genomes of individuals who served as donors for the human genome project. As the reference genome is used in probe design for microarray technology and mapping short reads in next generation sequencing (NGS), this missing sequence could be a source of bias in functional genomic studies and variant analysis. One End Anchor (OEA) and/or orphan reads from paired-end sequencing have been used to identify novel sequences that are absent in reference genome. However, there is no study to investigate the distribution, evolution and functionality of those sequences in human populations. To systematically identify and study the missing common sequences (micSeqs), we extended the previous method by pooling OEA reads from large number of individuals and applying strict filtering methods to remove false sequences. The pipeline was applied to data from phase 1 of the 1000 Genomes Project. We identified 309 micSeqs that are present in at least 1% of the human population, but absent in the reference genome. We confirmed 76% of these 309 micSeqs by comparison to other primate genomes, individual human genomes, and gene expression data. Furthermore, we randomly selected fifteen micSeqs and confirmed their presence using PCR validation in 38 additional individuals. Functional analysis using published RNA-seq and ChIP-seq data showed that eleven micSeqs are highly expressed in human brain and three micSeqs contain transcription factor (TF) binding regions, suggesting they are functional elements. In addition, the identified micSeqs are absent in non-primates and show dynamic acquisition during primate evolution culminating with most micSeqs being present in Africans, suggesting some micSeqs may be important sources of human diversity. 76% of micSeqs were confirmed by a comparative genomics approach. Fourteen micSeqs are expressed in human brain or contain TF binding regions. Some micSeqs are primate-specific, conserved and may play a role in the evolution of primates.

  14. Transcriptome profiling of the floral buds and discovery of genes related to sex-differentiation in the dioecious cucurbit Coccinia grandis (L.) Voigt.

    PubMed

    Mohanty, Jatindra Nath; Nayak, Sanghamitra; Jha, Sumita; Joshi, Raj Kumar

    2017-08-30

    Dioecious species offer an inclusive structure to study the molecular basis of sexual dimorphism in angiosperms. Despite having a small genome and heteromorphic sex chromosomes, Coccinia grandis is a highly neglected dioecious species with little information available on its physical state, genetic orientation and key sex-defining elements. In the present study, we performed RNA-Seq and DGE analysis of male (MB) and female (FB) buds in C. grandis to gain insights into the molecular basis of sex determination in this plant. De novo assembly of 75 million clean reads resulted in 72,479 unigenes for male library and 63,308 unigenes for female library with a mean length of 736bp. 61,458 (85.57%) unigenes displayed significant similarity with protein sequences from publicly available databases. Comparative transcriptome analyses revealed 1410 unigenes as differentially expressed (DEGs) between MB and FB samples. A consistent correlation between the expression levels of DEGs was observed for the RNA-Seq pattern and qRT-PCR validation. Functional annotation showed high enrichment of DEGs involved in phytohormone biosynthesis, hormone signaling and transduction, transcriptional regulation and methyltransferase activity. High induction of hormone responsive genes such as ARF6, ACC synthase1, SNRK2 and BRI1-associated receptor kinase 1 (BAK1) suggest that multiple phytohormones and their signaling crosstalk play crucial role in sex determination in this species. Beside, the transcription factors such as zinc fingers, homeodomain leucine zippers and MYBs were identified as major determinants of male specific expression. Moreover, the detection of multiple DEGs as the miRNA target site implies that a small RNA mediated gene silencing cascade may also be regulating gender differentiation in C. grandis. Overall, the present transcriptome resources provide us a large number of DEGs involved in sex expression and could form the groundwork for unravelling the molecular mechanism of sex determination in C. grandis. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Convergence of hepcidin deficiency, systemic iron overloading, heme accumulation, and REV-ERBα/β activation in aryl hydrocarbon receptor-elicited hepatotoxicity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fader, Kelly A.; Nault, Rance

    Persistent aryl hydrocarbon receptor (AhR) agonists elicit dose-dependent hepatic lipid accumulation, oxidative stress, inflammation, and fibrosis in mice. Iron (Fe) promotes AhR-mediated oxidative stress by catalyzing reactive oxygen species (ROS) production. To further characterize the role of Fe in AhR-mediated hepatotoxicity, male C57BL/6 mice were orally gavaged with sesame oil vehicle or 0.01–30 μg/kg 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) every 4 days for 28 days. Duodenal epithelial and hepatic RNA-Seq data were integrated with hepatic AhR ChIP-Seq, capillary electrophoresis protein measurements, and clinical chemistry analyses. TCDD dose-dependently repressed hepatic expression of hepcidin (Hamp and Hamp2), the master regulator of systemic Fe homeostasis, resultingmore » in a 2.6-fold increase in serum Fe with accumulating Fe spilling into urine. Total hepatic Fe levels were negligibly increased while transferrin saturation remained unchanged. Furthermore, TCDD elicited dose-dependent gene expression changes in heme biosynthesis including the induction of aminolevulinic acid synthase 1 (Alas1) and repression of uroporphyrinogen decarboxylase (Urod), leading to a 50% increase in hepatic hemin and a 13.2-fold increase in total urinary porphyrins. Consistent with this heme accumulation, differential gene expression suggests that heme activated BACH1 and REV-ERBα/β, causing induction of heme oxygenase 1 (Hmox1) and repression of fatty acid biosynthesis, respectively. Collectively, these results suggest that Hamp repression, Fe accumulation, and increased heme levels converge to promote oxidative stress and the progression of TCDD-elicited hepatotoxicity. - Highlights: • TCDD represses hepatic hepcidin expression, leading to systemic iron overloading. • Dysregulation of heme biosynthesis is consistent with heme and porphyrin accumulation. • Heme-activated REV-ERBα/β repress circadian-regulated hepatic lipid metabolism. • Disruption of iron homeostasis promotes TCDD-elicited steatohepatitis with fibrosis.« less

  16. Characterization of three active transposable elements recently inserted in three independent DFR-A alleles and one high-copy DNA transposon isolated from the Pink allele of the ANS gene in onion (Allium cepa L.).

    PubMed

    Kim, Sunggil; Park, Jee Young; Yang, Tae-Jin

    2015-06-01

    Intact retrotransposon and DNA transposons inserted in a single gene were characterized in onions (Allium cepa) and their transcription and copy numbers were estimated in this study. While analyzing diverse onion germplasm, large insertions in the DFR-A gene encoding dihydroflavonol 4-reductase (DFR) involved in the anthocyanin biosynthesis pathway were found in two accessions. A 5,070-bp long terminal repeat (LTR) retrotransposon inserted in the active DFR-A (R4) allele was identified from one of the large insertions and designated AcCOPIA1. An intact ORF encoded typical domains of copia-like LTR retrotransposons. However, AcCOPIA1 contained atypical 'TG' and 'TA' dinucleotides at the ends of the LTRs. A 4,615-bp DNA transposon was identified in the other large insertion. This DNA transposon, designated AcCACTA1, contained an ORF coding for a transposase showing homology with the CACTA superfamily transposable elements (TEs). Another 5,073-bp DNA transposon was identified from the DFR-A (TRN) allele. This DNA transposon, designated AchAT1, belonged to the hAT superfamily with short 4-bp terminal inverted repeats (TIRs). Finally, a 6,258-bp non-autonomous DNA transposon, designated AcPINK, was identified in the ANS-p allele encoding anthocyanidin synthase, the next downstream enzyme to DFR in the anthocyanin biosynthesis pathway. AcPINK also possessed very short 3-bp TIRs. Active transcription of AcCOPIA1, AcCACTA1, and AchAT1 was observed through RNA-Seq analysis and RT-PCR. The copy numbers of AcPINK estimated by mapping the genomic DNA reads produced by NextSeq 500 were predominantly high compared with the other TEs. A series of evidence indicated that these TEs might have transposed in these onion genes very recently, providing a stepping stone for elucidation of enormously large-sized onion genome structure.

  17. Dissecting Tissue-Specific Transcriptomic Responses from Leaf and Roots under Salt Stress in Petunia hybrida Mitchell

    PubMed Central

    Villarino, Gonzalo H.; Hu, Qiwen; Scanlon, Michael J.; Mueller, Lukas; Mattson, Neil S.

    2017-01-01

    One of the primary objectives of plant biotechnology is to increase resistance to abiotic stresses, such as salinity. Salinity is a major abiotic stress and increasing crop resistant to salt continues to the present day as a major challenge. Salt stress disturbs cellular environment leading to protein misfolding, affecting normal plant growth and causing agricultural losses worldwide. The advent of state-of-the-art technologies such as high throughput mRNA sequencing (RNA-seq) has revolutionized whole-transcriptome analysis by allowing, with high precision, to measure changes in gene expression. In this work, we used tissue-specific RNA-seq to gain insight into the Petunia hybrida transcriptional responses under NaCl stress using a controlled hydroponic system. Roots and leaves samples were taken from a continuum of 48 h of acute 150 mM NaCl. This analysis revealed a set of tissue and time point specific differentially expressed genes, such as genes related to transport, signal transduction, ion homeostasis as well as novel and undescribed genes, such as Peaxi162Scf00003g04130 and Peaxi162Scf00589g00323 expressed only in roots under salt stress. In this work, we identified early and late expressed genes in response to salt stress while providing a core of differentially express genes across all time points and tissues, including the trehalose-6-phosphate synthase 1 (TPS1), a glycosyltransferase reported in salt tolerance in other species. To test the function of the novel petunia TPS1 allele, we cloned and showed that TPS1 is a functional plant gene capable of complementing the trehalose biosynthesis pathway in a yeast tps1 mutant. The list of candidate genes to enhance salt tolerance provided in this work constitutes a major effort to better understand the detrimental effects of salinity in petunia with direct implications for other economically important Solanaceous species. PMID:28771200

  18. Deciphering the Cryptic Genome: Genome-wide Analyses of the Rice Pathogen Fusarium fujikuroi Reveal Complex Regulation of Secondary Metabolism and Novel Metabolites

    PubMed Central

    Studt, Lena; Niehaus, Eva-Maria; Espino, Jose J.; Huß, Kathleen; Michielse, Caroline B.; Albermann, Sabine; Wagner, Dominik; Bergner, Sonja V.; Connolly, Lanelle R.; Fischer, Andreas; Reuter, Gunter; Kleigrewe, Karin; Bald, Till; Wingfield, Brenda D.; Ophir, Ron; Freeman, Stanley; Hippler, Michael; Smith, Kristina M.; Brown, Daren W.; Proctor, Robert H.; Münsterkötter, Martin; Freitag, Michael; Humpf, Hans-Ulrich; Güldener, Ulrich; Tudzynski, Bettina

    2013-01-01

    The fungus Fusarium fujikuroi causes “bakanae” disease of rice due to its ability to produce gibberellins (GAs), but it is also known for producing harmful mycotoxins. However, the genetic capacity for the whole arsenal of natural compounds and their role in the fungus' interaction with rice remained unknown. Here, we present a high-quality genome sequence of F. fujikuroi that was assembled into 12 scaffolds corresponding to the 12 chromosomes described for the fungus. We used the genome sequence along with ChIP-seq, transcriptome, proteome, and HPLC-FTMS-based metabolome analyses to identify the potential secondary metabolite biosynthetic gene clusters and to examine their regulation in response to nitrogen availability and plant signals. The results indicate that expression of most but not all gene clusters correlate with proteome and ChIP-seq data. Comparison of the F. fujikuroi genome to those of six other fusaria revealed that only a small number of gene clusters are conserved among these species, thus providing new insights into the divergence of secondary metabolism in the genus Fusarium. Noteworthy, GA biosynthetic genes are present in some related species, but GA biosynthesis is limited to F. fujikuroi, suggesting that this provides a selective advantage during infection of the preferred host plant rice. Among the genome sequences analyzed, one cluster that includes a polyketide synthase gene (PKS19) and another that includes a non-ribosomal peptide synthetase gene (NRPS31) are unique to F. fujikuroi. The metabolites derived from these clusters were identified by HPLC-FTMS-based analyses of engineered F. fujikuroi strains overexpressing cluster genes. In planta expression studies suggest a specific role for the PKS19-derived product during rice infection. Thus, our results indicate that combined comparative genomics and genome-wide experimental analyses identified novel genes and secondary metabolites that contribute to the evolutionary success of F. fujikuroi as a rice pathogen. PMID:23825955

  19. IDENTIFICATION OF A NOVEL CLASS OF ANTI-INFLAMMATORY COMPOUNDS WITH ANTI-TUMOR ACTIVITY IN COLORECTAL AND LUNG CANCERS

    PubMed Central

    Chang, Hui-Hua; Song, Zuohe; Wisner, Lee; Tripp, Tina; Gokhale, Vijay

    2011-01-01

    Summary Chronic inflammation is associated with 25% of all cancers. In the inflammation-cancer axis, prostaglandin E2 (PGE2) is one of the major players. PGE2 synthases (PGES) are the enzymes downstream of the cyclooxygenases (COXs) in the PGE2 biosynthesis pathway. Microsomal prostaglandin E2 synthase 1 (mPGES-1) is inducible by pro-inflammatory stimuli and constitutively expressed in a variety of cancers. The potential role for this enzyme in tumorigenesis has been reported and mPGES-1 represents a novel therapeutic target for cancers. In order to identify novel small molecule inhibitors of mPGES-1, we screened the ChemBridge library and identified 13 compounds as potential hits. These compounds were tested for their ability to bind directly to the enzyme using surface plasmon resonance spectroscopy and to decrease cytokine-stimulated PGE2 production in various cancer cell lines. We demonstrate that the compound PGE0001 (ChemBridge ID number 5654455) binds to human mPGES-1 recombinant protein with good affinity (KD = 21.3 ± 7.8 μM). PGE0001 reduces IL-1β-induced PGE2 release in human HCA-7 colon and A549 lung cancer cell lines with EC50 in the submicromolar range. Although PGE0001 may have alternative targets based on the results from in vitro assays, it shows promising effects in vivo. PGE0001 exhibits significant anti-tumor activity in SW837 rectum and A549 lung cancer xenografts in SCID mice. Single injection i.p. of PGE0001 at 100 mg/kg decreases serum PGE2 levels in mice within 5 h. In summary, our data suggest that the identified compound PGE0001 exerts anti-tumor activity via the inhibition of the PGE2 synthesis pathway. PMID:21931968

  20. RETRACTED: Association of the ACE I/D gene polymorphism with sepsis susceptibility and sepsis progression.

    PubMed

    Yang, Chun-Hua; Zhou, Tian-Biao

    2015-12-01

    This article has been included in a multiple retraction: Chun-Hua Yang and Tian-Biao Zhou Association of the ACE I/D gene polymorphism with sepsis susceptibility and sepsis progression Journal of Renin-Angiotensin-Aldosterone System 1470320314568521, first published on February 3, 2015 doi: 10.1177/1470320314568521 This article has been retracted at the request of the Editors and the Publisher. After conducting a thorough investigation, SAGE found that the submitting authors of a number of papers published in the Journal of the Renin-Angiotensin Aldosterone System ( JRAAS) (listed below) had supplied fabricated contact details for their nominated reviewers. The Editors accepted these papers based on the reports supplied by the individuals using these fake reviewer email accounts. After concluding that the peer review process was therefore seriously compromised, SAGE and the journal Editors have decided to retract all affected articles. Online First articles (these articles will not be published in an issue) Wenzhuang Tang, Tian-Biao Zhou, and Zongpei Jiang Association of the angiotensinogen M235T gene polymorphism with risk of diabetes mellitus developing into diabetic nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314563426, first published on December 18, 2014 doi: 10.1177/1470320314563426 Tian-Biao Zhou, Hong-Yan Li, Zong-Pei Jiang, Jia-Fan Zhou, Miao-Fang Huang, and Zhi-Yang Zhou Role of renin-angiotensin-aldosterone system inhibitors in radiation nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314563424, first published on December 18, 2014 doi: 10.1177/1470320314563424 Weiqiang Zhong, Zongpei Jiang, and Tian-Biao Zhou Association between the ACE I/D gene polymorphism and T2DN susceptibility: The risk of T2DM developing into T2DN in the Asian population Journal of Renin-Angiotensin-Aldosterone System 1470320314566019, first published on January 26, 2015 doi: 10.1177/1470320314566019 Tian-Biao Zhou, Xue-Feng Guo, Zongpei Jiang, and Hong-Yan Li Relationship between the ACE I/D gene polymorphism and T1DN susceptibility/risk of T1DM developing into T1DN in the Caucasian population Journal of Renin-Angiotensin-Aldosterone System 1470320314563425, first published on February 1, 2015 doi: 10.1177/1470320314563425 Chun-Hua Yang and Tian-Biao Zhou Relationship between the angiotensinogen A1166C gene polymorphism and the risk of diabetes mellitus developing into diabetic nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314566221, first published on February 1, 2015 doi: 10.1177/1470320314566221 Chun-Hua Yang and Tian-Biao Zhou Association of the ACE I/D gene polymorphism with sepsis susceptibility and sepsis progression Journal of Renin-Angiotensin-Aldosterone System 1470320314568521, first published on February 3, 2015 doi: 10.1177/1470320314568521 Articles published in an issue Guohui Liu, Tian-Biao Zhou, Zongpei Jiang, and Dongwen Zheng Association of ACE I/D gene polymorphism with T2DN susceptibility and the risk of T2DM developing into T2DN in a Caucasian population Journal of Renin-Angiotensin-Aldosterone System March 2015 16: 165-171, first published on November 14, 2014 doi: 10.1177/1470320314557849 Weiqiang Zhong, Zhongliang Huang, Yong Wu, Zongpei Jiang, and Tian-Biao Zhou Association of aldosterone synthase (CYP11B2) gene polymorphism with IgA nephropathy risk and progression of IgA nephropathy Journal of Renin-Angiotensin-Aldosterone System September 2015 16: 660-665, first published on August 20, 2014 doi: 10.1177/1470320314524011.

  1. RETRACTED: Relationship between the ACE I/D gene polymorphism and T1DN susceptibility/risk of T1DM developing into T1DN in the Caucasian population.

    PubMed

    Zhou, Tian-Biao; Guo, Xue-Feng; Jiang, Zongpei; Li, Hong-Yan

    2015-12-01

    The following article has been included in a multiple retraction: Tian-Biao Zhou, Xue-Feng Guo, Zongpei Jiang, and Hong-Yan Li Relationship between the ACE I/D gene polymorphism and T1DN susceptibility/risk of T1DM developing into T1DN in the Caucasian population Journal of Renin-Angiotensin-Aldosterone System 1470320314563425, first published on February 1, 2015 doi: 10.1177/1470320314563425 This article has been retracted at the request of the Editors and the Publisher. After conducting a thorough investigation, SAGE found that the submitting authors of a number of papers published in the Journal of the Renin-Angiotensin Aldosterone System ( JRAAS) (listed below) had supplied fabricated contact details for their nominated reviewers. The Editors accepted these papers based on the reports supplied by the individuals using these fake reviewer email accounts. After concluding that the peer review process was therefore seriously compromised, SAGE and the journal Editors have decided to retract all affected articles. Online First articles (these articles will not be published in an issue) Wenzhuang Tang, Tian-Biao Zhou, and Zongpei Jiang Association of the angiotensinogen M235T gene polymorphism with risk of diabetes mellitus developing into diabetic nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314563426, first published on December 18, 2014 doi: 10.1177/1470320314563426 Tian-Biao Zhou, Hong-Yan Li, Zong-Pei Jiang, Jia-Fan Zhou, Miao-Fang Huang, and Zhi-Yang Zhou Role of renin-angiotensin-aldosterone system inhibitors in radiation nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314563424, first published on December 18, 2014 doi: 10.1177/1470320314563424 Weiqiang Zhong, Zongpei Jiang, and Tian-Biao Zhou Association between the ACE I/D gene polymorphism and T2DN susceptibility: The risk of T2DM developing into T2DN in the Asian population Journal of Renin-Angiotensin-Aldosterone System 1470320314566019, first published on January 26, 2015 doi: 10.1177/1470320314566019 Tian-Biao Zhou, Xue-Feng Guo, Zongpei Jiang, and Hong-Yan Li Relationship between the ACE I/D gene polymorphism and T1DN susceptibility/risk of T1DM developing into T1DN in the Caucasian population Journal of Renin-Angiotensin-Aldosterone System 1470320314563425, first published on February 1, 2015 doi: 10.1177/1470320314563425 Chun-Hua Yang and Tian-Biao Zhou Relationship between the angiotensinogen A1166C gene polymorphism and the risk of diabetes mellitus developing into diabetic nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314566221, first published on February 1, 2015 doi: 10.1177/1470320314566221 Chun-Hua Yang and Tian-Biao Zhou Association of the ACE I/D gene polymorphism with sepsis susceptibility and sepsis progression Journal of Renin-Angiotensin-Aldosterone System 1470320314568521, first published on February 3, 2015 doi: 10.1177/1470320314568521 Articles published in an issue Guohui Liu, Tian-Biao Zhou, Zongpei Jiang, and Dongwen Zheng Association of ACE I/D gene polymorphism with T2DN susceptibility and the risk of T2DM developing into T2DN in a Caucasian population Journal of Renin-Angiotensin-Aldosterone System March 2015 16: 165-171, first published on November 14, 2014 doi: 10.1177/1470320314557849 Weiqiang Zhong, Zhongliang Huang, Yong Wu, Zongpei Jiang, and Tian-Biao Zhou Association of aldosterone synthase (CYP11B2) gene polymorphism with IgA nephropathy risk and progression of IgA nephropathy Journal of Renin-Angiotensin-Aldosterone System September 2015 16: 660-665, first published on August 20, 2014 doi: 10.1177/1470320314524011.

  2. RETRACTED: Association between the ACE I/D gene polymorphism and T2DN susceptibility: The risk of T2DM developing into T2DN in the Asian population.

    PubMed

    Zhong, Weiqiang; Jiang, Zongpei; Zhou, Tian-Biao

    2015-12-01

    This article has been included in a multiple retraction: Weiqiang Zhong, Zongpei Jiang, and Tian-Biao Zhou Association between the ACE I/D gene polymorphism and T2DN susceptibility: The risk of T2DM developing into T2DN in the Asian population Journal of Renin-Angiotensin-Aldosterone System 1470320314566019, first published on January 26, 2015 doi: 10.1177/1470320314566019 This article has been retracted at the request of the Editors and the Publisher. After conducting a thorough investigation, SAGE found that the submitting authors of a number of papers published in the Journal of the Renin-Angiotensin Aldosterone System ( JRAAS) (listed below) had supplied fabricated contact details for their nominated reviewers. The Editors accepted these papers based on the reports supplied by the individuals using these fake reviewer email accounts. After concluding that the peer review process was therefore seriously compromised, SAGE and the journal Editors have decided to retract all affected articles. Online First articles (these articles will not be published in an issue) Wenzhuang Tang, Tian-Biao Zhou, and Zongpei Jiang Association of the angiotensinogen M235T gene polymorphism with risk of diabetes mellitus developing into diabetic nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314563426, first published on December 18, 2014 doi: 10.1177/1470320314563426 Tian-Biao Zhou, Hong-Yan Li, Zong-Pei Jiang, Jia-Fan Zhou, Miao-Fang Huang, and Zhi-Yang Zhou Role of renin-angiotensin-aldosterone system inhibitors in radiation nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314563424, first published on December 18, 2014 doi: 10.1177/1470320314563424 Weiqiang Zhong, Zongpei Jiang, and Tian-Biao Zhou Association between the ACE I/D gene polymorphism and T2DN susceptibility: The risk of T2DM developing into T2DN in the Asian population Journal of Renin-Angiotensin-Aldosterone System 1470320314566019, first published on January 26, 2015 doi: 10.1177/1470320314566019 Tian-Biao Zhou, Xue-Feng Guo, Zongpei Jiang, and Hong-Yan Li Relationship between the ACE I/D gene polymorphism and T1DN susceptibility/risk of T1DM developing into T1DN in the Caucasian population Journal of Renin-Angiotensin-Aldosterone System 1470320314563425, first published on February 1, 2015 doi: 10.1177/1470320314563425 Chun-Hua Yang and Tian-Biao Zhou Relationship between the angiotensinogen A1166C gene polymorphism and the risk of diabetes mellitus developing into diabetic nephropathy Journal of Renin-Angiotensin-Aldosterone System 1470320314566221, first published on February 1, 2015 doi: 10.1177/1470320314566221 Chun-Hua Yang and Tian-Biao Zhou Association of the ACE I/D gene polymorphism with sepsis susceptibility and sepsis progression Journal of Renin-Angiotensin-Aldosterone System 1470320314568521, first published on February 3, 2015 doi: 10.1177/1470320314568521 Articles published in an issue Guohui Liu, Tian-Biao Zhou, Zongpei Jiang, and Dongwen Zheng Association of ACE I/D gene polymorphism with T2DN susceptibility and the risk of T2DM developing into T2DN in a Caucasian population Journal of Renin-Angiotensin-Aldosterone System March 2015 16: 165-171, first published on November 14, 2014 doi: 10.1177/1470320314557849 Weiqiang Zhong, Zhongliang Huang, Yong Wu, Zongpei Jiang, and Tian-Biao Zhou Association of aldosterone synthase (CYP11B2) gene polymorphism with IgA nephropathy risk and progression of IgA nephropathy Journal of Renin-Angiotensin-Aldosterone System September 2015 16: 660-665, first published on August 20, 2014 doi: 10.1177/1470320314524011.

  3. Molecular Keys to the Janthinobacterium and Duganella spp. Interaction with the Plant Pathogen Fusarium graminearum

    PubMed Central

    Haack, Frederike S.; Poehlein, Anja; Kröger, Cathrin; Voigt, Christian A.; Piepenbring, Meike; Bode, Helge B.; Daniel, Rolf; Schäfer, Wilhelm; Streit, Wolfgang R.

    2016-01-01

    Janthinobacterium and Duganella are well-known for their antifungal effects. Surprisingly, almost nothing is known on molecular aspects involved in the close bacterium-fungus interaction. To better understand this interaction, we established the genomes of 11 Janthinobacterium and Duganella isolates in combination with phylogenetic and functional analyses of all publicly available genomes. Thereby, we identified a core and pan genome of 1058 and 23,628 genes. All strains encoded secondary metabolite gene clusters and chitinases, both possibly involved in fungal growth suppression. All but one strain carried a single gene cluster involved in the biosynthesis of alpha-hydroxyketone-like autoinducer molecules, designated JAI-1. Genome-wide RNA-seq studies employing the background of two isolates and the corresponding JAI-1 deficient strains identified a set of 45 QS-regulated genes in both isolates. Most regulated genes are characterized by a conserved sequence motif within the promoter region. Among the most strongly regulated genes were secondary metabolite and type VI secretion system gene clusters. Most intriguing, co-incubation studies of J. sp. HH102 or its corresponding JAI-1 synthase deletion mutant with the plant pathogen Fusarium graminearum provided first evidence of a QS-dependent interaction with this pathogen. PMID:27833590

  4. Transcriptome Analysis of Manganese-deficient Chlamydomonas reinhardtii Provides Insight on the Chlorophyll Biosynthesis Pathway

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lockhart, Ainsley; Zvenigorodsky, Natasha; Pedraza, Mary Ann

    2011-08-11

    The biosynthesis of chlorophyll and other tetrapyrroles is a vital but poorly understood process. Recent genomic advances with the unicellular green algae Chlamydomonas reinhardtii have created opportunity to more closely examine the mechanisms of the chlorophyll biosynthesis pathway via transcriptome analysis. Manganese is a nutrient of interest for complex reactions because of its multiple stable oxidation states and role in molecular oxygen coordination. C. reinhardtii was cultured in Manganese-deplete Tris-acetate-phosphate (TAP) media for 24 hours and used to create cDNA libraries for sequencing using Illumina TruSeq technology. Transcriptome analysis provided intriguing insight on possible regulatory mechanisms in the pathway. Evidencemore » supports similarities of GTR (Glutamyl-tRNA synthase) to its Chlorella vulgaris homolog in terms of Mn requirements. Data was also suggestive of Mn-related compensatory up-regulation for pathway proteins CHLH1 (Manganese Chelatase), GUN4 (Magnesium chelatase activating protein), and POR1 (Light-dependent protochlorophyllide reductase). Intriguingly, data suggests possible reciprocal expression of oxygen dependent CPX1 (coproporphyrinogen III oxidase) and oxygen independent CPX2. Further analysis using RT-PCR could provide compelling evidence for several novel regulatory mechanisms in the chlorophyll biosynthesis pathway.« less

  5. Stabilities and Dynamics of Protein Folding Nuclei by Molecular Dynamics Simulation

    NASA Astrophysics Data System (ADS)

    Song, Yong-Shun; Zhou, Xin; Zheng, Wei-Mou; Wang, Yan-Ting

    2017-07-01

    To understand how the stabilities of key nuclei fragments affect protein folding dynamics, we simulate by molecular dynamics (MD) simulation in aqueous solution four fragments cut out of a protein G, including one α-helix (seqB: KVFKQYAN), two β-turns (seqA: LNGKTLKG and seqC: YDDATKTF), and one β-strand (seqD: DGEWTYDD). The Markov State Model clustering method combined with the coarse-grained conformation letters method are employed to analyze the data sampled from 2-μs equilibrium MD simulation trajectories. We find that seqA and seqB have more stable structures than their native structures which become metastable when cut out of the protein structure. As expected, seqD alone is flexible and does not have a stable structure. Throughout our simulations, the native structure of seqC is stable but cannot be reached if starting from a structure other than the native one, implying a funnel-shape free energy landscape of seqC in aqueous solution. All the above results suggest that different nuclei have different formation dynamics during protein folding, which may have a major contribution to the hierarchy of protein folding dynamics. Supported by the National Basic Research Program of China under Grant No. 2013CB932804, the National Natural Science Foundation of China under Grant No. 11421063, and the CAS Biophysics Interdisciplinary Innovation Team Project

  6. 15 CFR 922.152 - Prohibited or otherwise regulated activities.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... Protection Act, as amended, (MMPA), 16 U.S.C. 1361 et seq., the Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq., and the Migratory Bird Treaty Act, as amended, (MBTA), 16 U.S.C. 703 et seq... section 312 of the Federal Water Pollution Control Act, as amended, (FWPCA), 33 U.S.C. 1322 et seq.; (C...

  7. 15 CFR 922.152 - Prohibited or otherwise regulated activities.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... Protection Act, as amended, (MMPA), 16 U.S.C. 1361 et seq., the Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq., and the Migratory Bird Treaty Act, as amended, (MBTA), 16 U.S.C. 703 et seq... section 312 of the Federal Water Pollution Control Act, as amended, (FWPCA), 33 U.S.C. 1322 et seq.; (C...

  8. 15 CFR 922.152 - Prohibited or otherwise regulated activities.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... Protection Act, as amended, (MMPA), 16 U.S.C. 1361 et seq., the Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq., and the Migratory Bird Treaty Act, as amended, (MBTA), 16 U.S.C. 703 et seq... section 312 of the Federal Water Pollution Control Act, as amended, (FWPCA), 33 U.S.C. 1322 et seq.; (C...

  9. 48 CFR 1552.235-71 - Treatment of confidential business information.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... Control Act (33 U.S.C. 1251, et seq.), the Safe Drinking Water Act (42 U.S.C. 300f et seq.), the Federal Insecticide, Fungicide, and Rodenticide Act (7 U.S.C. 136 et seq.), the Federal Food, Drug, and Cosmetic Act... Toxic Substances Control Act (15 U.S.C. 2601 et seq.). EPA regulations on confidentiality of business...

  10. 48 CFR 1552.235-71 - Treatment of confidential business information.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... Control Act (33 U.S.C. 1251, et seq.), the Safe Drinking Water Act (42 U.S.C. 300f et seq.), the Federal Insecticide, Fungicide, and Rodenticide Act (7 U.S.C. 136 et seq.), the Federal Food, Drug, and Cosmetic Act... Toxic Substances Control Act (15 U.S.C. 2601 et seq.). EPA regulations on confidentiality of business...

  11. 48 CFR 1552.235-71 - Treatment of confidential business information.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... Control Act (33 U.S.C. 1251, et seq.), the Safe Drinking Water Act (42 U.S.C. 300f et seq.), the Federal Insecticide, Fungicide, and Rodenticide Act (7 U.S.C. 136 et seq.), the Federal Food, Drug, and Cosmetic Act... Toxic Substances Control Act (15 U.S.C. 2601 et seq.). EPA regulations on confidentiality of business...

  12. 48 CFR 1552.235-71 - Treatment of confidential business information.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... Control Act (33 U.S.C. 1251, et seq.), the Safe Drinking Water Act (42 U.S.C. 300f et seq.), the Federal Insecticide, Fungicide, and Rodenticide Act (7 U.S.C. 136 et seq.), the Federal Food, Drug, and Cosmetic Act... Toxic Substances Control Act (15 U.S.C. 2601 et seq.). EPA regulations on confidentiality of business...

  13. 33 CFR 148.737 - What environmental statutes must an applicant follow?

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... U.S.C. 469; Archeological Resources Protection Act (AHPA), 16 U.S.C. 470 aa-ll, et. seq.... seq.; Clean Water Act of 1977 (CWA), Pub. L. 95-217, 33 U.S.C. 1251, et. seq.; Coastal Barrier Resources Act (CBRA), Pub. L. 97-348, 16 U.S.C. 3510, et. seq.; Coastal Zone Management Act (CZMA), Pub. L...

  14. The ChIP-Seq tools and web server: a resource for analyzing ChIP-seq and other types of genomic data.

    PubMed

    Ambrosini, Giovanna; Dreos, René; Kumar, Sunil; Bucher, Philipp

    2016-11-18

    ChIP-seq and related high-throughput chromatin profilig assays generate ever increasing volumes of highly valuable biological data. To make sense out of it, biologists need versatile, efficient and user-friendly tools for access, visualization and itegrative analysis of such data. Here we present the ChIP-Seq command line tools and web server, implementing basic algorithms for ChIP-seq data analysis starting with a read alignment file. The tools are optimized for memory-efficiency and speed thus allowing for processing of large data volumes on inexpensive hardware. The web interface provides access to a large database of public data. The ChIP-Seq tools have a modular and interoperable design in that the output from one application can serve as input to another one. Complex and innovative tasks can thus be achieved by running several tools in a cascade. The various ChIP-Seq command line tools and web services either complement or compare favorably to related bioinformatics resources in terms of computational efficiency, ease of access to public data and interoperability with other web-based tools. The ChIP-Seq server is accessible at http://ccg.vital-it.ch/chipseq/ .

  15. Assessing advantages of sequential boron neutron capture therapy (BNCT) in an oral cancer model with normalized blood vessels.

    PubMed

    Molinari, Ana J; Thorp, Silvia I; Portu, Agustina M; Saint Martin, Gisela; Pozzi, Emiliano C C; Heber, Elisa M; Bortolussi, Silva; Itoiz, Maria E; Aromando, Romina F; Monti Hughes, Andrea; Garabalino, Marcela A; Altieri, Saverio; Trivillin, Verónica A; Schwint, Amanda E

    2015-01-01

    We previously demonstrated the therapeutic success of sequential boron neutron capture therapy (Seq-BNCT) in the hamster cheek pouch oral cancer model. It consists of BPA-BNCT followed by GB-10-BNCT 24 or 48 hours later. Additionally, we proved that tumor blood vessel normalization with thalidomide prior to BPA-BNCT improves tumor control. The aim of the present study was to evaluate the therapeutic efficacy and explore potential boron microdistribution changes in Seq-BNCT preceded by tumor blood vessel normalization. Tumor bearing animals were treated with thalidomide for tumor blood vessel normalization, followed by Seq-BNCT (Th+ Seq-BNCT) or Seq-Beam Only (Th+ Seq-BO) in the window of normalization. Boron microdistribution was assessed by neutron autoradiography. Th+ Seq-BNCT induced overall tumor response of 100%, with 87 (4)% complete tumor response. No cases of severe mucositis in dose-limiting precancerous tissue were observed. Differences in boron homogeneity between tumors pre-treated and not pre-treated with thalidomide were observed. Th+ Seq-BNCT achieved, for the first time, response in all treated tumors. Increased homogeneity in tumor boron microdistribution is associated to an improvement in tumor control.

  16. Falco: a quick and flexible single-cell RNA-seq processing framework on the cloud.

    PubMed

    Yang, Andrian; Troup, Michael; Lin, Peijie; Ho, Joshua W K

    2017-03-01

    Single-cell RNA-seq (scRNA-seq) is increasingly used in a range of biomedical studies. Nonetheless, current RNA-seq analysis tools are not specifically designed to efficiently process scRNA-seq data due to their limited scalability. Here we introduce Falco, a cloud-based framework to enable paralellization of existing RNA-seq processing pipelines using big data technologies of Apache Hadoop and Apache Spark for performing massively parallel analysis of large scale transcriptomic data. Using two public scRNA-seq datasets and two popular RNA-seq alignment/feature quantification pipelines, we show that the same processing pipeline runs 2.6-145.4 times faster using Falco than running on a highly optimized standalone computer. Falco also allows users to utilize low-cost spot instances of Amazon Web Services, providing a ∼65% reduction in cost of analysis. Falco is available via a GNU General Public License at https://github.com/VCCRI/Falco/. j.ho@victorchang.edu.au. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  17. Characterization of Aspergillus sojae Isolated from Meju, Korean Traditional Fermented Soybean Brick.

    PubMed

    Kim, Kyung Min; Lim, Jaeho; Lee, Jae Jung; Hurh, Byung-Serk; Lee, Inhyung

    2017-02-28

    Initially, we screened 18 Aspergillus sojae -like strains from Aspergillus spp. isolated from meju (Korean traditional fermented soybean brick) according to their morphological characteristics. Because members of Aspergillus section Flavi are often incorrectly identified because of their phylogenetic similarity, we re-identified these strains at the morphological and molecular genetic levels. Fourteen strains were finally identified as A. sojae . The isolates produced protease and α-amylase with ranges of 2.66-10.64 and 21.53-106.73 unit/g-initial dry substrate (U/g-IDS), respectively, which were equivalent to those of the koji (starter mold) strains employed to produce Japanese soy sauce. Among the isolates and Japanese koji strains, strains SMF 127 and SMF 131 had the highest leucine aminopeptidase (LAP) activities at 6.00 and 6.06 U/g-IDS, respectively. LAP plays an important role in flavor development because of the production of low-molecular-weight peptides that affect the taste and decrease bitterness. SMF 127 and SMF 131 appeared to be non-aflatoxigenic because of a termination point mutation in aflR and the lack of the polyketide synthase gene found in other A. sojae strains. In addition, SMF 127 and SMF 131 were not cyclopiazonic acid (CPA) producers because of the deletion of maoA , dmaT , and pks/nrps , which are involved in CPA biosynthesis. Therefore, A. sojae strains such as SMF 127 and SMF 131, which have high protease and LAP activities and are free of safety issues, can be considered good starters for soybean fermentations, such as in the production of the Korean fermented soybean products meju, doenjang, and ganjang.

  18. Fatigue reduction during aggregated and distributed sequential stimulation.

    PubMed

    Bergquist, Austin J; Babbar, Vishvek; Ali, Saima; Popovic, Milos R; Masani, Kei

    2017-08-01

    Transcutaneous neuromuscular electrical stimulation (NMES) can generate muscle contractions for rehabilitation and exercise. However, NMES-evoked contractions are limited by fatigue when they are delivered "conventionally" (CONV) using a single active electrode. Researchers have developed "sequential" (SEQ) stimulation, involving rotation of pulses between multiple "aggregated" (AGGR-SEQ) or "distributed" (DISTR-SEQ) active electrodes, to reduce fatigue (torque-decline) by reducing motor unit discharge rates. The primary objective was to compare fatigue-related outcomes, "potentiation," "variability," and "efficiency" between CONV, AGGR-SEQ, and DISTR-SEQ stimulation of knee extensors in healthy participants. Torque and current were recorded during testing with fatiguing trains using each NMES type under isometric and isokinetic (180°/s) conditions. Compared with CONV stimulation, SEQ techniques reduced fatigue-related outcomes, increased potentiation, did not affect variability, and reduced efficiency. SEQ techniques hold promise for reducing fatigue during NMES-based rehabilitation and exercise; however, optimization is required to improve efficiency. Muscle Nerve 56: 271-281, 2017. © 2016 Wiley Periodicals, Inc.

  19. Gene expression profiling of human breast tissue samples using SAGE-Seq.

    PubMed

    Wu, Zhenhua Jeremy; Meyer, Clifford A; Choudhury, Sibgat; Shipitsin, Michail; Maruyama, Reo; Bessarabova, Marina; Nikolskaya, Tatiana; Sukumar, Saraswati; Schwartzman, Armin; Liu, Jun S; Polyak, Kornelia; Liu, X Shirley

    2010-12-01

    We present a powerful application of ultra high-throughput sequencing, SAGE-Seq, for the accurate quantification of normal and neoplastic mammary epithelial cell transcriptomes. We develop data analysis pipelines that allow the mapping of sense and antisense strands of mitochondrial and RefSeq genes, the normalization between libraries, and the identification of differentially expressed genes. We find that the diversity of cancer transcriptomes is significantly higher than that of normal cells. Our analysis indicates that transcript discovery plateaus at 10 million reads/sample, and suggests a minimum desired sequencing depth around five million reads. Comparison of SAGE-Seq and traditional SAGE on normal and cancerous breast tissues reveals higher sensitivity of SAGE-Seq to detect less-abundant genes, including those encoding for known breast cancer-related transcription factors and G protein-coupled receptors (GPCRs). SAGE-Seq is able to identify genes and pathways abnormally activated in breast cancer that traditional SAGE failed to call. SAGE-Seq is a powerful method for the identification of biomarkers and therapeutic targets in human disease.

  20. Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data.

    PubMed

    Yip, Shun H; Sham, Pak Chung; Wang, Junwen

    2018-02-21

    Traditional RNA sequencing (RNA-seq) allows the detection of gene expression variations between two or more cell populations through differentially expressed gene (DEG) analysis. However, genes that contribute to cell-to-cell differences are not discoverable with RNA-seq because RNA-seq samples are obtained from a mixture of cells. Single-cell RNA-seq (scRNA-seq) allows the detection of gene expression in each cell. With scRNA-seq, highly variable gene (HVG) discovery allows the detection of genes that contribute strongly to cell-to-cell variation within a homogeneous cell population, such as a population of embryonic stem cells. This analysis is implemented in many software packages. In this study, we compare seven HVG methods from six software packages, including BASiCS, Brennecke, scLVM, scran, scVEGs and Seurat. Our results demonstrate that reproducibility in HVG analysis requires a larger sample size than DEG analysis. Discrepancies between methods and potential issues in these tools are discussed and recommendations are made.

  1. Introduction to Single-Cell RNA Sequencing.

    PubMed

    Olsen, Thale Kristin; Baryawno, Ninib

    2018-04-01

    During the last decade, high-throughput sequencing methods have revolutionized the entire field of biology. The opportunity to study entire transcriptomes in great detail using RNA sequencing (RNA-seq) has fueled many important discoveries and is now a routine method in biomedical research. However, RNA-seq is typically performed in "bulk," and the data represent an average of gene expression patterns across thousands to millions of cells; this might obscure biologically relevant differences between cells. Single-cell RNA-seq (scRNA-seq) represents an approach to overcome this problem. By isolating single cells, capturing their transcripts, and generating sequencing libraries in which the transcripts are mapped to individual cells, scRNA-seq allows assessment of fundamental biological properties of cell populations and biological systems at unprecedented resolution. Here, we present the most common scRNA-seq protocols in use today and the basics of data analysis and discuss factors that are important to consider before planning and designing an scRNA-seq project. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.

  2. BAsE-Seq: a method for obtaining long viral haplotypes from short sequence reads.

    PubMed

    Hong, Lewis Z; Hong, Shuzhen; Wong, Han Teng; Aw, Pauline P K; Cheng, Yan; Wilm, Andreas; de Sessions, Paola F; Lim, Seng Gee; Nagarajan, Niranjan; Hibberd, Martin L; Quake, Stephen R; Burkholder, William F

    2014-01-01

    We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.

  3. Functional characterization of nine Norway Spruce TPS genes and evolution of gymnosperm terpene synthases of the TPS-d subfamily.

    PubMed

    Martin, Diane M; Fäldt, Jenny; Bohlmann, Jörg

    2004-08-01

    Constitutive and induced terpenoids are important defense compounds for many plants against potential herbivores and pathogens. In Norway spruce (Picea abies L. Karst), treatment with methyl jasmonate induces complex chemical and biochemical terpenoid defense responses associated with traumatic resin duct development in stems and volatile terpenoid emissions in needles. The cloning of (+)-3-carene synthase was the first step in characterizing this system at the molecular genetic level. Here we report the isolation and functional characterization of nine additional terpene synthase (TPS) cDNAs from Norway spruce. These cDNAs encode four monoterpene synthases, myrcene synthase, (-)-limonene synthase, (-)-alpha/beta-pinene synthase, and (-)-linalool synthase; three sesquiterpene synthases, longifolene synthase, E,E-alpha-farnesene synthase, and E-alpha-bisabolene synthase; and two diterpene synthases, isopimara-7,15-diene synthase and levopimaradiene/abietadiene synthase, each with a unique product profile. To our knowledge, genes encoding isopimara-7,15-diene synthase and longifolene synthase have not been previously described, and this linalool synthase is the first described from a gymnosperm. These functionally diverse TPS account for much of the structural diversity of constitutive and methyl jasmonate-induced terpenoids in foliage, xylem, bark, and volatile emissions from needles of Norway spruce. Phylogenetic analyses based on the inclusion of these TPS into the TPS-d subfamily revealed that functional specialization of conifer TPS occurred before speciation of Pinaceae. Furthermore, based on TPS enclaves created by distinct branching patterns, the TPS-d subfamily is divided into three groups according to sequence similarities and functional assessment. Similarities of TPS evolution in angiosperms and modeling of TPS protein structures are discussed.

  4. Distribution of Callose Synthase, Cellulose Synthase, and Sucrose Synthase in Tobacco Pollen Tube Is Controlled in Dissimilar Ways by Actin Filaments and Microtubules1[W

    PubMed Central

    Cai, Giampiero; Faleri, Claudia; Del Casino, Cecilia; Emons, Anne Mie C.; Cresti, Mauro

    2011-01-01

    Callose and cellulose are fundamental components of the cell wall of pollen tubes and are probably synthesized by distinct enzymes, callose synthase and cellulose synthase, respectively. We examined the distribution of callose synthase and cellulose synthase in tobacco (Nicotiana tabacum) pollen tubes in relation to the dynamics of actin filaments, microtubules, and the endomembrane system using specific antibodies to highly conserved peptide sequences. The role of the cytoskeleton and membrane flow was investigated using specific inhibitors (latrunculin B, 2,3-butanedione monoxime, taxol, oryzalin, and brefeldin A). Both enzymes are associated with the plasma membrane, but cellulose synthase is present along the entire length of pollen tubes (with a higher concentration at the apex) while callose synthase is located in the apex and in distal regions. In longer pollen tubes, callose synthase accumulates consistently around callose plugs, indicating its involvement in plug synthesis. Actin filaments and endomembrane dynamics are critical for the distribution of callose synthase and cellulose synthase, showing that enzymes are transported through Golgi bodies and/or vesicles moving along actin filaments. Conversely, microtubules appear to be critical in the positioning of callose synthase in distal regions and around callose plugs. In contrast, cellulose synthases are only partially coaligned with cortical microtubules and unrelated to callose plugs. Callose synthase also comigrates with tubulin by Blue Native-polyacrylamide gel electrophoresis. Membrane sucrose synthase, which expectedly provides UDP-glucose to callose synthase and cellulose synthase, binds to actin filaments depending on sucrose concentration; its distribution is dependent on the actin cytoskeleton and the endomembrane system but not on microtubules. PMID:21205616

  5. Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling.

    PubMed

    Zhang, Guoqiang; Wang, Jianfeng; Yang, Jin; Li, Wenjie; Deng, Yutian; Li, Jing; Huang, Jun; Hu, Songnian; Zhang, Bing

    2015-08-05

    To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer. Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3% in four samples, whereas the concordance of co-detected variant loci reached 99%. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5%) was higher than the SNPs specific to TargetSeq-Proton (60.0%) or specific to SureSelect-HiSeq (88.3%). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0%) and SureSelect-HiSeq-specific (89.6%) were higher than those of TargetSeq-Proton-specific (15.8%). In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the sequencing of platform-specific variants, the accuracy of variant calling by HiSeq 2000 was higher than that of Ion Proton, specifically for the InDel detection. Moreover, the variant calling software also influences the detection of SNPs and, specifically, InDels in Ion Proton exome sequencing.

  6. 50 CFR 12.2 - Scope of regulations.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... Act, 16 U.S.C. 1361 et seq.; (h) The Endangered Species Act, 16 U.S.C. 1531 et seq.; and (i) The Lacey... the following laws: (a) The Eagle Protection Act, 16 U.S.C. 668 et seq.; (b) The National Wildlife Refuge System Administration Act, 16 U.S.C. 668dd et seq.; (c) The Migratory Bird Treaty Act, 16 U.S.C...

  7. 50 CFR 12.2 - Scope of regulations.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... Act, 16 U.S.C. 1361 et seq.; (h) The Endangered Species Act, 16 U.S.C. 1531 et seq.; and (i) The Lacey... the following laws: (a) The Eagle Protection Act, 16 U.S.C. 668 et seq.; (b) The National Wildlife Refuge System Administration Act, 16 U.S.C. 668dd et seq.; (c) The Migratory Bird Treaty Act, 16 U.S.C...

  8. 50 CFR 12.2 - Scope of regulations.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... Act, 16 U.S.C. 1361 et seq.; (h) The Endangered Species Act, 16 U.S.C. 1531 et seq.; and (i) The Lacey... the following laws: (a) The Eagle Protection Act, 16 U.S.C. 668 et seq.; (b) The National Wildlife Refuge System Administration Act, 16 U.S.C. 668dd et seq.; (c) The Migratory Bird Treaty Act, 16 U.S.C...

  9. 50 CFR 12.2 - Scope of regulations.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... Act, 16 U.S.C. 1361 et seq.; (h) The Endangered Species Act, 16 U.S.C. 1531 et seq.; and (i) The Lacey... the following laws: (a) The Eagle Protection Act, 16 U.S.C. 668 et seq.; (b) The National Wildlife Refuge System Administration Act, 16 U.S.C. 668dd et seq.; (c) The Migratory Bird Treaty Act, 16 U.S.C...

  10. 50 CFR 12.2 - Scope of regulations.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... Act, 16 U.S.C. 1361 et seq.; (h) The Endangered Species Act, 16 U.S.C. 1531 et seq.; and (i) The Lacey... the following laws: (a) The Eagle Protection Act, 16 U.S.C. 668 et seq.; (b) The National Wildlife Refuge System Administration Act, 16 U.S.C. 668dd et seq.; (c) The Migratory Bird Treaty Act, 16 U.S.C...

  11. 49 CFR 1.66 - Delegations to Maritime Administrator.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ..., 1916, as amended (46 App. U.S.C. 801 et seq.); (b) Carry out the Merchant Marine Act, 1920, as amended (46 App. U.S.C. 861 et seq.), including the Ship Mortgage Act, 1920, as amended (46 App. U.S.C. 921 et seq.); (c) Carry out the Merchant Marine Act, 1928, as amended (46 App. U.S.C. 891 et seq.); (d) Carry...

  12. 49 CFR 1.66 - Delegations to Maritime Administrator.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ..., 1916, as amended (46 App. U.S.C. 801 et seq.); (b) Carry out the Merchant Marine Act, 1920, as amended (46 App. U.S.C. 861 et seq.), including the Ship Mortgage Act, 1920, as amended (46 App. U.S.C. 921 et seq.); (c) Carry out the Merchant Marine Act, 1928, as amended (46 App. U.S.C. 891 et seq.); (d) Carry...

  13. ReliefSeq: A Gene-Wise Adaptive-K Nearest-Neighbor Feature Selection Tool for Finding Gene-Gene Interactions and Main Effects in mRNA-Seq Gene Expression Data

    PubMed Central

    McKinney, Brett A.; White, Bill C.; Grill, Diane E.; Li, Peter W.; Kennedy, Richard B.; Poland, Gregory A.; Oberg, Ann L.

    2013-01-01

    Relief-F is a nonparametric, nearest-neighbor machine learning method that has been successfully used to identify relevant variables that may interact in complex multivariate models to explain phenotypic variation. While several tools have been developed for assessing differential expression in sequence-based transcriptomics, the detection of statistical interactions between transcripts has received less attention in the area of RNA-seq analysis. We describe a new extension and assessment of Relief-F for feature selection in RNA-seq data. The ReliefSeq implementation adapts the number of nearest neighbors (k) for each gene to optimize the Relief-F test statistics (importance scores) for finding both main effects and interactions. We compare this gene-wise adaptive-k (gwak) Relief-F method with standard RNA-seq feature selection tools, such as DESeq and edgeR, and with the popular machine learning method Random Forests. We demonstrate performance on a panel of simulated data that have a range of distributional properties reflected in real mRNA-seq data including multiple transcripts with varying sizes of main effects and interaction effects. For simulated main effects, gwak-Relief-F feature selection performs comparably to standard tools DESeq and edgeR for ranking relevant transcripts. For gene-gene interactions, gwak-Relief-F outperforms all comparison methods at ranking relevant genes in all but the highest fold change/highest signal situations where it performs similarly. The gwak-Relief-F algorithm outperforms Random Forests for detecting relevant genes in all simulation experiments. In addition, Relief-F is comparable to the other methods based on computational time. We also apply ReliefSeq to an RNA-Seq study of smallpox vaccine to identify gene expression changes between vaccinia virus-stimulated and unstimulated samples. ReliefSeq is an attractive tool for inclusion in the suite of tools used for analysis of mRNA-Seq data; it has power to detect both main effects and interaction effects. Software Availability: http://insilico.utulsa.edu/ReliefSeq.php. PMID:24339943

  14. Phosphoproteomics reveals that glycogen synthase kinase-3 phosphorylates multiple splicing factors and is associated with alternative splicing

    PubMed Central

    Shinde, Mansi Y.; Sidoli, Simone; Kulej, Katarzyna; Mallory, Michael J.; Radens, Caleb M.; Reicherter, Amanda L.; Myers, Rebecca L.; Barash, Yoseph; Lynch, Kristen W.; Garcia, Benjamin A.; Klein, Peter S.

    2017-01-01

    Glycogen synthase kinase-3 (GSK-3) is a constitutively active, ubiquitously expressed protein kinase that regulates multiple signaling pathways. In vitro kinase assays and genetic and pharmacological manipulations of GSK-3 have identified more than 100 putative GSK-3 substrates in diverse cell types. Many more have been predicted on the basis of a recurrent GSK-3 consensus motif ((pS/pT)XXX(S/T)), but this prediction has not been tested by analyzing the GSK-3 phosphoproteome. Using stable isotope labeling of amino acids in culture (SILAC) and MS techniques to analyze the repertoire of GSK-3–dependent phosphorylation in mouse embryonic stem cells (ESCs), we found that ∼2.4% of (pS/pT)XXX(S/T) sites are phosphorylated in a GSK-3–dependent manner. A comparison of WT and Gsk3a;Gsk3b knock-out (Gsk3 DKO) ESCs revealed prominent GSK-3–dependent phosphorylation of multiple splicing factors and regulators of RNA biosynthesis as well as proteins that regulate transcription, translation, and cell division. Gsk3 DKO reduced phosphorylation of the splicing factors RBM8A, SRSF9, and PSF as well as the nucleolar proteins NPM1 and PHF6, and recombinant GSK-3β phosphorylated these proteins in vitro. RNA-Seq of WT and Gsk3 DKO ESCs identified ∼190 genes that are alternatively spliced in a GSK-3–dependent manner, supporting a broad role for GSK-3 in regulating alternative splicing. The MS data also identified posttranscriptional regulation of protein abundance by GSK-3, with ∼47 proteins (1.4%) whose levels increased and ∼78 (2.4%) whose levels decreased in the absence of GSK-3. This study provides the first unbiased analysis of the GSK-3 phosphoproteome and strong evidence that GSK-3 broadly regulates alternative splicing. PMID:28916722

  15. Effect of cerulenin on fatty acid composition and gene expression pattern of DHA-producing strain Colwellia psychrerythraea strain 34H.

    PubMed

    Wan, Xia; Peng, Yun-Feng; Zhou, Xue-Rong; Gong, Yang-Min; Huang, Feng-Hong; Moncalián, Gabriel

    2016-02-06

    Colwellia psychrerythraea 34H is a psychrophilic bacterium able to produce docosahexaenoic acid (DHA). Polyketide synthase pathway is assumed to be responsible for DHA production in marine bacteria. Five pfa genes from strain 34H were confirmed to be responsible for DHA formation by heterogeneous expression in Escherichia coli. The complexity of fatty acid profile of this strain was revealed by GC and GC-MS. Treatment of cells with cerulenin resulted in significantly reduced level of C16 monounsaturated fatty acid (C16:1(Δ9t), C16:1(Δ7)). In contrast, the amount of saturated fatty acids (C10:0, C12:0, C14:0), hydroxyl fatty acids (3-OH C10:0 and 3-OH C12:0), as well as C20:4ω3, C20:5ω3 and C22:6ω3 were increased. RNA sequencing (RNA-Seq) revealed the altered gene expression pattern when C. psychrerythraea cells were treated with cerulenin. Genes involved in polyketide synthase pathway and fatty acid biosynthesis pathway were not obviously affected by cerulenin treatment. In contrast, several genes involved in fatty acid degradation or β-oxidation pathway were dramatically reduced at the transcriptional level. Genes responsible for DHA formation in C. psychrerythraea was first cloned and characterized. We revealed the complexity of fatty acid profile in this DHA-producing strain. Cerulenin could substantially change the fatty acid composition by affecting the fatty acid degradation at transcriptional level. Acyl-CoA dehydrogenase gene family involved in the first step of β-oxidation pathway may be important to the selectivity of degraded fatty acids. In addition, inhibition of FabB protein by cerulenin may lead to the accumulation of malonyl-CoA, which is the substrate for DHA formation.

  16. A multitask clustering approach for single-cell RNA-seq analysis in Recessive Dystrophic Epidermolysis Bullosa

    PubMed Central

    Petegrosso, Raphael; Tolar, Jakub

    2018-01-01

    Single-cell RNA sequencing (scRNA-seq) has been widely applied to discover new cell types by detecting sub-populations in a heterogeneous group of cells. Since scRNA-seq experiments have lower read coverage/tag counts and introduce more technical biases compared to bulk RNA-seq experiments, the limited number of sampled cells combined with the experimental biases and other dataset specific variations presents a challenge to cross-dataset analysis and discovery of relevant biological variations across multiple cell populations. In this paper, we introduce a method of variance-driven multitask clustering of single-cell RNA-seq data (scVDMC) that utilizes multiple single-cell populations from biological replicates or different samples. scVDMC clusters single cells in multiple scRNA-seq experiments of similar cell types and markers but varying expression patterns such that the scRNA-seq data are better integrated than typical pooled analyses which only increase the sample size. By controlling the variance among the cell clusters within each dataset and across all the datasets, scVDMC detects cell sub-populations in each individual experiment with shared cell-type markers but varying cluster centers among all the experiments. Applied to two real scRNA-seq datasets with several replicates and one large-scale droplet-based dataset on three patient samples, scVDMC more accurately detected cell populations and known cell markers than pooled clustering and other recently proposed scRNA-seq clustering methods. In the case study applied to in-house Recessive Dystrophic Epidermolysis Bullosa (RDEB) scRNA-seq data, scVDMC revealed several new cell types and unknown markers validated by flow cytometry. MATLAB/Octave code available at https://github.com/kuanglab/scVDMC. PMID:29630593

  17. A comparative study of students' performance in preclinical physiology assessed by multiple choice and short essay questions.

    PubMed

    Oyebola, D D; Adewoye, O E; Iyaniwura, J O; Alada, A R; Fasanmade, A A; Raji, Y

    2000-01-01

    This study was designed to compare the performance of medical students in physiology when assessed by multiple choice questions (MCQs) and short essay questions (SEQs). The study also examined the influence of factors such as age, sex, O/level grades and JAMB scores on performance in the MCQs and SEQs. A structured questionnaire was administered to 264 medical students' four months before the Part I MBBS examination. Apart from personal data of each student, the questionnaire sought information on the JAMB scores and GCE O' Level grades of each student in English Language, Biology, Chemistry, Physics and Mathematics. The physiology syllabus was divided into five parts and the students were administered separate examinations (tests) on each part. Each test consisted of MCQs and SEQs. The performance in MCQs and SEQs were compared. Also, the effects of JAMB scores and GCE O/level grades on the performance in both the MCQs and SEQs were assessed. The results showed that the students performed better in all MCQ tests than in the SEQs. JAMB scores and O' level English Language grade had no significant effect on students' performance in MCQs and SEQs. However O' level grades in Biology, Chemistry, Physics and Mathematics had significant effects on performance in MCQs and SEQs. Inadequate knowledge of physiology and inability to present information in a logical sequence are believed to be major factors contributing to the poorer performance in the SEQs compared with MCQs. In view of the finding of significant association between performance in MCQs and SEQs and GCE O/level grades in science subjects and mathematics, it was recommended that both JAMB results and the GCE results in the four O/level subjects above may be considered when selecting candidates for admission into the medical schools.

  18. ChIP-PIT: Enhancing the Analysis of ChIP-Seq Data Using Convex-Relaxed Pair-Wise Interaction Tensor Decomposition.

    PubMed

    Zhu, Lin; Guo, Wei-Li; Deng, Su-Ping; Huang, De-Shuang

    2016-01-01

    In recent years, thanks to the efforts of individual scientists and research consortiums, a huge amount of chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) experimental data have been accumulated. Instead of investigating them independently, several recent studies have convincingly demonstrated that a wealth of scientific insights can be gained by integrative analysis of these ChIP-seq data. However, when used for the purpose of integrative analysis, a serious drawback of current ChIP-seq technique is that it is still expensive and time-consuming to generate ChIP-seq datasets of high standard. Most researchers are therefore unable to obtain complete ChIP-seq data for several TFs in a wide variety of cell lines, which considerably limits the understanding of transcriptional regulation pattern. In this paper, we propose a novel method called ChIP-PIT to overcome the aforementioned limitation. In ChIP-PIT, ChIP-seq data corresponding to a diverse collection of cell types, TFs and genes are fused together using the three-mode pair-wise interaction tensor (PIT) model, and the prediction of unperformed ChIP-seq experimental results is formulated as a tensor completion problem. Computationally, we propose efficient first-order method based on extensions of coordinate descent method to learn the optimal solution of ChIP-PIT, which makes it particularly suitable for the analysis of massive scale ChIP-seq data. Experimental evaluation the ENCODE data illustrate the usefulness of the proposed model.

  19. Illuminating choices for library prep: a comparison of library preparation methods for whole genome sequencing of Cryptococcus neoformans using Illumina HiSeq.

    PubMed

    Rhodes, Johanna; Beale, Mathew A; Fisher, Matthew C

    2014-01-01

    The industry of next-generation sequencing is constantly evolving, with novel library preparation methods and new sequencing machines being released by the major sequencing technology companies annually. The Illumina TruSeq v2 library preparation method was the most widely used kit and the market leader; however, it has now been discontinued, and in 2013 was replaced by the TruSeq Nano and TruSeq PCR-free methods, leaving a gap in knowledge regarding which is the most appropriate library preparation method to use. Here, we used isolates from the pathogenic fungi Cryptococcus neoformans var. grubii and sequenced them using the existing TruSeq DNA v2 kit (Illumina), along with two new kits: the TruSeq Nano DNA kit (Illumina) and the NEBNext Ultra DNA kit (New England Biolabs) to provide a comparison. Compared to the original TruSeq DNA v2 kit, both newer kits gave equivalent or better sequencing data, with increased coverage. When comparing the two newer kits, we found little difference in cost and workflow, with the NEBNext Ultra both slightly cheaper and faster than the TruSeq Nano. However, the quality of data generated using the TruSeq Nano DNA kit was superior due to higher coverage at regions of low GC content, and more SNPs identified. Researchers should therefore evaluate their resources and the type of application (and hence data quality) being considered when ultimately deciding on which library prep method to use.

  20. 50 CFR 12.25 - Transfers in settlement of civil penalty claims.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... may be liable for civil penalty under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; or Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., may be given an opportunity to...

  1. 50 CFR 12.25 - Transfers in settlement of civil penalty claims.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... may be liable for civil penalty under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; or Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., may be given an opportunity to...

  2. 50 CFR 12.25 - Transfers in settlement of civil penalty claims.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... may be liable for civil penalty under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; or Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., may be given an opportunity to...

  3. 50 CFR 12.25 - Transfers in settlement of civil penalty claims.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... may be liable for civil penalty under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; or Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., may be given an opportunity to...

  4. 50 CFR 12.25 - Transfers in settlement of civil penalty claims.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... may be liable for civil penalty under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; or Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., may be given an opportunity to...

  5. Identification of Prostate Cancer-Specific microDNAs

    DTIC Science & Technology

    2014-12-01

    displacement amplification (MDA). 2 adopted multiple displacement amplification (MDA) with random primers for enriched circular DNA by rolling circle ... amplification (RCA) (Fig. 1) and then amplified DNA fragments were subject to deep sequencing. Sequence NO of Reads seq 1 184 seq 2 133 seq 3 2407 seq...prostate cancer cells through multiple displacement amplification .  Clone #7 is the top candidate which has been cloned in an expression vector and it

  6. GWIPS-viz: 2018 update

    PubMed Central

    Michel, Audrey M; Kiniry, Stephen J; O’Connor, Patrick B F; Mullan, James P

    2018-01-01

    Abstract The GWIPS-viz browser (http://gwips.ucc.ie/) is an on-line genome browser which is tailored for exploring ribosome profiling (Ribo-seq) data. Since its publication in 2014, GWIPS-viz provides Ribo-seq data for an additional 14 genomes bringing the current total to 23. The integration of new Ribo-seq data has been automated thereby increasing the number of available tracks to 1792, a 10-fold increase in the last three years. The increase is particularly substantial for data derived from human sources. Following user requests, we added the functionality to download these tracks in bigWig format. We also incorporated new types of data (e.g. TCP-seq) as well as auxiliary tracks from other sources that help with the interpretation of Ribo-seq data. Improvements in the visualization of the data have been carried out particularly for bacterial genomes where the Ribo-seq data are now shown in a strand specific manner. For higher eukaryotic datasets, we provide characteristics of individual datasets using the RUST program which includes the triplet periodicity, sequencing biases and relative inferred A-site dwell times. This information can be used for assessing the quality of Ribo-seq datasets. To improve the power of the signal, we aggregate Ribo-seq data from several studies into Global aggregate tracks for each genome. PMID:28977460

  7. seq-ImmuCC: Cell-Centric View of Tissue Transcriptome Measuring Cellular Compositions of Immune Microenvironment From Mouse RNA-Seq Data.

    PubMed

    Chen, Ziyi; Quan, Lijun; Huang, Anfei; Zhao, Qiang; Yuan, Yao; Yuan, Xuye; Shen, Qin; Shang, Jingzhe; Ben, Yinyin; Qin, F Xiao-Feng; Wu, Aiping

    2018-01-01

    The RNA sequencing approach has been broadly used to provide gene-, pathway-, and network-centric analyses for various cell and tissue samples. However, thus far, rich cellular information carried in tissue samples has not been thoroughly characterized from RNA-Seq data. Therefore, it would expand our horizons to better understand the biological processes of the body by incorporating a cell-centric view of tissue transcriptome. Here, a computational model named seq-ImmuCC was developed to infer the relative proportions of 10 major immune cells in mouse tissues from RNA-Seq data. The performance of seq-ImmuCC was evaluated among multiple computational algorithms, transcriptional platforms, and simulated and experimental datasets. The test results showed its stable performance and superb consistency with experimental observations under different conditions. With seq-ImmuCC, we generated the comprehensive landscape of immune cell compositions in 27 normal mouse tissues and extracted the distinct signatures of immune cell proportion among various tissue types. Furthermore, we quantitatively characterized and compared 18 different types of mouse tumor tissues of distinct cell origins with their immune cell compositions, which provided a comprehensive and informative measurement for the immune microenvironment inside tumor tissues. The online server of seq-ImmuCC are freely available at http://wap-lab.org:3200/immune/.

  8. RNA-Seq Technology and Its Application in Fish Transcriptomics

    PubMed Central

    Ba, Yi; Zhuang, Qianfeng

    2014-01-01

    Abstract High-throughput sequencing technologies, also known as next-generation sequencing (NGS) technologies, have revolutionized the way that genomic research is advancing. In addition to the static genome, these state-of-art technologies have been recently exploited to analyze the dynamic transcriptome, and the resulting technology is termed RNA sequencing (RNA-seq). RNA-seq is free from many limitations of other transcriptomic approaches, such as microarray and tag-based sequencing method. Although RNA-seq has only been available for a short time, studies using this method have completely changed our perspective of the breadth and depth of eukaryotic transcriptomes. In terms of the transcriptomics of teleost fishes, both model and non-model species have benefited from the RNA-seq approach and have undergone tremendous advances in the past several years. RNA-seq has helped not only in mapping and annotating fish transcriptome but also in our understanding of many biological processes in fish, such as development, adaptive evolution, host immune response, and stress response. In this review, we first provide an overview of each step of RNA-seq from library construction to the bioinformatic analysis of the data. We then summarize and discuss the recent biological insights obtained from the RNA-seq studies in a variety of fish species. PMID:24380445

  9. Granatum: a graphical single-cell RNA-Seq analysis pipeline for genomics scientists.

    PubMed

    Zhu, Xun; Wolfgruber, Thomas K; Tasato, Austin; Arisdakessian, Cédric; Garmire, David G; Garmire, Lana X

    2017-12-05

    Single-cell RNA sequencing (scRNA-Seq) is an increasingly popular platform to study heterogeneity at the single-cell level. Computational methods to process scRNA-Seq data are not very accessible to bench scientists as they require a significant amount of bioinformatic skills. We have developed Granatum, a web-based scRNA-Seq analysis pipeline to make analysis more broadly accessible to researchers. Without a single line of programming code, users can click through the pipeline, setting parameters and visualizing results via the interactive graphical interface. Granatum conveniently walks users through various steps of scRNA-Seq analysis. It has a comprehensive list of modules, including plate merging and batch-effect removal, outlier-sample removal, gene-expression normalization, imputation, gene filtering, cell clustering, differential gene expression analysis, pathway/ontology enrichment analysis, protein network interaction visualization, and pseudo-time cell series construction. Granatum enables broad adoption of scRNA-Seq technology by empowering bench scientists with an easy-to-use graphical interface for scRNA-Seq data analysis. The package is freely available for research use at http://garmiregroup.org/granatum/app.

  10. Simultaneous measurement of chromatin accessibility, DNA methylation, and nucleosome phasing in single cells

    PubMed Central

    Pott, Sebastian

    2017-01-01

    Gaining insights into the regulatory mechanisms that underlie the transcriptional variation observed between individual cells necessitates the development of methods that measure chromatin organization in single cells. Here I adapted Nucleosome Occupancy and Methylome-sequencing (NOMe-seq) to measure chromatin accessibility and endogenous DNA methylation in single cells (scNOMe-seq). scNOMe-seq recovered characteristic accessibility and DNA methylation patterns at DNase hypersensitive sites (DHSs). An advantage of scNOMe-seq is that sequencing reads are sampled independently of the accessibility measurement. scNOMe-seq therefore controlled for fragment loss, which enabled direct estimation of the fraction of accessible DHSs within individual cells. In addition, scNOMe-seq provided high resolution of chromatin accessibility within individual loci which was exploited to detect footprints of CTCF binding events and to estimate the average nucleosome phasing distances in single cells. scNOMe-seq is therefore well-suited to characterize the chromatin organization of single cells in heterogeneous cellular mixtures. DOI: http://dx.doi.org/10.7554/eLife.23203.001 PMID:28653622

  11. BrAD-seq: Breath Adapter Directional sequencing: a streamlined, ultra-simple and fast library preparation protocol for strand specific mRNA library construction.

    PubMed

    Townsley, Brad T; Covington, Michael F; Ichihashi, Yasunori; Zumstein, Kristina; Sinha, Neelima R

    2015-01-01

    Next Generation Sequencing (NGS) is driving rapid advancement in biological understanding and RNA-sequencing (RNA-seq) has become an indispensable tool for biology and medicine. There is a growing need for access to these technologies although preparation of NGS libraries remains a bottleneck to wider adoption. Here we report a novel method for the production of strand specific RNA-seq libraries utilizing the terminal breathing of double-stranded cDNA to capture and incorporate a sequencing adapter. Breath Adapter Directional sequencing (BrAD-seq) reduces sample handling and requires far fewer enzymatic steps than most available methods to produce high quality strand-specific RNA-seq libraries. The method we present is optimized for 3-prime Digital Gene Expression (DGE) libraries and can easily extend to full transcript coverage shotgun (SHO) type strand-specific libraries and is modularized to accommodate a diversity of RNA and DNA input materials. BrAD-seq offers a highly streamlined and inexpensive option for RNA-seq libraries.

  12. The Antibody Response of Pregnant Cameroonian Women to VAR2CSA ID1-ID2a, a Small Recombinant Protein Containing the CSA-Binding Site

    PubMed Central

    Babakhanyan, Anna; Leke, Rose G. F.; Salanti, Ali; Bobbili, Naveen; Gwanmesia, Philomina; Leke, Robert J. I.; Quakyi, Isabella A.; Chen, John J.; Taylor, Diane Wallace

    2014-01-01

    In pregnant women, Plasmodium falciparum-infected erythrocytes expressing the VAR2CSA antigen bind to chondroitin sulfate A in the placenta causing placental malaria. The binding site of VAR2CSA is present in the ID1-ID2a region. This study sought to determine if pregnant Cameroonian women naturally acquire antibodies to ID1-ID2a and if antibodies to ID1-ID2a correlate with absence of placental malaria at delivery. Antibody levels to full-length VAR2CSA and ID1-ID2a were measured in plasma samples from 745 pregnant Cameroonian women, 144 Cameroonian men, and 66 US subjects. IgM levels and IgG avidity to ID1-ID2a were also determined. As expected, antibodies to ID1-ID2a were absent in US controls. Although pregnant Cameroonian women developed increasing levels of antibodies to full-length VAR2CSA during pregnancy, no increase in either IgM or IgG to ID1-ID2a was observed. Surprisingly, no differences in antibody levels to ID1-ID2a were detected between Cameroonian men and pregnant women. For example, in rural settings only 8–9% of males had antibodies to full-length VAR2CSA, but 90–96% had antibodies to ID1-ID2a. In addition, no significant difference in the avidity of IgG to ID1-ID2a was found between pregnant women and Cameroonian men, and no correlation between antibody levels at delivery and absence of placental malaria was found. Thus, the response to ID1-ID2a was not pregnancy specific, but predominantly against cross-reactivity epitopes, which may have been induced by other PfEMP1 antigens, malarial antigens, or microbes. Currently, ID1-ID2a is a leading vaccine candidate, since it binds to the CSA with the same affinity as the full-length molecule and elicits binding-inhibitory antibodies in animals. Further studies are needed to determine if the presence of naturally acquired cross-reactive antibodies in women living in malaria endemic countries will alter the response to ID1-ID2a following vaccination with ID1-ID2a. PMID:24505415

  13. cChIP-seq: a robust small-scale method for investigation of histone modifications.

    PubMed

    Valensisi, Cristina; Liao, Jo Ling; Andrus, Colin; Battle, Stephanie L; Hawkins, R David

    2015-12-21

    ChIP-seq is highly utilized for mapping histone modifications that are informative about gene regulation and genome annotations. For example, applying ChIP-seq to histone modifications such as H3K4me1 has facilitated generating epigenomic maps of putative enhancers. This powerful technology, however, is limited in its application by the large number of cells required. ChIP-seq involves extensive manipulation of sample material and multiple reactions with limited quality control at each step, therefore, scaling down the number of cells required has proven challenging. Recently, several methods have been proposed to overcome this limit but most of these methods require extensive optimization to tailor the protocol to the specific antibody used or number of cells being profiled. Here we describe a robust, yet facile method, which we named carrier ChIP-seq (cChIP-seq), for use on limited cell amounts. cChIP-seq employs a DNA-free histone carrier in order to maintain the working ChIP reaction scale, removing the need to tailor reactions to specific amounts of cells or histone modifications to be assayed. We have applied our method to three different histone modifications, H3K4me3, H3K4me1 and H3K27me3 in the K562 cell line, and H3K4me1 in H1 hESCs. We successfully obtained epigenomic maps for these histone modifications starting with as few as 10,000 cells. We compared cChIP-seq data to data generated as part of the ENCODE project. ENCODE data are the reference standard in the field and have been generated starting from tens of million of cells. Our results show that cChIP-seq successfully recapitulates bulk data. Furthermore, we showed that the differences observed between small-scale ChIP-seq data and ENCODE data are largely to be due to lab-to-lab variability rather than operating on a reduced scale. Data generated using cChIP-seq are equivalent to reference epigenomic maps from three orders of magnitude more cells. Our method offers a robust and straightforward approach to scale down ChIP-seq to as low as 10,000 cells. The underlying principle of our strategy makes it suitable for being applied to a vast range of chromatin modifications without requiring expensive optimization. Furthermore, our strategy of a DNA-free carrier can be adapted to most ChIP-seq protocols.

  14. Isoprene synthase genes form a monophyletic clade of acyclic terpene synthases in the TPS-B terpene synthase family.

    PubMed

    Sharkey, Thomas D; Gray, Dennis W; Pell, Heather K; Breneman, Steven R; Topper, Lauren

    2013-04-01

    Many plants emit significant amounts of isoprene, which is hypothesized to help leaves tolerate short episodes of high temperature. Isoprene emission is found in all major groups of land plants including mosses, ferns, gymnosperms, and angiosperms; however, within these groups isoprene emission is variable. The patchy distribution of isoprene emission implies an evolutionary pattern characterized by many origins or many losses. To better understand the evolution of isoprene emission, we examine the phylogenetic relationships among isoprene synthase and monoterpene synthase genes in the angiosperms. In this study we identify nine new isoprene synthases within the rosid angiosperms. We also document the capacity of a myrcene synthase in Humulus lupulus to produce isoprene. Isoprene synthases and (E)-β-ocimene synthases form a monophyletic group within the Tps-b clade of terpene synthases. No asterid genes fall within this clade. The chemistry of isoprene synthase and ocimene synthase is similar and likely affects the apparent relationships among Tps-b enzymes. The chronology of rosid evolution suggests a Cretaceous origin followed by many losses of isoprene synthase over the course of evolutionary history. The phylogenetic pattern of Tps-b genes indicates that isoprene emission from non-rosid angiosperms likely arose independently. © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.

  15. Quartz-Seq2: a high-throughput single-cell RNA-sequencing method that effectively uses limited sequence reads.

    PubMed

    Sasagawa, Yohei; Danno, Hiroki; Takada, Hitomi; Ebisawa, Masashi; Tanaka, Kaori; Hayashi, Tetsutaro; Kurisaki, Akira; Nikaido, Itoshi

    2018-03-09

    High-throughput single-cell RNA-seq methods assign limited unique molecular identifier (UMI) counts as gene expression values to single cells from shallow sequence reads and detect limited gene counts. We thus developed a high-throughput single-cell RNA-seq method, Quartz-Seq2, to overcome these issues. Our improvements in the reaction steps make it possible to effectively convert initial reads to UMI counts, at a rate of 30-50%, and detect more genes. To demonstrate the power of Quartz-Seq2, we analyzed approximately 10,000 transcriptomes from in vitro embryonic stem cells and an in vivo stromal vascular fraction with a limited number of reads.

  16. eQTL Mapping Using RNA-seq Data

    PubMed Central

    Hu, Yijuan

    2012-01-01

    As RNA-seq is replacing gene expression microarrays to assess genome-wide transcription abundance, gene expression Quantitative Trait Locus (eQTL) studies using RNA-seq have emerged. RNA-seq delivers two novel features that are important for eQTL studies. First, it provides information on allele-specific expression (ASE), which is not available from gene expression microarrays. Second, it generates unprecedentedly rich data to study RNA-isoform expression. In this paper, we review current methods for eQTL mapping using ASE and discuss some future directions. We also review existing works that use RNA-seq data to study RNA-isoform expression and we discuss the gaps between these works and isoform-specific eQTL mapping. PMID:23667399

  17. SeqMule: automated pipeline for analysis of human exome/genome sequencing data.

    PubMed

    Guo, Yunfei; Ding, Xiaolei; Shen, Yufeng; Lyon, Gholson J; Wang, Kai

    2015-09-18

    Next-generation sequencing (NGS) technology has greatly helped us identify disease-contributory variants for Mendelian diseases. However, users are often faced with issues such as software compatibility, complicated configuration, and no access to high-performance computing facility. Discrepancies exist among aligners and variant callers. We developed a computational pipeline, SeqMule, to perform automated variant calling from NGS data on human genomes and exomes. SeqMule integrates computational-cluster-free parallelization capability built on top of the variant callers, and facilitates normalization/intersection of variant calls to generate consensus set with high confidence. SeqMule integrates 5 alignment tools, 5 variant calling algorithms and accepts various combinations all by one-line command, therefore allowing highly flexible yet fully automated variant calling. In a modern machine (2 Intel Xeon X5650 CPUs, 48 GB memory), when fast turn-around is needed, SeqMule generates annotated VCF files in a day from a 30X whole-genome sequencing data set; when more accurate calling is needed, SeqMule generates consensus call set that improves over single callers, as measured by both Mendelian error rate and consistency. SeqMule supports Sun Grid Engine for parallel processing, offers turn-key solution for deployment on Amazon Web Services, allows quality check, Mendelian error check, consistency evaluation, HTML-based reports. SeqMule is available at http://seqmule.openbioinformatics.org.

  18. TopHat: discovering splice junctions with RNA-Seq

    PubMed Central

    Trapnell, Cole; Pachter, Lior; Salzberg, Steven L.

    2009-01-01

    Motivation: A new protocol for sequencing the messenger RNA in a cell, known as RNA-Seq, generates millions of short sequence fragments in a single run. These fragments, or ‘reads’, can be used to measure levels of gene expression and to identify novel splice variants of genes. However, current software for aligning RNA-Seq data to a genome relies on known splice junctions and cannot identify novel ones. TopHat is an efficient read-mapping algorithm designed to align reads from an RNA-Seq experiment to a reference genome without relying on known splice sites. Results: We mapped the RNA-Seq reads from a recent mammalian RNA-Seq experiment and recovered more than 72% of the splice junctions reported by the annotation-based software from that study, along with nearly 20 000 previously unreported junctions. The TopHat pipeline is much faster than previous systems, mapping nearly 2.2 million reads per CPU hour, which is sufficient to process an entire RNA-Seq experiment in less than a day on a standard desktop computer. We describe several challenges unique to ab initio splice site discovery from RNA-Seq reads that will require further algorithm development. Availability: TopHat is free, open-source software available from http://tophat.cbcb.umd.edu Contact: cole@cs.umd.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19289445

  19. A quantitative and qualitative comparison of illumina MiSeq and 454 amplicon sequencing for genotyping the highly polymorphic major histocompatibility complex (MHC) in a non-model species.

    PubMed

    Razali, Haslina; O'Connor, Emily; Drews, Anna; Burke, Terry; Westerdahl, Helena

    2017-07-28

    High-throughput sequencing enables high-resolution genotyping of extremely duplicated genes. 454 amplicon sequencing (454) has become the standard technique for genotyping the major histocompatibility complex (MHC) genes in non-model organisms. However, illumina MiSeq amplicon sequencing (MiSeq), which offers a much higher read depth, is now superseding 454. The aim of this study was to quantitatively and qualitatively evaluate the performance of MiSeq in relation to 454 for genotyping MHC class I alleles using a house sparrow (Passer domesticus) dataset with pedigree information. House sparrows provide a good study system for this comparison as their MHC class I genes have been studied previously and, consequently, we had prior expectations concerning the number of alleles per individual. We found that 454 and MiSeq performed equally well in genotyping amplicons with low diversity, i.e. amplicons from individuals that had fewer than 6 alleles. Although there was a higher rate of failure in the 454 dataset in resolving amplicons with higher diversity (6-9 alleles), the same genotypes were identified by both 454 and MiSeq in 98% of cases. We conclude that low diversity amplicons are equally well genotyped using either 454 or MiSeq, but the higher coverage afforded by MiSeq can lead to this approach outperforming 454 in amplicons with higher diversity.

  20. Discovery of Azurin-Like Anticancer Bacteriocins from Human Gut Microbiome through Homology Modeling and Molecular Docking against the Tumor Suppressor p53.

    PubMed

    Nguyen, Chuong; Nguyen, Van Duy

    2016-01-01

    Azurin from Pseudomonas aeruginosa is known anticancer bacteriocin, which can specifically penetrate human cancer cells and induce apoptosis. We hypothesized that pathogenic and commensal bacteria with long term residence in human body can produce azurin-like bacteriocins as a weapon against the invasion of cancers. In our previous work, putative bacteriocins have been screened from complete genomes of 66 dominant bacteria species in human gut microbiota and subsequently characterized by subjecting them as functional annotation algorithms with azurin as control. We have qualitatively predicted 14 putative bacteriocins that possessed functional properties very similar to those of azurin. In this work, we perform a number of quantitative and structure-based analyses including hydrophobic percentage calculation, structural modeling, and molecular docking study of bacteriocins of interest against protein p53, a cancer target. Finally, we have identified 8 putative bacteriocins that bind p53 in a same manner as p28-azurin and azurin, in which 3 peptides (p1seq16, p2seq20, and p3seq24) shared with our previous study and 5 novel ones (p1seq09, p2seq05, p2seq08, p3seq02, and p3seq17) discovered in the first time. These bacteriocins are suggested for further in vitro tests in different neoplastic line cells.

  1. Discovery of Azurin-Like Anticancer Bacteriocins from Human Gut Microbiome through Homology Modeling and Molecular Docking against the Tumor Suppressor p53

    PubMed Central

    Nguyen, Chuong; Nguyen, Van Duy

    2016-01-01

    Azurin from Pseudomonas aeruginosa is known anticancer bacteriocin, which can specifically penetrate human cancer cells and induce apoptosis. We hypothesized that pathogenic and commensal bacteria with long term residence in human body can produce azurin-like bacteriocins as a weapon against the invasion of cancers. In our previous work, putative bacteriocins have been screened from complete genomes of 66 dominant bacteria species in human gut microbiota and subsequently characterized by subjecting them as functional annotation algorithms with azurin as control. We have qualitatively predicted 14 putative bacteriocins that possessed functional properties very similar to those of azurin. In this work, we perform a number of quantitative and structure-based analyses including hydrophobic percentage calculation, structural modeling, and molecular docking study of bacteriocins of interest against protein p53, a cancer target. Finally, we have identified 8 putative bacteriocins that bind p53 in a same manner as p28-azurin and azurin, in which 3 peptides (p1seq16, p2seq20, and p3seq24) shared with our previous study and 5 novel ones (p1seq09, p2seq05, p2seq08, p3seq02, and p3seq17) discovered in the first time. These bacteriocins are suggested for further in vitro tests in different neoplastic line cells. PMID:27239476

  2. Single-nucleus RNA-seq of differentiating human myoblasts reveals the extent of fate heterogeneity

    PubMed Central

    Zeng, Weihua; Jiang, Shan; Kong, Xiangduo; El-Ali, Nicole; Ball, Alexander R.; Ma, Christopher I-Hsing; Hashimoto, Naohiro; Yokomori, Kyoko; Mortazavi, Ali

    2016-01-01

    Myoblasts are precursor skeletal muscle cells that differentiate into fused, multinucleated myotubes. Current single-cell microfluidic methods are not optimized for capturing very large, multinucleated cells such as myotubes. To circumvent the problem, we performed single-nucleus transcriptome analysis. Using immortalized human myoblasts, we performed RNA-seq analysis of single cells (scRNA-seq) and single nuclei (snRNA-seq) and found them comparable, with a distinct enrichment for long non-coding RNAs (lncRNAs) in snRNA-seq. We then compared snRNA-seq of myoblasts before and after differentiation. We observed the presence of mononucleated cells (MNCs) that remained unfused and analyzed separately from multi-nucleated myotubes. We found that while the transcriptome profiles of myoblast and myotube nuclei are relatively homogeneous, MNC nuclei exhibited significant heterogeneity, with the majority of them adopting a distinct mesenchymal state. Primary transcripts for microRNAs (miRNAs) that participate in skeletal muscle differentiation were among the most differentially expressed lncRNAs, which we validated using NanoString. Our study demonstrates that snRNA-seq provides reliable transcriptome quantification for cells that are otherwise not amenable to current single-cell platforms. Our results further indicate that snRNA-seq has unique advantage in capturing nucleus-enriched lncRNAs and miRNA precursors that are useful in mapping and monitoring differential miRNA expression during cellular differentiation. PMID:27566152

  3. Structure-seq2: sensitive and accurate genome-wide profiling of RNA structure in vivo

    PubMed Central

    Ritchey, Laura E.; Su, Zhao; Tang, Yin; Tack, David C.

    2017-01-01

    Abstract RNA serves many functions in biology such as splicing, temperature sensing, and innate immunity. These functions are often determined by the structure of RNA. There is thus a pressing need to understand RNA structure and how it changes during diverse biological processes both in vivo and genome-wide. Here, we present Structure-seq2, which provides nucleotide-resolution RNA structural information in vivo and genome-wide. This optimized version of our original Structure-seq method increases sensitivity by at least 4-fold and improves data quality by minimizing formation of a deleterious by-product, reducing ligation bias, and improving read coverage. We also present a variation of Structure-seq2 in which a biotinylated nucleotide is incorporated during reverse transcription, which greatly facilitates the protocol by eliminating two PAGE purification steps. We benchmark Structure-seq2 on both mRNA and rRNA structure in rice (Oryza sativa). We demonstrate that Structure-seq2 can lead to new biological insights. Our Structure-seq2 datasets uncover hidden breaks in chloroplast rRNA and identify a previously unreported N1-methyladenosine (m1A) in a nuclear-encoded Oryza sativa rRNA. Overall, Structure-seq2 is a rapid, sensitive, and unbiased method to probe RNA in vivo and genome-wide that facilitates new insights into RNA biology. PMID:28637286

  4. Andrographis paniculata transcriptome provides molecular insights into tissue-specific accumulation of medicinal diterpenes.

    PubMed

    Garg, Anchal; Agrawal, Lalit; Misra, Rajesh Chandra; Sharma, Shubha; Ghosh, Sumit

    2015-09-02

    Kalmegh (Andrographis paniculata) has been widely exploited in traditional medicine for the treatment of infectious diseases and health disorders. Ent-labdane-related diterpene (ent-LRD) specialized (i.e., secondary) metabolites of kalmegh such as andrographolide, neoandrographolide and 14-deoxy-11,12-didehydroandrographolide, are known for variety of pharmacological activities. However, due to the lack of genomic and transcriptomic information, underlying molecular basis of ent-LRDs biosynthesis has remained largely unknown. To identify candidate genes of the ent-LRD biosynthetic pathway, we performed comparative transcriptome analysis using leaf and root tissues that differentially accumulate ent-LRDs. De novo assembly of Illumina HiSeq2000 platform-generated paired-end sequencing reads resulted into 69,011 leaf and 64,244 root transcripts which were assembled into a total of 84,628 unique transcripts. Annotation of these transcripts to the Uniprot, Kyoto Encyclopedia of Genes and Genomes (KEGG) and Carbohydrate-Active Enzymes (CAZy) databases identified candidate transcripts of the ent-LRD biosynthetic pathway. These included transcripts that encode enzymes of the plastidial 2C-methyl-D-erythritol-4-phosphate pathway which provides C5 isoprenoid precursors for the ent-LRDs biosynthesis, geranylgeranyl diphosphate synthase, class II diterpene synthase (diTPS), cytochrome P450 monooxygenase and glycosyltransferase. Three class II diTPSs (ApCPS1, ApCPS2 and ApCPS3) that showed distinct tissue-specific expression profiles and are phylogenetically related to the dicotyledon ent-copalyl diphosphate synthases, are identified. ApCPS1, ApCPS2 and ApCPS3 encode for 832-, 817- and 797- amino acids proteins of 55-63 % identity, respectively. Spatio-temporal patterns of transcripts and ent-LRDs accumulation are consistent with the involvement of ApCPS1 in general (i.e., primary) metabolism for the biosynthesis of phytohormone gibberellin, ApCPS2 in leaf specialized ent-LRDs biosynthesis and ApCPS3 in root diterpene biosynthesis. Moreover, simple sequence repeats (SSRs) that might assist in genotyping and developing specific chemotypes were identified in transcripts of the specialized metabolic pathways, including ent-LRDs. Comparative analysis of root and leaf transcriptomes disclosed novel genes of the ent-LRD biosynthetic pathway, including three class II diTPSs that showed discrete spatio-temporal expression patterns; thus, suggesting their participation into distinct diterpene metabolic pathways of kalmegh. Overall, these results will be useful in understanding molecular basis of the medicinal ent-LRDs biosynthesis and developing breeding strategies for improving their yields.

  5. Rewiring a secondary metabolite pathway towards itaconic acid production in Aspergillus niger.

    PubMed

    Hossain, Abeer H; Li, An; Brickwedde, Anja; Wilms, Lars; Caspers, Martien; Overkamp, Karin; Punt, Peter J

    2016-07-28

    The industrially relevant filamentous fungus Aspergillus niger is widely used in industry for its secretion capabilities of enzymes and organic acids. Biotechnologically produced organic acids promise to be an attractive alternative for the chemical industry to replace petrochemicals. Itaconic acid (IA) has been identified as one of the top twelve building block chemicals which have high potential to be produced by biotechnological means. The IA biosynthesis cluster (cadA, mttA and mfsA) has been elucidated in its natural producer Aspergillus terreus and transferred to A. niger to enable IA production. Here we report the rewiring of a secondary metabolite pathway towards further improved IA production through the overexpression of a putative cytosolic citrate synthase citB in a A. niger strain carrying the IA biosynthesis cluster. We have previously shown that expression of cadA from A. terreus results in itaconic acid production in A. niger AB1.13, albeit at low levels. This low-level production is boosted fivefold by the overexpression of mttA and mfsA in itaconic acid producing AB1.13 CAD background strains. Controlled batch cultivations with AB1.13 CAD + MFS + MTT strains showed increased production of itaconic acid compared with AB1.13 CAD strain. Moreover, preliminary RNA-Seq analysis of an itaconic acid producing AB1.13 CAD strain has led to the identification of the putative cytosolic citrate synthase citB which was induced in an IA producing strain. We have overexpressed citB in a AB1.13 CAD + MFS + MTT strain and by doing so hypothesize to have targeted itaconic acid production to the cytosolic compartment. By overexpressing citB in AB1.13 CAD + MFS + MTT strains in controlled batch cultivations we have achieved highly increased titers of up to 26.2 g/L IA with a productivity of 0.35 g/L/h while no CA was produced. Expression of the IA biosynthesis cluster in Aspergillus niger AB1.13 strain enables IA production. Moreover, in the AB1.13 CAD strain IA production resulted in overexpression of a putative cytosolic citrate synthase citB. Upon overexpression of citB we have achieved titers of up to 26.2 g/L IA with a productivity of 0.35 g/L/h in controlled batch cultivations. By overexpressing citB we have also diminished side product formation and optimized the production pathway towards IA.

  6. TSSAR: TSS annotation regime for dRNA-seq data.

    PubMed

    Amman, Fabian; Wolfinger, Michael T; Lorenz, Ronny; Hofacker, Ivo L; Stadler, Peter F; Findeiß, Sven

    2014-03-27

    Differential RNA sequencing (dRNA-seq) is a high-throughput screening technique designed to examine the architecture of bacterial operons in general and the precise position of transcription start sites (TSS) in particular. Hitherto, dRNA-seq data were analyzed by visualizing the sequencing reads mapped to the reference genome and manually annotating reliable positions. This is very labor intensive and, due to the subjectivity, biased. Here, we present TSSAR, a tool for automated de novo TSS annotation from dRNA-seq data that respects the statistics of dRNA-seq libraries. TSSAR uses the premise that the number of sequencing reads starting at a certain genomic position within a transcriptional active region follows a Poisson distribution with a parameter that depends on the local strength of expression. The differences of two dRNA-seq library counts thus follow a Skellam distribution. This provides a statistical basis to identify significantly enriched primary transcripts.We assessed the performance by analyzing a publicly available dRNA-seq data set using TSSAR and two simple approaches that utilize user-defined score cutoffs. We evaluated the power of reproducing the manual TSS annotation. Furthermore, the same data set was used to reproduce 74 experimentally validated TSS in H. pylori from reliable techniques such as RACE or primer extension. Both analyses showed that TSSAR outperforms the static cutoff-dependent approaches. Having an automated and efficient tool for analyzing dRNA-seq data facilitates the use of the dRNA-seq technique and promotes its application to more sophisticated analysis. For instance, monitoring the plasticity and dynamics of the transcriptomal architecture triggered by different stimuli and growth conditions becomes possible.The main asset of a novel tool for dRNA-seq analysis that reaches out to a broad user community is usability. As such, we provide TSSAR both as intuitive RESTful Web service ( http://rna.tbi.univie.ac.at/TSSAR) together with a set of post-processing and analysis tools, as well as a stand-alone version for use in high-throughput dRNA-seq data analysis pipelines.

  7. Guidelines for whole genome bisulphite sequencing of intact and FFPET DNA on the Illumina HiSeq X Ten.

    PubMed

    Nair, Shalima S; Luu, Phuc-Loi; Qu, Wenjia; Maddugoda, Madhavi; Huschtscha, Lily; Reddel, Roger; Chenevix-Trench, Georgia; Toso, Martina; Kench, James G; Horvath, Lisa G; Hayes, Vanessa M; Stricker, Phillip D; Hughes, Timothy P; White, Deborah L; Rasko, John E J; Wong, Justin J-L; Clark, Susan J

    2018-05-28

    Comprehensive genome-wide DNA methylation profiling is critical to gain insights into epigenetic reprogramming during development and disease processes. Among the different genome-wide DNA methylation technologies, whole genome bisulphite sequencing (WGBS) is considered the gold standard for assaying genome-wide DNA methylation at single base resolution. However, the high sequencing cost to achieve the optimal depth of coverage limits its application in both basic and clinical research. To achieve 15× coverage of the human methylome, using WGBS, requires approximately three lanes of 100-bp-paired-end Illumina HiSeq 2500 sequencing. It is important, therefore, for advances in sequencing technologies to be developed to enable cost-effective high-coverage sequencing. In this study, we provide an optimised WGBS methodology, from library preparation to sequencing and data processing, to enable 16-20× genome-wide coverage per single lane of HiSeq X Ten, HCS 3.3.76. To process and analyse the data, we developed a WGBS pipeline (METH10X) that is fast and can call SNPs. We performed WGBS on both high-quality intact DNA and degraded DNA from formalin-fixed paraffin-embedded tissue. First, we compared different library preparation methods on the HiSeq 2500 platform to identify the best method for sequencing on the HiSeq X Ten. Second, we optimised the PhiX and genome spike-ins to achieve higher quality and coverage of WGBS data on the HiSeq X Ten. Third, we performed integrated whole genome sequencing (WGS) and WGBS of the same DNA sample in a single lane of HiSeq X Ten to improve data output. Finally, we compared methylation data from the HiSeq 2500 and HiSeq X Ten and found high concordance (Pearson r > 0.9×). Together we provide a systematic, efficient and complete approach to perform and analyse WGBS on the HiSeq X Ten. Our protocol allows for large-scale WGBS studies at reasonable processing time and cost on the HiSeq X Ten platform.

  8. Classifying next-generation sequencing data using a zero-inflated Poisson model.

    PubMed

    Zhou, Yan; Wan, Xiang; Zhang, Baoxue; Tong, Tiejun

    2018-04-15

    With the development of high-throughput techniques, RNA-sequencing (RNA-seq) is becoming increasingly popular as an alternative for gene expression analysis, such as RNAs profiling and classification. Identifying which type of diseases a new patient belongs to with RNA-seq data has been recognized as a vital problem in medical research. As RNA-seq data are discrete, statistical methods developed for classifying microarray data cannot be readily applied for RNA-seq data classification. Witten proposed a Poisson linear discriminant analysis (PLDA) to classify the RNA-seq data in 2011. Note, however, that the count datasets are frequently characterized by excess zeros in real RNA-seq or microRNA sequence data (i.e. when the sequence depth is not enough or small RNAs with the length of 18-30 nucleotides). Therefore, it is desired to develop a new model to analyze RNA-seq data with an excess of zeros. In this paper, we propose a Zero-Inflated Poisson Logistic Discriminant Analysis (ZIPLDA) for RNA-seq data with an excess of zeros. The new method assumes that the data are from a mixture of two distributions: one is a point mass at zero, and the other follows a Poisson distribution. We then consider a logistic relation between the probability of observing zeros and the mean of the genes and the sequencing depth in the model. Simulation studies show that the proposed method performs better than, or at least as well as, the existing methods in a wide range of settings. Two real datasets including a breast cancer RNA-seq dataset and a microRNA-seq dataset are also analyzed, and they coincide with the simulation results that our proposed method outperforms the existing competitors. The software is available at http://www.math.hkbu.edu.hk/∼tongt. xwan@comp.hkbu.edu.hk or tongt@hkbu.edu.hk. Supplementary data are available at Bioinformatics online.

  9. Biological classification with RNA-Seq data: Can alternatively spliced transcript expression enhance machine learning classifier?

    PubMed

    Johnson, Nathan T; Dhroso, Andi; Hughes, Katelyn J; Korkin, Dmitry

    2018-06-25

    The extent to which the genes are expressed in the cell can be simplistically defined as a function of one or more factors of the environment, lifestyle, and genetics. RNA sequencing (RNA-Seq) is becoming a prevalent approach to quantify gene expression, and is expected to gain better insights to a number of biological and biomedical questions, compared to the DNA microarrays. Most importantly, RNA-Seq allows to quantify expression at the gene and alternative splicing isoform levels. However, leveraging the RNA-Seq data requires development of new data mining and analytics methods. Supervised machine learning methods are commonly used approaches for biological data analysis, and have recently gained attention for their applications to the RNA-Seq data. In this work, we assess the utility of supervised learning methods trained on RNA-Seq data for a diverse range of biological classification tasks. We hypothesize that the isoform-level expression data is more informative for biological classification tasks than the gene-level expression data. Our large-scale assessment is done through utilizing multiple datasets, organisms, lab groups, and RNA-Seq analysis pipelines. Overall, we performed and assessed 61 biological classification problems that leverage three independent RNA-Seq datasets and include over 2,000 samples that come from multiple organisms, lab groups, and RNA-Seq analyses. These 61 problems include predictions of the tissue type, sex, or age of the sample, healthy or cancerous phenotypes and, the pathological tumor stage for the samples from the cancerous tissue. For each classification problem, the performance of three normalization techniques and six machine learning classifiers was explored. We find that for every single classification problem, the isoform-based classifiers outperform or are comparable with gene expression based methods. The top-performing supervised learning techniques reached a near perfect classification accuracy, demonstrating the utility of supervised learning for RNA-Seq based data analysis. Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  10. Transformation and model choice for RNA-seq co-expression analysis.

    PubMed

    Rau, Andrea; Maugis-Rabusseau, Cathy

    2018-05-01

    Although a large number of clustering algorithms have been proposed to identify groups of co-expressed genes from microarray data, the question of if and how such methods may be applied to RNA sequencing (RNA-seq) data remains unaddressed. In this work, we investigate the use of data transformations in conjunction with Gaussian mixture models for RNA-seq co-expression analyses, as well as a penalized model selection criterion to select both an appropriate transformation and number of clusters present in the data. This approach has the advantage of accounting for per-cluster correlation structures among samples, which can be strong in RNA-seq data. In addition, it provides a rigorous statistical framework for parameter estimation, an objective assessment of data transformations and number of clusters and the possibility of performing diagnostic checks on the quality and homogeneity of the identified clusters. We analyze four varied RNA-seq data sets to illustrate the use of transformations and model selection in conjunction with Gaussian mixture models. Finally, we propose a Bioconductor package coseq (co-expression of RNA-seq data) to facilitate implementation and visualization of the recommended RNA-seq co-expression analyses.

  11. Exploring the single-cell RNA-seq analysis landscape with the scRNA-tools database.

    PubMed

    Zappia, Luke; Phipson, Belinda; Oshlack, Alicia

    2018-06-25

    As single-cell RNA-sequencing (scRNA-seq) datasets have become more widespread the number of tools designed to analyse these data has dramatically increased. Navigating the vast sea of tools now available is becoming increasingly challenging for researchers. In order to better facilitate selection of appropriate analysis tools we have created the scRNA-tools database (www.scRNA-tools.org) to catalogue and curate analysis tools as they become available. Our database collects a range of information on each scRNA-seq analysis tool and categorises them according to the analysis tasks they perform. Exploration of this database gives insights into the areas of rapid development of analysis methods for scRNA-seq data. We see that many tools perform tasks specific to scRNA-seq analysis, particularly clustering and ordering of cells. We also find that the scRNA-seq community embraces an open-source and open-science approach, with most tools available under open-source licenses and preprints being extensively used as a means to describe methods. The scRNA-tools database provides a valuable resource for researchers embarking on scRNA-seq analysis and records the growth of the field over time.

  12. Biochemical Characterization and Homology Modeling of Methylbutenol Synthase and Implications for Understanding Hemiterpene Synthase Evolution in Plants*

    PubMed Central

    Gray, Dennis W.; Breneman, Steven R.; Topper, Lauren A.; Sharkey, Thomas D.

    2011-01-01

    2-Methyl-3-buten-2-ol (MBO) is a five-carbon alcohol produced and emitted in large quantities by many species of pine native to western North America. MBO is structurally and biosynthetically related to isoprene and can have an important impact on regional atmospheric chemistry. The gene for MBO synthase was identified from Pinus sabiniana, and the protein encoded was functionally characterized. MBO synthase is a bifunctional enzyme that produces both MBO and isoprene in a ratio of ∼90:1. Divalent cations are required for activity, whereas monovalent cations are not. MBO production is enhanced by K+, whereas isoprene production is inhibited by K+ such that, at physiologically relevant [K+], little or no isoprene emission should be detected from MBO-emitting trees. The Km of MBO synthase for dimethylallyl diphosphate (20 mm) is comparable with that observed for angiosperm isoprene synthases and 3 orders of magnitude higher than that observed for monoterpene and sesquiterpene synthases. Phylogenetic analysis showed that MBO synthase falls into the TPS-d1 group (gymnosperm monoterpene synthases) and is most closely related to linalool synthase from Picea abies. Structural modeling showed that up to three phenylalanine residues restrict the size of the active site and may be responsible for making this a hemiterpene synthase rather than a monoterpene synthase. One of these residues is homologous to a Phe residue found in the active site of isoprene synthases. The remaining two Phe residues do not have homologs in isoprene synthases but occupy the same space as a second Phe residue that closes off the isoprene synthase active site. PMID:21504898

  13. 40 CFR 1502.25 - Environmental review and consultation requirements.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... Coordination Act (16 U.S.C. 661 et seq.), the National Historic Preservation Act of 1966 (16 U.S.C. 470 et seq.), the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.), and other environmental review laws and...

  14. 40 CFR 1502.25 - Environmental review and consultation requirements.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... Coordination Act (16 U.S.C. 661 et seq.), the National Historic Preservation Act of 1966 (16 U.S.C. 470 et seq.), the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.), and other environmental review laws and...

  15. 40 CFR 1502.25 - Environmental review and consultation requirements.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... Coordination Act (16 U.S.C. 661 et seq.), the National Historic Preservation Act of 1966 (16 U.S.C. 470 et seq.), the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.), and other environmental review laws and...

  16. 40 CFR 1502.25 - Environmental review and consultation requirements.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... Coordination Act (16 U.S.C. 661 et seq.), the National Historic Preservation Act of 1966 (16 U.S.C. 470 et seq.), the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.), and other environmental review laws and...

  17. 40 CFR 1502.25 - Environmental review and consultation requirements.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... Coordination Act (16 U.S.C. 661 et seq.), the National Historic Preservation Act of 1966 (16 U.S.C. 470 et seq.), the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.), and other environmental review laws and...

  18. The complete mitochondrial genome of Haliotis laevigata (Gastropoda: Haliotidae) using MiSeq and HiSeq sequencing.

    PubMed

    Robinson, Nick A; Hall, Nathan E; Ross, Elizabeth M; Cooke, Ira R; Shiel, Brett P; Robinson, Andrew J; Strugnell, Jan M

    2016-01-01

    The mitochondrial genome of greenlip abalone, Haliotis laevigata, is reported. MiSeq and HiSeq sequencing of one individual was assembled to yield a single 16,545 bp contig. The sequence shares 92% identity to the H. rubra mitochondrial genome (a closely related species that hybridize with H. laevigata in the wild). The sequence will be useful for determining the maternal contribution to hybrid populations, for investigating population structure and stock-enhancement effectiveness.

  19. Methylation Sensitive Amplification Polymorphism Sequencing (MSAP-Seq)-A Method for High-Throughput Analysis of Differentially Methylated CCGG Sites in Plants with Large Genomes.

    PubMed

    Chwialkowska, Karolina; Korotko, Urszula; Kosinska, Joanna; Szarejko, Iwona; Kwasniewski, Miroslaw

    2017-01-01

    Epigenetic mechanisms, including histone modifications and DNA methylation, mutually regulate chromatin structure, maintain genome integrity, and affect gene expression and transposon mobility. Variations in DNA methylation within plant populations, as well as methylation in response to internal and external factors, are of increasing interest, especially in the crop research field. Methylation Sensitive Amplification Polymorphism (MSAP) is one of the most commonly used methods for assessing DNA methylation changes in plants. This method involves gel-based visualization of PCR fragments from selectively amplified DNA that are cleaved using methylation-sensitive restriction enzymes. In this study, we developed and validated a new method based on the conventional MSAP approach called Methylation Sensitive Amplification Polymorphism Sequencing (MSAP-Seq). We improved the MSAP-based approach by replacing the conventional separation of amplicons on polyacrylamide gels with direct, high-throughput sequencing using Next Generation Sequencing (NGS) and automated data analysis. MSAP-Seq allows for global sequence-based identification of changes in DNA methylation. This technique was validated in Hordeum vulgare . However, MSAP-Seq can be straightforwardly implemented in different plant species, including crops with large, complex and highly repetitive genomes. The incorporation of high-throughput sequencing into MSAP-Seq enables parallel and direct analysis of DNA methylation in hundreds of thousands of sites across the genome. MSAP-Seq provides direct genomic localization of changes and enables quantitative evaluation. We have shown that the MSAP-Seq method specifically targets gene-containing regions and that a single analysis can cover three-quarters of all genes in large genomes. Moreover, MSAP-Seq's simplicity, cost effectiveness, and high-multiplexing capability make this method highly affordable. Therefore, MSAP-Seq can be used for DNA methylation analysis in crop plants with large and complex genomes.

  20. Methylation Sensitive Amplification Polymorphism Sequencing (MSAP-Seq)—A Method for High-Throughput Analysis of Differentially Methylated CCGG Sites in Plants with Large Genomes

    PubMed Central

    Chwialkowska, Karolina; Korotko, Urszula; Kosinska, Joanna; Szarejko, Iwona; Kwasniewski, Miroslaw

    2017-01-01

    Epigenetic mechanisms, including histone modifications and DNA methylation, mutually regulate chromatin structure, maintain genome integrity, and affect gene expression and transposon mobility. Variations in DNA methylation within plant populations, as well as methylation in response to internal and external factors, are of increasing interest, especially in the crop research field. Methylation Sensitive Amplification Polymorphism (MSAP) is one of the most commonly used methods for assessing DNA methylation changes in plants. This method involves gel-based visualization of PCR fragments from selectively amplified DNA that are cleaved using methylation-sensitive restriction enzymes. In this study, we developed and validated a new method based on the conventional MSAP approach called Methylation Sensitive Amplification Polymorphism Sequencing (MSAP-Seq). We improved the MSAP-based approach by replacing the conventional separation of amplicons on polyacrylamide gels with direct, high-throughput sequencing using Next Generation Sequencing (NGS) and automated data analysis. MSAP-Seq allows for global sequence-based identification of changes in DNA methylation. This technique was validated in Hordeum vulgare. However, MSAP-Seq can be straightforwardly implemented in different plant species, including crops with large, complex and highly repetitive genomes. The incorporation of high-throughput sequencing into MSAP-Seq enables parallel and direct analysis of DNA methylation in hundreds of thousands of sites across the genome. MSAP-Seq provides direct genomic localization of changes and enables quantitative evaluation. We have shown that the MSAP-Seq method specifically targets gene-containing regions and that a single analysis can cover three-quarters of all genes in large genomes. Moreover, MSAP-Seq's simplicity, cost effectiveness, and high-multiplexing capability make this method highly affordable. Therefore, MSAP-Seq can be used for DNA methylation analysis in crop plants with large and complex genomes. PMID:29250096

  1. ChIP-seq: advantages and challenges of a maturing technology.

    PubMed

    Park, Peter J

    2009-10-01

    Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is a technique for genome-wide profiling of DNA-binding proteins, histone modifications or nucleosomes. Owing to the tremendous progress in next-generation sequencing technology, ChIP-seq offers higher resolution, less noise and greater coverage than its array-based predecessor ChIP-chip. With the decreasing cost of sequencing, ChIP-seq has become an indispensable tool for studying gene regulation and epigenetic mechanisms. In this Review, I describe the benefits and challenges in harnessing this technique with an emphasis on issues related to experimental design and data analysis. ChIP-seq experiments generate large quantities of data, and effective computational analysis will be crucial for uncovering biological mechanisms.

  2. An Annotation Agnostic Algorithm for Detecting Nascent RNA Transcripts in GRO-Seq.

    PubMed

    Azofeifa, Joseph G; Allen, Mary A; Lladser, Manuel E; Dowell, Robin D

    2017-01-01

    We present a fast and simple algorithm to detect nascent RNA transcription in global nuclear run-on sequencing (GRO-seq). GRO-seq is a relatively new protocol that captures nascent transcripts from actively engaged polymerase, providing a direct read-out on bona fide transcription. Most traditional assays, such as RNA-seq, measure steady state RNA levels which are affected by transcription, post-transcriptional processing, and RNA stability. GRO-seq data, however, presents unique analysis challenges that are only beginning to be addressed. Here, we describe a new algorithm, Fast Read Stitcher (FStitch), that takes advantage of two popular machine-learning techniques, hidden Markov models and logistic regression, to classify which regions of the genome are transcribed. Given a small user-defined training set, our algorithm is accurate, robust to varying read depth, annotation agnostic, and fast. Analysis of GRO-seq data without a priori need for annotation uncovers surprising new insights into several aspects of the transcription process.

  3. Peregrine: A rapid and unbiased method to produce strand-specific RNA-Seq libraries from small quantities of starting material.

    PubMed

    Langevin, Stanley A; Bent, Zachary W; Solberg, Owen D; Curtis, Deanna J; Lane, Pamela D; Williams, Kelly P; Schoeniger, Joseph S; Sinha, Anupama; Lane, Todd W; Branda, Steven S

    2013-04-01

    Use of second generation sequencing (SGS) technologies for transcriptional profiling (RNA-Seq) has revolutionized transcriptomics, enabling measurement of RNA abundances with unprecedented specificity and sensitivity and the discovery of novel RNA species. Preparation of RNA-Seq libraries requires conversion of the RNA starting material into cDNA flanked by platform-specific adaptor sequences. Each of the published methods and commercial kits currently available for RNA-Seq library preparation suffers from at least one major drawback, including long processing times, large starting material requirements, uneven coverage, loss of strand information and high cost. We report the development of a new RNA-Seq library preparation technique that produces representative, strand-specific RNA-Seq libraries from small amounts of starting material in a fast, simple and cost-effective manner. Additionally, we have developed a new quantitative PCR-based assay for precisely determining the number of PCR cycles to perform for optimal enrichment of the final library, a key step in all SGS library preparation workflows.

  4. Polyester: simulating RNA-seq datasets with differential transcript expression.

    PubMed

    Frazee, Alyssa C; Jaffe, Andrew E; Langmead, Ben; Leek, Jeffrey T

    2015-09-01

    Statistical methods development for differential expression analysis of RNA sequencing (RNA-seq) requires software tools to assess accuracy and error rate control. Since true differential expression status is often unknown in experimental datasets, artificially constructed datasets must be utilized, either by generating costly spike-in experiments or by simulating RNA-seq data. Polyester is an R package designed to simulate RNA-seq data, beginning with an experimental design and ending with collections of RNA-seq reads. Its main advantage is the ability to simulate reads indicating isoform-level differential expression across biological replicates for a variety of experimental designs. Data generated by Polyester is a reasonable approximation to real RNA-seq data and standard differential expression workflows can recover differential expression set in the simulation by the user. Polyester is freely available from Bioconductor (http://bioconductor.org/). jtleek@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.

    PubMed

    O'Leary, Nuala A; Wright, Mathew W; Brister, J Rodney; Ciufo, Stacy; Haddad, Diana; McVeigh, Rich; Rajput, Bhanu; Robbertse, Barbara; Smith-White, Brian; Ako-Adjei, Danso; Astashyn, Alexander; Badretdin, Azat; Bao, Yiming; Blinkova, Olga; Brover, Vyacheslav; Chetvernin, Vyacheslav; Choi, Jinna; Cox, Eric; Ermolaeva, Olga; Farrell, Catherine M; Goldfarb, Tamara; Gupta, Tripti; Haft, Daniel; Hatcher, Eneida; Hlavina, Wratko; Joardar, Vinita S; Kodali, Vamsi K; Li, Wenjun; Maglott, Donna; Masterson, Patrick; McGarvey, Kelly M; Murphy, Michael R; O'Neill, Kathleen; Pujar, Shashikant; Rangwala, Sanjida H; Rausch, Daniel; Riddick, Lillian D; Schoch, Conrad; Shkeda, Andrei; Storz, Susan S; Sun, Hanzhen; Thibaud-Nissen, Francoise; Tolstoy, Igor; Tully, Raymond E; Vatsan, Anjana R; Wallin, Craig; Webb, David; Wu, Wendy; Landrum, Melissa J; Kimchi, Avi; Tatusova, Tatiana; DiCuccio, Michael; Kitts, Paul; Murphy, Terence D; Pruitt, Kim D

    2016-01-04

    The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  6. RnaSeqSampleSize: real data based sample size estimation for RNA sequencing.

    PubMed

    Zhao, Shilin; Li, Chung-I; Guo, Yan; Sheng, Quanhu; Shyr, Yu

    2018-05-30

    One of the most important and often neglected components of a successful RNA sequencing (RNA-Seq) experiment is sample size estimation. A few negative binomial model-based methods have been developed to estimate sample size based on the parameters of a single gene. However, thousands of genes are quantified and tested for differential expression simultaneously in RNA-Seq experiments. Thus, additional issues should be carefully addressed, including the false discovery rate for multiple statistic tests, widely distributed read counts and dispersions for different genes. To solve these issues, we developed a sample size and power estimation method named RnaSeqSampleSize, based on the distributions of gene average read counts and dispersions estimated from real RNA-seq data. Datasets from previous, similar experiments such as the Cancer Genome Atlas (TCGA) can be used as a point of reference. Read counts and their dispersions were estimated from the reference's distribution; using that information, we estimated and summarized the power and sample size. RnaSeqSampleSize is implemented in R language and can be installed from Bioconductor website. A user friendly web graphic interface is provided at http://cqs.mc.vanderbilt.edu/shiny/RnaSeqSampleSize/ . RnaSeqSampleSize provides a convenient and powerful way for power and sample size estimation for an RNAseq experiment. It is also equipped with several unique features, including estimation for interested genes or pathway, power curve visualization, and parameter optimization.

  7. Analysis of Strand-Specific RNA-Seq Data Using Machine Learning Reveals the Structures of Transcription Units in Clostridium thermocellum

    DOE PAGES

    Chou, Wen-Chi; Ma, Qin; Yang, Shihui; ...

    2015-03-12

    The identification of transcription units (TUs) encoded in a bacterial genome is essential to elucidation of transcriptional regulation of the organism. To gain a detailed understanding of the dynamically composed TU structures, we have used four strand-specific RNA-seq (ssRNA-seq) datasets collected under two experimental conditions to derive the genomic TU organization of Clostridium thermocellum using a machine-learning approach. Our method accurately predicted the genomic boundaries of individual TUs based on two sets of parameters measuring the RNA-seq expression patterns across the genome: expression-level continuity and variance. A total of 2590 distinct TUs are predicted based on the four RNA-seq datasets.more » Moreover, among the predicted TUs, 44% have multiple genes. We assessed our prediction method on an independent set of RNA-seq data with longer reads. The evaluation confirmed the high quality of the predicted TUs. Functional enrichment analyses on a selected subset of the predicted TUs revealed interesting biology. To demonstrate the generality of the prediction method, we have also applied the method to RNA-seq data collected on Escherichia coli and achieved high prediction accuracies. The TU prediction program named SeqTU is publicly available athttps://code.google.com/p/seqtu/. We expect that the predicted TUs can serve as the baseline information for studying transcriptional and post-transcriptional regulation in C. thermocellum and other bacteria.« less

  8. Geranyl diphosphate synthase large subunit, and methods of use

    DOEpatents

    Croteau, Rodney B.; Burke, Charles C.; Wildung, Mark R.

    2001-10-16

    A cDNA encoding geranyl diphosphate synthase large subunit from peppermint has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Replicable recombinant cloning vehicles are provided which code for geranyl diphosphate synthase large subunit). In another aspect, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding geranyl diphosphate synthase large subunit. In yet another aspect, the present invention provides isolated, recombinant geranyl diphosphate synthase protein comprising an isolated, recombinant geranyl diphosphate synthase large subunit protein and an isolated, recombinant geranyl diphosphate synthase small subunit protein. Thus, systems and methods are provided for the recombinant expression of geranyl diphosphate synthase.

  9. Molecular architectures of benzoic acid-specific type III polyketide synthases

    PubMed Central

    Stewart, Charles; Woods, Kate; Macias, Greg; Allan, Andrew C.; Noel, Joseph P.

    2017-01-01

    Biphenyl synthase and benzophenone synthase constitute an evolutionarily distinct clade of type III polyketide synthases (PKSs) that use benzoic acid-derived substrates to produce defense metabolites in plants. The use of benzoyl-CoA as an endogenous substrate is unusual for type III PKSs. Moreover, sequence analyses indicate that the residues responsible for the functional diversification of type III PKSs are mutated in benzoic acid-specific type III PKSs. In order to gain a better understanding of structure–function relationships within the type III PKS family, the crystal structures of biphenyl synthase from Malus × domestica and benzophenone synthase from Hypericum androsaemum were compared with the structure of an archetypal type III PKS: chalcone synthase from Malus × domestica. Both biphenyl synthase and benzophenone synthase contain mutations that reshape their active-site cavities to prevent the binding of 4-coumaroyl-CoA and to favor the binding of small hydrophobic substrates. The active-site cavities of biphenyl synthase and benzophenone synthase also contain a novel pocket associated with their chain-elongation and cyclization reactions. Collectively, these results illuminate structural determinants of benzoic acid-specific type III PKSs and expand the understanding of the evolution of specialized metabolic pathways in plants. PMID:29199980

  10. Suites of Terpene Synthases Explain Differential Terpenoid Production in Ginger and Turmeric Tissues

    PubMed Central

    Koo, Hyun Jo; Gang, David R.

    2012-01-01

    The essential oils of ginger (Zingiber officinale) and turmeric (Curcuma longa) contain a large variety of terpenoids, some of which possess anticancer, antiulcer, and antioxidant properties. Despite their importance, only four terpene synthases have been identified from the Zingiberaceae family: (+)-germacrene D synthase and (S)-β-bisabolene synthase from ginger rhizome, and α-humulene synthase and β-eudesmol synthase from shampoo ginger (Zingiber zerumbet) rhizome. We report the identification of 25 mono- and 18 sesquiterpene synthases from ginger and turmeric, with 13 and 11, respectively, being functionally characterized. Novel terpene synthases, (−)-caryolan-1-ol synthase and α-zingiberene/β-sesquiphellandrene synthase, which is responsible for formation of the major sesquiterpenoids in ginger and turmeric rhizomes, were also discovered. These suites of enzymes are responsible for formation of the majority of the terpenoids present in these two plants. Structures of several were modeled, and a comparison of sets of paralogs suggests how the terpene synthases in ginger and turmeric evolved. The most abundant and most important sesquiterpenoids in turmeric rhizomes, (+)-α-turmerone and (+)-β-turmerone, are produced from (−)-α-zingiberene and (−)-β-sesquiphellandrene, respectively, via α-zingiberene/β-sesquiphellandrene oxidase and a still unidentified dehydrogenase. PMID:23272109

  11. Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) Version 3.0 User Guide

    EPA Science Inventory

    User Guide to describe the complete functionality of the Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) Version 3.0 online tool. The US Environmental Protection Agency Sequence Alignment to Predict Across Species Susceptibility tool (SeqAPASS; https://seqa...

  12. Iron Deficiency (ID) at Both Birth and 9 Months Predicts Right Frontal EEG Asymmetry in Infancy

    PubMed Central

    Armony-Sivan, Rinat; Zhu, Bingquan; Clark, Katy M.; Richards, Blair; Ji, Chai; Kaciroti, Niko; Shao, Jie

    2016-01-01

    This study considered effects of timing and duration of iron deficiency (ID) on frontal EEG asymmetry in infancy. In healthy term Chinese infants, EEG was recorded at 9 months in three experimental conditions: baseline, peek-a-boo, and stranger approach. Eighty infants provided data for all conditions. Prenatal ID was defined as low cord ferritin or high ZPP/H. Postnatal ID was defined as ≥ two abnormal iron measures at 9 months. Study groups were pre- and postnatal ID, prenatal ID only, postnatal ID only, and not ID. GLM repeated measure analysis showed a main effect for iron group. The pre- and postnatal ID group had negative asymmetry scores, reflecting right frontal EEG asymmetry (mean ±SE: −.18 ±.07) versus prenatal ID only (.00 ±.04), postnatal ID only (.03 ±.04), and not ID (.02 ±.04). Thus, ID at both birth and 9 months was associated with right frontal EEG asymmetry, a neural correlate of behavioral withdrawal and negative emotions. PMID:26668100

  13. Federal Reserve System Semiannual Regulatory Agenda

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-12-20

    ... 1601 et seq Abstract: In August 2009 the Board issued a proposed rule amending Regulation Z's... 1601 et seq Abstract: On May 22, 2009, the Credit Card Accountability Responsibility and Disclosure Act... Authority: 15 USC 1601 et seq Abstract: The Board proposes to amend Regulation Z, which implements the Truth...

  14. 30 CFR 905.816 - Performance standards-Surface mining activities.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  15. 30 CFR 905.817 - Performance standards-Underground mining activities.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  16. 30 CFR 905.817 - Performance standards-Underground mining activities.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  17. 30 CFR 905.816 - Performance standards-Surface mining activities.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  18. 30 CFR 905.816 - Performance standards-Surface mining activities.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  19. 30 CFR 905.817 - Performance standards-Underground mining activities.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  20. 30 CFR 905.816 - Performance standards-Surface mining activities.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  1. 30 CFR 905.817 - Peformance standards-Underground mining activities.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... Quality Control Act, Cal. Pub. Res. Code section 13000 et seq.; the California Water Code section 1200 et seq.; the California Air Pollution Control Laws, Cal. Health & Safety Code section 39000 et seq.; the..., DEPARTMENT OF THE INTERIOR PROGRAMS FOR THE CONDUCT OF SURFACE MINING OPERATIONS WITHIN EACH STATE CALIFORNIA...

  2. Additional annotation of the pig transcriptome using integrated Iso-seq and Illumina RNA-seq analysis

    USDA-ARS?s Scientific Manuscript database

    Alternative splicing is a well-known phenomenon that dramatically increases eukaryotic transcriptome diversity. The extent of mRNA isoform diversity among porcine tissues was assessed using Pacific Biosciences single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short read sequencing ...

  3. Experimental Design and Power Calculation for RNA-seq Experiments.

    PubMed

    Wu, Zhijin; Wu, Hao

    2016-01-01

    Power calculation is a critical component of RNA-seq experimental design. The flexibility of RNA-seq experiment and the wide dynamic range of transcription it measures make it an attractive technology for whole transcriptome analysis. These features, in addition to the high dimensionality of RNA-seq data, bring complexity in experimental design, making an analytical power calculation no longer realistic. In this chapter we review the major factors that influence the statistical power of detecting differential expression, and give examples of power assessment using the R package PROPER.

  4. Single-cell Transcriptome Study as Big Data

    PubMed Central

    Yu, Pingjian; Lin, Wei

    2016-01-01

    The rapid growth of single-cell RNA-seq studies (scRNA-seq) demands efficient data storage, processing, and analysis. Big-data technology provides a framework that facilitates the comprehensive discovery of biological signals from inter-institutional scRNA-seq datasets. The strategies to solve the stochastic and heterogeneous single-cell transcriptome signal are discussed in this article. After extensively reviewing the available big-data applications of next-generation sequencing (NGS)-based studies, we propose a workflow that accounts for the unique characteristics of scRNA-seq data and primary objectives of single-cell studies. PMID:26876720

  5. NGScloud: RNA-seq analysis of non-model species using cloud computing.

    PubMed

    Mora-Márquez, Fernando; Vázquez-Poletti, José Luis; López de Heredia, Unai

    2018-05-03

    RNA-seq analysis usually requires large computing infrastructures. NGScloud is a bioinformatic system developed to analyze RNA-seq data using the cloud computing services of Amazon that permit the access to ad hoc computing infrastructure scaled according to the complexity of the experiment, so its costs and times can be optimized. The application provides a user-friendly front-end to operate Amazon's hardware resources, and to control a workflow of RNA-seq analysis oriented to non-model species, incorporating the cluster concept, which allows parallel runs of common RNA-seq analysis programs in several virtual machines for faster analysis. NGScloud is freely available at https://github.com/GGFHF/NGScloud/. A manual detailing installation and how-to-use instructions is available with the distribution. unai.lopezdeheredia@upm.es.

  6. SeqTU: A web server for identification of bacterial transcription units

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Xin; Chou, Wen -Chi; Ma, Qin

    A transcription unit (TU) consists of K ≥ 1 consecutive genes on the same strand of a bacterial genome that are transcribed into a single mRNA molecule under certain conditions. Their identification is an essential step in elucidation of transcriptional regulatory networks. We have recently developed a machine-learning method to accurately identify TUs from RNA-seq data, based on two features of the assembled RNA reads: the continuity and stability of RNA-seq coverage across a genomic region. While good performance was achieved by the method on Escherichia coli and Clostridium thermocellum, substantial work is needed to make the program generally applicablemore » to all bacteria, knowing that the program requires organism specific information. A web server, named SeqTU, was developed to automatically identify TUs with given RNA-seq data of any bacterium using a machine-learning approach. The server consists of a number of utility tools, in addition to TU identification, such as data preparation, data quality check and RNA-read mapping. SeqTU provides a user-friendly interface and automated prediction of TUs from given RNA-seq data. Furthermore, the predicted TUs are displayed intuitively using HTML format along with a graphic visualization of the prediction.« less

  7. SeqTU: A web server for identification of bacterial transcription units

    DOE PAGES

    Chen, Xin; Chou, Wen -Chi; Ma, Qin; ...

    2017-03-07

    A transcription unit (TU) consists of K ≥ 1 consecutive genes on the same strand of a bacterial genome that are transcribed into a single mRNA molecule under certain conditions. Their identification is an essential step in elucidation of transcriptional regulatory networks. We have recently developed a machine-learning method to accurately identify TUs from RNA-seq data, based on two features of the assembled RNA reads: the continuity and stability of RNA-seq coverage across a genomic region. While good performance was achieved by the method on Escherichia coli and Clostridium thermocellum, substantial work is needed to make the program generally applicablemore » to all bacteria, knowing that the program requires organism specific information. A web server, named SeqTU, was developed to automatically identify TUs with given RNA-seq data of any bacterium using a machine-learning approach. The server consists of a number of utility tools, in addition to TU identification, such as data preparation, data quality check and RNA-read mapping. SeqTU provides a user-friendly interface and automated prediction of TUs from given RNA-seq data. Furthermore, the predicted TUs are displayed intuitively using HTML format along with a graphic visualization of the prediction.« less

  8. Cardinality enhancement utilizing Sequential Algorithm (SeQ) code in OCDMA system

    NASA Astrophysics Data System (ADS)

    Fazlina, C. A. S.; Rashidi, C. B. M.; Rahman, A. K.; Aljunid, S. A.

    2017-11-01

    Optical Code Division Multiple Access (OCDMA) has been important with increasing demand for high capacity and speed for communication in optical networks because of OCDMA technique high efficiency that can be achieved, hence fibre bandwidth is fully used. In this paper we will focus on Sequential Algorithm (SeQ) code with AND detection technique using Optisystem design tool. The result revealed SeQ code capable to eliminate Multiple Access Interference (MAI) and improve Bit Error Rate (BER), Phase Induced Intensity Noise (PIIN) and orthogonally between users in the system. From the results, SeQ shows good performance of BER and capable to accommodate 190 numbers of simultaneous users contrast with existing code. Thus, SeQ code have enhanced the system about 36% and 111% of FCC and DCS code. In addition, SeQ have good BER performance 10-25 at 155 Mbps in comparison with 622 Mbps, 1 Gbps and 2 Gbps bit rate. From the plot graph, 155 Mbps bit rate is suitable enough speed for FTTH and LAN networks. Resolution can be made based on the superior performance of SeQ code. Thus, these codes will give an opportunity in OCDMA system for better quality of service in an optical access network for future generation's usage

  9. High-throughput sequencing of human plasma RNA by using thermostable group II intron reverse transcriptases

    PubMed Central

    Qin, Yidan; Yao, Jun; Wu, Douglas C.; Nottingham, Ryan M.; Mohr, Sabine; Hunicke-Smith, Scott; Lambowitz, Alan M.

    2016-01-01

    Next-generation RNA-sequencing (RNA-seq) has revolutionized transcriptome profiling, gene expression analysis, and RNA-based diagnostics. Here, we developed a new RNA-seq method that exploits thermostable group II intron reverse transcriptases (TGIRTs) and used it to profile human plasma RNAs. TGIRTs have higher thermostability, processivity, and fidelity than conventional reverse transcriptases, plus a novel template-switching activity that can efficiently attach RNA-seq adapters to target RNA sequences without RNA ligation. The new TGIRT-seq method enabled construction of RNA-seq libraries from <1 ng of plasma RNA in <5 h. TGIRT-seq of RNA in 1-mL plasma samples from a healthy individual revealed RNA fragments mapping to a diverse population of protein-coding gene and long ncRNAs, which are enriched in intron and antisense sequences, as well as nearly all known classes of small ncRNAs, some of which have never before been seen in plasma. Surprisingly, many of the small ncRNA species were present as full-length transcripts, suggesting that they are protected from plasma RNases in ribonucleoprotein (RNP) complexes and/or exosomes. This TGIRT-seq method is readily adaptable for profiling of whole-cell, exosomal, and miRNAs, and for related procedures, such as HITS-CLIP and ribosome profiling. PMID:26554030

  10. QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization.

    PubMed

    Zhao, Shanrong; Xi, Li; Quan, Jie; Xi, Hualin; Zhang, Ying; von Schack, David; Vincent, Michael; Zhang, Baohong

    2016-01-08

    RNA sequencing (RNA-seq), a next-generation sequencing technique for transcriptome profiling, is being increasingly used, in part driven by the decreasing cost of sequencing. Nevertheless, the analysis of the massive amounts of data generated by large-scale RNA-seq remains a challenge. Multiple algorithms pertinent to basic analyses have been developed, and there is an increasing need to automate the use of these tools so as to obtain results in an efficient and user friendly manner. Increased automation and improved visualization of the results will help make the results and findings of the analyses readily available to experimental scientists. By combing the best open source tools developed for RNA-seq data analyses and the most advanced web 2.0 technologies, we have implemented QuickRNASeq, a pipeline for large-scale RNA-seq data analyses and visualization. The QuickRNASeq workflow consists of three main steps. In Step #1, each individual sample is processed, including mapping RNA-seq reads to a reference genome, counting the numbers of mapped reads, quality control of the aligned reads, and SNP (single nucleotide polymorphism) calling. Step #1 is computationally intensive, and can be processed in parallel. In Step #2, the results from individual samples are merged, and an integrated and interactive project report is generated. All analyses results in the report are accessible via a single HTML entry webpage. Step #3 is the data interpretation and presentation step. The rich visualization features implemented here allow end users to interactively explore the results of RNA-seq data analyses, and to gain more insights into RNA-seq datasets. In addition, we used a real world dataset to demonstrate the simplicity and efficiency of QuickRNASeq in RNA-seq data analyses and interactive visualizations. The seamless integration of automated capabilites with interactive visualizations in QuickRNASeq is not available in other published RNA-seq pipelines. The high degree of automation and interactivity in QuickRNASeq leads to a substantial reduction in the time and effort required prior to further downstream analyses and interpretation of the analyses findings. QuickRNASeq advances primary RNA-seq data analyses to the next level of automation, and is mature for public release and adoption.

  11. Decreasing patient identification band errors by standardizing processes.

    PubMed

    Walley, Susan Chu; Berger, Stephanie; Harris, Yolanda; Gallizzi, Gina; Hayes, Leslie

    2013-04-01

    Patient identification (ID) bands are an essential component in patient ID. Quality improvement methodology has been applied as a model to reduce ID band errors although previous studies have not addressed standardization of ID bands. Our specific aim was to decrease ID band errors by 50% in a 12-month period. The Six Sigma DMAIC (define, measure, analyze, improve, and control) quality improvement model was the framework for this study. ID bands at a tertiary care pediatric hospital were audited from January 2011 to January 2012 with continued audits to June 2012 to confirm the new process was in control. After analysis, the major improvement strategy implemented was standardization of styles of ID bands and labels. Additional interventions included educational initiatives regarding the new ID band processes and disseminating institutional and nursing unit data. A total of 4556 ID bands were audited with a preimprovement ID band error average rate of 9.2%. Significant variation in the ID band process was observed, including styles of ID bands. Interventions were focused on standardization of the ID band and labels. The ID band error rate improved to 5.2% in 9 months (95% confidence interval: 2.5-5.5; P < .001) and was maintained for 8 months. Standardization of ID bands and labels in conjunction with other interventions resulted in a statistical decrease in ID band error rates. This decrease in ID band error rates was maintained over the subsequent 8 months.

  12. Inhibition of glycogen-synthase kinase 3 stimulates glycogen synthase and glucose transport by distinct mechanisms in 3T3-L1 adipocytes.

    PubMed

    Oreña, S J; Torchia, A J; Garofalo, R S

    2000-05-26

    The role of glycogen-synthase kinase 3 (GSK3) in insulin-stimulated glucose transport and glycogen synthase activation was investigated in 3T3-L1 adipocytes. GSK3 protein was clearly present in adipocytes and was found to be more abundant than in muscle and liver cell lines. The selective GSK3 inhibitor, LiCl, stimulated glucose transport and glycogen synthase activity (20 and 65%, respectively, of the maximal (1 microm) insulin response) and potentiated the responses to a submaximal concentration (1 nm) of insulin. LiCl- and insulin-stimulated glucose transport were abolished by the phosphatidylinositol 3-kinase (PI3-kinase) inhibitor, wortmannin; however, LiCl stimulation of glycogen synthase was not. In contrast to the rapid stimulation of glucose transport by insulin, transport stimulated by LiCl increased gradually over 3-5 h reaching 40% of the maximal insulin-stimulated level. Both LiCl- and insulin-stimulated glycogen synthase activity were maximal at 25 min. However, insulin-stimulated glycogen synthase activity returned to basal after 2 h, coincident with reactivation of GSK3. After a 2-h exposure to insulin, glycogen synthase was refractory to restimulation with insulin, indicating selective desensitization of this pathway. However, LiCl could partially stimulate glycogen synthase in desensitized cells. Furthermore, coincubation with LiCl during the 2 h exposure to insulin completely blocked desensitization of glycogen synthase activity. In summary, inhibition of GSK3 by LiCl: 1) stimulated glycogen synthase activity directly and independently of PI3-kinase, 2) stimulated glucose transport at a point upstream of PI3-kinase, 3) stimulated glycogen synthase activity in desensitized cells, and 4) prevented desensitization of glycogen synthase due to chronic insulin treatment. These data are consistent with GSK3 playing a central role in the regulation of glycogen synthase activity and a contributing factor in the regulation of glucose transport in 3T3-L1 adipocytes.

  13. 75 FR 44030 - Agency Forms Submitted for OMB Review, Request for Comments

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-07-27

    ..., ID-4X, Advising of Service/Earnings Requirements for Sickness Benefits, ID-20-1, Advising that Normal..., ID-4K, ID-4Y, ID-20-1, ID-20-2, and ID-204 is required to obtain or retain benefits. One response is... determine (1) the practical utility of the collection; (2) the accuracy of the estimated burden of the...

  14. Sandalwood Fragrance Biosynthesis Involves Sesquiterpene Synthases of Both the Terpene Synthase (TPS)-a and TPS-b Subfamilies, including Santalene Synthases*

    PubMed Central

    Jones, Christopher G.; Moniodis, Jessie; Zulak, Katherine G.; Scaffidi, Adrian; Plummer, Julie A.; Ghisalberti, Emilio L.; Barbour, Elizabeth L.; Bohlmann, Jörg

    2011-01-01

    Sandalwood oil is one of the worlds most highly prized fragrances. To identify the genes and encoded enzymes responsible for santalene biosynthesis, we cloned and characterized three orthologous terpene synthase (TPS) genes SaSSy, SauSSy, and SspiSSy from three divergent sandalwood species; Santalum album, S. austrocaledonicum, and S. spicatum, respectively. The encoded enzymes catalyze the formation of α-, β-, epi-β-santalene, and α-exo-bergamotene from (E,E)-farnesyl diphosphate (E,E-FPP). Recombinant SaSSy was additionally tested with (Z,Z)-farnesyl diphosphate (Z,Z-FPP) and remarkably, found to produce a mixture of α-endo-bergamotene, α-santalene, (Z)-β-farnesene, epi-β-santalene, and β-santalene. Additional cDNAs that encode bisabolene/bisabolol synthases were also cloned and functionally characterized from these three species. Both the santalene synthases and the bisabolene/bisabolol synthases reside in the TPS-b phylogenetic clade, which is more commonly associated with angiosperm monoterpene synthases. An orthologous set of TPS-a synthases responsible for formation of macrocyclic and bicyclic sesquiterpenes were characterized. Strict functionality and limited sequence divergence in the santalene and bisabolene synthases are in contrast to the TPS-a synthases, suggesting these compounds have played a significant role in the evolution of the Santalum genus. PMID:21454632

  15. [BIOINFORMATIC SEARCH AND PHYLOGENETIC ANALYSIS OF THE CELLULOSE SYNTHASE GENES OF FLAX (LINUM USITATISSIMUM)].

    PubMed

    Pydiura, N A; Bayer, G Ya; Galinousky, D V; Yemets, A I; Pirko, Ya V; Podvitski, T A; Anisimova, N V; Khotyleva, L V; Kilchevsky, A V; Blume, Ya B

    2015-01-01

    A bioinformatic search of sequences encoding cellulose synthase genes in the flax genome, and their comparison to dicots orthologs was carried out. The analysis revealed 32 cellulose synthase gene candidates, 16 of which are highly likely to encode cellulose synthases, and the remaining 16--cellulose synthase-like proteins (Csl). Phylogenetic analysis of gene products of cellulose synthase genes allowed distinguishing 6 groups of cellulose synthase genes of different classes: CesA1/10, CesA3, CesA4, CesA5/6/2/9, CesA7 and CesA8. Paralogous sequences within classes CesA1/10 and CesA5/6/2/9 which are associated with the primary cell wall formation are characterized by a greater similarity within these classes than orthologous sequences. Whereas the genes controlling the biosynthesis of secondary cell wall cellulose form distinct clades: CesA4, CesA7, and CesA8. The analysis of 16 identified flax cellulose synthase gene candidates shows the presence of at least 12 different cellulose synthase gene variants in flax genome which are represented in all six clades of cellulose synthase genes. Thus, at this point genes of all ten known cellulose synthase classes are identify in flax genome, but their correct classification requires additional research.

  16. Missing data and technical variability in single-cell RNA-sequencing experiments.

    PubMed

    Hicks, Stephanie C; Townes, F William; Teng, Mingxiang; Irizarry, Rafael A

    2017-11-06

    Until recently, high-throughput gene expression technology, such as RNA-Sequencing (RNA-seq) required hundreds of thousands of cells to produce reliable measurements. Recent technical advances permit genome-wide gene expression measurement at the single-cell level. Single-cell RNA-Seq (scRNA-seq) is the most widely used and numerous publications are based on data produced with this technology. However, RNA-seq and scRNA-seq data are markedly different. In particular, unlike RNA-seq, the majority of reported expression levels in scRNA-seq are zeros, which could be either biologically-driven, genes not expressing RNA at the time of measurement, or technically-driven, genes expressing RNA, but not at a sufficient level to be detected by sequencing technology. Another difference is that the proportion of genes reporting the expression level to be zero varies substantially across single cells compared to RNA-seq samples. However, it remains unclear to what extent this cell-to-cell variation is being driven by technical rather than biological variation. Furthermore, while systematic errors, including batch effects, have been widely reported as a major challenge in high-throughput technologies, these issues have received minimal attention in published studies based on scRNA-seq technology. Here, we use an assessment experiment to examine data from published studies and demonstrate that systematic errors can explain a substantial percentage of observed cell-to-cell expression variability. Specifically, we present evidence that some of these reported zeros are driven by technical variation by demonstrating that scRNA-seq produces more zeros than expected and that this bias is greater for lower expressed genes. In addition, this missing data problem is exacerbated by the fact that this technical variation varies cell-to-cell. Then, we show how this technical cell-to-cell variability can be confused with novel biological results. Finally, we demonstrate and discuss how batch-effects and confounded experiments can intensify the problem. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  17. Timing, duration, and severity of iron deficiency in early development and motor outcomes at 9 months

    PubMed Central

    Santos, Denise CC; Angulo-Barroso, Rosa M; Li, Ming; Bian, Yang; Sturza, Julie; Richards, Blair; Lozoff, Betsy

    2017-01-01

    BACKGROUND/OBJECTIVES Poorer motor development is reported in infants with iron deficiency (ID). The role of timing, duration and severity is unclear. We assessed relations between ID timing, duration, and severity and gross motor scores, neurological integrity, and motor behavior quality at 9 months. METHODS Iron status was determined at birth and 9 months in otherwise healthy term Chinese infants. The 9-month motor evaluation included the Peabody Developmental Motor Scale (PDMS-2), Infant Neurological International Battery (INFANIB), and motor quality factor. Motor outcomes were analyzed by ID timing (fetal-neonatal, infancy), duration, and severity. For severity, we also considered maternal iron status. RESULTS Data were available for 1194 infants. Iron status was classified as fetal-neonatal and infancy ID (n=253), fetal-neonatal ID (n=256), infancy ID (n=288), and not ID (n=397). Compared with not ID, infants with fetal-neonatal or infancy ID had lower locomotion scores (effect size ds=0.19, 0.18) and those with ID in both periods (longer duration) had lower locomotion and overall PDMS-2 gross motor scores (ds=0.20, 0.18); ID groups did not differ. More severe ID in late pregnancy was associated with lower INFANIB Vestibular function (p=0.01), and total score (p=0.03). More severe ID in infancy was associated with lower scores for locomotion (p=0.03), overall gross motor (p=0.05). CONCLUSIONS Fetal-neonatal and/or infancy ID was associated with lower overall gross motor development and locomotion test scores at 9 months. Associations with ID severity varied by ID timing: more severe ID in late pregnancy, poorer neurological integrity; more severe ID in infancy, poorer gross motor development. PMID:29235557

  18. ASAP: a web-based platform for the analysis and interactive visualization of single-cell RNA-seq data

    PubMed Central

    Gardeux, Vincent; David, Fabrice P. A.; Shajkofci, Adrian; Schwalie, Petra C.; Deplancke, Bart

    2017-01-01

    Abstract Motivation Single-cell RNA-sequencing (scRNA-seq) allows whole transcriptome profiling of thousands of individual cells, enabling the molecular exploration of tissues at the cellular level. Such analytical capacity is of great interest to many research groups in the world, yet these groups often lack the expertise to handle complex scRNA-seq datasets. Results We developed a fully integrated, web-based platform aimed at the complete analysis of scRNA-seq data post genome alignment: from the parsing, filtering and normalization of the input count data files, to the visual representation of the data, identification of cell clusters, differentially expressed genes (including cluster-specific marker genes), and functional gene set enrichment. This Automated Single-cell Analysis Pipeline (ASAP) combines a wide range of commonly used algorithms with sophisticated visualization tools. Compared with existing scRNA-seq analysis platforms, researchers (including those lacking computational expertise) are able to interact with the data in a straightforward fashion and in real time. Furthermore, given the overlap between scRNA-seq and bulk RNA-seq analysis workflows, ASAP should conceptually be broadly applicable to any RNA-seq dataset. As a validation, we demonstrate how we can use ASAP to simply reproduce the results from a single-cell study of 91 mouse cells involving five distinct cell types. Availability and implementation The tool is freely available at asap.epfl.ch and R/Python scripts are available at github.com/DeplanckeLab/ASAP. Contact bart.deplancke@epfl.ch Supplementary information Supplementary data are available at Bioinformatics online. PMID:28541377

  19. ASAP: a web-based platform for the analysis and interactive visualization of single-cell RNA-seq data.

    PubMed

    Gardeux, Vincent; David, Fabrice P A; Shajkofci, Adrian; Schwalie, Petra C; Deplancke, Bart

    2017-10-01

    Single-cell RNA-sequencing (scRNA-seq) allows whole transcriptome profiling of thousands of individual cells, enabling the molecular exploration of tissues at the cellular level. Such analytical capacity is of great interest to many research groups in the world, yet these groups often lack the expertise to handle complex scRNA-seq datasets. We developed a fully integrated, web-based platform aimed at the complete analysis of scRNA-seq data post genome alignment: from the parsing, filtering and normalization of the input count data files, to the visual representation of the data, identification of cell clusters, differentially expressed genes (including cluster-specific marker genes), and functional gene set enrichment. This Automated Single-cell Analysis Pipeline (ASAP) combines a wide range of commonly used algorithms with sophisticated visualization tools. Compared with existing scRNA-seq analysis platforms, researchers (including those lacking computational expertise) are able to interact with the data in a straightforward fashion and in real time. Furthermore, given the overlap between scRNA-seq and bulk RNA-seq analysis workflows, ASAP should conceptually be broadly applicable to any RNA-seq dataset. As a validation, we demonstrate how we can use ASAP to simply reproduce the results from a single-cell study of 91 mouse cells involving five distinct cell types. The tool is freely available at asap.epfl.ch and R/Python scripts are available at github.com/DeplanckeLab/ASAP. bart.deplancke@epfl.ch. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.

  20. Deep sequencing of the prothoracic gland transcriptome reveals new players in insect ecdysteroidogenesis

    PubMed Central

    Nakaoka, Takayoshi; Iga, Masatoshi; Yamada, Tetsuya; Koujima, Ikumi; Takeshima, Mika; Zhou, Xiangying; Suzuki, Yutaka; Ogihara, Mari H.; Kataoka, Hiroshi

    2017-01-01

    Ecdysteroids are steroid hormones that induce molting and determine developmental timing in arthropods. In insect larva, the prothoracic gland (PG) is a major organ for ecdysone synthesis and release. Released ecdysone is converted into the active form, 20-hydroxyecdysone (20E) in the peripheral tissues. All processes from ecdysone synthesis and release from the PG to its conversion to 20E are called ecdysteroidogenesis and are under the regulation of numerous factors expressed in the PG and peripheral tissues. Classical genetic approaches and recent transcriptomic screening in the PG identified several genes responsible for ecdysone synthesis and release, whereas the regulatory mechanism remains largely unknown. We analyzed RNA-seq data of the silkworm Bombyx mori PG and employed the fruit fly Drosophila melanogaster GAL4/UAS binary RNAi system to comprehensively screen for genes involved in ecdysone synthesis and/or release. We found that the genes encoding δ-aminolevulinic acid synthase (CG3017/alas) and putative NAD kinase (CG33156) were highly expressed in the PG of both B. mori and D. melanogaster. Neither alas nor CG33156 RNAi-induced larvae could enter into the pupal stage, and they had a lower abundance of the active form ecdysteroids in their prolonged larval stage. These results demonstrated that alas and CG33156 are indispensable for ecdysteroidogenesis. PMID:28257485

  1. RNA-Seq analysis of global transcriptomic changes suggests a roles for the MAPK pathway and carbon metabolism in cell wall maintenance in a Saccharomyces cerevisiae FKS1 mutant.

    PubMed

    Huang, Cong; Zhao, Fengguang; Lin, Ying; Zheng, Suiping; Liang, Shuli; Han, Shuangyan

    2018-06-07

    FKS1 encodes a β-1,3-glucan synthase, which is a key player in cell wall assembly in Saccharomyces cerevisiae. Here we analyzed the global transcriptomic changes in the FKS1 mutant to establish a correlation between the changes in the cell wall of the FKS1 mutant and the molecular mechanism of cell wall maintenance. These transcriptomic profiles showed that there are 1151 differentially expressed genes (DEGs) in the FKS1 mutant. Through KEGG pathway analysis of the DEGs, the MAPK pathway and seven pathways involved in carbon metabolism were significantly enriched. We found that the MAPK pathway is activated for FKS1 mutant survival and the synthesis of cell wall components are reinforced in the FKS1 mutant. Our results confirm that the FKS1 mutant has a β-1,3-glucan defect that affects the cell wall and partly elucidate the molecular mechanism responsible for cell wall synthesis. Our greater understanding of these mechanisms helps to explain how the FKS1 mutant survives, has useful implications for the study of similar pathways in other fungi, and increases the theoretical foundation for the regulation of the cell wall in S. cerevisiae. Copyright © 2018 Elsevier Inc. All rights reserved.

  2. Intelligent Design versus Evolution

    PubMed Central

    Aviezer, Nathan

    2010-01-01

    Intelligent Design (ID) burst onto the scene in 1996, with the publication of Darwin’s Black Box by Michael Behe. Since then, there has been a plethora of articles written about ID, both pro and con. However, most of the articles critical of ID deal with peripheral issues, such as whether ID is just another form of creationism or whether ID qualifies as science or whether ID should be taught in public schools. It is our view that the central issue is whether the basic claim of ID is correct. Our goal is fourfold: (I) to show that most of the proposed refutations of ID are unconvincing and/or incorrect, (II) to describe the single fundamental error of ID, (III) to discuss the historic tradition surrounding the ID controversy, showing that ID is an example of a “god-of-the-gaps” argument, and (IV) to place the ID controversy in the larger context of proposed proofs for the existence of God, with the emphasis on Jewish tradition. PMID:23908779

  3. Intelligent Design versus Evolution.

    PubMed

    Aviezer, Nathan

    2010-07-01

    Intelligent Design (ID) burst onto the scene in 1996, with the publication of Darwin's Black Box by Michael Behe. Since then, there has been a plethora of articles written about ID, both pro and con. However, most of the articles critical of ID deal with peripheral issues, such as whether ID is just another form of creationism or whether ID qualifies as science or whether ID should be taught in public schools. It is our view that the central issue is whether the basic claim of ID is correct. Our goal is fourfold: (I) to show that most of the proposed refutations of ID are unconvincing and/or incorrect, (II) to describe the single fundamental error of ID, (III) to discuss the historic tradition surrounding the ID controversy, showing that ID is an example of a "god-of-the-gaps" argument, and (IV) to place the ID controversy in the larger context of proposed proofs for the existence of God, with the emphasis on Jewish tradition.

  4. Molecular cloning and functional expression of geranylgeranyl pyrophosphate synthase from Coleus forskohlii Briq

    PubMed Central

    Engprasert, Surang; Taura, Futoshi; Kawamukai, Makoto; Shoyama, Yukihiro

    2004-01-01

    Background Isopentenyl diphosphate (IPP), a common biosynthetic precursor to the labdane diterpene forskolin, has been biosynthesised via a non-mevalonate pathway. Geranylgeranyl diphosphate (GGPP) synthase is an important branch point enzyme in terpenoid biosynthesis. Therefore, GGPP synthase is thought to be a key enzyme in biosynthesis of forskolin. Herein we report the first confirmation of the GGPP synthase gene in Coleus forskohlii Briq. Results The open reading frame for full-length GGPP synthase encodes a protein of 359 amino acids, in which 1,077 nucleotides long with calculated molecular mass of 39.3 kDa. Alignments of C. forskohlii GGPP synthase amino acid sequences revealed high homologies with other plant GGPP synthases. Several highly conserved regions, including two aspartate-rich motifs were identified. Transient expression of the N-terminal region of C. forskohlii GGPP synthase-GFP fusion protein in tobacco cells demonstrated subcellular localization in the chloroplast. Carotenoid production was observed in Escherichia coli harboring pACCAR25ΔcrtE from Erwinia uredovora and plasmid carrying C. forskohlii GGPP synthase. These results suggested that cDNA encoded functional GGPP synthase. Furthermore, C. forskohlii GGPP synthase expression was strong in leaves, decreased in stems and very little expression was observed in roots. Conclusion This investigation proposed that forskolin was synthesised via a non-mevalonate pathway. GGPP synthase is thought to be involved in the biosynthesis of forskolin, which is primarily synthesised in the leaves and subsequently accumulates in the stems and roots. PMID:15550168

  5. Safety and immunogenicity of a quadrivalent intradermal influenza vaccine in adults.

    PubMed

    Gorse, Geoffrey J; Falsey, Ann R; Ozol-Godfrey, Ayca; Landolfi, Victoria; Tsang, Peter H

    2015-02-25

    An intradermal (ID) trivalent split-virion influenza vaccine (IIV3-ID) (Fluzone(®) Intradermal, Sanofi Pasteur, Swiftwater, PA) has been available in the US since the 2011/2012 influenza season for adults aged 18-64 years. This study examined whether adding a second B-lineage strain affects immunogenicity and safety. This randomized, double-blind, multicentre trial evaluated the immunogenicity and safety of an intradermal quadrivalent split-virion influenza vaccine (IIV4-ID) in adults 18-64 years of age in the US during the 2012-2013 influenza season. Participants were randomized 2:1:1 to receive a single injection of IIV4-ID, licensed IIV3-ID, or an investigational IIV3-ID containing the alternate B-lineage strain. Haemagglutination inhibition antibody titres were assessed in two-thirds of participants before vaccination and 28 days after vaccination. 1672 participants were vaccinated with IIV4-ID, 837 with licensed IIV3-ID, and 846 with an investigational IIV3-ID. For all four vaccine strains, antibody responses to IIV4-ID were statistically non-inferior to the response to the IIV3-ID vaccines containing the matched strains. For both B strains, post-vaccination antibody responses to IIV4-ID were statistically superior to the responses to IIV3-ID lacking the corresponding B strain. Adverse events were similar for IIV4-ID and IIV3-ID. The most commonly reported solicited reactions were pain, pruritus, myalgia, headache, and malaise; and most were grade 1 or 2 and appeared and resolved within 3 days of vaccination. IIV4-ID was statistically non-inferior to the two pooled IIV3-ID vaccines for the proportions of participants with at least one grade 2 or 3 systemic reaction. Antibody responses to the IIV4-ID were non-inferior to IIV3-ID for the A and matched B strains and superior for the unmatched B strains. IIV4-ID was well tolerated without any safety concerns. IIV4-ID may help address an unmet need due to mismatched B strains in previous influenza vaccines. Copyright © 2015 Elsevier Ltd. All rights reserved.

  6. 7 CFR 1794.2 - Authority.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... review procedures required by law, or by RUS practice including but not limited to: (1) Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.); (2) The National Historic Preservation Act (16 U.S.C. 470 et seq.); (3) Farmland Protection Policy Act (7 U.S.C. 4201 et seq.); (4) E.O. 11593, Protection and...

  7. EMSAR: estimation of transcript abundance from RNA-seq data by mappability-based segmentation and reclustering.

    PubMed

    Lee, Soohyun; Seo, Chae Hwa; Alver, Burak Han; Lee, Sanghyuk; Park, Peter J

    2015-09-03

    RNA-seq has been widely used for genome-wide expression profiling. RNA-seq data typically consists of tens of millions of short sequenced reads from different transcripts. However, due to sequence similarity among genes and among isoforms, the source of a given read is often ambiguous. Existing approaches for estimating expression levels from RNA-seq reads tend to compromise between accuracy and computational cost. We introduce a new approach for quantifying transcript abundance from RNA-seq data. EMSAR (Estimation by Mappability-based Segmentation And Reclustering) groups reads according to the set of transcripts to which they are mapped and finds maximum likelihood estimates using a joint Poisson model for each optimal set of segments of transcripts. The method uses nearly all mapped reads, including those mapped to multiple genes. With an efficient transcriptome indexing based on modified suffix arrays, EMSAR minimizes the use of CPU time and memory while achieving accuracy comparable to the best existing methods. EMSAR is a method for quantifying transcripts from RNA-seq data with high accuracy and low computational cost. EMSAR is available at https://github.com/parklab/emsar.

  8. GUIDE-Seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases

    PubMed Central

    Nguyen, Nhu T.; Liebers, Matthew; Topkar, Ved V.; Thapar, Vishal; Wyvekens, Nicolas; Khayter, Cyd; Iafrate, A. John; Le, Long P.; Aryee, Martin J.; Joung, J. Keith

    2014-01-01

    CRISPR RNA-guided nucleases (RGNs) are widely used genome-editing reagents, but methods to delineate their genome-wide off-target cleavage activities have been lacking. Here we describe an approach for global detection of DNA double-stranded breaks (DSBs) introduced by RGNs and potentially other nucleases. This method, called Genome-wide Unbiased Identification of DSBs Enabled by Sequencing (GUIDE-Seq), relies on capture of double-stranded oligodeoxynucleotides into breaks Application of GUIDE-Seq to thirteen RGNs in two human cell lines revealed wide variability in RGN off-target activities and unappreciated characteristics of off-target sequences. The majority of identified sites were not detected by existing computational methods or ChIP-Seq. GUIDE-Seq also identified RGN-independent genomic breakpoint ‘hotspots’. Finally, GUIDE-Seq revealed that truncated guide RNAs exhibit substantially reduced RGN-induced off-target DSBs. Our experiments define the most rigorous framework for genome-wide identification of RGN off-target effects to date and provide a method for evaluating the safety of these nucleases prior to clinical use. PMID:25513782

  9. A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications.

    PubMed

    Haque, Ashraful; Engel, Jessica; Teichmann, Sarah A; Lönnberg, Tapio

    2017-08-18

    RNA sequencing (RNA-seq) is a genomic approach for the detection and quantitative analysis of messenger RNA molecules in a biological sample and is useful for studying cellular responses. RNA-seq has fueled much discovery and innovation in medicine over recent years. For practical reasons, the technique is usually conducted on samples comprising thousands to millions of cells. However, this has hindered direct assessment of the fundamental unit of biology-the cell. Since the first single-cell RNA-sequencing (scRNA-seq) study was published in 2009, many more have been conducted, mostly by specialist laboratories with unique skills in wet-lab single-cell genomics, bioinformatics, and computation. However, with the increasing commercial availability of scRNA-seq platforms, and the rapid ongoing maturation of bioinformatics approaches, a point has been reached where any biomedical researcher or clinician can use scRNA-seq to make exciting discoveries. In this review, we present a practical guide to help researchers design their first scRNA-seq studies, including introductory information on experimental hardware, protocol choice, quality control, data analysis and biological interpretation.

  10. Network embedding-based representation learning for single cell RNA-seq data.

    PubMed

    Li, Xiangyu; Chen, Weizheng; Chen, Yang; Zhang, Xuegong; Gu, Jin; Zhang, Michael Q

    2017-11-02

    Single cell RNA-seq (scRNA-seq) techniques can reveal valuable insights of cell-to-cell heterogeneities. Projection of high-dimensional data into a low-dimensional subspace is a powerful strategy in general for mining such big data. However, scRNA-seq suffers from higher noise and lower coverage than traditional bulk RNA-seq, hence bringing in new computational difficulties. One major challenge is how to deal with the frequent drop-out events. The events, usually caused by the stochastic burst effect in gene transcription and the technical failure of RNA transcript capture, often render traditional dimension reduction methods work inefficiently. To overcome this problem, we have developed a novel Single Cell Representation Learning (SCRL) method based on network embedding. This method can efficiently implement data-driven non-linear projection and incorporate prior biological knowledge (such as pathway information) to learn more meaningful low-dimensional representations for both cells and genes. Benchmark results show that SCRL outperforms other dimensional reduction methods on several recent scRNA-seq datasets. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Integrative analysis with ChIP-seq advances the limits of transcript quantification from RNA-seq

    PubMed Central

    Liu, Peng; Sanalkumar, Rajendran; Bresnick, Emery H.; Keleş, Sündüz; Dewey, Colin N.

    2016-01-01

    RNA-seq is currently the technology of choice for global measurement of transcript abundances in cells. Despite its successes, isoform-level quantification remains difficult because short RNA-seq reads are often compatible with multiple alternatively spliced isoforms. Existing methods rely heavily on uniquely mapping reads, which are not available for numerous isoforms that lack regions of unique sequence. To improve quantification accuracy in such difficult cases, we developed a novel computational method, prior-enhanced RSEM (pRSEM), which uses a complementary data type in addition to RNA-seq data. We found that ChIP-seq data of RNA polymerase II and histone modifications were particularly informative in this approach. In qRT-PCR validations, pRSEM was shown to be superior than competing methods in estimating relative isoform abundances within or across conditions. Data-driven simulations suggested that pRSEM has a greatly decreased false-positive rate at the expense of a small increase in false-negative rate. In aggregate, our study demonstrates that pRSEM transforms existing capacity to precisely estimate transcript abundances, especially at the isoform level. PMID:27405803

  12. Single-cell full-length total RNA sequencing uncovers dynamics of recursive splicing and enhancer RNAs.

    PubMed

    Hayashi, Tetsutaro; Ozaki, Haruka; Sasagawa, Yohei; Umeda, Mana; Danno, Hiroki; Nikaido, Itoshi

    2018-02-12

    Total RNA sequencing has been used to reveal poly(A) and non-poly(A) RNA expression, RNA processing and enhancer activity. To date, no method for full-length total RNA sequencing of single cells has been developed despite the potential of this technology for single-cell biology. Here we describe random displacement amplification sequencing (RamDA-seq), the first full-length total RNA-sequencing method for single cells. Compared with other methods, RamDA-seq shows high sensitivity to non-poly(A) RNA and near-complete full-length transcript coverage. Using RamDA-seq with differentiation time course samples of mouse embryonic stem cells, we reveal hundreds of dynamically regulated non-poly(A) transcripts, including histone transcripts and long noncoding RNA Neat1. Moreover, RamDA-seq profiles recursive splicing in >300-kb introns. RamDA-seq also detects enhancer RNAs and their cell type-specific activity in single cells. Taken together, we demonstrate that RamDA-seq could help investigate the dynamics of gene expression, RNA-processing events and transcriptional regulation in single cells.

  13. [Advances in isoprene synthase research].

    PubMed

    Gou, Yan; Liu, Zhongchuan; Wang, Ganggang

    2017-11-25

    Isoprene emission can lead to significant consequence for atmospheric chemistry. In addition, isoprene is a chemical compound for various industrial applications. In the organisms, isoprene is produced by isoprene synthase that eliminates the pyrophosphate from the dimethylallyl diphosphate. As a key enzyme of isoprene formation, isoprene synthase plays an important role in the process of natural emission and artificial synthesis of isoprene. So far, isoprene synthase has been found in various plants. Isoprene synthases from different sources are of conservative structural and similar biochemical properties. In this review, the biochemical and structural characteristics of isoprene synthases from different sources were compared, the catalytic mechanism of isoprene synthase was discussed, and the perspective application of the enzyme in bioengineering was proposed.

  14. ESR studies on reactivity of protein-derived tyrosyl radicals formed by prostaglandin H synthase and ribonucleotide reductase.

    PubMed

    Lassmann, G; Curtis, J; Liermann, B; Mason, R P; Eling, T E

    1993-01-01

    Using ESR spectroscopy, the ability of enzyme inhibitors to quench protein-derived tyrosyl radicals was studied in two different enzymes, prostaglandin H synthase and ribonucleotide reductase. The prostaglandin H synthase inhibitors indomethacin, eugenol, and MK-410 effectively prevent the formation of tyrosyl radicals during the oxidation of arachidonic acid by prostaglandin H synthase from ram seminal vesicles. A direct reaction with preformed tyrosyl radicals was observed only with eugenol. The other prostaglandin H synthase inhibitors were ineffective. The ribonucleotide reductase inhibitors hydroxyurea and 4-hydroxyanisole, which effectively inactivate the tyrosyl radical in the active site of ribonucleotide reductase present in tumor cells, exhibit a different reactivity with tyrosyl radicals formed by prostaglandin H synthase. Hydroxyurea quenches preformed tyrosyl radicals in prostaglandin H synthase weakly, whereas 4-hydroxyanisole does not quench tyrosyl radicals in prostaglandin H synthase at all. Eugenol, which quenches preformed prostaglandin H synthase-derived tyrosyl radicals, also quenches the tyrosyl radical in ribonucleotide reductase. The results suggest that the reactivity of protein-linked tyrosyl radicals in ribonucleotide reductase and those formed during prostaglandin H synthase catalysis are very different and have unrelated roles in enzyme catalysis.

  15. SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.

    PubMed

    Johnson, Benjamin K; Scholz, Matthew B; Teal, Tracy K; Abramovitch, Robert B

    2016-02-04

    Many tools exist in the analysis of bacterial RNA sequencing (RNA-seq) transcriptional profiling experiments to identify differentially expressed genes between experimental conditions. Generally, the workflow includes quality control of reads, mapping to a reference, counting transcript abundance, and statistical tests for differentially expressed genes. In spite of the numerous tools developed for each component of an RNA-seq analysis workflow, easy-to-use bacterially oriented workflow applications to combine multiple tools and automate the process are lacking. With many tools to choose from for each step, the task of identifying a specific tool, adapting the input/output options to the specific use-case, and integrating the tools into a coherent analysis pipeline is not a trivial endeavor, particularly for microbiologists with limited bioinformatics experience. To make bacterial RNA-seq data analysis more accessible, we developed a Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis (SPARTA). SPARTA is a reference-based bacterial RNA-seq analysis workflow application for single-end Illumina reads. SPARTA is turnkey software that simplifies the process of analyzing RNA-seq data sets, making bacterial RNA-seq analysis a routine process that can be undertaken on a personal computer or in the classroom. The easy-to-install, complete workflow processes whole transcriptome shotgun sequencing data files by trimming reads and removing adapters, mapping reads to a reference, counting gene features, calculating differential gene expression, and, importantly, checking for potential batch effects within the data set. SPARTA outputs quality analysis reports, gene feature counts and differential gene expression tables and scatterplots. SPARTA provides an easy-to-use bacterial RNA-seq transcriptional profiling workflow to identify differentially expressed genes between experimental conditions. This software will enable microbiologists with limited bioinformatics experience to analyze their data and integrate next generation sequencing (NGS) technologies into the classroom. The SPARTA software and tutorial are available at sparta.readthedocs.org.

  16. High-throughput full-length single-cell mRNA-seq of rare cells.

    PubMed

    Ooi, Chin Chun; Mantalas, Gary L; Koh, Winston; Neff, Norma F; Fuchigami, Teruaki; Wong, Dawson J; Wilson, Robert J; Park, Seung-Min; Gambhir, Sanjiv S; Quake, Stephen R; Wang, Shan X

    2017-01-01

    Single-cell characterization techniques, such as mRNA-seq, have been applied to a diverse range of applications in cancer biology, yielding great insight into mechanisms leading to therapy resistance and tumor clonality. While single-cell techniques can yield a wealth of information, a common bottleneck is the lack of throughput, with many current processing methods being limited to the analysis of small volumes of single cell suspensions with cell densities on the order of 107 per mL. In this work, we present a high-throughput full-length mRNA-seq protocol incorporating a magnetic sifter and magnetic nanoparticle-antibody conjugates for rare cell enrichment, and Smart-seq2 chemistry for sequencing. We evaluate the efficiency and quality of this protocol with a simulated circulating tumor cell system, whereby non-small-cell lung cancer cell lines (NCI-H1650 and NCI-H1975) are spiked into whole blood, before being enriched for single-cell mRNA-seq by EpCAM-functionalized magnetic nanoparticles and the magnetic sifter. We obtain high efficiency (> 90%) capture and release of these simulated rare cells via the magnetic sifter, with reproducible transcriptome data. In addition, while mRNA-seq data is typically only used for gene expression analysis of transcriptomic data, we demonstrate the use of full-length mRNA-seq chemistries like Smart-seq2 to facilitate variant analysis of expressed genes. This enables the use of mRNA-seq data for differentiating cells in a heterogeneous population by both their phenotypic and variant profile. In a simulated heterogeneous mixture of circulating tumor cells in whole blood, we utilize this high-throughput protocol to differentiate these heterogeneous cells by both their phenotype (lung cancer versus white blood cells), and mutational profile (H1650 versus H1975 cells), in a single sequencing run. This high-throughput method can help facilitate single-cell analysis of rare cell populations, such as circulating tumor or endothelial cells, with demonstrably high-quality transcriptomic data.

  17. 4C-ker: A Method to Reproducibly Identify Genome-Wide Interactions Captured by 4C-Seq Experiments.

    PubMed

    Raviram, Ramya; Rocha, Pedro P; Müller, Christian L; Miraldi, Emily R; Badri, Sana; Fu, Yi; Swanzey, Emily; Proudhon, Charlotte; Snetkova, Valentina; Bonneau, Richard; Skok, Jane A

    2016-03-01

    4C-Seq has proven to be a powerful technique to identify genome-wide interactions with a single locus of interest (or "bait") that can be important for gene regulation. However, analysis of 4C-Seq data is complicated by the many biases inherent to the technique. An important consideration when dealing with 4C-Seq data is the differences in resolution of signal across the genome that result from differences in 3D distance separation from the bait. This leads to the highest signal in the region immediately surrounding the bait and increasingly lower signals in far-cis and trans. Another important aspect of 4C-Seq experiments is the resolution, which is greatly influenced by the choice of restriction enzyme and the frequency at which it can cut the genome. Thus, it is important that a 4C-Seq analysis method is flexible enough to analyze data generated using different enzymes and to identify interactions across the entire genome. Current methods for 4C-Seq analysis only identify interactions in regions near the bait or in regions located in far-cis and trans, but no method comprehensively analyzes 4C signals of different length scales. In addition, some methods also fail in experiments where chromatin fragments are generated using frequent cutter restriction enzymes. Here, we describe 4C-ker, a Hidden-Markov Model based pipeline that identifies regions throughout the genome that interact with the 4C bait locus. In addition, we incorporate methods for the identification of differential interactions in multiple 4C-seq datasets collected from different genotypes or experimental conditions. Adaptive window sizes are used to correct for differences in signal coverage in near-bait regions, far-cis and trans chromosomes. Using several datasets, we demonstrate that 4C-ker outperforms all existing 4C-Seq pipelines in its ability to reproducibly identify interaction domains at all genomic ranges with different resolution enzymes.

  18. 4C-ker: A Method to Reproducibly Identify Genome-Wide Interactions Captured by 4C-Seq Experiments

    PubMed Central

    Raviram, Ramya; Rocha, Pedro P.; Müller, Christian L.; Miraldi, Emily R.; Badri, Sana; Fu, Yi; Swanzey, Emily; Proudhon, Charlotte; Snetkova, Valentina

    2016-01-01

    4C-Seq has proven to be a powerful technique to identify genome-wide interactions with a single locus of interest (or “bait”) that can be important for gene regulation. However, analysis of 4C-Seq data is complicated by the many biases inherent to the technique. An important consideration when dealing with 4C-Seq data is the differences in resolution of signal across the genome that result from differences in 3D distance separation from the bait. This leads to the highest signal in the region immediately surrounding the bait and increasingly lower signals in far-cis and trans. Another important aspect of 4C-Seq experiments is the resolution, which is greatly influenced by the choice of restriction enzyme and the frequency at which it can cut the genome. Thus, it is important that a 4C-Seq analysis method is flexible enough to analyze data generated using different enzymes and to identify interactions across the entire genome. Current methods for 4C-Seq analysis only identify interactions in regions near the bait or in regions located in far-cis and trans, but no method comprehensively analyzes 4C signals of different length scales. In addition, some methods also fail in experiments where chromatin fragments are generated using frequent cutter restriction enzymes. Here, we describe 4C-ker, a Hidden-Markov Model based pipeline that identifies regions throughout the genome that interact with the 4C bait locus. In addition, we incorporate methods for the identification of differential interactions in multiple 4C-seq datasets collected from different genotypes or experimental conditions. Adaptive window sizes are used to correct for differences in signal coverage in near-bait regions, far-cis and trans chromosomes. Using several datasets, we demonstrate that 4C-ker outperforms all existing 4C-Seq pipelines in its ability to reproducibly identify interaction domains at all genomic ranges with different resolution enzymes. PMID:26938081

  19. A randomized phase II/III study of adverse events between sequential (SEQ) versus simultaneous integrated boost (SIB) intensity modulated radiation therapy (IMRT) in nasopharyngeal carcinoma; preliminary result on acute adverse events.

    PubMed

    Songthong, Anussara P; Kannarunimit, Danita; Chakkabat, Chakkapong; Lertbutsayanukul, Chawalit

    2015-08-08

    To investigate acute and late toxicities comparing sequential (SEQ-IMRT) versus simultaneous integrated boost intensity modulated radiotherapy (SIB-IMRT) in nasopharyngeal carcinoma (NPC) patients. Newly diagnosed stage I-IVB NPC patients were randomized to receive SEQ-IMRT or SIB-IMRT, with or without chemotherapy. SEQ-IMRT consisted of two sequential radiation treatment plans: 2 Gy x 25 fractions to low-risk planning target volume (PTV-LR) followed by 2 Gy x 10 fractions to high-risk planning target volume (PTV-HR). In contrast, SIB-IMRT consisted of only one treatment plan: 2.12 Gy and 1.7 Gy x 33 fractions to PTV-HR and PTV-LR, respectively. Toxicities were evaluated according to CTCAE version 4.0. Between October 2010 and November 2013, 122 eligible patients were randomized between SEQ-IMRT (54 patients) and SIB-IMRT (68 patients). With median follow-up time of 16.8 months, there was no significant difference in toxicities between the two IMRT techniques. During chemoradiation, the most common grade 3-5 acute toxicities were mucositis (15.4% vs 13.6%, SEQ vs SIB, p = 0.788) followed by dysphagia (9.6% vs 9.1%, p = 1.000) and xerostomia (9.6% vs 7.6%, p = 0.748). During the adjuvant chemotherapy period, 25.6% and 32.7% experienced grade 3 weight loss in SEQ-IMRT and SIB-IMRT (p = 0.459). One-year overall survival (OS) and progression-free survival (PFS) were 95.8% and 95.5% in SEQ-IMRT and 98% and 90.2% in SIB-IMRT, respectively (p = 0.472 for OS and 0.069 for PFS). This randomized, phase II/III trial comparing SIB-IMRT versus SEQ-IMRT in NPC showed no statistically significant difference between both IMRT techniques in terms of acute adverse events. Short-term tumor control and survival outcome were promising.

  20. 17 CFR Appendix B to Part 20 - Explanatory Guidance on Data Record Layouts

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... reference price Data record 1 CCO_ID_1 CM_ID_2 CP_04 9/27/2010 C Nov-10 NYMEX NY Harbor No.2. Data record 2 CCO_ID_1 CM_ID_2 CP_04 9/27/2010 C Oct-10 NYMEX NY Harbor No.2. Data record 3 CCO_ID_1 CM_ID_2 CP_02 9/27/2010 C Nov-10 NYMEX Henry Hub. Data record 4 CCO_ID_1 CM_ID_2 CP_02 9/27/2010 C Oct-10 NYMEX Henry...

  1. 17 CFR Appendix B to Part 20 - Explanatory Guidance on Data Record Layouts

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... reference price Data record 1 CCO_ID_1 CM_ID_2 CP_04 9/27/2010 C Nov-10 NYMEX NY Harbor No.2. Data record 2 CCO_ID_1 CM_ID_2 CP_04 9/27/2010 C Oct-10 NYMEX NY Harbor No.2. Data record 3 CCO_ID_1 CM_ID_2 CP_02 9/27/2010 C Nov-10 NYMEX Henry Hub. Data record 4 CCO_ID_1 CM_ID_2 CP_02 9/27/2010 C Oct-10 NYMEX Henry...

  2. 17 CFR Appendix B to Part 20 - Explanatory Guidance on Data Record Layouts

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... reference price Data record 1 CCO_ID_1 CM_ID_2 CP_04 9/27/2010 C Nov-10 NYMEX NY Harbor No.2. Data record 2 CCO_ID_1 CM_ID_2 CP_04 9/27/2010 C Oct-10 NYMEX NY Harbor No.2. Data record 3 CCO_ID_1 CM_ID_2 CP_02 9/27/2010 C Nov-10 NYMEX Henry Hub. Data record 4 CCO_ID_1 CM_ID_2 CP_02 9/27/2010 C Oct-10 NYMEX Henry...

  3. Inhibition of muscle-specific gene expression by Id3: requirement of the C-terminal region of the protein for stable expression and function.

    PubMed

    Chen, B; Han, B H; Sun, X H; Lim, R W

    1997-01-15

    We have examined the role of an Id-like protein, Id3 (also known as HLH462), in the regulation of muscle-specific gene expression. Id proteins are believed to block expression of muscle-specific genes by preventing the dimerization between ubiquitous bHLH proteins (E proteins) and myogenic bHLH proteins such as MyoD. Consistent with its putative role as an inhibitor of differentiation, Id3 mRNA was detected in proliferating skeletal muscle cells, was further induced by basic fibroblast growth factor (bFGF) and was down-regulated in differentiated muscle cultures. Overexpression of Id3 efficiently inhibited the MyoD-mediated activation of the muscle-specific creatine kinase (MCK) reporter gene. Deletion analysis indicated that the C-terminal 15 amino acids of Id3 are critical for the full inhibitory activity while deleting up to 42 residues from the C-terminus of the related protein, Id2, did not affect its ability to inhibit the MCK reporter gene. Chimeric protein containing the N-terminal region of Id3 and the C-terminus of Id2 was also non-functional in transfected cells. In contrast, wild-type Id3, the C-terminal mutants, and the Id3/Id2 chimera could all interact with the E-protein E47in vitro. Additional studies indicated that truncation of the Id3 C-terminus might have adversely affected the expression level of the mutant proteins but the Id3/Id2 chimera was stably expressed. Taken together, our results revealed a more complex requirement for the expression and proper function of the Id family proteins than was hitherto expected.

  4. Inhibition of muscle-specific gene expression by Id3: requirement of the C-terminal region of the protein for stable expression and function.

    PubMed Central

    Chen, B; Han, B H; Sun, X H; Lim, R W

    1997-01-01

    We have examined the role of an Id-like protein, Id3 (also known as HLH462), in the regulation of muscle-specific gene expression. Id proteins are believed to block expression of muscle-specific genes by preventing the dimerization between ubiquitous bHLH proteins (E proteins) and myogenic bHLH proteins such as MyoD. Consistent with its putative role as an inhibitor of differentiation, Id3 mRNA was detected in proliferating skeletal muscle cells, was further induced by basic fibroblast growth factor (bFGF) and was down-regulated in differentiated muscle cultures. Overexpression of Id3 efficiently inhibited the MyoD-mediated activation of the muscle-specific creatine kinase (MCK) reporter gene. Deletion analysis indicated that the C-terminal 15 amino acids of Id3 are critical for the full inhibitory activity while deleting up to 42 residues from the C-terminus of the related protein, Id2, did not affect its ability to inhibit the MCK reporter gene. Chimeric protein containing the N-terminal region of Id3 and the C-terminus of Id2 was also non-functional in transfected cells. In contrast, wild-type Id3, the C-terminal mutants, and the Id3/Id2 chimera could all interact with the E-protein E47in vitro. Additional studies indicated that truncation of the Id3 C-terminus might have adversely affected the expression level of the mutant proteins but the Id3/Id2 chimera was stably expressed. Taken together, our results revealed a more complex requirement for the expression and proper function of the Id family proteins than was hitherto expected. PMID:9016574

  5. The state of infectious diseases clinical trials: a systematic review of ClinicalTrials.gov.

    PubMed

    Goswami, Neela D; Pfeiffer, Christopher D; Horton, John R; Chiswell, Karen; Tasneem, Asba; Tsalik, Ephraim L

    2013-01-01

    There is a paucity of clinical trials informing specific questions faced by infectious diseases (ID) specialists. The ClinicalTrials.gov registry offers an opportunity to evaluate the ID clinical trials portfolio. We examined 40,970 interventional trials registered with ClinicalTrials.gov from 2007-2010, focusing on study conditions and interventions to identify ID-related trials. Relevance to ID was manually confirmed for each programmatically identified trial, yielding 3570 ID trials and 37,400 non-ID trials for analysis. The number of ID trials was similar to the number of trials identified as belonging to cardiovascular medicine (n = 3437) or mental health (n = 3695) specialties. Slightly over half of ID trials were treatment-oriented trials (53%, vs. 77% for non-ID trials) followed by prevention (38%, vs. 8% in non-ID trials). ID trials tended to be larger than those of other specialties, with a median enrollment of 125 subjects (interquartile range [IQR], 45-400) vs. 60 (IQR, 30-160) for non-ID trials. Most ID studies are randomized (73%) but nonblinded (56%). Industry was the funding source in 51% of ID trials vs. 10% that were primarily NIH-funded. HIV-AIDS trials constitute the largest subset of ID trials (n = 815 [23%]), followed by influenza vaccine (n = 375 [11%]), and hepatitis C (n = 339 [9%]) trials. Relative to U.S. and global mortality rates, HIV-AIDS and hepatitis C virus trials are over-represented, whereas lower respiratory tract infection trials are under-represented in this large sample of ID clinical trials. This work is the first to characterize ID clinical trials registered in ClinicalTrials.gov, providing a framework to discuss prioritization, methodology, and policy.

  6. Sandalwood fragrance biosynthesis involves sesquiterpene synthases of both the terpene synthase (TPS)-a and TPS-b subfamilies, including santalene synthases.

    PubMed

    Jones, Christopher G; Moniodis, Jessie; Zulak, Katherine G; Scaffidi, Adrian; Plummer, Julie A; Ghisalberti, Emilio L; Barbour, Elizabeth L; Bohlmann, Jörg

    2011-05-20

    Sandalwood oil is one of the worlds most highly prized fragrances. To identify the genes and encoded enzymes responsible for santalene biosynthesis, we cloned and characterized three orthologous terpene synthase (TPS) genes SaSSy, SauSSy, and SspiSSy from three divergent sandalwood species; Santalum album, S. austrocaledonicum, and S. spicatum, respectively. The encoded enzymes catalyze the formation of α-, β-, epi-β-santalene, and α-exo-bergamotene from (E,E)-farnesyl diphosphate (E,E-FPP). Recombinant SaSSy was additionally tested with (Z,Z)-farnesyl diphosphate (Z,Z-FPP) and remarkably, found to produce a mixture of α-endo-bergamotene, α-santalene, (Z)-β-farnesene, epi-β-santalene, and β-santalene. Additional cDNAs that encode bisabolene/bisabolol synthases were also cloned and functionally characterized from these three species. Both the santalene synthases and the bisabolene/bisabolol synthases reside in the TPS-b phylogenetic clade, which is more commonly associated with angiosperm monoterpene synthases. An orthologous set of TPS-a synthases responsible for formation of macrocyclic and bicyclic sesquiterpenes were characterized. Strict functionality and limited sequence divergence in the santalene and bisabolene synthases are in contrast to the TPS-a synthases, suggesting these compounds have played a significant role in the evolution of the Santalum genus. © 2011 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis.

    PubMed

    Cornwell, MacIntosh; Vangala, Mahesh; Taing, Len; Herbert, Zachary; Köster, Johannes; Li, Bo; Sun, Hanfei; Li, Taiwen; Zhang, Jian; Qiu, Xintao; Pun, Matthew; Jeselsohn, Rinath; Brown, Myles; Liu, X Shirley; Long, Henry W

    2018-04-12

    RNA sequencing has become a ubiquitous technology used throughout life sciences as an effective method of measuring RNA abundance quantitatively in tissues and cells. The increase in use of RNA-seq technology has led to the continuous development of new tools for every step of analysis from alignment to downstream pathway analysis. However, effectively using these analysis tools in a scalable and reproducible way can be challenging, especially for non-experts. Using the workflow management system Snakemake we have developed a user friendly, fast, efficient, and comprehensive pipeline for RNA-seq analysis. VIPER (Visualization Pipeline for RNA-seq analysis) is an analysis workflow that combines some of the most popular tools to take RNA-seq analysis from raw sequencing data, through alignment and quality control, into downstream differential expression and pathway analysis. VIPER has been created in a modular fashion to allow for the rapid incorporation of new tools to expand the capabilities. This capacity has already been exploited to include very recently developed tools that explore immune infiltrate and T-cell CDR (Complementarity-Determining Regions) reconstruction abilities. The pipeline has been conveniently packaged such that minimal computational skills are required to download and install the dozens of software packages that VIPER uses. VIPER is a comprehensive solution that performs most standard RNA-seq analyses quickly and effectively with a built-in capacity for customization and expansion.

  8. A Guide for Designing and Analyzing RNA-Seq Data.

    PubMed

    Chatterjee, Aniruddha; Ahn, Antonio; Rodger, Euan J; Stockwell, Peter A; Eccles, Michael R

    2018-01-01

    The identity of a cell or an organism is at least in part defined by its gene expression and therefore analyzing gene expression remains one of the most frequently performed experimental techniques in molecular biology. The development of the RNA-Sequencing (RNA-Seq) method allows an unprecedented opportunity to analyze expression of protein-coding, noncoding RNA and also de novo transcript assembly of a new species or organism. However, the planning and design of RNA-Seq experiments has important implications for addressing the desired biological question and maximizing the value of the data obtained. In addition, RNA-Seq generates a huge volume of data and accurate analysis of this data involves several different steps and choices of tools. This can be challenging and overwhelming, especially for bench scientists. In this chapter, we describe an entire workflow for performing RNA-Seq experiments. We describe critical aspects of wet lab experiments such as RNA isolation, library preparation and the initial design of an experiment. Further, we provide a step-by-step description of the bioinformatics workflow for different steps involved in RNA-Seq data analysis. This includes power calculations, setting up a computational environment, acquisition and processing of publicly available data if desired, quality control measures, preprocessing steps for the raw data, differential expression analysis, and data visualization. We particularly mention important considerations for each step to provide a guide for designing and analyzing RNA-Seq data.

  9. Advances in single-cell RNA sequencing and its applications in cancer research.

    PubMed

    Zhu, Sibo; Qing, Tao; Zheng, Yuanting; Jin, Li; Shi, Leming

    2017-08-08

    Unlike population-level approaches, single-cell RNA sequencing enables transcriptomic analysis of an individual cell. Through the combination of high-throughput sequencing and bioinformatic tools, single-cell RNA-seq can detect more than 10,000 transcripts in one cell to distinguish cell subsets and dynamic cellular changes. After several years' development, single-cell RNA-seq can now achieve massively parallel, full-length mRNA sequencing as well as in situ sequencing and even has potential for multi-omic detection. One appealing area of single-cell RNA-seq is cancer research, and it is regarded as a promising way to enhance prognosis and provide more precise target therapy by identifying druggable subclones. Indeed, progresses have been made regarding solid tumor analysis to reveal intratumoral heterogeneity, correlations between signaling pathways, stemness, drug resistance, and tumor architecture shaping the microenvironment. Furthermore, through investigation into circulating tumor cells, many genes have been shown to promote a propensity toward stemness and the epithelial-mesenchymal transition, to enhance anchoring and adhesion, and to be involved in mechanisms of anoikis resistance and drug resistance. This review focuses on advances and progresses of single-cell RNA-seq with regard to the following aspects: 1. Methodologies of single-cell RNA-seq 2. Single-cell isolation techniques 3. Single-cell RNA-seq in solid tumor research 4. Single-cell RNA-seq in circulating tumor cell research 5.

  10. Advances in single-cell RNA sequencing and its applications in cancer research

    PubMed Central

    Zhu, Sibo; Qing, Tao; Zheng, Yuanting; Jin, Li; Shi, Leming

    2017-01-01

    Unlike population-level approaches, single-cell RNA sequencing enables transcriptomic analysis of an individual cell. Through the combination of high-throughput sequencing and bioinformatic tools, single-cell RNA-seq can detect more than 10,000 transcripts in one cell to distinguish cell subsets and dynamic cellular changes. After several years’ development, single-cell RNA-seq can now achieve massively parallel, full-length mRNA sequencing as well as in situ sequencing and even has potential for multi-omic detection. One appealing area of single-cell RNA-seq is cancer research, and it is regarded as a promising way to enhance prognosis and provide more precise target therapy by identifying druggable subclones. Indeed, progresses have been made regarding solid tumor analysis to reveal intratumoral heterogeneity, correlations between signaling pathways, stemness, drug resistance, and tumor architecture shaping the microenvironment. Furthermore, through investigation into circulating tumor cells, many genes have been shown to promote a propensity toward stemness and the epithelial-mesenchymal transition, to enhance anchoring and adhesion, and to be involved in mechanisms of anoikis resistance and drug resistance. This review focuses on advances and progresses of single-cell RNA-seq with regard to the following aspects: 1. Methodologies of single-cell RNA-seq 2. Single-cell isolation techniques 3. Single-cell RNA-seq in solid tumor research 4. Single-cell RNA-seq in circulating tumor cell research 5. Perspectives PMID:28881849

  11. Cross-Cultural Register Differences in Infant-Directed Speech: An Initial Study.

    PubMed

    Farran, Lama K; Lee, Chia-Cheng; Yoo, Hyunjoo; Oller, D Kimbrough

    2016-01-01

    Infant-directed speech (IDS) provides an environment that appears to play a significant role in the origins of language in the human infant. Differences have been reported in the use of IDS across cultures, suggesting different styles of infant language-learning. Importantly, both cross-cultural and intra-cultural research suggest there may be a positive relationship between the use of IDS and rates of language development, underscoring the need to investigate cultural differences more deeply. The majority of studies, however, have conceptualized IDS monolithically, granting little attention to a potentially key distinction in how IDS manifests across cultures during the first two years. This study examines and quantifies for the first time differences within IDS in the use of baby register (IDS/BR), an acoustically identifiable type of IDS that includes features such as high pitch, long duration, and smooth intonation (the register that is usually assumed to occur in IDS), and adult register (IDS/AR), the type of IDS that does not include such features and thus sounds as if it could have been addressed to an adult. We studied IDS across 19 American and 19 Lebanese mother-infant dyads, with particular focus on the differential use of registers within IDS as mothers interacted with their infants ages 0-24 months. Our results showed considerable usage of IDS/AR (>30% of utterances) and a tendency for Lebanese mothers to use more IDS than American mothers. Implications for future research on IDS and its role in elucidating how language evolves across cultures are explored.

  12. Cross-Cultural Register Differences in Infant-Directed Speech: An Initial Study

    PubMed Central

    Farran, Lama K.; Lee, Chia-Cheng; Yoo, Hyunjoo; Oller, D. Kimbrough

    2016-01-01

    Infant-directed speech (IDS) provides an environment that appears to play a significant role in the origins of language in the human infant. Differences have been reported in the use of IDS across cultures, suggesting different styles of infant language-learning. Importantly, both cross-cultural and intra-cultural research suggest there may be a positive relationship between the use of IDS and rates of language development, underscoring the need to investigate cultural differences more deeply. The majority of studies, however, have conceptualized IDS monolithically, granting little attention to a potentially key distinction in how IDS manifests across cultures during the first two years. This study examines and quantifies for the first time differences within IDS in the use of baby register (IDS/BR), an acoustically identifiable type of IDS that includes features such as high pitch, long duration, and smooth intonation (the register that is usually assumed to occur in IDS), and adult register (IDS/AR), the type of IDS that does not include such features and thus sounds as if it could have been addressed to an adult. We studied IDS across 19 American and 19 Lebanese mother-infant dyads, with particular focus on the differential use of registers within IDS as mothers interacted with their infants ages 0–24 months. Our results showed considerable usage of IDS/AR (>30% of utterances) and a tendency for Lebanese mothers to use more IDS than American mothers. Implications for future research on IDS and its role in elucidating how language evolves across cultures are explored. PMID:26981626

  13. [Expression of Id1 and Id3 in endometrial carcinoma and their roles in regulating biological behaviors of endometrial carcinoma cells in vitro].

    PubMed

    Sun, Lili; Li, Xuenong; Liu, Guobing

    2013-06-01

    To investigate the expression of inhibitor of DNA differentiation/DNA binding 1 (Id1) and Id3 in endometrial carcinoma and explore their roles in regulating the proliferation, invasion, migration and adhesion of endometrial carcinoma cells in vitro. Id1 and Id3 expression in 4 fresh endometrial cancer tissue specimens and matched adjacent tissues were detected using Western blotting. Two endometrial cancer cell lines, HEC-1-B and RL-952, were both divided into 4 groups, namely the untreated group, blank virus group, promoter group and Id1/Id3 double-knockdown group, and their expressions of MMP2, CXCR4 and P21 were detected by qRT-PCR and Western blotting. The proliferation, invasion, migration and adhesion of the cells were evaluated with MTT, Transwell, wound-healing, and adhesion assays. Endometrial carcinoma tissues showed significantly higher Id1 and Id3 expression than the adjacent tissues (P<0.05). In the two endometrial carcinoma cell lines, Id1/Id3 double-knockdown significantly decreased MMP2 and CXCR4 expression and increased P21 expression at both mRNA and protein levels (P<0.05), and resulted in suppressed cell proliferation, invasion, migration and adhesion. Id1 and Id3 expressions are up-regulated in endometrial carcinoma to promote the proliferation, invasion, migration and adhesion of the tumor cells by increasing MMP2 and CXCR4 expression and reducing P21 expression. Therapies targeting Id1/Id3 can be a novel strategy for treatment of endometrial carcinoma.

  14. Intra-tumoral delivery of functional ID4 protein via PCL/maltodextrin nano-particle inhibits prostate cancer growth

    PubMed Central

    Morton, Derrick; Sharma, Pankaj; Gorantla, Yamini; Joshi, Jugal; Nagappan, Perri; Pallaniappan, Ravi; Chaudhary, Jaideep

    2016-01-01

    ID4, a helix loop helix transcriptional regulator has emerged as a tumor suppressor in prostate cancer. Epigenetic silencing of ID4 promotes prostate cancer whereas ectopic expression in prostate cancer cell lines blocks cancer phenotype. To directly investigate the anti-tumor property, full length human recombinant ID4 encapsulated in biodegradable Polycaprolactone/Maltodextrin (PCL-MD) nano-carrier was delivered to LNCaP cells in which the native ID4 was stably silenced (LNCaP(-)ID4). The cellular uptake of ID4 resulted in increased apoptosis, decreased proliferation and colony formation. Intratumoral delivery of PCL-MD ID4 into growing LNCaP(-)ID4 tumors in SCID mice significantly reduced the tumor volume compared to the tumors treated with chemotherapeutic Docetaxel. The study supports the feasibility of using nano-carrier encapsulated ID4 protein as a therapeutic. Mechanistically, ID4 may assimilate multiple regulatory pathways for example epigenetic re-programming, integration of multiple AR co-regulators or signaling pathways resulting in tumor suppressor activity of ID4. PMID:27487149

  15. Aromatic Polyketide Synthases (Purification, Characterization, and Antibody Development to Benzalacetone Synthase from Raspberry Fruits).

    PubMed Central

    Borejsza-Wysocki, W.; Hrazdina, G.

    1996-01-01

    p-Hydroxyphenylbutan-2-one, the characteristic aroma compound of raspberries (Rubus idaeus L.), is synthesized from p-coumaryl-coenzyme A and malonyl-coenzyme A in a two-step reaction sequence that is catalyzed by benzalacetone synthase and benzalacetone reductase (W. Borejsza-Wysocki and G. Hrazdina [1994] Phytochemistry 35: 623-628). Benzalacetone synthase condenses one malonate with p-coumarate to form the pathway intermediate p-hydroxyphenylbut-3-ene-2-one (p-hydroxybenzalacetone) in a reaction that is similar to those catalyzed by chalcone and stilbene synthases. We have obtained an enzyme preparation from ripe raspberries that was preferentially enriched in benzalacetone synthase (approximately 170-fold) over chalcone synthase (approximately 14-fold) activity. This preparation was used to characterize benzalacetone synthase and to develop polyclonal antibodies in rabbits. Benzalacetone synthase showed similarity in its molecular properties to chalcone synthase but differed distinctly in its substrate specificity, response to 2-mercaptoethanol and ethylene glycol, and induction in cell-suspension cultures. The product of the enzyme, p-hydroxybenzalacetone, inhibited mycelial growth of the raspberry pathogen Phytophthora fragariae var rubi at 250 [mu]M. We do not know whether the dual activity in the benzalacetone synthase preparation is the result of a bifunctional enzyme or is caused by contamination with chalcone synthase that was also present. The rapid induction of the enzyme in cell-suspension cultures upon addition of yeast extract and the toxicity of its product, p-hydroxybenzalacetone, to phytopathogenic fungi also suggest that the pathway may be part of a plant defense response. PMID:12226219

  16. Mitochondrial F1Fo-ATP synthase translocates to cell surface in hepatocytes and has high activity in tumor-like acidic and hypoxic environment.

    PubMed

    Ma, Zhan; Cao, Manlin; Liu, Yiwen; He, Yiqing; Wang, Yingzhi; Yang, Cuixia; Wang, Wenjuan; Du, Yan; Zhou, Muqing; Gao, Feng

    2010-08-01

    F1Fo-ATP synthase was originally thought to exclusively locate in the inner membrane of the mitochondria. However, recent studies prove the existence of ectopic F1Fo-ATP synthase on the outside of the cell membrane. Ectopic ATP synthase was proposed as a marker for tumor target therapy. Nevertheless, the protein transport mechanism of the ectopic ATP synthase is still unclear. The specificity of the ectopic ATP synthase, with regard to tumors, is questioned because of its widespread expression. In the current study, we constructed green fluorescent protein-ATP5B fusion protein and introduced it into HepG2 cells to study the localization of the ATP synthase. The expression of ATP5B was analyzed in six cell lines with different 'malignancies'. These cells were cultured in both normal and tumor-like acidic and hypoxic conditions. The results suggested that the ectopic expression of ATP synthase is a consequence of translocation from the mitochondria. The expression and catalytic activity of ectopic ATP synthase were similar on the surface of malignant cells as on the surface of less malignant cells. Interestingly, the expression of ectopic ATP synthase was not up-regulated in tumor-like acidic and hypoxic microenvironments. However, the catalytic activity of ectopic ATP synthase was up-regulated in tumor-like microenvironments. Therefore, the specificity of ectopic ATP synthase for tumor target therapy relies on the high level of catalytic activity that is observed in acidic and hypoxic microenvironments in tumor tissues.

  17. RNA-seq reveals transcriptome changes in goats following myostatin gene knockout

    PubMed Central

    Cai, Bei; Zhou, Shiwei; Zhu, Haijing; Qu, Lei; Wang, Xiaolong

    2017-01-01

    Myostatin (MSTN) is a powerful negative regulator of skeletal muscle mass in mammalian species that is primarily expressed in skeletal muscles, and mutations of its encoding gene can result in the double-muscling trait. In this study, the CRISPR/Cas9 technique was used to edit MSTN in Shaanbei Cashmere goats and generate knockout animals. RNA sequencing was used to determine and compare the transcriptome profiles of the muscles from three wild-type (WT) goats, three fibroblast growth factor 5 (FGF5) knockout goats (FGF5+/- group) and three goats with disrupted expression of both the FGF5 and MSTN genes (FM+/- group). The sequence reads were obtained using the Illumina HiSeq 2000 system and mapped to the Capra hircus reference genome using TopHat (v2.0.9). In total, 68.93, 62.04 and 66.26 million clean sequencing reads were obtained from the WT, FM+/- and FGF5+/- groups, respectively. There were 201 differentially expressed genes (DEGs) between the WT and FGF5+/- groups, with 86 down- and 115 up-regulated genes in the FGF5+/- group. Between the WT and FM+/- groups, 121 DEGs were identified, including 81 down- and 40 up-regulated genes in the FM+/- group. A total of 198 DEGs were detected between the FGF5+/- group and FM+/- group, with 128 down- and 70 up-regulated genes in the FM+/- group. At the transcriptome level, we found substantial changes in genes involved in fatty acid metabolism and the biosynthesis of unsaturated fatty acids, such as stearoyl-CoA dehydrogenase, 3-hydroxyacyl-CoA dehydratase 2, ELOVL fatty acid elongase 6 and fatty acid synthase, suggesting that the expression levels of these genes may be directly regulated by MSTN and that these genes are likely downstream targets of MSTN with potential roles in lipid metabolism in goats. Moreover, five randomly selected DEGs were further validated with qRT-PCR, and the results were consistent with the transcriptome analysis. The present study provides insight into the unique transcriptome profile of the MSTN knockout goat, which is a valuable resource for studying goat genomics. PMID:29228005

  18. 50 CFR 230.1 - Purpose and scope.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    .... Provisions of the Marine Mammal Protection Act of 1972 (16 U.S.C. 1361 et seq.) and the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.) also pertain to human interactions with whales. Rules elsewhere... in this part is to implement the Whaling Convention Act (16 U.S.C. 916 et seq.) by prohibiting...

  19. 50 CFR 230.1 - Purpose and scope.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    .... Provisions of the Marine Mammal Protection Act of 1972 (16 U.S.C. 1361 et seq.) and the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.) also pertain to human interactions with whales. Rules elsewhere... in this part is to implement the Whaling Convention Act (16 U.S.C. 916 et seq.) by prohibiting...

  20. 50 CFR 230.1 - Purpose and scope.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    .... Provisions of the Marine Mammal Protection Act of 1972 (16 U.S.C. 1361 et seq.) and the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.) also pertain to human interactions with whales. Rules elsewhere... in this part is to implement the Whaling Convention Act (16 U.S.C. 916 et seq.) by prohibiting...

  1. 50 CFR 230.1 - Purpose and scope.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    .... Provisions of the Marine Mammal Protection Act of 1972 (16 U.S.C. 1361 et seq.) and the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.) also pertain to human interactions with whales. Rules elsewhere... in this part is to implement the Whaling Convention Act (16 U.S.C. 916 et seq.) by prohibiting...

  2. 50 CFR 230.1 - Purpose and scope.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    .... Provisions of the Marine Mammal Protection Act of 1972 (16 U.S.C. 1361 et seq.) and the Endangered Species Act of 1973 (16 U.S.C. 1531 et seq.) also pertain to human interactions with whales. Rules elsewhere... in this part is to implement the Whaling Convention Act (16 U.S.C. 916 et seq.) by prohibiting...

  3. 15 CFR 922.163 - Prohibited activities-Sanctuary-wide.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ..., (MMPA), 16 U.S.C. 1361 et seq., the Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq... Florida, pursuant to applicable State law. See § 370.027, Florida Statutes and implementing regulations... Pollution Control Act (FWPCA), as amended, 33 U.S.C. 1322 et seq.; (C) Those authorized under Monroe County...

  4. 15 CFR 922.163 - Prohibited activities-Sanctuary-wide.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ..., (MMPA), 16 U.S.C. 1361 et seq., the Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq... Florida, pursuant to applicable State law. See § 370.027, Florida Statutes and implementing regulations... Pollution Control Act (FWPCA), as amended, 33 U.S.C. 1322 et seq.; (C) Those authorized under Monroe County...

  5. 50 CFR 12.24 - Petition for remission of forfeiture.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... in unlawful taking, subject to forfeiture under the Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., or any person who has an interest in any property subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16...

  6. 15 CFR 922.163 - Prohibited activities-Sanctuary-wide.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ..., (MMPA), 16 U.S.C. 1361 et seq., the Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq... Florida, pursuant to applicable State law. See § 370.027, Florida Statutes and implementing regulations... Pollution Control Act (FWPCA), as amended, 33 U.S.C. 1322 et seq.; (C) Those authorized under Monroe County...

  7. 50 CFR 12.24 - Petition for remission of forfeiture.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... in unlawful taking, subject to forfeiture under the Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., or any person who has an interest in any property subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16...

  8. 50 CFR 12.24 - Petition for remission of forfeiture.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... in unlawful taking, subject to forfeiture under the Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., or any person who has an interest in any property subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16...

  9. 50 CFR 12.24 - Petition for remission of forfeiture.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... in unlawful taking, subject to forfeiture under the Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., or any person who has an interest in any property subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16...

  10. 50 CFR 12.24 - Petition for remission of forfeiture.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... in unlawful taking, subject to forfeiture under the Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., or any person who has an interest in any property subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16...

  11. Evaluation of the utility of the new rainbow trout genome assembly for analyzing RNA-seq data from stress response experiments

    USDA-ARS?s Scientific Manuscript database

    The newly released rainbow trout genome assembly in NCBI RefSeq has greatly expanded our abilities for analyzing rainbow trout sequencing data. In this poster, we evaluate the utility of this genome assembly for analyzing RNA sequencing (RNA-seq) data of rainbow trout responses to various stressors,...

  12. Applications of Redwood Genotyping by Using Microsatellite Markers

    Treesearch

    Chris Brinegar; Dan Bruno; Ryan Kirkbride; Steven Glavas; Ingrid Udranszky

    2007-01-01

    A panel of polymorphic microsatellite markers have been developed in coast redwood (Sequoia sempervirens). Two loci in particular (Seq18D7-3 and Seq21E5) demonstrate the potential of microsatellite genotyping in the assessment of genetic diversity and inheritance in redwoods. The highly polymorphic Seq18D7-3 marker provided evidence for the planting...

  13. 40 CFR 123.27 - Requirements for enforcement authority.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... established under § 123.34. (Clean Water Act (33 U.S.C. 1251 et seq.), Safe Drinking Water Act (42 U.S.C. 300f et seq.), Clean Air Act (42 U.S.C. 7401 et seq.), Resource Conservation and Recovery Act (42 U.S.C.... 123.27 Section 123.27 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) WATER...

  14. 40 CFR 123.27 - Requirements for enforcement authority.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... established under § 123.34. (Clean Water Act (33 U.S.C. 1251 et seq.), Safe Drinking Water Act (42 U.S.C. 300f et seq.), Clean Air Act (42 U.S.C. 7401 et seq.), Resource Conservation and Recovery Act (42 U.S.C.... 123.27 Section 123.27 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) WATER...

  15. 40 CFR 123.27 - Requirements for enforcement authority.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... established under § 123.34. (Clean Water Act (33 U.S.C. 1251 et seq.), Safe Drinking Water Act (42 U.S.C. 300f et seq.), Clean Air Act (42 U.S.C. 7401 et seq.), Resource Conservation and Recovery Act (42 U.S.C.... 123.27 Section 123.27 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) WATER...

  16. 40 CFR 123.27 - Requirements for enforcement authority.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... established under § 123.34. (Clean Water Act (33 U.S.C. 1251 et seq.), Safe Drinking Water Act (42 U.S.C. 300f et seq.), Clean Air Act (42 U.S.C. 7401 et seq.), Resource Conservation and Recovery Act (42 U.S.C.... 123.27 Section 123.27 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) WATER...

  17. 40 CFR 720.3 - Definitions.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ..., Drug, and Cosmetic Act, 21 U.S.C. 321 et seq., and the regulations issued under it. In addition, the..., 21 U.S.C. 453 et seq.; meats and meat food products, as defined in the Federal Meat Inspection Act, 21 U.S.C. 60 et seq.; and eggs and egg products, as defined in the Egg Products Inspection Act, 21 U...

  18. 40 CFR 720.3 - Definitions.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ..., Drug, and Cosmetic Act, 21 U.S.C. 321 et seq., and the regulations issued under it. In addition, the..., 21 U.S.C. 453 et seq.; meats and meat food products, as defined in the Federal Meat Inspection Act, 21 U.S.C. 60 et seq.; and eggs and egg products, as defined in the Egg Products Inspection Act, 21 U...

  19. 33 CFR 151.1018 - Withdrawal of a conditional permit.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... COMMERCIAL WASTE, AND BALLAST WATER Transportation of Municipal and Commercial Waste § 151.1018 Withdrawal of... 1988 (33 U.S.C. 2601 et seq.); (2) The Solid Waste Disposal Act (42 U.S.C. 6901 et seq.); (3) The Marine Protection, Research, and Sanctuaries Act of 1972 (33 U.S.C. 1401 et seq.); (4) The Rivers and...

  20. In silico site-directed mutagenesis informs species-specific predictions of chemical susceptibility derived from the Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) tool

    EPA Science Inventory

    The Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) tool was developed to address needs for rapid, cost effective methods of species extrapolation of chemical susceptibility. Specifically, the SeqAPASS tool compares the primary sequence (Level 1), functiona...

  1. Characterization of a monoterpene synthase from Paeonia lactiflora producing α-pinene as its single product.

    PubMed

    Ma, Xiaohui; Guo, Juan; Ma, Ying; Jin, Baolong; Zhan, Zhilai; Yuan, Yuan; Huang, Luqi

    2016-07-01

    To identify a terpene synthase that catalyzes the conversion of geranyl pyrophosphate (GPP) to α-pinene and is involved in the biosynthesis of paeoniflorin. Two new terpene synthase genes were isolated from the transcriptome data of Peaonia lactiflora. Phylogenetic analysis and sequence characterization revealed that one gene, named PlPIN, encoded a monoterpene synthase that might be involved in the biosynthesis of paeoniflorin. In vitro enzyme assay showed that, in contrast to most monoterpene synthases, PlPIN encoded an α-pinene synthase which converted GPP into α-pinene as a single product. This newly identified α-pinene synthase could be used for improving paeoniflorin accumulation by metabolic engineering or for producing α-pinene via synthetic biology.

  2. Id-1 promotes osteosarcoma cell growth and inhibits cell apoptosis via PI3K/AKT signaling pathway

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hao, Liang; Liao, Qi; Tang, Qiang

    2016-02-12

    Accumulating evidence reveals that Id-1 is upregulated and functions as a potential tumor promoter in several human cancer types. However, the role of Id-1 in osteosarcoma (OS) is unknown. In present study, we found that Id-1 expression was elevated in OS tissues than adjacent normal bone tissues. More importantly, we demonstrated that overexpression of Id-1 is significantly correlated with tumor progression and poor survival in OS patients. Furthermore, increased expression of Id-1 was observed in OS cell lines and ectopic expression of Id-1 significantly enhanced in vitro cell proliferation and promoted in vivo tumor growth, whereas knockdown of Id-1 suppressed OS cellsmore » growth. Moreover, our experimental data revealed that Id-1 promotes cell proliferation by facilitating cell cycle progression and inhibits cell apoptosis. Mechanistically, the effects of Id-1 in OS cells is at least partly through activation of PI3K/Akt signaling pathway. Therefore, we identified a tumorigenic role of Id-1 in OS and suggested a potential therapeutic target for OS patients. - Highlights: • Id-1 expression is positively correlated in OS patients with poor prognosis. • Overexpression of Id-1 promotes OS cell growth in vitro and in vivo. • Id-1induces cell cycle progression and inhibits cell apoptosis. • PI3K/Akt signaling pathway contributed to the oncogenic effects of Id-1 in OS cells.« less

  3. The inhibitor of differentiation-1 (Id1) enables lung cancer liver colonization through activation of an EMT program in tumor cells and establishment of the pre-metastatic niche.

    PubMed

    Castañón, Eduardo; Soltermann, Alex; López, Inés; Román, Marta; Ecay, Margarita; Collantes, María; Redrado, Miriam; Baraibar, Iosune; López-Picazo, José María; Rolfo, Christian; Vidal-Vanaclocha, Fernando; Raez, Luis; Weder, Walter; Calvo, Alfonso; Gil-Bazo, Ignacio

    2017-08-28

    Id1 promotes carcinogenesis and metastasis, and predicts prognosis of non-small cell lung cancer (NSCLC)-adenocarcionoma patients. We hypothesized that Id1 may play a critical role in lung cancer colonization of the liver by affecting both tumor cells and the microenvironment. Depleted levels of Id1 in LLC (Lewis lung carcinoma cells, LLC shId1) significantly reduced cell proliferation and migration in vitro. Genetic loss of Id1 in the host tissue (Id1 -/- mice) impaired liver colonization and increased survival of Id1 -/- animals. Histologically, the presence of Id1 in tumor cells of liver metastasis was responsible for liver colonization. Microarray analysis comparing liver tumor nodules from Id1 +/+ mice and Id1 -/- mice injected with LLC control cells revealed that Id1 loss reduces the levels of EMT-related proteins, such as vimentin. In tissue microarrays containing 532 NSCLC patients' samples, we found that Id1 significantly correlated with vimentin and other EMT-related proteins. Id1 loss decreased the levels of vimentin, integrinβ1, TGFβ1 and snail, both in vitro and in vivo. Therefore, Id1 enables both LLC and the host microenvironment for an effective liver colonization, and may represent a novel therapeutic target to avoid NSCLC liver metastasis. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. The Self-Identity Protein IdsD Is Communicated between Cells in Swarming Proteus mirabilis Colonies.

    PubMed

    Saak, Christina C; Gibbs, Karine A

    2016-12-15

    Proteus mirabilis is a social bacterium that is capable of self (kin) versus nonself recognition. Swarming colonies of this bacterium expand outward on surfaces to centimeter-scale distances due to the collective motility of individual cells. Colonies of genetically distinct populations remain separate, while those of identical populations merge. Ids proteins are essential for this recognition behavior. Two of these proteins, IdsD and IdsE, encode identity information for each strain. These two proteins bind in vitro in an allele-restrictive manner. IdsD-IdsE binding is correlated with the merging of populations, whereas a lack of binding is correlated with the separation of populations. Key questions remained about the in vivo interactions of IdsD and IdsE, specifically, whether IdsD and IdsE bind within single cells or whether IdsD-IdsE interactions occur across neighboring cells and, if so, which of the two proteins is exchanged. Here we demonstrate that IdsD must originate from another cell to communicate identity and that this nonresident IdsD interacts with IdsE resident in the recipient cell. Furthermore, we show that unbound IdsD in recipient cells does not cause cell death and instead appears to contribute to a restriction in the expansion radius of the swarming colony. We conclude that P. mirabilis communicates IdsD between neighboring cells for nonlethal kin recognition, which suggests that the Ids proteins constitute a type of cell-cell communication. We demonstrate that self (kin) versus nonself recognition in P. mirabilis entails the cell-cell communication of an identity-encoding protein that is exported from one cell and received by another. We further show that this intercellular exchange affects swarm colony expansion in a nonlethal manner, which adds social communication to the list of potential swarm-related regulatory factors. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  5. Role of ID Proteins in BMP4 Inhibition of Profibrotic Effects of TGF-β2 in Human TM Cells.

    PubMed

    Mody, Avani A; Wordinger, Robert J; Clark, Abbot F

    2017-02-01

    Increased expression of TGF-β2 in primary open-angle glaucoma (POAG) aqueous humor (AH) and trabecular meshwork (TM) causes deposition of extracellular matrix (ECM) in the TM and elevated IOP. Bone morphogenetic proteins (BMPs) regulate TGF-β2-induced ECM production. The underlying mechanism for BMP4 inhibition of TGF-β2-induced fibrosis remains undetermined. Bone morphogenic protein 4 induces inhibitor of DNA binding proteins (ID1, ID3), which suppress transcription factor activities to regulate gene expression. Our study will determine whether ID1and ID3 proteins are downstream targets of BMP4, which attenuates TGF-β2 induction of ECM proteins in TM cells. Primary human TM cells were treated with BMP4, and ID1 and ID3 mRNA, and protein expression was determined by quantitative PCR (Q-PCR) and Western immunoblotting. Intracellular ID1 and ID3 protein localization was studied by immunocytochemistry. Transformed human TM cells (GTM3 cells) were transfected with ID1 or ID3 expression vectors to determine their potential inhibitory effects on TGF-β2-induced fibronectin and plasminogen activator inhibitor-I (PAI-1) protein expression. Basal expression of ID1-3 was detected in primary human TM cells. Bone morphogenic protein 4 significantly induced early expression of ID1 and ID3 mRNA (P < 0.05) and protein in primary TM cells, and a BMP receptor inhibitor blocked this induction. Overexpression of ID1 and ID3 significantly inhibited TGF-β2-induced expression of fibronectin and PAI-1 in TM cells (P < 0.01). Bone morphogenic protein 4 induced ID1 and ID3 expression suppresses TGF-β2 profibrotic activity in human TM cells. In the future, targeting specific regulators may control the TGF-β2 profibrotic effects on the TM, leading to disease modifying IOP lowering therapies.

  6. The Self-Identity Protein IdsD Is Communicated between Cells in Swarming Proteus mirabilis Colonies

    PubMed Central

    Saak, Christina C.

    2016-01-01

    ABSTRACT Proteus mirabilis is a social bacterium that is capable of self (kin) versus nonself recognition. Swarming colonies of this bacterium expand outward on surfaces to centimeter-scale distances due to the collective motility of individual cells. Colonies of genetically distinct populations remain separate, while those of identical populations merge. Ids proteins are essential for this recognition behavior. Two of these proteins, IdsD and IdsE, encode identity information for each strain. These two proteins bind in vitro in an allele-restrictive manner. IdsD-IdsE binding is correlated with the merging of populations, whereas a lack of binding is correlated with the separation of populations. Key questions remained about the in vivo interactions of IdsD and IdsE, specifically, whether IdsD and IdsE bind within single cells or whether IdsD-IdsE interactions occur across neighboring cells and, if so, which of the two proteins is exchanged. Here we demonstrate that IdsD must originate from another cell to communicate identity and that this nonresident IdsD interacts with IdsE resident in the recipient cell. Furthermore, we show that unbound IdsD in recipient cells does not cause cell death and instead appears to contribute to a restriction in the expansion radius of the swarming colony. We conclude that P. mirabilis communicates IdsD between neighboring cells for nonlethal kin recognition, which suggests that the Ids proteins constitute a type of cell-cell communication. IMPORTANCE We demonstrate that self (kin) versus nonself recognition in P. mirabilis entails the cell-cell communication of an identity-encoding protein that is exported from one cell and received by another. We further show that this intercellular exchange affects swarm colony expansion in a nonlethal manner, which adds social communication to the list of potential swarm-related regulatory factors. PMID:27672195

  7. Predicting gene regulatory networks of soybean nodulation from RNA-Seq transcriptome data.

    PubMed

    Zhu, Mingzhu; Dahmen, Jeremy L; Stacey, Gary; Cheng, Jianlin

    2013-09-22

    High-throughput RNA sequencing (RNA-Seq) is a revolutionary technique to study the transcriptome of a cell under various conditions at a systems level. Despite the wide application of RNA-Seq techniques to generate experimental data in the last few years, few computational methods are available to analyze this huge amount of transcription data. The computational methods for constructing gene regulatory networks from RNA-Seq expression data of hundreds or even thousands of genes are particularly lacking and urgently needed. We developed an automated bioinformatics method to predict gene regulatory networks from the quantitative expression values of differentially expressed genes based on RNA-Seq transcriptome data of a cell in different stages and conditions, integrating transcriptional, genomic and gene function data. We applied the method to the RNA-Seq transcriptome data generated for soybean root hair cells in three different development stages of nodulation after rhizobium infection. The method predicted a soybean nodulation-related gene regulatory network consisting of 10 regulatory modules common for all three stages, and 24, 49 and 70 modules separately for the first, second and third stage, each containing both a group of co-expressed genes and several transcription factors collaboratively controlling their expression under different conditions. 8 of 10 common regulatory modules were validated by at least two kinds of validations, such as independent DNA binding motif analysis, gene function enrichment test, and previous experimental data in the literature. We developed a computational method to reliably reconstruct gene regulatory networks from RNA-Seq transcriptome data. The method can generate valuable hypotheses for interpreting biological data and designing biological experiments such as ChIP-Seq, RNA interference, and yeast two hybrid experiments.

  8. ORMAN: optimal resolution of ambiguous RNA-Seq multimappings in the presence of novel isoforms.

    PubMed

    Dao, Phuong; Numanagić, Ibrahim; Lin, Yen-Yi; Hach, Faraz; Karakoc, Emre; Donmez, Nilgun; Collins, Colin; Eichler, Evan E; Sahinalp, S Cenk

    2014-03-01

    RNA-Seq technology is promising to uncover many novel alternative splicing events, gene fusions and other variations in RNA transcripts. For an accurate detection and quantification of transcripts, it is important to resolve the mapping ambiguity for those RNA-Seq reads that can be mapped to multiple loci: >17% of the reads from mouse RNA-Seq data and 50% of the reads from some plant RNA-Seq data have multiple mapping loci. In this study, we show how to resolve the mapping ambiguity in the presence of novel transcriptomic events such as exon skipping and novel indels towards accurate downstream analysis. We introduce ORMAN ( O ptimal R esolution of M ultimapping A mbiguity of R N A-Seq Reads), which aims to compute the minimum number of potential transcript products for each gene and to assign each multimapping read to one of these transcripts based on the estimated distribution of the region covering the read. ORMAN achieves this objective through a combinatorial optimization formulation, which is solved through well-known approximation algorithms, integer linear programs and heuristics. On a simulated RNA-Seq dataset including a random subset of transcripts from the UCSC database, the performance of several state-of-the-art methods for identifying and quantifying novel transcripts, such as Cufflinks, IsoLasso and CLIIQ, is significantly improved through the use of ORMAN. Furthermore, in an experiment using real RNA-Seq reads, we show that ORMAN is able to resolve multimapping to produce coverage values that are similar to the original distribution, even in genes with highly non-uniform coverage. ORMAN is available at http://orman.sf.net

  9. The power and promise of RNA-seq in ecology and evolution.

    PubMed

    Todd, Erica V; Black, Michael A; Gemmell, Neil J

    2016-03-01

    Reference is regularly made to the power of new genomic sequencing approaches. Using powerful technology, however, is not the same as having the necessary power to address a research question with statistical robustness. In the rush to adopt new and improved genomic research methods, limitations of technology and experimental design may be initially neglected. Here, we review these issues with regard to RNA sequencing (RNA-seq). RNA-seq adds large-scale transcriptomics to the toolkit of ecological and evolutionary biologists, enabling differential gene expression (DE) studies in nonmodel species without the need for prior genomic resources. High biological variance is typical of field-based gene expression studies and means that larger sample sizes are often needed to achieve the same degree of statistical power as clinical studies based on data from cell lines or inbred animal models. Sequencing costs have plummeted, yet RNA-seq studies still underutilize biological replication. Finite research budgets force a trade-off between sequencing effort and replication in RNA-seq experimental design. However, clear guidelines for negotiating this trade-off, while taking into account study-specific factors affecting power, are currently lacking. Study designs that prioritize sequencing depth over replication fail to capitalize on the power of RNA-seq technology for DE inference. Significant recent research effort has gone into developing statistical frameworks and software tools for power analysis and sample size calculation in the context of RNA-seq DE analysis. We synthesize progress in this area and derive an accessible rule-of-thumb guide for designing powerful RNA-seq experiments relevant in eco-evolutionary and clinical settings alike. © 2016 John Wiley & Sons Ltd.

  10. An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data.

    PubMed

    Wang, Yejun; MacKenzie, Keith D; White, Aaron P

    2015-05-07

    As sequencing costs are being lowered continuously, RNA-seq has gradually been adopted as the first choice for comparative transcriptome studies with bacteria. Unlike microarrays, RNA-seq can directly detect cDNA derived from mRNA transcripts at a single nucleotide resolution. Not only does this allow researchers to determine the absolute expression level of genes, but it also conveys information about transcript structure. Few automatic software tools have yet been established to investigate large-scale RNA-seq data for bacterial transcript structure analysis. In this study, 54 directional RNA-seq libraries from Salmonella serovar Typhimurium (S. Typhimurium) 14028s were examined for potential relationships between read mapping patterns and transcript structure. We developed an empirical method, combined with statistical tests, to automatically detect key transcript features, including transcriptional start sites (TSSs), transcriptional termination sites (TTSs) and operon organization. Using our method, we obtained 2,764 TSSs and 1,467 TTSs for 1331 and 844 different genes, respectively. Identification of TSSs facilitated further discrimination of 215 putative sigma 38 regulons and 863 potential sigma 70 regulons. Combining the TSSs and TTSs with intergenic distance and co-expression information, we comprehensively annotated the operon organization in S. Typhimurium 14028s. Our results show that directional RNA-seq can be used to detect transcriptional borders at an acceptable resolution of ±10-20 nucleotides. Technical limitations of the RNA-seq procedure may prevent single nucleotide resolution. The automatic transcript border detection methods, statistical models and operon organization pipeline that we have described could be widely applied to RNA-seq studies in other bacteria. Furthermore, the TSSs, TTSs, operons, promoters and unstranslated regions that we have defined for S. Typhimurium 14028s may constitute valuable resources that can be used for comparative analyses with other Salmonella serotypes.

  11. Landscape of DNA Virus Associations across Human Malignant Cancers: Analysis of 3,775 Cases Using RNA-Seq

    PubMed Central

    Tannir, Nizar M.; Williams, Michelle D.; Chen, Yunxin; Yao, Hui; Zhang, Jianping; Thompson, Erika J.; Meric-Bernstam, Funda; Medeiros, L. Jeffrey; Weinstein, John N.

    2013-01-01

    Elucidation of tumor-DNA virus associations in many cancer types has enhanced our knowledge of fundamental oncogenesis mechanisms and provided a basis for cancer prevention initiatives. RNA-Seq is a novel tool to comprehensively assess such associations. We interrogated RNA-Seq data from 3,775 malignant neoplasms in The Cancer Genome Atlas database for the presence of viral sequences. Viral integration sites were also detected in expressed transcripts using a novel approach. The detection capacity of RNA-Seq was compared to available clinical laboratory data. Human papillomavirus (HPV) transcripts were detected using RNA-Seq analysis in head-and-neck squamous cell carcinoma, uterine endometrioid carcinoma, and squamous cell carcinoma of the lung. Detection of HPV by RNA-Seq correlated with detection by in situ hybridization and immunohistochemistry in squamous cell carcinoma tumors of the head and neck. Hepatitis B virus and Epstein-Barr virus (EBV) were detected using RNA-Seq in hepatocellular carcinoma and gastric carcinoma tumors, respectively. Integration sites of viral genes and oncogenes were detected in cancers harboring HPV or hepatitis B virus but not in EBV-positive gastric carcinoma. Integration sites of expressed viral transcripts frequently involved known coding areas of the host genome. No DNA virus transcripts were detected in acute myeloid leukemia, cutaneous melanoma, low- and high-grade gliomas of the brain, and adenocarcinomas of the breast, colon and rectum, lung, prostate, ovary, kidney, and thyroid. In conclusion, this study provides a large-scale overview of the landscape of DNA viruses in human malignant cancers. While further validation is necessary for specific cancer types, our findings highlight the utility of RNA-Seq in detecting tumor-associated DNA viruses and identifying viral integration sites that may unravel novel mechanisms of cancer pathogenesis. PMID:23740984

  12. Peregrine

    PubMed Central

    Langevin, Stanley A.; Bent, Zachary W.; Solberg, Owen D.; Curtis, Deanna J.; Lane, Pamela D.; Williams, Kelly P.; Schoeniger, Joseph S.; Sinha, Anupama; Lane, Todd W.; Branda, Steven S.

    2013-01-01

    Use of second generation sequencing (SGS) technologies for transcriptional profiling (RNA-Seq) has revolutionized transcriptomics, enabling measurement of RNA abundances with unprecedented specificity and sensitivity and the discovery of novel RNA species. Preparation of RNA-Seq libraries requires conversion of the RNA starting material into cDNA flanked by platform-specific adaptor sequences. Each of the published methods and commercial kits currently available for RNA-Seq library preparation suffers from at least one major drawback, including long processing times, large starting material requirements, uneven coverage, loss of strand information and high cost. We report the development of a new RNA-Seq library preparation technique that produces representative, strand-specific RNA-Seq libraries from small amounts of starting material in a fast, simple and cost-effective manner. Additionally, we have developed a new quantitative PCR-based assay for precisely determining the number of PCR cycles to perform for optimal enrichment of the final library, a key step in all SGS library preparation workflows. PMID:23558773

  13. Measuring Sister Chromatid Cohesion Protein Genome Occupancy in Drosophila melanogaster by ChIP-seq.

    PubMed

    Dorsett, Dale; Misulovin, Ziva

    2017-01-01

    This chapter presents methods to conduct and analyze genome-wide chromatin immunoprecipitation of the cohesin complex and the Nipped-B cohesin loading factor in Drosophila cells using high-throughput DNA sequencing (ChIP-seq). Procedures for isolation of chromatin, immunoprecipitation, and construction of sequencing libraries for the Ion Torrent Proton high throughput sequencer are detailed, and computational methods to calculate occupancy as input-normalized fold-enrichment are described. The results obtained by ChIP-seq are compared to those obtained by ChIP-chip (genomic ChIP using tiling microarrays), and the effects of sequencing depth on the accuracy are analyzed. ChIP-seq provides similar sensitivity and reproducibility as ChIP-chip, and identifies the same broad regions of occupancy. The locations of enrichment peaks, however, can differ between ChIP-chip and ChIP-seq, and low sequencing depth can splinter broad regions of occupancy into distinct peaks.

  14. TRAPR: R Package for Statistical Analysis and Visualization of RNA-Seq Data.

    PubMed

    Lim, Jae Hyun; Lee, Soo Youn; Kim, Ju Han

    2017-03-01

    High-throughput transcriptome sequencing, also known as RNA sequencing (RNA-Seq), is a standard technology for measuring gene expression with unprecedented accuracy. Numerous bioconductor packages have been developed for the statistical analysis of RNA-Seq data. However, these tools focus on specific aspects of the data analysis pipeline, and are difficult to appropriately integrate with one another due to their disparate data structures and processing methods. They also lack visualization methods to confirm the integrity of the data and the process. In this paper, we propose an R-based RNA-Seq analysis pipeline called TRAPR, an integrated tool that facilitates the statistical analysis and visualization of RNA-Seq expression data. TRAPR provides various functions for data management, the filtering of low-quality data, normalization, transformation, statistical analysis, data visualization, and result visualization that allow researchers to build customized analysis pipelines.

  15. A Methodological Framework for Instructional Design Model Development: Critical Dimensions and Synthesized Procedures

    ERIC Educational Resources Information Center

    Lee, Jihyun; Jang, Seonyoung

    2014-01-01

    Instructional design (ID) models have been developed to promote understandings of ID reality and guide ID performance. As the number and diversity of ID practices grows, implicit doubts regarding the reliability, validity, and usefulness of ID models suggest the need for methodological guidance that would help to generate ID models that are…

  16. Id-1 and Id-2 genes and products as therapeutic targets for treatment of breast cancer and other types of carcinoma

    DOEpatents

    Desprez, Pierre-Yves; Campisi, Judith

    2014-09-30

    A method for treatment and amelioration of breast, cervical, ovarian, endometrial, squamous cells, prostate cancer and melanoma in a patient comprising targeting Id-1 or Id-2 gene expression with a delivery vehicle comprising a product which modulates Id-1 or Id-2 expression.

  17. Molecular Diversity of Terpene Synthases in the Liverwort Marchantia polymorpha[OPEN

    PubMed Central

    Zhuang, Xun; Jiang, Zuodong; Jia, Qidong; Babbitt, Patricia C.

    2016-01-01

    Marchantia polymorpha is a basal terrestrial land plant, which like most liverworts accumulates structurally diverse terpenes believed to serve in deterring disease and herbivory. Previous studies have suggested that the mevalonate and methylerythritol phosphate pathways, present in evolutionarily diverged plants, are also operative in liverworts. However, the genes and enzymes responsible for the chemical diversity of terpenes have yet to be described. In this study, we resorted to a HMMER search tool to identify 17 putative terpene synthase genes from M. polymorpha transcriptomes. Functional characterization identified four diterpene synthase genes phylogenetically related to those found in diverged plants and nine rather unusual monoterpene and sesquiterpene synthase-like genes. The presence of separate monofunctional diterpene synthases for ent-copalyl diphosphate and ent-kaurene biosynthesis is similar to orthologs found in vascular plants, pushing the date of the underlying gene duplication and neofunctionalization of the ancestral diterpene synthase gene family to >400 million years ago. By contrast, the mono- and sesquiterpene synthases represent a distinct class of enzymes, not related to previously described plant terpene synthases and only distantly so to microbial-type terpene synthases. The absence of a Mg2+ binding, aspartate-rich, DDXXD motif places these enzymes in a noncanonical family of terpene synthases. PMID:27650333

  18. Interactions of citrate synthases from osmoconforming and osmoregulating animals with salt: possible signs of molecular eco-adaptation?

    PubMed

    Sarkissian, I V

    1977-01-01

    This study considers differential sensitivity of citrate synthase (citrate oxaloacetatelyase [CoA acetylating]) EC 4.1.3.7. from an osmoconforming animal (sea anemone) and an osmoregulating animal (the pig) to salt. Attention is drawn to the fact that the osmoconforming sea anemone is in essence a sessile creature while the pig is readily mobile and able to change its ionic environment at will. It had been shown earlier that citrate synthase from another osmoconformer (oyster) is also not sensitive to ionic strength while citrate synthase from osmoregulating white shrimp is sensitive to increasing levels of salt. However, these enzymes are characteristically regulated by ATP and alpha-ketoglutarate. Both forms of citrate synthase are denatured by 6 M guanidine hydrochloride and are aided by salt levels in their refolding but the rate and extent of refolding of the osmoconformer citrate synthase are greater than those of the osmoregulator citrate synthase. Catalytic activity of both forms of citrate synthase is inhibited by incubation in distilled water; osmoconformer citrate synthase was inhibited completely in 7 h while osmoregulator citrate synthase was inhibited only 60% in this time and 80% after 22 h in distilled water. The eco-adaptive and evolutionary implications of these findings are discussed.

  19. Microbe-ID: an open source toolbox for microbial genotyping and species identification.

    PubMed

    Tabima, Javier F; Everhart, Sydney E; Larsen, Meredith M; Weisberg, Alexandra J; Kamvar, Zhian N; Tancos, Matthew A; Smart, Christine D; Chang, Jeff H; Grünwald, Niklaus J

    2016-01-01

    Development of tools to identify species, genotypes, or novel strains of invasive organisms is critical for monitoring emergence and implementing rapid response measures. Molecular markers, although critical to identifying species or genotypes, require bioinformatic tools for analysis. However, user-friendly analytical tools for fast identification are not readily available. To address this need, we created a web-based set of applications called Microbe-ID that allow for customizing a toolbox for rapid species identification and strain genotyping using any genetic markers of choice. Two components of Microbe-ID, named Sequence-ID and Genotype-ID, implement species and genotype identification, respectively. Sequence-ID allows identification of species by using BLAST to query sequences for any locus of interest against a custom reference sequence database. Genotype-ID allows placement of an unknown multilocus marker in either a minimum spanning network or dendrogram with bootstrap support from a user-created reference database. Microbe-ID can be used for identification of any organism based on nucleotide sequences or any molecular marker type and several examples are provided. We created a public website for demonstration purposes called Microbe-ID (microbe-id.org) and provided a working implementation for the genus Phytophthora (phytophthora-id.org). In Phytophthora-ID, the Sequence-ID application allows identification based on ITS or cox spacer sequences. Genotype-ID groups individuals into clonal lineages based on simple sequence repeat (SSR) markers for the two invasive plant pathogen species P. infestans and P. ramorum. All code is open source and available on github and CRAN. Instructions for installation and use are provided at https://github.com/grunwaldlab/Microbe-ID.

  20. 30 CFR 773.5 - Regulatory coordination with requirements under other laws.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... operations with applicable requirements of the Endangered Species Act of 1973, as amended (16 U.S.C. 1531 et seq.); the Fish and Wildlife Coordination Act, as amended (16 U.S.C. 661 et seq.); the Migratory Bird Treaty Act of 1918, as amended (16 U.S.C. 703 et seq.); The National Historic Preservation Act of 1966...

  1. 30 CFR 773.5 - Regulatory coordination with requirements under other laws.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... operations with applicable requirements of the Endangered Species Act of 1973, as amended (16 U.S.C. 1531 et seq.); the Fish and Wildlife Coordination Act, as amended (16 U.S.C. 661 et seq.); the Migratory Bird Treaty Act of 1918, as amended (16 U.S.C. 703 et seq.); The National Historic Preservation Act of 1966...

  2. 30 CFR 773.5 - Regulatory coordination with requirements under other laws.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... operations with applicable requirements of the Endangered Species Act of 1973, as amended (16 U.S.C. 1531 et seq.); the Fish and Wildlife Coordination Act, as amended (16 U.S.C. 661 et seq.); the Migratory Bird Treaty Act of 1918, as amended (16 U.S.C. 703 et seq.); The National Historic Preservation Act of 1966...

  3. 30 CFR 773.5 - Regulatory coordination with requirements under other laws.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... operations with applicable requirements of the Endangered Species Act of 1973, as amended (16 U.S.C. 1531 et seq.); the Fish and Wildlife Coordination Act, as amended (16 U.S.C. 661 et seq.); the Migratory Bird Treaty Act of 1918, as amended (16 U.S.C. 703 et seq.); The National Historic Preservation Act of 1966...

  4. 30 CFR 773.5 - Regulatory coordination with requirements under other laws.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... operations with applicable requirements of the Endangered Species Act of 1973, as amended (16 U.S.C. 1531 et seq.); the Fish and Wildlife Coordination Act, as amended (16 U.S.C. 661 et seq.); the Migratory Bird Treaty Act of 1918, as amended (16 U.S.C. 703 et seq.); The National Historic Preservation Act of 1966...

  5. 50 CFR 12.23 - Administrative forfeiture proceedings.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... Protection Act, 16 U.S.C. 668 et seq., or Airborne Hunting Act, 16 U.S.C. 742j-1, or any wildlife or plant subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq., or any fish, wildlife or plant subject to forfeiture under the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq., is...

  6. 50 CFR 12.23 - Administrative forfeiture proceedings.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... Protection Act, 16 U.S.C. 668 et seq., or Airborne Hunting Act, 16 U.S.C. 742j-1, or any wildlife or plant subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq., or any fish, wildlife or plant subject to forfeiture under the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq., is...

  7. 15 CFR 922.72 - Prohibited or otherwise regulated activities-Sanctuary-wide.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... Mammal Protection Act, as amended, (MMPA), 16 U.S.C. 1361 et seq., Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq., Migratory Bird Treaty Act, as amended, (MBTA), 16 U.S.C. 703 et seq., or... classification) approved in accordance with section 312 of the Federal Water Pollution Control Act, as amended...

  8. 50 CFR 12.23 - Administrative forfeiture proceedings.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... Protection Act, 16 U.S.C. 668 et seq., or Airborne Hunting Act, 16 U.S.C. 742j-1, or any wildlife or plant subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq., or any fish, wildlife or plant subject to forfeiture under the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq., is...

  9. 50 CFR 12.23 - Administrative forfeiture proceedings.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... Protection Act, 16 U.S.C. 668 et seq., or Airborne Hunting Act, 16 U.S.C. 742j-1, or any wildlife or plant subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq., or any fish, wildlife or plant subject to forfeiture under the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq., is...

  10. 50 CFR 12.23 - Administrative forfeiture proceedings.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... Protection Act, 16 U.S.C. 668 et seq., or Airborne Hunting Act, 16 U.S.C. 742j-1, or any wildlife or plant subject to forfeiture under the Endangered Species Act, 16 U.S.C. 1531 et seq., or any fish, wildlife or plant subject to forfeiture under the Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq., is...

  11. 15 CFR 922.72 - Prohibited or otherwise regulated activities-Sanctuary-wide.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... Mammal Protection Act, as amended, (MMPA), 16 U.S.C. 1361 et seq., Endangered Species Act, as amended, (ESA), 16 U.S.C. 1531 et seq., Migratory Bird Treaty Act, as amended, (MBTA), 16 U.S.C. 703 et seq., or... classification) approved in accordance with section 312 of the Federal Water Pollution Control Act, as amended...

  12. 29 CFR 1910.1200 - Hazard communication.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ..., Fungicide, and Rodenticide Act (7 U.S.C. 136 et seq.), when subject to the labeling requirements of that Act... chemical substance or mixture as such terms are defined in the Toxic Substances Control Act (15 U.S.C. 2601..., and Cosmetic Act (21 U.S.C. 301 et seq.) or the Virus-Serum-Toxin Act of 1913 (21 U.S.C. 151 et seq...

  13. 30 CFR 905.773 - Requirements for permits and permit processing.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ...) National Historic Preservation Act, 16 U.S.C. 470 et seq CEQA. (7) Coastal Zone Management Act, 16 U.S.C. 1451, 1453-1464 California Coastal Act of 1976, Cal. Pub. Res. Code section 30000 et seq. (West 1986... section 46000 et seq. (West Supp. 1986). (12) Bald Eagle Protection Act, 16 U.S.C. 668-668(d) (c) Where...

  14. 30 CFR 875.16 - Exclusion of certain noncoal reclamation sites.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... Radiation Control Act of 1978 (42 U.S.C. 7901 et seq.) or that have been listed for remedial action under the Comprehensive Environmental Response Compensation and Liability Act of 1980 (42 U.S.C. 9601 et seq... Uranium Mill Tailings Radiation Control Act of 1978 (42 U.S.C. 7901 et seq.) or that have been listed for...

  15. The role of prostacyclin synthase and thromboxane synthase signaling in the development and progression of cancer.

    PubMed

    Cathcart, Mary-Clare; Reynolds, John V; O'Byrne, Kenneth J; Pidgeon, Graham P

    2010-04-01

    Prostacyclin synthase and thromboxane synthase signaling via arachidonic acid metabolism affects a number of tumor cell survival pathways such as cell proliferation, apoptosis, tumor cell invasion and metastasis, and angiogenesis. However, the effects of these respective synthases differ considerably with respect to the pathways described. While prostacyclin synthase is generally believed to be anti-tumor, a pro-carcinogenic role for thromboxane synthase has been demonstrated in a variety of cancers. The balance of oppositely-acting COX-derived prostanoids influences many processes throughout the body, such as blood pressure regulation, clotting, and inflammation. The PGI(2)/TXA(2) ratio is of particular interest in-vivo, with the corresponding synthases shown to be differentially regulated in a variety of disease states. Pharmacological inhibition of thromboxane synthase has been shown to significantly inhibit tumor cell growth, invasion, metastasis and angiogenesis in a range of experimental models. In direct contrast, prostacyclin synthase overexpression has been shown to be chemopreventive in a murine model of the disease, suggesting that the expression and activity of this enzyme may protect against tumor development. In this review, we discuss the aberrant expression and known functions of both prostacyclin synthase and thromboxane synthase in cancer. We discuss the effects of these enzymes on a range of tumor cell survival pathways, such as tumor cell proliferation, induction of apoptosis, invasion and metastasis, and tumor cell angiogenesis. As downstream signaling pathways of these enzymes have also been implicated in cancer states, we examine the role of downstream effectors of PGIS and TXS activity in tumor growth and progression. Finally, we discuss current therapeutic strategies aimed at targeting these enzymes for the prevention/treatment of cancer.

  16. voom: precision weights unlock linear model analysis tools for RNA-seq read counts

    PubMed Central

    2014-01-01

    New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments. The voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline. This opens access for RNA-seq analysts to a large body of methodology developed for microarrays. Simulation studies show that voom performs as well or better than count-based RNA-seq methods even when the data are generated according to the assumptions of the earlier methods. Two case studies illustrate the use of linear modeling and gene set testing methods. PMID:24485249

  17. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.

    PubMed

    Law, Charity W; Chen, Yunshun; Shi, Wei; Smyth, Gordon K

    2014-02-03

    New normal linear modeling strategies are presented for analyzing read counts from RNA-seq experiments. The voom method estimates the mean-variance relationship of the log-counts, generates a precision weight for each observation and enters these into the limma empirical Bayes analysis pipeline. This opens access for RNA-seq analysts to a large body of methodology developed for microarrays. Simulation studies show that voom performs as well or better than count-based RNA-seq methods even when the data are generated according to the assumptions of the earlier methods. Two case studies illustrate the use of linear modeling and gene set testing methods.

  18. Rcorrector: efficient and accurate error correction for Illumina RNA-seq reads.

    PubMed

    Song, Li; Florea, Liliana

    2015-01-01

    Next-generation sequencing of cellular RNA (RNA-seq) is rapidly becoming the cornerstone of transcriptomic analysis. However, sequencing errors in the already short RNA-seq reads complicate bioinformatics analyses, in particular alignment and assembly. Error correction methods have been highly effective for whole-genome sequencing (WGS) reads, but are unsuitable for RNA-seq reads, owing to the variation in gene expression levels and alternative splicing. We developed a k-mer based method, Rcorrector, to correct random sequencing errors in Illumina RNA-seq reads. Rcorrector uses a De Bruijn graph to compactly represent all trusted k-mers in the input reads. Unlike WGS read correctors, which use a global threshold to determine trusted k-mers, Rcorrector computes a local threshold at every position in a read. Rcorrector has an accuracy higher than or comparable to existing methods, including the only other method (SEECER) designed for RNA-seq reads, and is more time and memory efficient. With a 5 GB memory footprint for 100 million reads, it can be run on virtually any desktop or server. The software is available free of charge under the GNU General Public License from https://github.com/mourisl/Rcorrector/.

  19. Integrative analysis with ChIP-seq advances the limits of transcript quantification from RNA-seq.

    PubMed

    Liu, Peng; Sanalkumar, Rajendran; Bresnick, Emery H; Keleş, Sündüz; Dewey, Colin N

    2016-08-01

    RNA-seq is currently the technology of choice for global measurement of transcript abundances in cells. Despite its successes, isoform-level quantification remains difficult because short RNA-seq reads are often compatible with multiple alternatively spliced isoforms. Existing methods rely heavily on uniquely mapping reads, which are not available for numerous isoforms that lack regions of unique sequence. To improve quantification accuracy in such difficult cases, we developed a novel computational method, prior-enhanced RSEM (pRSEM), which uses a complementary data type in addition to RNA-seq data. We found that ChIP-seq data of RNA polymerase II and histone modifications were particularly informative in this approach. In qRT-PCR validations, pRSEM was shown to be superior than competing methods in estimating relative isoform abundances within or across conditions. Data-driven simulations suggested that pRSEM has a greatly decreased false-positive rate at the expense of a small increase in false-negative rate. In aggregate, our study demonstrates that pRSEM transforms existing capacity to precisely estimate transcript abundances, especially at the isoform level. © 2016 Liu et al.; Published by Cold Spring Harbor Laboratory Press.

  20. sequoia controls the type I>0 daughter proliferation switch in the developing Drosophila nervous system.

    PubMed

    Gunnar, Erika; Bivik, Caroline; Starkenberg, Annika; Thor, Stefan

    2016-10-15

    Neural progenitors typically divide asymmetrically to renew themselves, while producing daughters with more limited potential. In the Drosophila embryonic ventral nerve cord, neuroblasts initially produce daughters that divide once to generate two neurons/glia (type I proliferation mode). Subsequently, many neuroblasts switch to generating daughters that differentiate directly (type 0). This programmed type I>0 switch is controlled by Notch signaling, triggered at a distinct point of lineage progression in each neuroblast. However, how Notch signaling onset is gated was unclear. We recently identified Sequoia (Seq), a C2H2 zinc-finger transcription factor with homology to Drosophila Tramtrack (Ttk) and the positive regulatory domain (PRDM) family, as important for lineage progression. Here, we find that seq mutants fail to execute the type I>0 daughter proliferation switch and also display increased neuroblast proliferation. Genetic interaction studies reveal that seq interacts with the Notch pathway, and seq furthermore affects expression of a Notch pathway reporter. These findings suggest that seq may act as a context-dependent regulator of Notch signaling, and underscore the growing connection between Seq, Ttk, the PRDM family and Notch signaling. © 2016. Published by The Company of Biologists Ltd.

  1. Combining multiple ChIP-seq peak detection systems using combinatorial fusion.

    PubMed

    Schweikert, Christina; Brown, Stuart; Tang, Zuojian; Smith, Phillip R; Hsu, D Frank

    2012-01-01

    Due to the recent rapid development in ChIP-seq technologies, which uses high-throughput next-generation DNA sequencing to identify the targets of Chromatin Immunoprecipitation, there is an increasing amount of sequencing data being generated that provides us with greater opportunity to analyze genome-wide protein-DNA interactions. In particular, we are interested in evaluating and enhancing computational and statistical techniques for locating protein binding sites. Many peak detection systems have been developed; in this study, we utilize the following six: CisGenome, MACS, PeakSeq, QuEST, SISSRs, and TRLocator. We define two methods to merge and rescore the regions of two peak detection systems and analyze the performance based on average precision and coverage of transcription start sites. The results indicate that ChIP-seq peak detection can be improved by fusion using score or rank combination. Our method of combination and fusion analysis would provide a means for generic assessment of available technologies and systems and assist researchers in choosing an appropriate system (or fusion method) for analyzing ChIP-seq data. This analysis offers an alternate approach for increasing true positive rates, while decreasing false positive rates and hence improving the ChIP-seq peak identification process.

  2. Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data.

    PubMed

    Paulson, Joseph N; Chen, Cho-Yi; Lopes-Ramos, Camila M; Kuijjer, Marieke L; Platig, John; Sonawane, Abhijeet R; Fagny, Maud; Glass, Kimberly; Quackenbush, John

    2017-10-03

    Although ultrahigh-throughput RNA-Sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-Seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-Seq data from hundreds or thousands of samples, often collected at multiple centers and from diverse tissues. These complex data sets present significant analytical challenges due to batch and tissue effects, but provide the opportunity to revisit the assumptions and methods that we use to preprocess, normalize, and filter RNA-Seq data - critical first steps for any subsequent analysis. We find that analysis of large RNA-Seq data sets requires both careful quality control and the need to account for sparsity due to the heterogeneity intrinsic in multi-group studies. We developed Yet Another RNA Normalization software pipeline (YARN), that includes quality control and preprocessing, gene filtering, and normalization steps designed to facilitate downstream analysis of large, heterogeneous RNA-Seq data sets and we demonstrate its use with data from the Genotype-Tissue Expression (GTEx) project. An R package instantiating YARN is available at http://bioconductor.org/packages/yarn .

  3. Selective amplification and sequencing of cyclic phosphate-containing RNAs by the cP-RNA-seq method.

    PubMed

    Honda, Shozo; Morichika, Keisuke; Kirino, Yohei

    2016-03-01

    RNA digestions catalyzed by many ribonucleases generate RNA fragments that contain a 2',3'-cyclic phosphate (cP) at their 3' termini. However, standard RNA-seq methods are unable to accurately capture cP-containing RNAs because the cP inhibits the adapter ligation reaction. We recently developed a method named cP-RNA-seq that is able to selectively amplify and sequence cP-containing RNAs. Here we describe the cP-RNA-seq protocol in which the 3' termini of all RNAs, except those containing a cP, are cleaved through a periodate treatment after phosphatase treatment; hence, subsequent adapter ligation and cDNA amplification steps are exclusively applied to cP-containing RNAs. cP-RNA-seq takes ∼6 d, excluding the time required for sequencing and bioinformatics analyses, which are not covered in detail in this protocol. Biochemical validation of the existence of cP in the identified RNAs takes ∼3 d. Even though the cP-RNA-seq method was developed to identify angiogenin-generating 5'-tRNA halves as a proof of principle, the method should be applicable to global identification of cP-containing RNA repertoires in various transcriptomes.

  4. Gene expression and splicing alterations analyzed by high throughput RNA sequencing of chronic lymphocytic leukemia specimens.

    PubMed

    Liao, Wei; Jordaan, Gwen; Nham, Phillipp; Phan, Ryan T; Pelegrini, Matteo; Sharma, Sanjai

    2015-10-16

    To determine differentially expressed and spliced RNA transcripts in chronic lymphocytic leukemia specimens a high throughput RNA-sequencing (HTS RNA-seq) analysis was performed. Ten CLL specimens and five normal peripheral blood CD19+ B cells were analyzed by HTS RNA-seq. The library preparation was performed with Illumina TrueSeq RNA kit and analyzed by Illumina HiSeq 2000 sequencing system. An average of 48.5 million reads for B cells, and 50.6 million reads for CLL specimens were obtained with 10396 and 10448 assembled transcripts for normal B cells and primary CLL specimens respectively. With the Cuffdiff analysis, 2091 differentially expressed genes (DEG) between B cells and CLL specimens based on FPKM (fragments per kilobase of transcript per million reads and false discovery rate, FDR q < 0.05, fold change >2) were identified. Expression of selected DEGs (n = 32) with up regulated and down regulated expression in CLL from RNA-seq data were also analyzed by qRT-PCR in a test cohort of CLL specimens. Even though there was a variation in fold expression of DEG genes between RNA-seq and qRT-PCR; more than 90 % of analyzed genes were validated by qRT-PCR analysis. Analysis of RNA-seq data for splicing alterations in CLL and B cells was performed by Multivariate Analysis of Transcript Splicing (MATS analysis). Skipped exon was the most frequent splicing alteration in CLL specimens with 128 significant events (P-value <0.05, minimum inclusion level difference >0.1). The RNA-seq analysis of CLL specimens identifies novel DEG and alternatively spliced genes that are potential prognostic markers and therapeutic targets. High level of validation by qRT-PCR for a number of DEG genes supports the accuracy of this analysis. Global comparison of transcriptomes of B cells, IGVH non-mutated CLL (U-CLL) and mutated CLL specimens (M-CLL) with multidimensional scaling analysis was able to segregate CLL and B cell transcriptomes but the M-CLL and U-CLL transcriptomes were indistinguishable. The analysis of HTS RNA-seq data to identify alternative splicing events and other genetic abnormalities specific to CLL is an added advantage of RNA-seq that is not feasible with other genome wide analysis.

  5. Domain analysis of 3 Keto Acyl-CoA synthase for structural variations in Vitis vinifera and Oryza brachyantha using comparative modelling.

    PubMed

    Sagar, Mamta; Pandey, Neetesh; Qamar, Naseha; Singh, Brijendra; Shukla, Akanksha

    2015-03-01

    The long chain fatty acids incorporated into plant lipids are derived from the iterative addition of C2 units which is provided by malonyl-CoA to an acyl-CoA after interactions with 3-ketoacyl-CoA synthase (KCS), found in several plants. This study provides functional characterization of three 3 ketoacyl CoA synthase like proteins in Vitis vinifera (one) and Oryza brachyantha (two proteins). Sequence analysis reveals that protein of Oryza brachyantha shows 96% similarity to a hypothetical protein in Sorghum bicolor; total 11 homologs were predicted in Sorghum bicolor. Conserved domain prediction confirm the presence of FAE1/Type III polyketide synthase-like protein, Thiolase-like, subgroup; Thiolase-like and 3-Oxoacyl-ACP synthase III, C-terminal and chalcone synthase like domain but very long chain 3-keto acyl CoA domain is absent. All three proteins were found to have Chalcone and stilbene synthases C terminal domain which is similar to domain of thiolase and β keto acyl synthase. Its N terminal domain is absent in J3M9Z7 protein of Oryza brachyantha and F6HH63 protein of Vitis vinifera. Differences in N-terminal domain is responsible for distinguish activity. The J3MF16 protein of Oryza brachyantha contains N terminal domain and C terminal domain and characterized using annotation of these domains. Domains Gcs (streptomyces coelicolor) and Chalcone-stilbene synthases (KAS) in 2-pyrone synthase (Gerbera hybrid) and chalcone synthase 2 (Medicago sativa) were found to be present in three proteins. This similarity points toward anthocyanin biosynthetic process. Similarity to chalcone synthase 2 reveals its possible role in Naringenine and Chalcone synthase like activity. In 3 keto acyl CoA synthase of Oryza brachyantha. Active site residues C-240, H-407, N-447 are present in J3MF16 protein that are common in these three protein at different positions. Structural variations among dimer interface, product binding site, malonyl-CoA binding sites, were predicted in localized combination of conserved residues.

  6. Isolated polypeptide having arabinofuranosidase activity

    DOEpatents

    Foreman, Pamela; Van Solingen, Pieter; Goedegebuur, Frits; Ward, Michael

    2010-02-23

    Described herein are novel gene sequences isolated from Trichoderma reesei. Two genes encoding proteins comprising a cellulose binding domain, one encoding an arabionfuranosidase and one encoding an acetylxylanesterase are described. The sequences, CIP1 and CIP2, contain a cellulose binding domain. These proteins are especially useful in the textile and detergent industry and in pulp and paper industry. TABLE-US-00001 cip1 cDNA sequence (SEQ ID NO: 1) GACTAGTTCA TAATACAGTA GTTGAGTTCA TAGCAACTTC 50 ACTCTCTAGC TGAACAAATT ATCTGCGCAA ACATGGTTCG CCGGACTGCT 100 CTGCTGGCCC TTGGGGCTCT CTCAACGCTC TCTATGGCCC AAATCTCAGA 150 CGACTTCGAG TCGGGCTGGG ATCAGACTAA ATGGCCCATT TCGGCACCAG 200 ACTGTAACCA GGGCGGCACC GTCAGCCTCG ACACCACAGT AGCCCACAGC 250 GGCAGCAACT CCATGAAGGT CGTTGGTGGC CCCAATGGCT ACTGTGGACA 300 CATCTTCTTC GGCACTACCC AGGTGCCAAC TGGGGATGTA TATGTCAGAG 350 CTTGGATTCG GCTTCAGACT GCTCTCGGCA GCAACCACGT CACATTCATC 400 ATCATGCCAG ACACCGCTCA GGGAGGGAAG CACCTCCGAA TTGGTGGCCA 450 AAGCCAAGTT CTCGACTACA ACCGCGAGTC CGACGATGCC ACTCTTCCGG 500 ACCTGTCTCC CAACGGCATT GCCTCCACCG TCACTCTGCC TACCGGCGCG 550 TTCCAGTGCT TCGAGTACCA CCTGGGCACT GACGGAACCA TCGAGACGTG 600 GCTCAACGGC AGCCTCATCC CGGGCATGAC CGTGGGCCCT GGCGTCGACA 650 ATCCAAACGA CGCTGGCTGG ACGAGGGCCA GCTATATTCC GGAGATCACC 700 GGTGTCAACT TTGGCTGGGA GGCCTACAGC GGAGACGTCA ACACCGTCTG 750 GTTCGACGAC ATCTCGATTG CGTCGACCCG CGTGGGATGC GGCCCCGGCA 800 GCCCCGGCGG TCCTGGAAGC TCGACGACTG GGCGTAGCAG CACCTCGGGC 850 CCGACGAGCA CTTCGAGGCC AAGCACCACC ATTCCGCCAC CGACTTCCAG 900 GACAACGACC GCCACGGGTC CGACTCAGAC ACACTATGGC CAGTGCGGAG 1000 GGATTGGTTA CAGCGGGCCT ACGGTCTGCG CGAGCGGCAC GACCTGCCAG 1050 GTCCTGAACC CATACTACTC CCAGTGCTTA TAAGGGGATG AGCATGGAGT 1100 GAAGTGAAGT GAAGTGGAGA GAGTTGAAGT GGCATTGCGC TCGGCTGGGT 1150 AGATAAAAGT CAGCAGCTAT GAATACTCTA TGTGATGCTC ATTGGCGTGT 1200 ACGTTTTAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1250 AAAAAAAAAA AAAAAAAAAG GGGGCGGCCG C 1271

  7. CRISPR-FOCUS: A web server for designing focused CRISPR screening experiments.

    PubMed

    Cao, Qingyi; Ma, Jian; Chen, Chen-Hao; Xu, Han; Chen, Zhi; Li, Wei; Liu, X Shirley

    2017-01-01

    The recently developed CRISPR screen technology, based on the CRISPR/Cas9 genome editing system, enables genome-wide interrogation of gene functions in an efficient and cost-effective manner. Although many computational algorithms and web servers have been developed to design single-guide RNAs (sgRNAs) with high specificity and efficiency, algorithms specifically designed for conducting CRISPR screens are still lacking. Here we present CRISPR-FOCUS, a web-based platform to search and prioritize sgRNAs for CRISPR screen experiments. With official gene symbols or RefSeq IDs as the only mandatory input, CRISPR-FOCUS filters and prioritizes sgRNAs based on multiple criteria, including efficiency, specificity, sequence conservation, isoform structure, as well as genomic variations including Single Nucleotide Polymorphisms and cancer somatic mutations. CRISPR-FOCUS also provides pre-defined positive and negative control sgRNAs, as well as other necessary sequences in the construct (e.g., U6 promoters to drive sgRNA transcription and RNA scaffolds of the CRISPR/Cas9). These features allow users to synthesize oligonucleotides directly based on the output of CRISPR-FOCUS. Overall, CRISPR-FOCUS provides a rational and high-throughput approach for sgRNA library design that enables users to efficiently conduct a focused screen experiment targeting up to thousands of genes. (CRISPR-FOCUS is freely available at http://cistrome.org/crispr-focus/).

  8. Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce.

    PubMed

    Decap, Dries; Reumers, Joke; Herzeel, Charlotte; Costanza, Pascal; Fostier, Jan

    2017-01-01

    Given the current cost-effectiveness of next-generation sequencing, the amount of DNA-seq and RNA-seq data generated is ever increasing. One of the primary objectives of NGS experiments is calling genetic variants. While highly accurate, most variant calling pipelines are not optimized to run efficiently on large data sets. However, as variant calling in genomic data has become common practice, several methods have been proposed to reduce runtime for DNA-seq analysis through the use of parallel computing. Determining the effectively expressed variants from transcriptomics (RNA-seq) data has only recently become possible, and as such does not yet benefit from efficiently parallelized workflows. We introduce Halvade-RNA, a parallel, multi-node RNA-seq variant calling pipeline based on the GATK Best Practices recommendations. Halvade-RNA makes use of the MapReduce programming model to create and manage parallel data streams on which multiple instances of existing tools such as STAR and GATK operate concurrently. Whereas the single-threaded processing of a typical RNA-seq sample requires ∼28h, Halvade-RNA reduces this runtime to ∼2h using a small cluster with two 20-core machines. Even on a single, multi-core workstation, Halvade-RNA can significantly reduce runtime compared to using multi-threading, thus providing for a more cost-effective processing of RNA-seq data. Halvade-RNA is written in Java and uses the Hadoop MapReduce 2.0 API. It supports a wide range of distributions of Hadoop, including Cloudera and Amazon EMR.

  9. iReceptor: A platform for querying and analyzing antibody/B-cell and T-cell receptor repertoire data across federated repositories.

    PubMed

    Corrie, Brian D; Marthandan, Nishanth; Zimonja, Bojan; Jaglale, Jerome; Zhou, Yang; Barr, Emily; Knoetze, Nicole; Breden, Frances M W; Christley, Scott; Scott, Jamie K; Cowell, Lindsay G; Breden, Felix

    2018-07-01

    Next-generation sequencing allows the characterization of the adaptive immune receptor repertoire (AIRR) in exquisite detail. These large-scale AIRR-seq data sets have rapidly become critical to vaccine development, understanding the immune response in autoimmune and infectious disease, and monitoring novel therapeutics against cancer. However, at present there is no easy way to compare these AIRR-seq data sets across studies and institutions. The ability to combine and compare information for different disease conditions will greatly enhance the value of AIRR-seq data for improving biomedical research and patient care. The iReceptor Data Integration Platform (gateway.ireceptor.org) provides one implementation of the AIRR Data Commons envisioned by the AIRR Community (airr-community.org), an initiative that is developing protocols to facilitate sharing and comparing AIRR-seq data. The iReceptor Scientific Gateway links distributed (federated) AIRR-seq repositories, allowing sequence searches or metadata queries across multiple studies at multiple institutions, returning sets of sequences fulfilling specific criteria. We present a review of the development of iReceptor, and how it fits in with the general trend toward sharing genomic and health data, and the development of standards for describing and reporting AIRR-seq data. Researchers interested in integrating their repositories of AIRR-seq data into the iReceptor Platform are invited to contact support@ireceptor.org. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  10. Construction of a High-Density Genetic Map from RNA-Seq Data for an Arabidopsis Bay-0 × Shahdara RIL Population

    PubMed Central

    Serin, Elise A. R.; Snoek, L. B.; Nijveen, Harm; Willems, Leo A. J.; Jiménez-Gómez, Jose M.; Hilhorst, Henk W. M.; Ligterink, Wilco

    2017-01-01

    High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches. PMID:29259624

  11. An Alternative Approach to ChIP-Seq Normalization Enables Detection of Genome-Wide Changes in Histone H3 Lysine 27 Trimethylation upon EZH2 Inhibition

    PubMed Central

    Yuan, Chih-Chi; Craske, Madeleine Lisa; Labhart, Paul; Guler, Gulfem D.; Arnott, David; Maile, Tobias M.; Busby, Jennifer; Henry, Chisato; Kelly, Theresa K.; Tindell, Charles A.; Jhunjhunwala, Suchit; Zhao, Feng; Hatton, Charlie; Bryant, Barbara M.

    2016-01-01

    Chromatin immunoprecipitation and DNA sequencing (ChIP-seq) has been instrumental in inferring the roles of histone post-translational modifications in the regulation of transcription, chromatin compaction and other cellular processes that require modulation of chromatin structure. However, analysis of ChIP-seq data is challenging when the manipulation of a chromatin-modifying enzyme significantly affects global levels of histone post-translational modifications. For example, small molecule inhibition of the methyltransferase EZH2 reduces global levels of histone H3 lysine 27 trimethylation (H3K27me3). However, standard ChIP-seq normalization and analysis methods fail to detect a decrease upon EZH2 inhibitor treatment. We overcome this challenge by employing an alternative normalization approach that is based on the addition of Drosophila melanogaster chromatin and a D. melanogaster-specific antibody into standard ChIP reactions. Specifically, the use of an antibody that exclusively recognizes the D. melanogaster histone variant H2Av enables precipitation of D. melanogaster chromatin as a minor fraction of the total ChIP DNA. The D. melanogaster ChIP-seq tags are used to normalize the human ChIP-seq data from DMSO and EZH2 inhibitor-treated samples. Employing this strategy, a substantial reduction in H3K27me3 signal is now observed in ChIP-seq data from EZH2 inhibitor treated samples. PMID:27875550

  12. An Alternative Approach to ChIP-Seq Normalization Enables Detection of Genome-Wide Changes in Histone H3 Lysine 27 Trimethylation upon EZH2 Inhibition.

    PubMed

    Egan, Brian; Yuan, Chih-Chi; Craske, Madeleine Lisa; Labhart, Paul; Guler, Gulfem D; Arnott, David; Maile, Tobias M; Busby, Jennifer; Henry, Chisato; Kelly, Theresa K; Tindell, Charles A; Jhunjhunwala, Suchit; Zhao, Feng; Hatton, Charlie; Bryant, Barbara M; Classon, Marie; Trojer, Patrick

    2016-01-01

    Chromatin immunoprecipitation and DNA sequencing (ChIP-seq) has been instrumental in inferring the roles of histone post-translational modifications in the regulation of transcription, chromatin compaction and other cellular processes that require modulation of chromatin structure. However, analysis of ChIP-seq data is challenging when the manipulation of a chromatin-modifying enzyme significantly affects global levels of histone post-translational modifications. For example, small molecule inhibition of the methyltransferase EZH2 reduces global levels of histone H3 lysine 27 trimethylation (H3K27me3). However, standard ChIP-seq normalization and analysis methods fail to detect a decrease upon EZH2 inhibitor treatment. We overcome this challenge by employing an alternative normalization approach that is based on the addition of Drosophila melanogaster chromatin and a D. melanogaster-specific antibody into standard ChIP reactions. Specifically, the use of an antibody that exclusively recognizes the D. melanogaster histone variant H2Av enables precipitation of D. melanogaster chromatin as a minor fraction of the total ChIP DNA. The D. melanogaster ChIP-seq tags are used to normalize the human ChIP-seq data from DMSO and EZH2 inhibitor-treated samples. Employing this strategy, a substantial reduction in H3K27me3 signal is now observed in ChIP-seq data from EZH2 inhibitor treated samples.

  13. Effects of plasma-induced charging damage on random telegraph noise in metal-oxide-semiconductor field-effect transistors with SiO2 and high-k gate dielectrics

    NASA Astrophysics Data System (ADS)

    Kamei, Masayuki; Takao, Yoshinori; Eriguchi, Koji; Ono, Kouichi

    2014-01-01

    We clarified in this study how plasma-induced charging damage (PCD) affects the so-called “random telegraph noise (RTN)” — a principal concern in designing ultimately scaled large-scale integrated circuits (LSIs). Metal-oxide-semiconductor field-effect transistors (MOSFETs) with SiO2 and high-k gate dielectric were exposed to an inductively coupled plasma (ICP) with Ar gas. Drain current vs gate voltage (Ids-Vg) characteristics were obtained before and after the ICP plasma exposure for the same device. Then, the time evolution of Ids fluctuation defined as Ids/μIds was measured, where μIds is the mean Ids. This value corresponds to an RTN feature, and RTN was obtained under various gate voltages (Vg) by a customized measurement technique. We focused on the statistical distribution width of (Ids/μIds), δ(Ids/μIds), in order to clarify the effects of PCD on RTN. δ(Ids/μIds) was increased by PCD for both MOSFETs with the SiO2 and high-k gate dielectrics, suggesting that RTN can be used as a measure of PCD, i.e., a distribution width increase directly indicates the presence of PCD. The dependence of δ(Ids/μIds) on the overdrive voltage Vg-Vth, where Vth is the threshold voltage, was investigated by the present technique. It was confirmed that δ(Ids/μIds) increased with a decrease in the overdrive voltage for MOSFETs with the SiO2 and high-k gate dielectrics. The presence of created carrier trap sites with PCD was characterized by the time constants for carrier capture and emission. The threshold voltage shift (ΔVth) induced by PCD was also evaluated and compared with the RTN change, to correlate the RTN increase with ΔVth induced by PCD. Although the estimated time constants exhibited complex behaviors due to the nature of trap sites created by PCD, δ(Ids/μIds) showed a straightforward tendency in accordance with the amount of PCD. These findings provide an in-depth understanding of plasma-induced RTN characteristic changes in future MOSFETs.

  14. Mutant DD genotype of NFKB1 gene is associated with the susceptibility and severity of coronary artery disease.

    PubMed

    Luo, Jun-Yi; Li, Xiao-Mei; Zhou, Yun; Zhao, Qiang; Chen, Bang-Dang; Liu, Fen; Chen, Xiao-Cui; Zheng, Hong; Ma, Yi-Tong; Gao, Xiao-Ming; Yang, Yi-Ning

    2017-02-01

    Nuclear factor κappa B (NF-κB) is an important transcription factor in the development and progression of coronary artery disease (CAD). Recent evidence suggests that -94 ATTG ins/del mutant in the promoter of NFKB1 gene is an essential functional mutant. The present study demonstrated the frequencies of the del/del (DD) genotype and del (D) allele were significantly higher in CAD patients than in controls. CAD patients carrying mutant DD genotype had worse stenosis of diseased coronary arteries compared to those carrying ins/ins (II) or ins/del (ID) genotype. Plasma levels of endothelial nitric oxide synthase (eNOS) were lower, while inflammatory cytokine incnterlukin-6 (IL-6) was higher in CAD patients with DD genotype than those with II or ID genotype (both P<0.05). In vitro study showed that mutant human umbilical vein endothelial cells (DD genotype HUVECs) were more susceptible to H 2 O 2 -induced apoptosis, which was accompanied with a decreased Bcl-2 expression. Further, mutant HUVECs had lower eNOS but higher IL-6 mRNA levels and decreased phosphorylation of eNOS under H 2 O 2 -stimulation (both P<0.05). Compared to wild type cells (II genotype), significantly downregulated protein expression of total NF-κB p50 subunit were observed in mutant HUVECs with or without oxidative stress, and a lower expression of unclear p50 was associated with a decreased p50 nuclear translocation in mutant HUVECs versus wild type cells under H 2 O 2 -stimulation (both P<0.05). In conclusion, mutant DD genotype of NFKB1 gene is associated with the risk and severity of CAD. Dwonregulation of NF-κB p50 subunit leads to exacerbated endothelial dysfunction and apoptosis and enhanced inflammatory response that is the potential underlying mechanism. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Angiotensin converting enzyme gene polymorphism is associated with severity of coronary artery disease in men with high total cholesterol levels.

    PubMed

    Borzyszkowska, Joanna; Stanislawska-Sachadyn, Anna; Wirtwein, Marcin; Sobiczewski, Wojciech; Ciecwierz, Dariusz; Targonski, Radoslaw; Gruchala, Marcin; Rynkiewicz, Andrzej; Limon, Janusz

    2012-05-01

    This study examines whether renin-angiotensin-aldosterone system gene polymorphisms: ACE (encoding for angiotensin converting enzyme) c.2306-117_404 I/D, AGTR1 (encoding for angiotensin II type-1 receptor) c.1080*86A>C and CYP11B2 (encoding for aldosterone synthase) c.-344C>T are associated with the extension of coronary atherosclerosis in a group of 647 patients who underwent elective coronary angiography. The extension of CAD was evaluated using the Gensini score. The polymorphisms were determined by PCR and RFLP assays. The associations between genotypes and the extent of coronary atherosclerosis were tested by the Kruskal-Wallis test, followed by pairwise comparisons using Wilcoxon test. The population has been divided into groups defined by: sex, smoking habit, past myocardial infarction, BMI (>, ≤ 25), age (>, ≤ 55), diabetes mellitus, level of total cholesterol (>, ≤ 200 mg/dl), LDL cholesterol (>, ≤ 130 mg/dl), HDL cholesterol (>, ≤ 40 mg/dl), triglycerides (>, ≤ 150 mg/dl). Significant associations between the ACE c.2306-117_404 I/D polymorphism and the Gensini score in men with high total cholesterol levels (P(Kruskal-Wallis) = 0.008; P(adjusted) = 0.009), high level of LDL cholesterol (P(Kruskal-Wallis) = 0.016; P(adjusted) = 0.028) and low level of HDL cholesterol (P(Kruskal-Wallis) = 0.04; P(adjusted) = 0.055) have been found. No association between the AGTR1 c.1080*86A>C and CYP11B2 c.-344C>T and the Gensini score has been found. These results suggest that men who carry ACE c.2306-117_404 DD genotype and have high total cholesterol, high LDL cholesterol and low HDL cholesterol levels may be predisposed to the development of more severe CAD.

  16. CTP synthase forms cytoophidia in the cytoplasm and nucleus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gou, Ke-Mian; State Key Laboratory for Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing 100193; Chang, Chia-Chun

    2014-04-15

    CTP synthase is an essential metabolic enzyme responsible for the de novo synthesis of CTP. Multiple studies have recently showed that CTP synthase protein molecules form filamentous structures termed cytoophidia or CTP synthase filaments in the cytoplasm of eukaryotic cells, as well as in bacteria. Here we report that CTP synthase can form cytoophidia not only in the cytoplasm, but also in the nucleus of eukaryotic cells. Both glutamine deprivation and glutamine analog treatment promote formation of cytoplasmic cytoophidia (C-cytoophidia) and nuclear cytoophidia (N-cytoophidia). N-cytoophidia are generally shorter and thinner than their cytoplasmic counterparts. In mammalian cells, both CTP synthasemore » 1 and CTP synthase 2 can form cytoophidia. Using live imaging, we have observed that both C-cytoophidia and N-cytoophidia undergo multiple rounds of fusion upon glutamine analog treatment. Our study reveals the coexistence of cytoophidia in the cytoplasm and nucleus, therefore providing a good opportunity to investigate the intracellular compartmentation of CTP synthase. - Highlights: • CTP synthase forms cytoophidia not only in the cytoplasm but also in the nucleus. • Glutamine deprivation and Glutamine analogs promotes cytoophidium formation. • N-cytoophidia exhibit distinct morphology when compared to C-cytoophidia. • Both CTP synthase 1 and CTP synthase 2 form cytoophidia in mammalian cells. • Fusions of cytoophidia occur in the cytoplasm and nucleus.« less

  17. Evolution of glutamine amidotransferase genes. Nucleotide sequences of the pabA genes from Salmonella typhimurium, Klebsiella aerogenes and Serratia marcescens.

    PubMed

    Kaplan, J B; Merkel, W K; Nichols, B P

    1985-06-05

    The amide group of glutamine is a source of nitrogen in the biosynthesis of a variety of compounds. These reactions are catalyzed by a group of enzymes known as glutamine amidotransferases; two of these, the glutamine amidotransferase subunits of p-aminobenzoate synthase and anthranilate synthase have been studied in detail and have been shown to be structurally and functionally related. In some micro-organisms, p-aminobenzoate synthase and anthranilate synthase share a common glutamine amidotransferase subunit. We report here the primary DNA and deduced amino acid sequences of the p-aminobenzoate synthase glutamine amidotransferase subunits from Salmonella typhimurium, Klebsiella aerogenes and Serratia marcescens. A comparison of these glutamine amidotransferase sequences to the sequences of ten others, including some that function specifically in either the p-aminobenzoate synthase or anthranilate synthase complexes and some that are shared by both synthase complexes, has revealed several interesting features of the structure and organization of these genes, and has allowed us to speculate as to the evolutionary history of this family of enzymes. We propose a model for the evolution of the p-aminobenzoate synthase and anthranilate synthase glutamine amidotransferase subunits in which the duplication and subsequent divergence of the genetic information encoding a shared glutamine amidotransferase subunit led to the evolution of two new pathway-specific enzymes.

  18. Training on intellectual disability in health sciences: the European perspective.

    PubMed

    Salvador-Carulla, Luis; Martínez-Leal, Rafael; Heyler, Carla; Alvarez-Galvez, Javier; Veenstra, Marja Y; García-Ibáñez, Jose; Carpenter, Sylvia; Bertelli, Marco; Munir, Kerim; Torr, Jennifer; Van Schrojenstein Lantman-de Valk, Henny M J

    2015-01-01

    Intellectual disability (ID) has consequences at all stages of life, requires high service provision and leads to high health and societal costs. However, ID is largely disregarded as a health issue by national and international organisations, as are training in ID and in the health aspects of ID at every level of the education system. This paper aims to (1) update the current information about availability of training and education in ID and related health issues in Europe with a particular focus in mental health; and (2) to identify opportunities arising from the initial process of educational harmonization in Europe to include ID contents in health sciences curricula and professional training. We carried out a systematic search of scientific databases and websites, as well as policy and research reports from the European Commission, European Council and WHO. Furthermore, we contacted key international organisations related to health education and/or ID in Europe, as well as other regional institutions. ID modules and contents are minimal in the revised health sciences curricula and publications on ID training in Europe are equally scarce. European countries report few undergraduate and graduate training modules in ID, even in key specialties such as paediatrics. Within the health sector, ID programmes focus mainly on psychiatry and psychology. The poor availability of ID training in health sciences is a matter of concern. However, the current European policy on training provides an opportunity to promote ID in the curricula of programmes at all levels. This strategy should address all professionals working in ID and it should increase the focus on ID relative to other developmental disorders at all stages of life.

  19. Microbe-ID: an open source toolbox for microbial genotyping and species identification

    PubMed Central

    Tabima, Javier F.; Everhart, Sydney E.; Larsen, Meredith M.; Weisberg, Alexandra J.; Kamvar, Zhian N.; Tancos, Matthew A.; Smart, Christine D.; Chang, Jeff H.

    2016-01-01

    Development of tools to identify species, genotypes, or novel strains of invasive organisms is critical for monitoring emergence and implementing rapid response measures. Molecular markers, although critical to identifying species or genotypes, require bioinformatic tools for analysis. However, user-friendly analytical tools for fast identification are not readily available. To address this need, we created a web-based set of applications called Microbe-ID that allow for customizing a toolbox for rapid species identification and strain genotyping using any genetic markers of choice. Two components of Microbe-ID, named Sequence-ID and Genotype-ID, implement species and genotype identification, respectively. Sequence-ID allows identification of species by using BLAST to query sequences for any locus of interest against a custom reference sequence database. Genotype-ID allows placement of an unknown multilocus marker in either a minimum spanning network or dendrogram with bootstrap support from a user-created reference database. Microbe-ID can be used for identification of any organism based on nucleotide sequences or any molecular marker type and several examples are provided. We created a public website for demonstration purposes called Microbe-ID (microbe-id.org) and provided a working implementation for the genus Phytophthora (phytophthora-id.org). In Phytophthora-ID, the Sequence-ID application allows identification based on ITS or cox spacer sequences. Genotype-ID groups individuals into clonal lineages based on simple sequence repeat (SSR) markers for the two invasive plant pathogen species P. infestans and P. ramorum. All code is open source and available on github and CRAN. Instructions for installation and use are provided at https://github.com/grunwaldlab/Microbe-ID. PMID:27602267

  20. ID4 promotes AR expression and blocks tumorigenicity of PC3 prostate cancer cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Komaragiri, Shravan Kumar; Bostanthirige, Dhanushka H.; Morton, Derrick J.

    Deregulation of tumor suppressor genes is associated with tumorigenesis and the development of cancer. In prostate cancer, ID4 is epigenetically silenced and acts as a tumor suppressor. In normal prostate epithelial cells, ID4 collaborates with androgen receptor (AR) and p53 to exert its tumor suppressor activity. Previous studies have shown that ID4 promotes tumor suppressive function of AR whereas loss of ID4 results in tumor promoter activity of AR. Previous study from our lab showed that ectopic ID4 expression in DU145 attenuates proliferation and promotes AR expression suggesting that ID4 dependent AR activity is tumor suppressive. In this study, wemore » examined the effect of ectopic expression of ID4 on highly malignant prostate cancer cell, PC3. Here we show that stable overexpression of ID4 in PC3 cells leads to increased apoptosis and decreased cell proliferation and migration. In addition, in vivo studies showed a decrease in tumor size and volume of ID4 overexpressing PC3 cells, in nude mice. At the molecular level, these changes were associated with increased androgen receptor (AR), p21, and AR dependent FKBP51 expression. At the mechanistic level, ID4 may regulate the expression or function of AR through specific but yet unknown AR co-regulators that may determine the final outcome of AR function. - Highlights: • ID4 expression induces AR expression in PC3 cells, which generally lack AR. • ID4 expression increased apoptosis and decreased cell proliferation and invasion. • Overexpression of ID4 reduces tumor growth of subcutaneous xenografts in vivo. • ID4 induces p21 and FKBP51 expression- co-factors of AR tumor suppressor activity.« less

  1. Participation in daytime activities among people with mild or moderate intellectual disability.

    PubMed

    Dusseljee, J C E; Rijken, P M; Cardol, M; Curfs, L M G; Groenewegen, P P

    2011-01-01

    Community participation has been defined as performing daytime activities by people while interacting with others. Previous studies on community participation among people with intellectual disability (ID) have mainly focused on the domestic life aspect. This study investigates the variation in community participation in the domains work, social contacts and leisure activities among people with ID in the Netherlands. A number of categories of people with ID were distinguished by: (1) gender; (2) age; (3) type of education; (4) severity of ID; and (5) accommodation type. Data were gathered on 653 people with mild or moderate ID, of whom 513 by oral interviews and 140 by structured questionnaires filled in by representatives of those who could not be interviewed. Pearson chi-square tests were used to test differences between categories of people with ID in the distributions of the participation variables. Additional logistic regression analyses were conducted to correct for differences between the categories in other variables. Most people with mild or moderate ID in the Netherlands have work or other daytime activities, have social contacts and have leisure activities. However, people aged 50 years and over and people with moderate ID participate less in these domains than those under 50 years and people with mild ID. Moreover, people with ID hardly participate in activities with people without ID. High participation among people with a mild or moderate ID within the domains of work, social contact and leisure activities does not necessarily indicate a high level of interaction with the community, because the majority hardly interact with people without ID. Furthermore, older people with ID and people with a more severe level of ID seem to be more at risk for social exclusion. © 2010 The Authors. Journal of Intellectual Disability Research © 2010 Blackwell Publishing Ltd.

  2. Training on intellectual disability in health sciences: the European perspective

    PubMed Central

    Salvador-Carulla, Luis; Martínez-Leal, Rafael; Heyler, Carla; Alvarez-Galvez, Javier; Veenstra, Marja Y.; García-Ibáñez, Jose; Carpenter, Sylvia; Bertelli, Marco; Munir, Kerim; Torr, Jennifer; Van Schrojenstein Lantman-de Valk, Henny M. J.

    2015-01-01

    Background Intellectual disability (ID) has consequences at all stages of life, requires high service provision and leads to high health and societal costs. However, ID is largely disregarded as a health issue by national and international organisations, as are training in ID and in the health aspects of ID at every level of the education system. Specific aim This paper aims to (1) update the current information about availability of training and education in ID and related health issues in Europe with a particular focus in mental health; and (2) to identify opportunities arising from the initial process of educational harmonization in Europe to include ID contents in health sciences curricula and professional training. Method We carried out a systematic search of scientific databases and websites, as well as policy and research reports from the European Commission, European Council and WHO. Furthermore, we contacted key international organisations related to health education and/or ID in Europe, as well as other regional institutions. Results ID modules and contents are minimal in the revised health sciences curricula and publications on ID training in Europe are equally scarce. European countries report few undergraduate and graduate training modules in ID, even in key specialties such as paediatrics. Within the health sector, ID programmes focus mainly on psychiatry and psychology. Conclusion The poor availability of ID training in health sciences is a matter of concern. However, the current European policy on training provides an opportunity to promote ID in the curricula of programmes at all levels. This strategy should address all professionals working in ID and it should increase the focus on ID relative to other developmental disorders at all stages of life. PMID:25705375

  3. Detecting 'infant-directedness' in face and voice.

    PubMed

    Kim, Hojin I; Johnson, Scott P

    2014-07-01

    Five- and 3-month-old infants' perception of infant-directed (ID) faces and the role of speech in perceiving faces were examined. Infants' eye movements were recorded as they viewed a series of two side-by-side talking faces, one infant-directed and one adult-directed (AD), while listening to ID speech, AD speech, or in silence. Infants showed consistently greater dwell time on ID faces vs. AD faces, and this ID face preference was consistent across all three sound conditions. ID speech resulted in higher looking overall, but it did not increase looking at the ID face per se. Together, these findings demonstrate that infants' preferences for ID speech extend to ID faces. © 2014 John Wiley & Sons Ltd.

  4. Molecular docking studies to map the binding site of squalene synthase inhibitors on dehydrosqualene synthase of Staphylococcus aureus.

    PubMed

    Kahlon, Amandeep Kaur; Roy, Sudeep; Sharma, Ashok

    2010-10-01

    Dehydrosqualene synthase of Staphylococcus aureus is involved in the synthesis of golden carotenoid pigment staphyloxanthin. This pigment of S. aureus provides the antioxidant property to this bacterium to survive inside the host cell. Dehydrosqualene synthase (CrtM) is having structural similarity with the human squalene synthase enzyme which is involved in the cholesterol synthesis pathway in humans (Liu et al., 2008). Cholesterol lowering drugs were found to have inhibitory effect on dehydrosqualene synthase enzyme of S. aureus. The present study attempts to focus on squalene synthase inhibitors, lapaquistat acetate and squalestatins reported as cholesterol lowering agents in vitro and in vivo but not studied in context to dehydrosqualene synthase of S. aureus. Mode of binding of lapaquistat acetate and squalestatin analogs on dehydrosqualene synthase (CrtM) enzyme of S. aureus was identified by performing docking analysis with Scigress Explorer Ultra 7.7 docking software. Based on the molecular docking analysis, it was found that the His18, Arg45, Asp48, Asp52, Tyr129, Gln165, Asn168 and Asp172 residues interacted with comparatively high frequency with the inhibitors studied. Comparative docking study with Discovery studio 2.0 also confirmed the involvement of these residues of dehydrosqualene synthase enzyme with the inhibitors studied. This further confirms the importance of these residues in the enzyme function. In silico ADMET analysis was done to predict the ADMET properties of the standard drugs and test compounds. This might provide insights to develop new drugs to target the virulence factor, dehydrosqualene synthase of S. aureus.

  5. In Planta Recapitulation of Isoprene Synthase Evolution from Ocimene Synthases

    PubMed Central

    Li, Mingai; Xu, Jia; Algarra Alarcon, Alberto; Carlin, Silvia; Barbaro, Enrico; Cappellin, Luca; Velikova, Violeta; Vrhovsek, Urska; Loreto, Francesco; Varotto, Claudio

    2017-01-01

    Abstract Isoprene is the most abundant biogenic volatile hydrocarbon compound naturally emitted by plants and plays a major role in atmospheric chemistry. It has been proposed that isoprene synthases (IspS) may readily evolve from other terpene synthases, but this hypothesis has not been experimentally investigated. We isolated and functionally validated in Arabidopsis the first isoprene synthase gene, AdoIspS, from a monocotyledonous species (Arundo donax L., Poaceae). Phylogenetic reconstruction indicates that AdoIspS and dicots isoprene synthases most likely originated by parallel evolution from TPS-b monoterpene synthases. Site-directed mutagenesis demonstrated invivo the functional and evolutionary relevance of the residues considered diagnostic for IspS function. One of these positions was identified by saturating mutagenesis as a major determinant of substrate specificity in AdoIspS able to cause invivo a dramatic change in total volatile emission from hemi- to monoterpenes and supporting evolution of isoprene synthases from ocimene synthases. The mechanism responsible for IspS neofunctionalization by active site size modulation by a single amino acid mutation demonstrated in this study might be general, as the very same amino acidic position is implicated in the parallel evolution of different short-chain terpene synthases from both angiosperms and gymnosperms. Based on these results, we present a model reconciling in a unified conceptual framework the apparently contrasting patterns previously observed for isoprene synthase evolution in plants. These results indicate that parallel evolution may be driven by relatively simple biophysical constraints, and illustrate the intimate molecular evolutionary links between the structural and functional bases of traits with global relevance. PMID:28637270

  6. 77 FR 57086 - Radio Broadcasting Services; AM or FM Proposals To Change The Community of License.

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-09-17

    ...The following applicants filed AM or FM proposals to change the community of license: ALEXANDRA COMMUNICATIONS, INC., Station KRKZ- FM, Facility ID 189499, BPH-20120725AHL, From NETARTS, OR, To CHINOOK, WA; ALEXANDRA COMMUNICATIONS, INC., Station KTIL, Facility ID 50554, BMP-20120725AHO, From TILLAMOOK, OR, To NETARTS, OR; BIRACH BROADCASTING CORPORATION, Station NEW, Facility ID 136069, BMP- 20120813ABI, From TERRE HAUTE, IN, To PEOTONE, IN; BRAHMIN BROADCASTING CORPORATION, Station KPAD, Facility ID 166006, BMPH-20111230ABO, From RAWLINS, WY, To WHEATLAND, WY; CITICASTERS LICENSES, INC., Station WOGB, Facility ID 89, BPH-20120720ACQ, From KAUKAUNA, WI, To REEDSVILLE, WI; CLEAR CHANNEL BROADCASTING LICENSES, INC., Station WQNS, Facility ID 41008, BPH-20120807ACK, From WAYNESVILLE, NC, To WOODFIN, NC; CORPORATION FOR NATIVE BROADCASTING, Station KXSW, Facility ID 171940, BPED-20120717AAL, From SISSETON, SD, To AGENCY VILLAGE, SD; CRAIN MEDIA GROUP, LLC, Station KEAZ, Facility ID 48748, BPH-20120716ADV, From HEBER SPRINGS, AR, To KENSETT, AR; DAIJ MEDIA, LLC, Station KJOZ, Facility ID 20625, BP-20120731AAA, From CONROE, TX, To FRIENDSWOOD, TX; ENTERTAINMENT MEDIA TRUST, DENNIS J.WATKINS, TRUSTEE, Station KQQZ, Facility ID 5281, BMP-20120628AAL, From FAIRVIEW HEIGHTS, IL, To DESOTO, MO; GOOD TIDINGS TRUST, INC., Station WAYR, Facility ID 24625, BP-20120724ABN, From ORANGE PARK, FL, To FLEMING ISLAND, FL; IHR EDUCATIONAL BROADCASTING, Station NEW, Facility ID 160745, BMP-20120821AAF, From MERRILL, OR, To ALTAMONT, OR; JER LICENSES, LLC, Station NEW, Facility ID 190382, BNPH-20120529ALR, From GUNNISON, CO, To DOTSERO, CO; KIERTRON, INC., Station KBRT, Facility ID 34588, BMP-20120809AAQ, From AVALON, CA, To COSTA MESA, CA; MALVERN ENTERTAINMENT CORPORATION, Station KHAN, Facility ID 164210, BPH-20120716ADT, From KENSETT, AR, To MAGNESS, AR; SYNERGY BROADCAST NORTH DAKOTA, LLC, Station KLTQ, Facility ID 164305, BPH-20120727AHW, From NEW ENGLAND, ND, To BEULAH, ND; SYNERGY BROADCAST NORTH DAKOTA, LLC, Station KQLZ, Facility ID 166059, BPH-20120727AID, From BEULAH, ND, To NEW ENGLAND, ND; THE OPP BROADCASTING CO., INC., Station WAMI- FM, Facility ID 66211, BPH-20120612ACO, From FORT DEPOSIT, AL, To OPP, AL; TRI STATE RADIO, LLC, Station KYLZ, Facility ID 170181, BPH- 20120807ACF, From PAROWAN, UT, To ENOCH, UT.

  7. 50 CFR 12.6 - Bonded release.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ..., payment of the value as determined under § 12.12) in place of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Marine Mammal Protection Act, 16 U.S.C. 1361 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Airborne Hunting Act, 16 U.S.C. 742j...

  8. 50 CFR 12.6 - Bonded release.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ..., payment of the value as determined under § 12.12) in place of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Marine Mammal Protection Act, 16 U.S.C. 1361 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Airborne Hunting Act, 16 U.S.C. 742j...

  9. 50 CFR 12.6 - Bonded release.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ..., payment of the value as determined under § 12.12) in place of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Marine Mammal Protection Act, 16 U.S.C. 1361 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Airborne Hunting Act, 16 U.S.C. 742j...

  10. 50 CFR 12.6 - Bonded release.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ..., payment of the value as determined under § 12.12) in place of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Marine Mammal Protection Act, 16 U.S.C. 1361 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Airborne Hunting Act, 16 U.S.C. 742j...

  11. 50 CFR 12.6 - Bonded release.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ..., payment of the value as determined under § 12.12) in place of any property seized under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Marine Mammal Protection Act, 16 U.S.C. 1361 et seq.; Lacey Act, 18 U.S.C. 43; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Airborne Hunting Act, 16 U.S.C. 742j...

  12. 50 CFR 12.41 - Petition for restoration of proceeds.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... which has been forfeited under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1; or the Lacey Act Amendments of 1981... Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., and sold according to law, may file with the...

  13. 50 CFR 12.41 - Petition for restoration of proceeds.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... which has been forfeited under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1; or the Lacey Act Amendments of 1981... Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., and sold according to law, may file with the...

  14. 50 CFR 12.41 - Petition for restoration of proceeds.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... which has been forfeited under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1; or the Lacey Act Amendments of 1981... Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., and sold according to law, may file with the...

  15. 50 CFR 12.41 - Petition for restoration of proceeds.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... which has been forfeited under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1; or the Lacey Act Amendments of 1981... Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., and sold according to law, may file with the...

  16. 50 CFR 12.41 - Petition for restoration of proceeds.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... which has been forfeited under the Endangered Species Act, 16 U.S.C. 1531 et seq.; Eagle Protection Act, 16 U.S.C. 668 et seq.; Airborne Hunting Act, 16 U.S.C. 742j-1; or the Lacey Act Amendments of 1981... Marine Mammal Protection Act, 16 U.S.C. 1361 et seq., and sold according to law, may file with the...

  17. 33 CFR 148.737 - What environmental statutes must an applicant follow?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ...; Archeological Resources Protection Act (AHPA), 16 U.S.C. 470 aa-ll, et. seq.; Architectural Barriers Act, 42 U.S... 1977 (CWA), Pub. L. 95-217, 33 U.S.C. 1251, et. seq.; Coastal Barrier Resources Act (CBRA), Pub. L. 97-348, 16 U.S.C. 3510, et. seq.; Coastal Zone Management Act (CZMA), Pub. L. 92-583, 16 U.S.C. 1451, et...

  18. 33 CFR 148.737 - What environmental statutes must an applicant follow?

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ...; Archeological Resources Protection Act (AHPA), 16 U.S.C. 470 aa-ll, et. seq.; Architectural Barriers Act, 42 U.S... 1977 (CWA), Pub. L. 95-217, 33 U.S.C. 1251, et. seq.; Coastal Barrier Resources Act (CBRA), Pub. L. 97-348, 16 U.S.C. 3510, et. seq.; Coastal Zone Management Act (CZMA), Pub. L. 92-583, 16 U.S.C. 1451, et...

  19. 17 CFR 200.30-14 - Delegation of authority to the General Counsel.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ...., the Investment Advisers Act of 1940, 15 U.S.C. 80b-1 et seq., the Securities Investor Protection Act... U.S.C. 80b-1 et seq.), the Securities Investor Protection Act of 1970 (15 U.S.C. 78aaa et seq.) and...] Editorial Note: For Federal Register citations affecting § 200.30-14, see the List of CFR Sections Affected...

  20. 28 CFR Appendix to Subpart Y of... - Redelegations of Authority To Compromise and Close Civil Claims

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... of 1937, as amended, 7 U.S.C. 601 et seq. (4) Suits by social security beneficiaries under the Social Security Act, 42 U.S.C. 402 et seq. (5) Social Security disability suits under 42 U.S.C. 423 et seq. (6... from local officials and the media when the action is commenced. Because the actual situation covered...

  1. 28 CFR Appendix to Subpart Y of... - Redelegations of Authority To Compromise and Close Civil Claims

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... of 1937, as amended, 7 U.S.C. 601 et seq. (4) Suits by social security beneficiaries under the Social Security Act, 42 U.S.C. 402 et seq. (5) Social Security disability suits under 42 U.S.C. 423 et seq. (6... from local officials and the media when the action is commenced. Because the actual situation covered...

  2. 28 CFR Appendix to Subpart Y of... - Redelegations of Authority To Compromise and Close Civil Claims

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... of 1937, as amended, 7 U.S.C. 601 et seq. (4) Suits by social security beneficiaries under the Social Security Act, 42 U.S.C. 402 et seq. (5) Social Security disability suits under 42 U.S.C. 423 et seq. (6... from local officials and the media when the action is commenced. Because the actual situation covered...

  3. 28 CFR Appendix to Subpart Y of... - Redelegations of Authority To Compromise and Close Civil Claims

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... of 1937, as amended, 7 U.S.C. 601 et seq. (4) Suits by social security beneficiaries under the Social Security Act, 42 U.S.C. 402 et seq. (5) Social Security disability suits under 42 U.S.C. 423 et seq. (6... from local officials and the media when the action is commenced. Because the actual situation covered...

  4. 28 CFR Appendix to Subpart Y of... - Redelegations of Authority To Compromise and Close Civil Claims

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... of 1937, as amended, 7 U.S.C. 601 et seq. (4) Suits by social security beneficiaries under the Social Security Act, 42 U.S.C. 402 et seq. (5) Social Security disability suits under 42 U.S.C. 423 et seq. (6... from local officials and the media when the action is commenced. Because the actual situation covered...

  5. Under representation of people with epilepsy and intellectual disability in research.

    PubMed

    Shankar, Rohit; Rowe, Charles; Van Hoorn, Alje; Henley, William; Laugharne, Richard; Cox, David; Pande, Raj; Roy, Ashok; Sander, Josemir W

    2018-01-01

    One quarter of people with epilepsy have an intellectual disability (ID) and one fifth of people with an ID have epilepsy. Both conditions are associated with higher levels of morbidity, stigma and premature mortality. There have been calls for action to promote more research in this group. We examined if this group are represented adequately in current research. The proportion of research output in epilepsy conferences and publications relevant to ID and the proportion in ID conferences and publications on epilepsy for 2015-2016 were identified. As the percentage of children in the population with epilepsy is 17%, research output of this group was compared with the ID group. Recognised material was classified based on whether it applied to general epilepsy/ID research, children with epilepsy or people with epilepsy and ID. Data was analysed to determine the proportion of presented research specifically identifying people with epilepsy and ID. Fewer than 2% of presentations at epilepsy conferences specifically related to the ID and epilepsy group compared to 15% relating to children with epilepsy. Similarly only 1.4% of the research presented at major ID conferences related to those with people with epilepsy and ID. About 5% of published research in the field of epilepsy related to those with ID as compared with 24% for children with epilepsy. Twelve percent of published research in ID specifically identified epilepsy. Publications and conference presentations, on the population with epilepsy and comorbid ID is under-represented. Increased research in this area might assist in improving the quality of care for this relatively neglected group.

  6. Evaluation of red blood cell and platelet antigen genotyping platforms (ID CORE XT/ID HPA XT) in routine clinical practice.

    PubMed

    Finning, Kirstin; Bhandari, Radhika; Sellers, Fiona; Revelli, Nicoletta; Villa, Maria Antonietta; Muñiz-Díaz, Eduardo; Nogués, Núria

    2016-03-01

    High-throughput genotyping platforms enable simultaneous analysis of multiple polymorphisms for blood group typing. BLOODchip® ID is a genotyping platform based on Luminex® xMAP technology for simultaneous determination of 37 red blood cell (RBC) antigens (ID CORE XT) and 18 human platelet antigens (HPA) (ID HPA XT) using the BIDS XT software. In this international multicentre study, the performance of ID CORE XT and ID HPA XT, using the centres' current genotyping methods as the reference for comparison, and the usability and practicality of these systems, were evaluated under working laboratory conditions. DNA was extracted from whole blood in EDTA with Qiagen methodologies. Ninety-six previously phenotyped/genotyped samples were processed per assay: 87 testing samples plus five positive controls and four negative controls. Results were available for 519 samples: 258 with ID CORE XT and 261 with ID HPA XT. There were three "no calls" that were either caused by human error or resolved after repeating the test. Agreement between the tests and reference methods was 99.94% for ID CORE XT (9,540/9,546 antigens determined) and 100% for ID HPA XT (all 4,698 alleles determined). There were six discrepancies in antigen results in five RBC samples, four of which (in VS, N, S and Do(a)) could not be investigated due to lack of sufficient sample to perform additional tests and two of which (in S and C) were resolved in favour of ID CORE XT (100% accuracy). The total hands-on time was 28-41 minutes for a batch of 16 samples. Compared with the reference platforms, ID CORE XT and ID HPA XT were considered simpler to use and had shorter processing times. ID CORE XT and ID HPA XT genotyping platforms for RBC and platelet systems were accurate and user-friendly in working laboratory settings.

  7. A Genome-Wide Association Study for Culm Cellulose Content in Barley Reveals Candidate Genes Co-Expressed with Members of the CELLULOSE SYNTHASE A Gene Family

    PubMed Central

    Houston, Kelly; Burton, Rachel A.; Sznajder, Beata; Rafalski, Antoni J.; Dhugga, Kanwarpal S.; Mather, Diane E.; Taylor, Jillian; Steffenson, Brian J.; Waugh, Robbie; Fincher, Geoffrey B.

    2015-01-01

    Cellulose is a fundamentally important component of cell walls of higher plants. It provides a scaffold that allows the development and growth of the plant to occur in an ordered fashion. Cellulose also provides mechanical strength, which is crucial for both normal development and to enable the plant to withstand both abiotic and biotic stresses. We quantified the cellulose concentration in the culm of 288 two – rowed and 288 six – rowed spring type barley accessions that were part of the USDA funded barley Coordinated Agricultural Project (CAP) program in the USA. When the population structure of these accessions was analysed we identified six distinct populations, four of which we considered to be comprised of a sufficient number of accessions to be suitable for genome-wide association studies (GWAS). These lines had been genotyped with 3072 SNPs so we combined the trait and genetic data to carry out GWAS. The analysis allowed us to identify regions of the genome containing significant associations between molecular markers and cellulose concentration data, including one region cross-validated in multiple populations. To identify candidate genes we assembled the gene content of these regions and used these to query a comprehensive RNA-seq based gene expression atlas. This provided us with gene annotations and associated expression data across multiple tissues, which allowed us to formulate a supported list of candidate genes that regulate cellulose biosynthesis. Several regions identified by our analysis contain genes that are co-expressed with CELLULOSE SYNTHASE A (HvCesA) across a range of tissues and developmental stages. These genes are involved in both primary and secondary cell wall development. In addition, genes that have been previously linked with cellulose synthesis by biochemical methods, such as HvCOBRA, a gene of unknown function, were also associated with cellulose levels in the association panel. Our analyses provide new insights into the genes that contribute to cellulose content in cereal culms and to a greater understanding of the interactions between them. PMID:26154104

  8. dCLIP: a computational approach for comparative CLIP-seq analyses

    PubMed Central

    2014-01-01

    Although comparison of RNA-protein interaction profiles across different conditions has become increasingly important to understanding the function of RNA-binding proteins (RBPs), few computational approaches have been developed for quantitative comparison of CLIP-seq datasets. Here, we present an easy-to-use command line tool, dCLIP, for quantitative CLIP-seq comparative analysis. The two-stage method implemented in dCLIP, including a modified MA normalization method and a hidden Markov model, is shown to be able to effectively identify differential binding regions of RBPs in four CLIP-seq datasets, generated by HITS-CLIP, iCLIP and PAR-CLIP protocols. dCLIP is freely available at http://qbrc.swmed.edu/software/. PMID:24398258

  9. Accumulation of prenyl alcohols by terpenoid biosynthesis inhibitors in various microorganisms.

    PubMed

    Muramatsu, Masayoshi; Ohto, Chikara; Obata, Shusei; Sakuradani, Eiji; Shimizu, Sakayu

    2008-09-01

    Squalene synthase inhibitors significantly accelerate the production of farnesol by various microorganisms. However, farnesol production by Saccharomyces cerevisiae ATCC 64031, in which the squalene synthase gene is deleted, was not affected by the inhibitors, indicating that farnesol accumulation is enhanced in the absence of squalene synthase activity. The combination of diphenylamine as an inhibitor of carotenoid biosynthesis and a squalene synthase inhibitor increases geranylgeraniol production by a yeast, Rhodotorula rubra NBRC 0870. An ent-kauren synthase inhibitor also enhances the production of farnesol and geranylgeraniol by a filamentous fungus, Gibberella fujikuroi NBRC 30336. These results indicate that the inhibition of downstream enzymes from prenyl diphosphate synthase leads to the production of farnesol and geranylgeraniol.

  10. An Intrusion Detection System Based on Multi-Level Clustering for Hierarchical Wireless Sensor Networks

    PubMed Central

    Butun, Ismail; Ra, In-Ho; Sankar, Ravi

    2015-01-01

    In this work, an intrusion detection system (IDS) framework based on multi-level clustering for hierarchical wireless sensor networks is proposed. The framework employs two types of intrusion detection approaches: (1) “downward-IDS (D-IDS)” to detect the abnormal behavior (intrusion) of the subordinate (member) nodes; and (2) “upward-IDS (U-IDS)” to detect the abnormal behavior of the cluster heads. By using analytical calculations, the optimum parameters for the D-IDS (number of maximum hops) and U-IDS (monitoring group size) of the framework are evaluated and presented. PMID:26593915

  11. Diff-seq: A high throughput sequencing-based mismatch detection assay for DNA variant enrichment and discovery

    PubMed Central

    Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin

    2018-01-01

    Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139

  12. A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data

    PubMed Central

    2014-01-01

    Abstract ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data. Reviewers This article was reviewed by Prof. Sandor Pongor, Dr. Yuriy Gusev, and Dr. Shyam Prabhakar (nominated by Prof. Limsoon Wong). PMID:24555784

  13. How B-Cell Receptor Repertoire Sequencing Can Be Enriched with Structural Antibody Data

    PubMed Central

    Kovaltsuk, Aleksandr; Krawczyk, Konrad; Galson, Jacob D.; Kelly, Dominic F.; Deane, Charlotte M.; Trück, Johannes

    2017-01-01

    Next-generation sequencing of immunoglobulin gene repertoires (Ig-seq) allows the investigation of large-scale antibody dynamics at a sequence level. However, structural information, a crucial descriptor of antibody binding capability, is not collected in Ig-seq protocols. Developing systematic relationships between the antibody sequence information gathered from Ig-seq and low-throughput techniques such as X-ray crystallography could radically improve our understanding of antibodies. The mapping of Ig-seq datasets to known antibody structures can indicate structurally, and perhaps functionally, uncharted areas. Furthermore, contrasting naïve and antigenically challenged datasets using structural antibody descriptors should provide insights into antibody maturation. As the number of antibody structures steadily increases and more and more Ig-seq datasets become available, the opportunities that arise from combining the two types of information increase as well. Here, we review how these data types enrich one another and show potential for advancing our knowledge of the immune system and improving antibody engineering. PMID:29276518

  14. Technical variations in low-input RNA-seq methodologies.

    PubMed

    Bhargava, Vipul; Head, Steven R; Ordoukhanian, Phillip; Mercola, Mark; Subramaniam, Shankar

    2014-01-14

    Recent advances in RNA-seq methodologies from limiting amounts of mRNA have facilitated the characterization of rare cell-types in various biological systems. So far, however, technical variations in these methods have not been adequately characterized, vis-à-vis sensitivity, starting with reduced levels of mRNA. Here, we generated sequencing libraries from limiting amounts of mRNA using three amplification-based methods, viz. Smart-seq, DP-seq and CEL-seq, and demonstrated significant technical variations in these libraries. Reduction in mRNA levels led to inefficient amplification of the majority of low to moderately expressed transcripts. Furthermore, noise in primer hybridization and/or enzyme incorporation was magnified during the amplification step resulting in significant distortions in fold changes of the transcripts. Consequently, the majority of the differentially expressed transcripts identified were either high-expressed and/or exhibited high fold changes. High technical variations ultimately masked subtle biological differences mandating the development of improved amplification-based strategies for quantitative transcriptomics from limiting amounts of mRNA.

  15. Single-cell transcriptional dynamics of flavivirus infection

    PubMed Central

    Bekerman, Elena

    2018-01-01

    Dengue and Zika viral infections affect millions of people annually and can be complicated by hemorrhage and shock or neurological manifestations, respectively. However, a thorough understanding of the host response to these viruses is lacking, partly because conventional approaches ignore heterogeneity in virus abundance across cells. We present viscRNA-Seq (virus-inclusive single cell RNA-Seq), an approach to probe the host transcriptome together with intracellular viral RNA at the single cell level. We applied viscRNA-Seq to monitor dengue and Zika virus infection in cultured cells and discovered extreme heterogeneity in virus abundance. We exploited this variation to identify host factors that show complex dynamics and a high degree of specificity for either virus, including proteins involved in the endoplasmic reticulum translocon, signal peptide processing, and membrane trafficking. We validated the viscRNA-Seq hits and discovered novel proviral and antiviral factors. viscRNA-Seq is a powerful approach to assess the genome-wide virus-host dynamics at single cell level. PMID:29451494

  16. RNA-seq Data: Challenges in and Recommendations for Experimental Design and Analysis.

    PubMed

    Williams, Alexander G; Thomas, Sean; Wyman, Stacia K; Holloway, Alisha K

    2014-10-01

    RNA-seq is widely used to determine differential expression of genes or transcripts as well as identify novel transcripts, identify allele-specific expression, and precisely measure translation of transcripts. Thoughtful experimental design and choice of analysis tools are critical to ensure high-quality data and interpretable results. Important considerations for experimental design include number of replicates, whether to collect paired-end or single-end reads, sequence length, and sequencing depth. Common analysis steps in all RNA-seq experiments include quality control, read alignment, assigning reads to genes or transcripts, and estimating gene or transcript abundance. Our aims are two-fold: to make recommendations for common components of experimental design and assess tool capabilities for each of these steps. We also test tools designed to detect differential expression, since this is the most widespread application of RNA-seq. We hope that these analyses will help guide those who are new to RNA-seq and will generate discussion about remaining needs for tool improvement and development. Copyright © 2014 John Wiley & Sons, Inc.

  17. Comprehensive RNA-Seq profiling to evaluate lactating sheep mammary gland transcriptome

    PubMed Central

    Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Klopp, Christophe; Tosser-Klopp, Gwenola; Arranz, Juan-José

    2016-01-01

    RNA-Seq enables the generation of extensive transcriptome information providing the capability to characterize transcripts (including alternative isoforms and polymorphism), to quantify expression and to identify differential regulation in a single experiment. Our aim in this study was to take advantage of using RNA-Seq high-throughput technology to provide a comprehensive transcriptome profiling of the sheep lactating mammary gland. Eight ewes of two dairy sheep breeds with differences in milk production traits were used in this experiment (four Churra and four Assaf ewes). Milk samples from these animals were collected on days 10, 50, 120 and 150 after lambing to cover the various physiological stages of the mammary gland across the complete lactation. RNA samples were extracted from milk somatic cells. The RNA-Seq dataset was generated using an Illumina HiSeq 2000 sequencer. The information reported here will be useful to understand the biology of lactation in sheep, providing also an opportunity to characterize their different patterns on milk production aptitude. PMID:27377755

  18. Integrating single-cell transcriptomic data across different conditions, technologies, and species.

    PubMed

    Butler, Andrew; Hoffman, Paul; Smibert, Peter; Papalexi, Efthymia; Satija, Rahul

    2018-06-01

    Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. However, identifying subpopulations of cells that are present across multiple data sets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq data sets based on common sources of variation, enabling the identification of shared populations across data sets and downstream comparative analysis. We apply this approach, implemented in our R toolkit Seurat (http://satijalab.org/seurat/), to align scRNA-seq data sets of peripheral blood mononuclear cells under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell 'atlases' generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across data sets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq data sets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.

  19. A multiplexed single-cell CRISPR screening platform enables systematic dissection of the unfolded protein response

    PubMed Central

    Adamson, Britt; Norman, Thomas M.; Jost, Marco; Cho, Min Y.; Nuñez, James K.; Chen, Yuwen; Villalta, Jacqueline E.; Gilbert, Luke A.; Horlbeck, Max A.; Hein, Marco Y.; Pak, Ryan A.; Gray, Andrew N.; Gross, Carol A.; Dixit, Atray; Parnas, Oren; Regev, Aviv; Weissman, Jonathan S.

    2016-01-01

    SUMMARY Functional genomics efforts face tradeoffs between number of perturbations examined and complexity of phenotypes measured. We bridge this gap with Perturb-seq, which combines droplet-based single-cell RNA-seq with a strategy for barcoding CRISPR-mediated perturbations, allowing many perturbations to be profiled in pooled format. We applied Perturb-seq to dissect the mammalian unfolded protein response (UPR) using single and combinatorial CRISPR perturbations. Two genome-scale CRISPR interference (CRISPRi) screens identified genes whose repression perturbs ER homeostasis. Subjecting ~100 hits to Perturb-seq enabled high-precision functional clustering of genes. Single-cell analyses decoupled the three UPR branches, revealed bifurcated UPR branch activation among cells subject to the same perturbation, and uncovered differential activation of the branches across hits, including an isolated feedback loop between the translocon and IRE1α. These studies provide insight into how the three sensors of ER homeostasis monitor distinct types of stress and highlight the ability of Perturb-seq to dissect complex cellular responses. PMID:27984733

  20. Using single nuclei for RNA-seq to capture the transcriptome of postmortem neurons

    PubMed Central

    Krishnaswami, Suguna Rani; Grindberg, Rashel V; Novotny, Mark; Venepally, Pratap; Lacar, Benjamin; Bhutani, Kunal; Linker, Sara B; Pham, Son; Erwin, Jennifer A; Miller, Jeremy A; Hodge, Rebecca; McCarthy, James K; Kelder, Martin; McCorrison, Jamison; Aevermann, Brian D; Fuertes, Francisco Diez; Scheuermann, Richard H; Lee, Jun; Lein, Ed S; Schork, Nicholas; McConnell, Michael J; Gage, Fred H; Lasken, Roger S

    2016-01-01

    A protocol is described for sequencing the transcriptome of a cell nucleus. Nuclei are isolated from specimens and sorted by FACS, cDNA libraries are constructed and RNA-seq is performed, followed by data analysis. Some steps follow published methods (Smart-seq2 for cDNA synthesis and Nextera XT barcoded library preparation) and are not described in detail here. Previous single-cell approaches for RNA-seq from tissues include cell dissociation using protease treatment at 30 °C, which is known to alter the transcriptome. We isolate nuclei at 4 °C from tissue homogenates, which cause minimal damage. Nuclear transcriptomes can be obtained from postmortem human brain tissue stored at −80 °C, making brain archives accessible for RNA-seq from individual neurons. The method also allows investigation of biological features unique to nuclei, such as enrichment of certain transcripts and precursors of some noncoding RNAs. By following this procedure, it takes about 4 d to construct cDNA libraries that are ready for sequencing. PMID:26890679

  1. An interactive environment for agile analysis and visualization of ChIP-sequencing data.

    PubMed

    Lerdrup, Mads; Johansen, Jens Vilstrup; Agrawal-Singh, Shuchi; Hansen, Klaus

    2016-04-01

    To empower experimentalists with a means for fast and comprehensive chromatin immunoprecipitation sequencing (ChIP-seq) data analyses, we introduce an integrated computational environment, EaSeq. The software combines the exploratory power of genome browsers with an extensive set of interactive and user-friendly tools for genome-wide abstraction and visualization. It enables experimentalists to easily extract information and generate hypotheses from their own data and public genome-wide datasets. For demonstration purposes, we performed meta-analyses of public Polycomb ChIP-seq data and established a new screening approach to analyze more than 900 datasets from mouse embryonic stem cells for factors potentially associated with Polycomb recruitment. EaSeq, which is freely available and works on a standard personal computer, can substantially increase the throughput of many analysis workflows, facilitate transparency and reproducibility by automatically documenting and organizing analyses, and enable a broader group of scientists to gain insights from ChIP-seq data.

  2. DTWscore: differential expression and cell clustering analysis for time-series single-cell RNA-seq data.

    PubMed

    Wang, Zhuo; Jin, Shuilin; Liu, Guiyou; Zhang, Xiurui; Wang, Nan; Wu, Deliang; Hu, Yang; Zhang, Chiping; Jiang, Qinghua; Xu, Li; Wang, Yadong

    2017-05-23

    The development of single-cell RNA sequencing has enabled profound discoveries in biology, ranging from the dissection of the composition of complex tissues to the identification of novel cell types and dynamics in some specialized cellular environments. However, the large-scale generation of single-cell RNA-seq (scRNA-seq) data collected at multiple time points remains a challenge to effective measurement gene expression patterns in transcriptome analysis. We present an algorithm based on the Dynamic Time Warping score (DTWscore) combined with time-series data, that enables the detection of gene expression changes across scRNA-seq samples and recovery of potential cell types from complex mixtures of multiple cell types. The DTWscore successfully classify cells of different types with the most highly variable genes from time-series scRNA-seq data. The study was confined to methods that are implemented and available within the R framework. Sample datasets and R packages are available at https://github.com/xiaoxiaoxier/DTWscore .

  3. From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline.

    PubMed

    Chen, Yunshun; Lun, Aaron T L; Smyth, Gordon K

    2016-01-01

    In recent years, RNA sequencing (RNA-seq) has become a very widely used technology for profiling gene expression. One of the most common aims of RNA-seq profiling is to identify genes or molecular pathways that are differentially expressed (DE) between two or more biological conditions. This article demonstrates a computational workflow for the detection of DE genes and pathways from RNA-seq data by providing a complete analysis of an RNA-seq experiment profiling epithelial cell subsets in the mouse mammary gland. The workflow uses R software packages from the open-source Bioconductor project and covers all steps of the analysis pipeline, including alignment of read sequences, data exploration, differential expression analysis, visualization and pathway analysis. Read alignment and count quantification is conducted using the Rsubread package and the statistical analyses are performed using the edgeR package. The differential expression analysis uses the quasi-likelihood functionality of edgeR.

  4. Comprehensive RNA-Seq profiling to evaluate lactating sheep mammary gland transcriptome.

    PubMed

    Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Klopp, Christophe; Tosser-Klopp, Gwenola; Arranz, Juan-José

    2016-07-05

    RNA-Seq enables the generation of extensive transcriptome information providing the capability to characterize transcripts (including alternative isoforms and polymorphism), to quantify expression and to identify differential regulation in a single experiment. Our aim in this study was to take advantage of using RNA-Seq high-throughput technology to provide a comprehensive transcriptome profiling of the sheep lactating mammary gland. Eight ewes of two dairy sheep breeds with differences in milk production traits were used in this experiment (four Churra and four Assaf ewes). Milk samples from these animals were collected on days 10, 50, 120 and 150 after lambing to cover the various physiological stages of the mammary gland across the complete lactation. RNA samples were extracted from milk somatic cells. The RNA-Seq dataset was generated using an Illumina HiSeq 2000 sequencer. The information reported here will be useful to understand the biology of lactation in sheep, providing also an opportunity to characterize their different patterns on milk production aptitude.

  5. Structural Basis for a Unique ATP Synthase Core Complex from Nanoarcheaum equitans*

    PubMed Central

    Mohanty, Soumya; Jobichen, Chacko; Chichili, Vishnu Priyanka Reddy; Velázquez-Campoy, Adrián; Low, Boon Chuan; Hogue, Christopher W. V.; Sivaraman, J.

    2015-01-01

    ATP synthesis is a critical and universal life process carried out by ATP synthases. Whereas eukaryotic and prokaryotic ATP synthases are well characterized, archaeal ATP synthases are relatively poorly understood. The hyperthermophilic archaeal parasite, Nanoarcheaum equitans, lacks several subunits of the ATP synthase and is suspected to be energetically dependent on its host, Ignicoccus hospitalis. This suggests that this ATP synthase might be a rudimentary machine. Here, we report the crystal structures and biophysical studies of the regulatory subunit, NeqB, the apo-NeqAB, and NeqAB in complex with nucleotides, ADP, and adenylyl-imidodiphosphate (non-hydrolysable analog of ATP). NeqB is ∼20 amino acids shorter at its C terminus than its homologs, but this does not impede its binding with NeqA to form the complex. The heterodimeric NeqAB complex assumes a closed, rigid conformation irrespective of nucleotide binding; this differs from its homologs, which require conformational changes for catalytic activity. Thus, although N. equitans possesses an ATP synthase core A3B3 hexameric complex, it might not function as a bona fide ATP synthase. PMID:26370083

  6. Interaction of Constitutive Nitric Oxide Synthases with Cyclooxygenases in Regulation of Bicarbonate Secretion in the Gastric Mucosa.

    PubMed

    Zolotarev, V A; Andreeva, Yu V; Vershinina, E; Khropycheva, R P

    2017-05-01

    Neuronal NO synthase blocker 7-nitroindazole suppressed bicarbonate secretion in rat gastric mucosa induced by mild local irritation with 1 M NaCl (pH 2.0). Non-selective blocker of neuronal and endothelial synthases, Nω-nitro-L-arginine (L-NNA), did not affect HCO 3 - production, but inhibited secretion after pretreatment with omeprazole. Non-selective cyclooxygenase blocker indomethacin inhibited HCO 3 - production under conditions of normal synthase activity and in the presence of L-NNA, but was ineffective when co-administered with 7-nitroindazole. It was concluded that neuronal and endothelial synthases are involved in different mechanisms of regulation of HCO 3 - secretion in the gastric mucosa induced by mild irritation. Activation of neuronal synthase stimulated HCO 3 - production, which is mediated mainly through activation of cyclooxygenase. Theoretically, activation of endothelial synthase should suppress HCO 3 - production. The effect of endothelial synthase depends on acid secretion in the stomach and bicarbonate concentration in the submucosa, as it was demonstrated in experiments with intravenous NaHCO 3 infusion.

  7. Nitric Oxide Synthase and Neuronal NADPH Diaphorase are Identical in Brain and Peripheral Tissues

    NASA Astrophysics Data System (ADS)

    Dawson, Ted M.; Bredt, David S.; Fotuhi, Majid; Hwang, Paul M.; Snyder, Solomon H.

    1991-09-01

    NADPH diaphorase staining neurons, uniquely resistant to toxic insults and neurodegenerative disorders, have been colocalized with neurons in the brain and peripheral tissue containing nitric oxide synthase (EC 1.14.23.-), which generates nitric oxide (NO), a recently identified neuronal messenger molecule. In the corpus striatum and cerebral cortex, NO synthase immunoreactivity and NADPH diaphorase staining are colocalized in medium to large aspiny neurons. These same neurons colocalize with somatostatin and neuropeptide Y immunoreactivity. NO synthase immunoreactivity and NADPH diaphorase staining are colocalized in the pedunculopontine nucleus with choline acetyltransferase-containing cells and are also colocalized in amacrine cells of the inner nuclear layer and ganglion cells of the retina, myenteric plexus neurons of the intestine, and ganglion cells of the adrenal medulla. Transfection of human kidney cells with NO synthase cDNA elicits NADPH diaphorase staining. The ratio of NO synthase to NADPH diaphorase staining in the transfected cells is the same as in neurons, indicating that NO synthase fully accounts for observed NADPH staining. The identity of neuronal NO synthase and NADPH diaphorase suggests a role for NO in modulating neurotoxicity.

  8. Isolation and functional effects of monoclonal antibodies binding to thymidylate synthase.

    PubMed

    Jastreboff, M M; Todd, M B; Malech, H L; Bertino, J R

    1985-01-29

    Monoclonal antibodies against electrophoretically pure thymidylate synthase from HeLa cells have been produced. Antibodies (M-TS-4 and M-TS-9) from hybridoma clones were shown by enzyme-linked immunoassay to recognize thymidylate synthase from a variety of human cell lines, but they did not bind to thymidylate synthase from mouse cell lines. The strongest binding of antibodies was observed to enzyme from HeLa cells. These two monoclonal antibodies bind simultaneously to different antigenic sites on thymidylate synthase purified from HeLa cells, as reflected by a high additivity index and results of cross-linked radioimmunoassay. Both monoclonal antibodies inhibit the activity of thymidylate synthase from human cell lines. The strongest inhibition was observed with thymidylate synthase from HeLa cells. Monoclonal antibody M-TS-9 (IgM subclass) decreased the rate of binding of [3H]FdUMP to thymidylate synthase in the presence of 5,10-methylenetetrahydrofolate while M-TS-4 (IgG1) did not change the rate of ternary complex formation. These data indicate that the antibodies recognize different epitopes on the enzyme molecule.

  9. Friedelin Synthase from Maytenus ilicifolia: Leucine 482 Plays an Essential Role in the Production of the Most Rearranged Pentacyclic Triterpene

    PubMed Central

    Souza-Moreira, Tatiana M.; Alves, Thaís B.; Pinheiro, Karina A.; Felippe, Lidiane G.; De Lima, Gustavo M. A.; Watanabe, Tatiana F.; Barbosa, Cristina C.; Santos, Vânia A. F. F. M.; Lopes, Norberto P.; Valentini, Sandro R.; Guido, Rafael V. C.; Furlan, Maysa; Zanelli, Cleslei F.

    2016-01-01

    Among the biologically active triterpenes, friedelin has the most-rearranged structure produced by the oxidosqualene cyclases and is the only one containing a cetonic group. In this study, we cloned and functionally characterized friedelin synthase and one cycloartenol synthase from Maytenus ilicifolia (Celastraceae). The complete coding sequences of these 2 genes were cloned from leaf mRNA, and their functions were characterized by heterologous expression in yeast. The cycloartenol synthase sequence is very similar to other known OSCs of this type (approximately 80% identity), although the M. ilicifolia friedelin synthase amino acid sequence is more related to β-amyrin synthases (65–74% identity), which is similar to the friedelin synthase cloned from Kalanchoe daigremontiana. Multiple sequence alignments demonstrated the presence of a leucine residue two positions upstream of the friedelin synthase Asp-Cys-Thr-Ala-Glu (DCTAE) active site motif, while the vast majority of OSCs identified so far have a valine or isoleucine residue at the same position. The substitution of the leucine residue with valine, threonine or isoleucine in M. ilicifolia friedelin synthase interfered with substrate recognition and lead to the production of different pentacyclic triterpenes. Hence, our data indicate a key role for the leucine residue in the structure and function of this oxidosqualene cyclase. PMID:27874020

  10. Friedelin Synthase from Maytenus ilicifolia: Leucine 482 Plays an Essential Role in the Production of the Most Rearranged Pentacyclic Triterpene

    NASA Astrophysics Data System (ADS)

    Souza-Moreira, Tatiana M.; Alves, Thaís B.; Pinheiro, Karina A.; Felippe, Lidiane G.; de Lima, Gustavo M. A.; Watanabe, Tatiana F.; Barbosa, Cristina C.; Santos, Vânia A. F. F. M.; Lopes, Norberto P.; Valentini, Sandro R.; Guido, Rafael V. C.; Furlan, Maysa; Zanelli, Cleslei F.

    2016-11-01

    Among the biologically active triterpenes, friedelin has the most-rearranged structure produced by the oxidosqualene cyclases and is the only one containing a cetonic group. In this study, we cloned and functionally characterized friedelin synthase and one cycloartenol synthase from Maytenus ilicifolia (Celastraceae). The complete coding sequences of these 2 genes were cloned from leaf mRNA, and their functions were characterized by heterologous expression in yeast. The cycloartenol synthase sequence is very similar to other known OSCs of this type (approximately 80% identity), although the M. ilicifolia friedelin synthase amino acid sequence is more related to β-amyrin synthases (65-74% identity), which is similar to the friedelin synthase cloned from Kalanchoe daigremontiana. Multiple sequence alignments demonstrated the presence of a leucine residue two positions upstream of the friedelin synthase Asp-Cys-Thr-Ala-Glu (DCTAE) active site motif, while the vast majority of OSCs identified so far have a valine or isoleucine residue at the same position. The substitution of the leucine residue with valine, threonine or isoleucine in M. ilicifolia friedelin synthase interfered with substrate recognition and lead to the production of different pentacyclic triterpenes. Hence, our data indicate a key role for the leucine residue in the structure and function of this oxidosqualene cyclase.

  11. 50 CFR 12.22 - Civil actions to obtain forfeiture.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... Endangered Species Act, 16 U.S.C. 1531 et seq. Before any such action is filed against property subject to... property subject to forfeiture under the Airborne Hunting Act, 16 U.S.C. 742j-1; Lacey Act, 18 U.S.C. 43-44; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Black Bass Act, 16 U.S.C. 851 et seq.; Marine...

  12. 50 CFR 12.22 - Civil actions to obtain forfeiture.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... Endangered Species Act, 16 U.S.C. 1531 et seq. Before any such action is filed against property subject to... property subject to forfeiture under the Airborne Hunting Act, 16 U.S.C. 742j-1; Lacey Act, 18 U.S.C. 43-44; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Black Bass Act, 16 U.S.C. 851 et seq.; Marine...

  13. 50 CFR 12.22 - Civil actions to obtain forfeiture.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... Endangered Species Act, 16 U.S.C. 1531 et seq. Before any such action is filed against property subject to... property subject to forfeiture under the Airborne Hunting Act, 16 U.S.C. 742j-1; Lacey Act, 18 U.S.C. 43-44; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Black Bass Act, 16 U.S.C. 851 et seq.; Marine...

  14. 50 CFR 12.22 - Civil actions to obtain forfeiture.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... Endangered Species Act, 16 U.S.C. 1531 et seq. Before any such action is filed against property subject to... property subject to forfeiture under the Airborne Hunting Act, 16 U.S.C. 742j-1; Lacey Act, 18 U.S.C. 43-44; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Black Bass Act, 16 U.S.C. 851 et seq.; Marine...

  15. 50 CFR 12.22 - Civil actions to obtain forfeiture.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... Endangered Species Act, 16 U.S.C. 1531 et seq. Before any such action is filed against property subject to... property subject to forfeiture under the Airborne Hunting Act, 16 U.S.C. 742j-1; Lacey Act, 18 U.S.C. 43-44; Lacey Act Amendments of 1981, 16 U.S.C. 3371 et seq.; Black Bass Act, 16 U.S.C. 851 et seq.; Marine...

  16. Environmental Assessment: Hurlburt Field Soundside Boathouse and Restroom Facility Construction

    DTIC Science & Technology

    2007-08-01

    seq., and Air Force Instruction (AFI) 32-7061, The Environmental Impact Analysis Process, the USAF concludes that the Proposed Action will have no...U.S.C.) §4321, et seq., and Air Force Instruction (AFI) 32-7061, The Environmental Impact Analysis Process, the USAF concludes that the Proposed...et seq. • AFI 32-7061, The Environmental Impact Analysis Process These regulations require federal agencies to analyze the potential environmental

  17. 78 FR 9435 - Submission for OMB Review; Comment Request

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-02-08

    ... is hereby given that, pursuant to the Paperwork Reduction Act of 1995 (44 U.S.C. 3501 et seq.), the... securities on Forms F-7, F-8, F-9 or F-10 under the Securities Act of 1933 (U.S.C. 77a et seq.), or filing periodic reports on Form 40-F under the Exchange Act of 1934 (15 U.S.C. 78a et seq.). The information...

  18. Model-based clustering for RNA-seq data.

    PubMed

    Si, Yaqing; Liu, Peng; Li, Pinghua; Brutnell, Thomas P

    2014-01-15

    RNA-seq technology has been widely adopted as an attractive alternative to microarray-based methods to study global gene expression. However, robust statistical tools to analyze these complex datasets are still lacking. By grouping genes with similar expression profiles across treatments, cluster analysis provides insight into gene functions and networks, and hence is an important technique for RNA-seq data analysis. In this manuscript, we derive clustering algorithms based on appropriate probability models for RNA-seq data. An expectation-maximization algorithm and another two stochastic versions of expectation-maximization algorithms are described. In addition, a strategy for initialization based on likelihood is proposed to improve the clustering algorithms. Moreover, we present a model-based hybrid-hierarchical clustering method to generate a tree structure that allows visualization of relationships among clusters as well as flexibility of choosing the number of clusters. Results from both simulation studies and analysis of a maize RNA-seq dataset show that our proposed methods provide better clustering results than alternative methods such as the K-means algorithm and hierarchical clustering methods that are not based on probability models. An R package, MBCluster.Seq, has been developed to implement our proposed algorithms. This R package provides fast computation and is publicly available at http://www.r-project.org

  19. Optimizing exosomal RNA isolation for RNA-Seq analyses of archival sera specimens.

    PubMed

    Prendergast, Emily N; de Souza Fonseca, Marcos Abraão; Dezem, Felipe Segato; Lester, Jenny; Karlan, Beth Y; Noushmehr, Houtan; Lin, Xianzhi; Lawrenson, Kate

    2018-01-01

    Exosomes are endosome-derived membrane vesicles that contain proteins, lipids, and nucleic acids. The exosomal transcriptome mediates intercellular communication, and represents an understudied reservoir of novel biomarkers for human diseases. Next-generation sequencing enables complex quantitative characterization of exosomal RNAs from diverse sources. However, detailed protocols describing exosome purification for preparation of exosomal RNA-sequence (RNA-Seq) libraries are lacking. Here we compared methods for isolation of exosomes and extraction of exosomal RNA from human cell-free serum, as well as strategies for attaining equal representation of samples within pooled RNA-Seq libraries. We compared commercial precipitation with ultracentrifugation for exosome purification and confirmed the presence of exosomes via both transmission electron microscopy and immunoblotting. Exosomal RNA extraction was compared using four different RNA purification methods. We determined the minimal starting volume of serum required for exosome preparation and showed that high quality exosomal RNA can be isolated from sera stored for over a decade. Finally, RNA-Seq libraries were successfully prepared with exosomal RNAs extracted from human cell-free serum, cataloguing both coding and non-coding exosomal transcripts. This method provides researchers with strategic options to prepare RNA-Seq libraries and compare RNA-Seq data quantitatively from minimal volumes of fresh and archival human cell-free serum for disease biomarker discovery.

  20. Detection and quantitation of chromosomal mosaicism in human blastocysts using copy number variation sequencing.

    PubMed

    Ruttanajit, Tida; Chanchamroen, Sujin; Cram, David S; Sawakwongpra, Kritchakorn; Suksalak, Wanwisa; Leng, Xue; Fan, Junmei; Wang, Li; Yao, Yuanqing; Quangkananurug, Wiwat

    2016-02-01

    Currently, our understanding of the nature and reproductive potential of blastocysts associated with trophectoderm (TE) lineage chromosomal mosaicism is limited. The objective of this study was to first validate copy number variation sequencing (CNV-Seq) for measuring the level of mosaicism and second, examine the nature and level of mosaicism in TE biopsies of patient's blastocysts. TE biopy samples were analysed by array comparative genomic hybridization (CGH) and CNV-Seq to discriminate between euploid, aneuploid and mosaic blastocysts. Using artificial models of TE mosaicism for five different chromosomes, CNV-Seq accurately and reproducibly quantitated mosaicism at levels of 50% and 20%. In a comparative 24-chromosome study of 49 blastocysts by array CGH and CNV-Seq, 43 blastocysts (87.8%) had a concordant diagnosis and 6 blastocysts (12.2%) were discordant. The discordance was attributed to low to medium levels of chromosomal mosaicism (30-70%) not detected by array CGH. In an expanded study of 399 blastocysts using CNV-Seq as the sole diagnostic method, the proportion of diploid-aneuploid mosaics (34, 8.5%) was significantly higher than aneuploid mosaics (18, 4.5%) (p < 0.02). Mosaicism is a significant chromosomal abnormality associated with the TE lineage of human blastocysts that can be reliably and accurately detected by CNV-Seq. © 2015 John Wiley & Sons, Ltd.

  1. FREQ-Seq: A Rapid, Cost-Effective, Sequencing-Based Method to Determine Allele Frequencies Directly from Mixed Populations

    PubMed Central

    Delaney, Nigel F.; Marx, Christopher J.

    2012-01-01

    Understanding evolutionary dynamics within microbial populations requires the ability to accurately follow allele frequencies through time. Here we present a rapid, cost-effective method (FREQ-Seq) that leverages Illumina next-generation sequencing for localized, quantitative allele frequency detection. Analogous to RNA-Seq, FREQ-Seq relies upon counts from the >105 reads generated per locus per time-point to determine allele frequencies. Loci of interest are directly amplified from a mixed population via two rounds of PCR using inexpensive, user-designed oligonucleotides and a bar-coded bridging primer system that can be regenerated in-house. The resulting bar-coded PCR products contain the adapters needed for Illumina sequencing, eliminating further library preparation. We demonstrate the utility of FREQ-Seq by determining the order and dynamics of beneficial alleles that arose as a microbial population, founded with an engineered strain of Methylobacterium, evolved to grow on methanol. Quantifying allele frequencies with minimal bias down to 1% abundance allowed effective analysis of SNPs, small in-dels and insertions of transposable elements. Our data reveal large-scale clonal interference during the early stages of adaptation and illustrate the utility of FREQ-Seq as a cost-effective tool for tracking allele frequencies in populations. PMID:23118913

  2. RNA-Rocket: an RNA-Seq analysis resource for infectious disease research

    PubMed Central

    Warren, Andrew S.; Aurrecoechea, Cristina; Brunk, Brian; Desai, Prerak; Emrich, Scott; Giraldo-Calderón, Gloria I.; Harb, Omar; Hix, Deborah; Lawson, Daniel; Machi, Dustin; Mao, Chunhong; McClelland, Michael; Nordberg, Eric; Shukla, Maulik; Vosshall, Leslie B.; Wattam, Alice R.; Will, Rebecca; Yoo, Hyun Seung; Sobral, Bruno

    2015-01-01

    Motivation: RNA-Seq is a method for profiling transcription using high-throughput sequencing and is an important component of many research projects that wish to study transcript isoforms, condition specific expression and transcriptional structure. The methods, tools and technologies used to perform RNA-Seq analysis continue to change, creating a bioinformatics challenge for researchers who wish to exploit these data. Resources that bring together genomic data, analysis tools, educational material and computational infrastructure can minimize the overhead required of life science researchers. Results: RNA-Rocket is a free service that provides access to RNA-Seq and ChIP-Seq analysis tools for studying infectious diseases. The site makes available thousands of pre-indexed genomes, their annotations and the ability to stream results to the bioinformatics resources VectorBase, EuPathDB and PATRIC. The site also provides a combination of experimental data and metadata, examples of pre-computed analysis, step-by-step guides and a user interface designed to enable both novice and experienced users of RNA-Seq data. Availability and implementation: RNA-Rocket is available at rnaseq.pathogenportal.org. Source code for this project can be found at github.com/cidvbi/PathogenPortal. Contact: anwarren@vt.edu Supplementary information: Supplementary materials are available at Bioinformatics online. PMID:25573919

  3. RNA-Rocket: an RNA-Seq analysis resource for infectious disease research.

    PubMed

    Warren, Andrew S; Aurrecoechea, Cristina; Brunk, Brian; Desai, Prerak; Emrich, Scott; Giraldo-Calderón, Gloria I; Harb, Omar; Hix, Deborah; Lawson, Daniel; Machi, Dustin; Mao, Chunhong; McClelland, Michael; Nordberg, Eric; Shukla, Maulik; Vosshall, Leslie B; Wattam, Alice R; Will, Rebecca; Yoo, Hyun Seung; Sobral, Bruno

    2015-05-01

    RNA-Seq is a method for profiling transcription using high-throughput sequencing and is an important component of many research projects that wish to study transcript isoforms, condition specific expression and transcriptional structure. The methods, tools and technologies used to perform RNA-Seq analysis continue to change, creating a bioinformatics challenge for researchers who wish to exploit these data. Resources that bring together genomic data, analysis tools, educational material and computational infrastructure can minimize the overhead required of life science researchers. RNA-Rocket is a free service that provides access to RNA-Seq and ChIP-Seq analysis tools for studying infectious diseases. The site makes available thousands of pre-indexed genomes, their annotations and the ability to stream results to the bioinformatics resources VectorBase, EuPathDB and PATRIC. The site also provides a combination of experimental data and metadata, examples of pre-computed analysis, step-by-step guides and a user interface designed to enable both novice and experienced users of RNA-Seq data. RNA-Rocket is available at rnaseq.pathogenportal.org. Source code for this project can be found at github.com/cidvbi/PathogenPortal. anwarren@vt.edu Supplementary materials are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  4. Inference of chromosomal inversion dynamics from Pool-Seq data in natural and laboratory populations of Drosophila melanogaster.

    PubMed

    Kapun, Martin; van Schalkwyk, Hester; McAllister, Bryant; Flatt, Thomas; Schlötterer, Christian

    2014-04-01

    Sequencing of pools of individuals (Pool-Seq) represents a reliable and cost-effective approach for estimating genome-wide SNP and transposable element insertion frequencies. However, Pool-Seq does not provide direct information on haplotypes so that, for example, obtaining inversion frequencies has not been possible until now. Here, we have developed a new set of diagnostic marker SNPs for seven cosmopolitan inversions in Drosophila melanogaster that can be used to infer inversion frequencies from Pool-Seq data. We applied our novel marker set to Pool-Seq data from an experimental evolution study and from North American and Australian latitudinal clines. In the experimental evolution data, we find evidence that positive selection has driven the frequencies of In(3R)C and In(3R)Mo to increase over time. In the clinal data, we confirm the existence of frequency clines for In(2L)t, In(3L)P and In(3R)Payne in both North America and Australia and detect a previously unknown latitudinal cline for In(3R)Mo in North America. The inversion markers developed here provide a versatile and robust tool for characterizing inversion frequencies and their dynamics in Pool-Seq data from diverse D. melanogaster populations. © 2013 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.

  5. Inference of chromosomal inversion dynamics from Pool-Seq data in natural and laboratory populations of Drosophila melanogaster

    PubMed Central

    Kapun, Martin; van Schalkwyk, Hester; McAllister, Bryant; Flatt, Thomas; Schlötterer, Christian

    2014-01-01

    Sequencing of pools of individuals (Pool-Seq) represents a reliable and cost-effective approach for estimating genome-wide SNP and transposable element insertion frequencies. However, Pool-Seq does not provide direct information on haplotypes so that, for example, obtaining inversion frequencies has not been possible until now. Here, we have developed a new set of diagnostic marker SNPs for seven cosmopolitan inversions in Drosophila melanogaster that can be used to infer inversion frequencies from Pool-Seq data. We applied our novel marker set to Pool-Seq data from an experimental evolution study and from North American and Australian latitudinal clines. In the experimental evolution data, we find evidence that positive selection has driven the frequencies of In(3R)C and In(3R)Mo to increase over time. In the clinal data, we confirm the existence of frequency clines for In(2L)t, In(3L)P and In(3R)Payne in both North America and Australia and detect a previously unknown latitudinal cline for In(3R)Mo in North America. The inversion markers developed here provide a versatile and robust tool for characterizing inversion frequencies and their dynamics in Pool-Seq data from diverse D. melanogaster populations. PMID:24372777

  6. Multiple defects in muscle glycogen synthase activity contribute to reduced glycogen synthesis in non-insulin dependent diabetes mellitus.

    PubMed Central

    Thorburn, A W; Gumbiner, B; Bulacan, F; Brechtel, G; Henry, R R

    1991-01-01

    To define the mechanisms of impaired muscle glycogen synthase and reduced glycogen formation in non-insulin dependent diabetes mellitus (NIDDM), glycogen synthase activity was kinetically analyzed during the basal state and three glucose clamp studies (insulin approximately equal to 300, 700, and 33,400 pmol/liter) in eight matched nonobese NIDDM and eight control subjects. Muscle glycogen content was measured in the basal state and following clamps at insulin levels of 33,400 pmol/liter. NIDDM subjects had glucose uptake matched to controls in each clamp by raising serum glucose to 15-20 mmol/liter. The insulin concentration required to half-maximally activate glycogen synthase (ED50) was approximately fourfold greater for NIDDM than control subjects (1,004 +/- 264 vs. 257 +/- 110 pmol/liter, P less than 0.02) but the maximal insulin effect was similar. Total glycogen synthase activity was reduced approximately 38% and glycogen content was approximately 30% lower in NIDDM. A positive correlation was present between glycogen content and glycogen synthase activity (r = 0.51, P less than 0.01). In summary, defects in muscle glycogen synthase activity and reduced glycogen content are present in NIDDM. NIDDM subjects also have less total glycogen synthase activity consistent with reduced functional mass of the enzyme. These findings and the correlation between glycogen synthase activity and glycogen content support the theory that multiple defects in glycogen synthase activity combine to cause reduced glycogen formation in NIDDM. PMID:1899428

  7. Glycogen synthase activation by sugars in isolated hepatocytes.

    PubMed

    Ciudad, C J; Carabaza, A; Bosch, F; Gòmez I Foix, A M; Guinovart, J J

    1988-07-01

    We have investigated the activation by sugars of glycogen synthase in relation to (i) phosphorylase a activity and (ii) changes in the intracellular concentration of glucose 6-phosphate and adenine nucleotides. All the sugars tested in this work present the common denominator of activating glycogen synthase. On the other hand, phosphorylase a activity is decreased by mannose and glucose, unchanged by galactose and xylitol, and increased by tagatose, glyceraldehyde, and fructose. Dihydroxyacetone exerts a biphasic effect on phosphorylase. These findings provide additional evidence proving that glycogen synthase can be activated regardless of the levels of phosphorylase a, clearly establishing that a nonsequential mechanism for the activation of glycogen synthase occurs in liver cells. The glycogen synthase activation state is related to the concentrations of glucose 6-phosphate and adenine nucleotides. In this respect, tagatose, glyceraldehyde, and fructose deplete ATP and increase AMP contents, whereas glucose, mannose, galactose, xylitol, and dihydroxyacetone do not alter the concentration of these nucleotides. In addition, all these sugars, except glyceraldehyde, increase the intracellular content of glucose 6-phosphate. The activation of glycogen synthase by sugars is reflected in decreases on both kinetic constants of the enzyme, M0.5 (for glucose 6-phosphate) and S0.5 (for UDP-glucose). We propose that hepatocyte glycogen synthase is activated by monosaccharides by a mechanism triggered by changes in glucose 6-phosphate and adenine nucleotide concentrations which have been described to modify glycogen synthase phosphatase activity. This mechanism represents a metabolite control of the sugar-induced activation of hepatocyte glycogen synthase.

  8. In Planta Recapitulation of Isoprene Synthase Evolution from Ocimene Synthases.

    PubMed

    Li, Mingai; Xu, Jia; Algarra Alarcon, Alberto; Carlin, Silvia; Barbaro, Enrico; Cappellin, Luca; Velikova, Violeta; Vrhovsek, Urska; Loreto, Francesco; Varotto, Claudio

    2017-10-01

    Isoprene is the most abundant biogenic volatile hydrocarbon compound naturally emitted by plants and plays a major role in atmospheric chemistry. It has been proposed that isoprene synthases (IspS) may readily evolve from other terpene synthases, but this hypothesis has not been experimentally investigated. We isolated and functionally validated in Arabidopsis the first isoprene synthase gene, AdoIspS, from a monocotyledonous species (Arundo donax L., Poaceae). Phylogenetic reconstruction indicates that AdoIspS and dicots isoprene synthases most likely originated by parallel evolution from TPS-b monoterpene synthases. Site-directed mutagenesis demonstrated invivo the functional and evolutionary relevance of the residues considered diagnostic for IspS function. One of these positions was identified by saturating mutagenesis as a major determinant of substrate specificity in AdoIspS able to cause invivo a dramatic change in total volatile emission from hemi- to monoterpenes and supporting evolution of isoprene synthases from ocimene synthases. The mechanism responsible for IspS neofunctionalization by active site size modulation by a single amino acid mutation demonstrated in this study might be general, as the very same amino acidic position is implicated in the parallel evolution of different short-chain terpene synthases from both angiosperms and gymnosperms. Based on these results, we present a model reconciling in a unified conceptual framework the apparently contrasting patterns previously observed for isoprene synthase evolution in plants. These results indicate that parallel evolution may be driven by relatively simple biophysical constraints, and illustrate the intimate molecular evolutionary links between the structural and functional bases of traits with global relevance. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  9. Intellectual disability is associated with increased risk for obesity in a nationally representative sample of U.S. children.

    PubMed

    Segal, Mary; Eliasziw, Misha; Phillips, Sarah; Bandini, Linda; Curtin, Carol; Kral, Tanja V E; Sherwood, Nancy E; Sikich, Lin; Stanish, Heidi; Must, Aviva

    2016-07-01

    Data on obesity prevalence in children with intellectual disability (ID) are scarce. We estimated rates of obesity among children aged 10-17 years with and without ID in a nationally representative dataset that included measures of child weight and ID status, as well as family meal frequency, physical activity, and sedentary behavior. Chi-square tests compared prevalence of obesity, demographic and behavioral characteristics between children with and without ID as reported in the 2011 National Survey of Children's Health. Tests for interaction in logistic regression models determined whether associations between obesity and behavioral characteristics were different between children with/without ID. Obesity prevalence for children with ID was 28.9% and 15.5% for children without ID. After adjusting for age, sex, race/ethnicity and poverty level, the odds ratio was significantly 1.89 times greater among children with ID than among those without ID (95% CI: 1.14 to 3.12). Among children with ID, 49.8% ate at least one meal with family members every day compared to 35.0% without ID (p < 0.002), and 49.5% with ID participated in frequent physical activity compared to 62.9% (p < 0.005). Prevalence of obesity was higher among all children who ate family meals every day compared to fewer days per week, and the effect was significantly more pronounced among those with ID (p = 0.05). Prevalence of obesity among youth with ID was almost double that of the general population. Prospective studies are needed in this population to examine the impact of consistent family mealtimes and infrequent physical activity. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Polyester synthases: natural catalysts for plastics.

    PubMed Central

    Rehm, Bernd H A

    2003-01-01

    Polyhydroxyalkanoates (PHAs) are biopolyesters composed of hydroxy fatty acids, which represent a complex class of storage polyesters. They are synthesized by a wide range of different Gram-positive and Gram-negative bacteria, as well as by some Archaea, and are deposited as insoluble cytoplasmic inclusions. Polyester synthases are the key enzymes of polyester biosynthesis and catalyse the conversion of (R)-hydroxyacyl-CoA thioesters to polyesters with the concomitant release of CoA. These soluble enzymes turn into amphipathic enzymes upon covalent catalysis of polyester-chain formation. A self-assembly process is initiated resulting in the formation of insoluble cytoplasmic inclusions with a phospholipid monolayer and covalently attached polyester synthases at the surface. Surface-attached polyester synthases show a marked increase in enzyme activity. These polyester synthases have only recently been biochemically characterized. An overview of these recent findings is provided. At present, 59 polyester synthase structural genes from 45 different bacteria have been cloned and the nucleotide sequences have been obtained. The multiple alignment of the primary structures of these polyester synthases show an overall identity of 8-96% with only eight strictly conserved amino acid residues. Polyester synthases can been assigned to four classes based on their substrate specificity and subunit composition. The current knowledge on the organization of the polyester synthase genes, and other genes encoding proteins related to PHA metabolism, is compiled. In addition, the primary structures of the 59 PHA synthases are aligned and analysed with respect to highly conserved amino acids, and biochemical features of polyester synthases are described. The proposed catalytic mechanism based on similarities to alpha/beta-hydrolases and mutational analysis is discussed. Different threading algorithms suggest that polyester synthases belong to the alpha/beta-hydrolase superfamily, with a conserved cysteine residue as catalytic nucleophile. This review provides a survey of the known biochemical features of these unique enzymes and their proposed catalytic mechanism. PMID:12954080

  11. JingleBells: A Repository of Immune-Related Single-Cell RNA-Sequencing Datasets.

    PubMed

    Ner-Gaon, Hadas; Melchior, Ariel; Golan, Nili; Ben-Haim, Yael; Shay, Tal

    2017-05-01

    Recent advances in single-cell RNA-sequencing (scRNA-seq) technology increase the understanding of immune differentiation and activation processes, as well as the heterogeneity of immune cell types. Although the number of available immune-related scRNA-seq datasets increases rapidly, their large size and various formats render them hard for the wider immunology community to use, and read-level data are practically inaccessible to the non-computational immunologist. To facilitate datasets reuse, we created the JingleBells repository for immune-related scRNA-seq datasets ready for analysis and visualization of reads at the single-cell level (http://jinglebells.bgu.ac.il/). To this end, we collected the raw data of publicly available immune-related scRNA-seq datasets, aligned the reads to the relevant genome, and saved aligned reads in a uniform format, annotated for cell of origin. We also added scripts and a step-by-step tutorial for visualizing each dataset at the single-cell level, through the commonly used Integrated Genome Viewer (www.broadinstitute.org/igv/). The uniform scRNA-seq format used in JingleBells can facilitate reuse of scRNA-seq data by computational biologists. It also enables immunologists who are interested in a specific gene to visualize the reads aligned to this gene to estimate cell-specific preferences for splicing, mutation load, or alleles. Thus JingleBells is a resource that will extend the usefulness of scRNA-seq datasets outside the programming aficionado realm. Copyright © 2017 by The American Association of Immunologists, Inc.

  12. "Sequential" boron neutron capture therapy (BNCT): a novel approach to BNCT for the treatment of oral cancer in the hamster cheek pouch model.

    PubMed

    Molinari, Ana J; Pozzi, Emiliano C C; Monti Hughes, Andrea; Heber, Elisa M; Garabalino, Marcela A; Thorp, Silvia I; Miller, Marcelo; Itoiz, Maria E; Aromando, Romina F; Nigg, David W; Quintana, Jorge; Santa Cruz, Gustavo A; Trivillin, Verónica A; Schwint, Amanda E

    2011-04-01

    In the present study the therapeutic effect and potential toxicity of the novel "Sequential" boron neutron capture therapy (Seq-BNCT) for the treatment of oral cancer was evaluated in the hamster cheek pouch model at the RA-3 Nuclear Reactor. Two groups of animals were treated with "Sequential" BNCT, i.e., BNCT mediated by boronophenylalanine (BPA) followed by BNCT mediated by sodium decahydrodecaborate (GB-10) either 24 h (Seq-24h-BNCT) or 48 h (Seq-48h-BNCT) later. In an additional group of animals, BPA and GB-10 were administered concomitantly [(BPA + GB-10)-BNCT]. The single-application BNCT was to the same total physical tumor dose as the "Sequential" BNCT treatments. At 28 days post-treatment, Seq-24h-BNCT and Seq-48h-BNCT induced, respectively, overall tumor responses of 95 ± 2% and 91 ± 3%, with no statistically significant differences between protocols. Overall response for the single treatment with (BPA + GB-10)-BNCT was 75 ± 5%, significantly lower than for Seq-BNCT. Both Seq-BNCT protocols and (BPA + GB-10)-BNCT induced reversible mucositis in the dose-limiting precancerous tissue around treated tumors, reaching Grade 3/4 mucositis in 47 ± 12% and 60 ± 22% of the animals, respectively. No normal tissue toxicity was associated with tumor response for any of the protocols. "Sequential" BNCT enhanced tumor response without an increase in mucositis in dose-limiting precancerous tissue. © 2011 by Radiation Research Society

  13. Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki)

    PubMed Central

    2013-01-01

    Background The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. Results We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Conclusions Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools. PMID:24209455

  14. Traditional karyotyping vs copy number variation sequencing for detection of chromosomal abnormalities associated with spontaneous miscarriage.

    PubMed

    Liu, S; Song, L; Cram, D S; Xiong, L; Wang, K; Wu, R; Liu, J; Deng, K; Jia, B; Zhong, M; Yang, F

    2015-10-01

    To compare the performance of traditional G-banding karyotyping with that of copy number variation sequencing (CNV-Seq) for detection of chromosomal abnormalities associated with miscarriage. Products of conception (POC) were collected from spontaneous miscarriages. Chromosomal abnormalities were detected using high-resolution G-banding karyotyping and CNV sequencing. Quantitative fluorescent polymerase chain reaction analysis of maternal and POC DNA for short tandem repeat (STR) markers was used to both monitor maternal cell contamination and confirm the chromosomal status and sex of the miscarriage tissue. A total of 64 samples of POC, comprising 16 with an abnormal and 48 with a normal karyotype, were selected and coded for analysis by CNV-Seq. CNV-Seq results were concordant for 14 (87.5%) of the 16 gross chromosomal abnormalities identified by karyotyping, including 11 autosomal trisomies and three sex chromosomal aneuploidies (45,X). Of the two discordant results, a 69,XXX polyploidy was missed by CNV-Seq, although supporting STR marker analysis confirmed the triploidy. In contrast, CNV-Seq identified a sample with 45,X karyotype as a 45,X/46,XY mosaic. In the remaining 48 samples of POC with a normal karyotype, CNV-Seq detected a 2.58-Mb 22q deletion associated with DiGeorge syndrome and nine different smaller CNVs of no apparent clinical significance. CNV-Seq used in parallel with STR profiling is a reliable and accurate alternative to karyotyping for identifying chromosome copy number abnormalities associated with spontaneous miscarriage. Copyright © 2015 ISUOG. Published by John Wiley & Sons Ltd.

  15. Sensitivity of the ViroSeq HIV-1 Genotyping System for Detection of the K103N Resistance Mutation in HIV-1 Subtypes A, C, and D

    PubMed Central

    Church, Jessica D.; Jones, Dana; Flys, Tamara; Hoover, Donald; Marlowe, Natalia; Chen, Shu; Shi, Chanjuan; Eshleman, James R.; Guay, Laura A.; Jackson, J. Brooks; Kumwenda, Newton; Taha, Taha E.; Eshleman, Susan H.

    2006-01-01

    The US Food and Drug Administration-cleared ViroSeq HIV-1 Genotyping System (ViroSeq) and other population sequencing-based human immunodeficiency virus type 1 (HIV-1) genotyping methods detect antiretroviral drug resistance mutations present in the major viral population of a test sample. These assays also detect some mutations in viral variants that are present as mixtures. We compared detection of the K103N nevirapine resistance mutation using ViroSeq and a sensitive, quantitative point mutation assay, LigAmp. The LigAmp assay measured the percentage of K103N-containing variants in the viral population (percentage of K103N). We analyzed 305 samples with HIV-1 subtypes A, C, and D collected from African women after nevirapine administration. ViroSeq detected K103N in 100% of samples with >20% K103N, 77.8% of samples with 10 to 20% K103N, 71.4% of samples with 5 to 10% K103N, and 16.9% of samples with 1 to 5% K103N. The sensitivity of ViroSeq for detection of K103N was similar for subtypes A, C, and D. These data indicate that the ViroSeq system reliably detects the K103N mutation at levels above 20% and frequently detects the mutation at lower levels. Further studies are needed to compare the sensitivity of different assays for detection of HIV-1 drug resistance mutations and to determine the clinical relevance of HIV-1 minority variants. PMID:16931582

  16. Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki).

    PubMed

    Sturgill, David; Malone, John H; Sun, Xia; Smith, Harold E; Rabinow, Leonard; Samson, Marie-Laure; Oliver, Brian

    2013-11-09

    The production of multiple transcript isoforms from one gene is a major source of transcriptome complexity. RNA-Seq experiments, in which transcripts are converted to cDNA and sequenced, allow the resolution and quantification of alternative transcript isoforms. However, methods to analyze splicing are underdeveloped and errors resulting in incorrect splicing calls occur in every experiment. We used RNA-Seq data to develop sequencing and aligner error models. By applying these error models to known input from simulations, we found that errors result from false alignment to minor splice motifs and antisense stands, shifted junction positions, paralog joining, and repeat induced gaps. By using a series of quantitative and qualitative filters, we eliminated diagnosed errors in the simulation, and applied this to RNA-Seq data from Drosophila melanogaster heads. We used high-confidence junction detections to specifically interrogate local splicing differences between transcripts. This method out-performed commonly used RNA-seq methods to identify known alternative splicing events in the Drosophila sex determination pathway. We describe a flexible software package to perform these tasks called Splicing Analysis Kit (Spanki), available at http://www.cbcb.umd.edu/software/spanki. Splice-junction centric analysis of RNA-Seq data provides advantages in specificity for detection of alternative splicing. Our software provides tools to better understand error profiles in RNA-Seq data and improve inference from this new technology. The splice-junction centric approach that this software enables will provide more accurate estimates of differentially regulated splicing than current tools.

  17. What's New | Galaxy of Images

    Science.gov Websites

    ] View Images Details ID: SIL32-035-02 Enlarge Image View Images Details ID: SIL32-038-02 Enlarge Image View Images Details ID: SIL-2004_CT_6_1 Enlarge Image View Images Details ID: SIL32-010-01 Enlarge Image View Images Details ID: SIL32-013-05 Enlarge Image View Images Details ID: SIL32-014-02 Enlarge

  18. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Long; Shi, Songting; Zhang, Juan

    Highlights: Black-Right-Pointing-Pointer Expression of Id3 but not Id1 is induced by Wnt3a stimulation in C2C12 cells. Black-Right-Pointing-Pointer Wnt3a induces Id3 expression via canonical Wnt/{beta}-catenin pathway. Black-Right-Pointing-Pointer Wnt3a-induced Id3 expression does not depend on BMP signaling activation. Black-Right-Pointing-Pointer Induction of Id3 expression is critical determinant in Wnt3a-induced cell proliferation and differentiation. -- Abstract: Canonical Wnt signaling plays important roles in regulating cell proliferation and differentiation. In this study, we report that inhibitor of differentiation (Id)3 is a Wnt-inducible gene in mouse C2C12 myoblasts. Wnt3a induced Id3 expression in a {beta}-catenin-dependent manner. Bone morphogenetic protein (BMP) also potently induced Id3 expression. However,more » Wnt-induced Id3 expression occurred independent of the BMP/Smad pathway. Functional studies showed that Id3 depletion in C2C12 cells impaired Wnt3a-induced cell proliferation and alkaline phosphatase activity, an early marker of osteoblast cells. Id3 depletion elevated myogenin induction during myogenic differentiation and partially impaired Wnt3a suppressed myogenin expression in C2C12 cells. These results suggest that Id3 is an important Wnt/{beta}-catenin induced gene in myoblast cell fate determination.« less

  19. ACE: an efficient and sensitive tool to detect insecticide resistance-associated mutations in insect acetylcholinesterase from RNA-Seq data.

    PubMed

    Guo, Dianhao; Luo, Jiapeng; Zhou, Yuenan; Xiao, Huamei; He, Kang; Yin, Chuanlin; Xu, Jianhua; Li, Fei

    2017-07-10

    Insecticide resistance is a substantial problem in controlling agricultural and medical pests. Detecting target site mutations is crucial to manage insecticide resistance. Though PCR-based methods have been widely used in this field, they are time-consuming and inefficient, and typically have a high false positive rate. Acetylcholinesterases (Ace) is the neural target of the widely used organophosphate (OP) and carbamate insecticides. However, there is not any software available to detect insecticide resistance associated mutations in RNA-Seq data at present. A computational pipeline ACE was developed to detect resistance mutations of ace in insect RNA-Seq data. Known ace resistance mutations were collected and used as a reference. We constructed a Web server for ACE, and the standalone software in both Linux and Windows versions is available for download. ACE was used to analyse 971 RNA-Seq data from 136 studies in 7 insect pests. The mutation frequency of each RNA-Seq dataset was calculated. The results indicated that the resistance frequency was 30%-44% in an eastern Ugandan Anopheles population, thus suggesting this resistance-conferring mutation has reached high frequency in these mosquitoes in Uganda. Analyses of RNA-Seq data from the diamondback moth Plutella xylostella indicated that the G227A mutation was positively related with resistance levels to organophosphate or carbamate insecticides. The wasp Nasonia vitripennis had a low frequency of resistant reads (<5%), but the agricultural pests Chilo suppressalis and Bemisia tabaci had a high resistance frequency. All ace reads in the 30 B. tabaci RNA-Seq data were resistant reads, suggesting that insecticide resistance has spread to very high frequency in B. tabaci. To the best of our knowledge, the ACE pipeline is the first tool to detect resistance mutations from RNA-Seq data, and it facilitates the full utilization of large-scale genetic data obtained by using next-generation sequencing.

  20. Broadband 0.25-um Gallium Nitride (GaN) Power Amplifier Designs

    DTIC Science & Technology

    2017-08-14

    CP pF RES ID=R1 R=RP Ohm PORT P=1 Z=50 Ohm RP=87.5ohm/mm... CP =-0.31pF/mm For 1.75mm, RP=50ohms, CP =0.54pf CP = 0.31 * size size=1.75 RP = 87.5 / size CAP ID=C1 C=CP1 pF RES ID=R1 R=RP Ohm IND ID=L1 L=LP1 nH CAP...ID=C2 C=Cser2 pF IND ID=L2 L=Lser2 nH IND ID=L3 L=LP1 nH CAP ID=C3 C=CP1 pF PORT P=1 Z=50 Ohm PORT P=2 Z=50 Ohm size=1.75 RP = 87.5 / size CP =

  1. Infundibular dilations of the posterior communicating arteries: pathogenesis, anatomical variants, aneurysm formation, and subarachnoid hemorrhage.

    PubMed

    Chen, Ching-Jen; Moosa, Shayan; Ding, Dale; Raper, Daniel M; Burke, Rebecca M; Lee, Cheng-Chia; Chivukula, Srinivas; Wang, Tony R; Starke, Robert M; Crowley, R Webster; Liu, Kenneth C

    2016-08-01

    Cerebrovascular infundibular dilations (IDs) are triangular-shaped widenings less than 3 mm in diameter, which are most commonly found at the posterior communicating artery (PCoA). The aims of this systematic review are to elucidate the natural histories of IDs, determine their risk of progression to significant pathology, and discuss potential management options. A comprehensive literature search of PubMed was used to find all case reports and series relating to cerebral IDs. IDs were classified into three types: type I IDs do not exhibit morphological change over a long follow-up period, type II IDs evolve into saccular aneurysms, while type III IDs are those that result in subarachnoid hemorrhage without prior aneurysmal progression. Data were extracted from studies that demonstrated type II or III IDs. We reviewed 16 cases of type II and seven cases of type III IDs. For type II IDs, 81.3% of patients were female with a median age at diagnosis of 38. All type II IDs were located at the PCoA without a clear predilection for sidedness. Median time to aneurysm progression was 7.5 years. For type III IDs there was no clear gender preponderance and the median age at diagnosis was 51. The PCoA was involved in 85.7% of cases, with 57.1% of IDs occurring on the left. Most patients were treated with clipping. Risk factors for aneurysm formation appear to be female gender, young age, left-sided localization, coexisting aneurysms, and hypertension. IDs can rarely progress to aneurysms or rupture. Young patients with type II or III IDs with coexisting aneurysms or hypertension may benefit from long-term imaging surveillance. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/

  2. Sucrose synthase in wild tomato Lycopersicon chmielewskii and tomato fruit sink strength

    Treesearch

    Shi-Jean S. Sung; T. Loboda; S.S. Sung; C.C. Black

    1992-01-01

    Here it is reported that sucrose synthase can be readily measured in growing wild tomato fruits (Lycopersicon chmielewskii) when suitable methods are adopted during fruit extraction. The enzyme also was present in fruit pericarp tissues, in seeds, and in flowers.In mature, nongrowing fruits, sucrose synthase activities approached nil values.Therefore, sucrose synthase...

  3. Exome Sequencing Identifies Three Novel Candidate Genes Implicated in Intellectual Disability

    PubMed Central

    Azam, Maleeha; Ayub, Humaira; Vissers, Lisenka E. L. M.; Gilissen, Christian; Ali, Syeda Hafiza Benish; Riaz, Moeen; Veltman, Joris A.; Pfundt, Rolph; van Bokhoven, Hans; Qamar, Raheel

    2014-01-01

    Intellectual disability (ID) is a major health problem mostly with an unknown etiology. Recently exome sequencing of individuals with ID identified novel genes implicated in the disease. Therefore the purpose of the present study was to identify the genetic cause of ID in one syndromic and two non-syndromic Pakistani families. Whole exome of three ID probands was sequenced. Missense variations in two plausible novel genes implicated in autosomal recessive ID were identified: lysine (K)-specific methyltransferase 2B (KMT2B), zinc finger protein 589 (ZNF589), as well as hedgehog acyltransferase (HHAT) with a de novo mutation with autosomal dominant mode of inheritance. The KMT2B recessive variant is the first report of recessive Kleefstra syndrome-like phenotype. Identification of plausible causative mutations for two recessive and a dominant type of ID, in genes not previously implicated in disease, underscores the large genetic heterogeneity of ID. These results also support the viewpoint that large number of ID genes converge on limited number of common networks i.e. ZNF589 belongs to KRAB-domain zinc-finger proteins previously implicated in ID, HHAT is predicted to affect sonic hedgehog, which is involved in several disorders with ID, KMT2B associated with syndromic ID fits the epigenetic module underlying the Kleefstra syndromic spectrum. The association of these novel genes in three different Pakistani ID families highlights the importance of screening these genes in more families with similar phenotypes from different populations to confirm the involvement of these genes in pathogenesis of ID. PMID:25405613

  4. Regulation of Id2 expression in EL4 T lymphoma cells overexpressing growth hormone.

    PubMed

    Weigent, Douglas A

    2009-01-01

    In previous studies, we have shown that overexpression of growth hormone (GH) in cells of the immune system upregulates proteins involved in cell growth and protects from apoptosis. Here, we report that overexpression of GH in EL4 T lymphoma cells (GHo) also significantly increased levels of the inhibitor of differentiation-2 (Id2). The increase in Id2 was suggested in both Id2 promoter luciferase assays and by Western analysis for Id2 protein. To identify the regulatory elements that mediate transcriptional activation by GH in the Id2 promoter, promoter deletion analysis was performed. Deletion analysis revealed that transactivation involved a 301-132bp region upstream to the Id2 transcriptional start site. The pattern in the human GHo Jurkat T lymphoma cell line paralleled that found in the mouse GHo EL4 T lymphoma cell line. Significantly less Id2 was detected in the nucleus of GHo EL4 T lymphoma cells compared to vector alone controls. Although serum increased the levels of Id2 in control vector alone cells, no difference was found in the total levels of Id2 in GHo EL4 T lymphoma cells treated with or without serum. The increase in Id2 expression in GHo EL4 T lymphoma cells measured by Id2 promoter luciferase expression and Western blot analysis was blocked by the overexpression of a dominant-negative mutant of STAT5. The results suggest that in EL4 T lymphoma cells overexpressing GH, there is an upregulation of Id2 protein that appears to involve STAT protein activity.

  5. Id1, Id2 and Id3 are induced in rat melanotrophs of the pituitary gland by dopamine suppression under continuous stress.

    PubMed

    Konishi, H; Ogawa, T; Nakagomi, S; Inoue, K; Tohyama, M; Kiyama, H

    2010-09-15

    In rats under continuous stress (CS) there is decreased hypothalamic dopaminergic innervation to the intermediate lobe (IL) of the pituitary gland, which causes hyperactivation and subsequent degeneration of melanotrophs in the IL. In this study, we investigated the molecular basis for the changes that occur in melanotrophs during CS. Using microarray analysis, we identified several genes differentially expressed in the IL under CS conditions. Among the genes up-regulated under CS conditions, we focused on the inhibitor of DNA binding/differentiation (Id) family of dominant negative basic helix-loop-helix (bHLH) transcription factors. RT-PCR, Western blotting and in situ hybridization confirmed the significant inductions of Id1, Id2 and Id3 in the IL of CS rats. Administration of the dopamine D2 receptor agonist bromocriptine prevented the inductions of Id1-3 in the IL of CS rats, whereas application of the dopamine D2 antagonist sulpiride induced significant expressions of Id1-3 in the IL of normal rats. Moreover, an in vitro study using primary cultured melanotrophs demonstrated a direct effect on Id1-3 inductions by dopamine suppression. These results suggest that the decreased dopamine levels in the IL during CS induce Id1-3 expressions in melanotrophs. Because Id family members inhibit various bHLH transcription factors, it is conceivable that the induced Id1-3 would cooperatively modulate gene expressions in melanotrophs under CS conditions to induce hormone secretion. (c) 2010 IBRO. Published by Elsevier Ltd. All rights reserved.

  6. Maleimide conjugation markedly enhances the immunogenicity of both human and murine idiotype-KLH vaccines

    PubMed Central

    Kafi, Kamran; Betting, David J.; Yamada, Reiko E.; Bacica, Michael; Steward, Kristopher K.; Timmerman, John M.

    2009-01-01

    The collection of epitopes present within the variable regions of the tumor-specific clonal immunoglobulin expressed by B cell lymphomas (idiotype, Id) can serve as a target for active immunotherapy. Traditionally, tumor-derived Id protein is chemically-conjugated to the immunogenic foreign carrier protein keyhole limpet hemocyanin (KLH) using glutaraldehyde to serve as a therapeutic vaccine. While this approach offered promising results for some patients treated in early clinical trials, glutaraldehyde Id-KLH vaccines have failed to induce immune and clinical responses in many vaccinated subjects. We recently described an alternative conjugation method employing maleimide-sulfhydryl chemistry that significantly increased the therapeutic efficacy of Id-KLH vaccines in three different murine B cell lymphoma models, with protection mediated by either CD8+ T cells or antibodies. We now define in detail the methods and parameters critical for enhancing the in vivo immunogenicity of human as well as murine Id-KLH conjugate vaccines. Optimal conditions for Id sulfhydryl pre-reduction were determined, and maleimide Id-KLH conjugates maintained stability and potency even after prolonged storage. Field flow fractionation analysis of Id-KLH particle size revealed that maleimide conjugates were far more uniform in size than glutaraldehyde conjugates. Under increasingly stringent conditions, maleimide Id-KLH vaccines maintained superior efficacy over glutaraldehyde Id-KLH in treating established, disseminated murine lymphoma. More importantly, human maleimide Id-KLH conjugates were consistently superior to glutaraldehyde Id-KLH conjugates in inducing Id-specific antibody and T cell responses. The described methods should be easily adaptable to the production of clinical grade vaccines for human trials in B cell malignancies. PMID:19046770

  7. Discontinuity in the genetic and environmental causes of the intellectual disability spectrum.

    PubMed

    Reichenberg, Abraham; Cederlöf, Martin; McMillan, Andrew; Trzaskowski, Maciej; Kapra, Ori; Fruchter, Eyal; Ginat, Karen; Davidson, Michael; Weiser, Mark; Larsson, Henrik; Plomin, Robert; Lichtenstein, Paul

    2016-01-26

    Intellectual disability (ID) occurs in almost 3% of newborns. Despite substantial research, a fundamental question about its origin and links to intelligence (IQ) still remains. ID has been shown to be inherited and has been accepted as the extreme low of the normal IQ distribution. However, ID displays a complex pattern of inheritance. Previously, noninherited rare mutations were shown to contribute to severe ID risk in individual families, but in the majority of cases causes remain unknown. Common variants associated with ID risk in the population have not been systematically established. Here we evaluate the hypothesis, originally proposed almost 1 century ago, that most ID is caused by the same genetic and environmental influences responsible for the normal distribution of IQ, but that severe ID is not. We studied more than 1,000,000 sibling pairs and 9,000 twin pairs assessed for IQ and for the presence of ID. We evaluated whether genetic and environmental influences at the extremes of the distribution are different from those operating in the normal range. Here we show that factors influencing mild ID (lowest 3% of IQ distribution) were similar to those influencing IQ in the normal range. In contrast, the factors influencing severe ID (lowest 0.5% of IQ distribution) differ from those influencing mild ID or IQ scores in the normal range. Taken together, our results suggest that most severe ID is a distinct condition, qualitatively different from the preponderance of ID, which, in turn, represents the low extreme of the normal distribution of intelligence.

  8. Discontinuity in the genetic and environmental causes of the intellectual disability spectrum

    PubMed Central

    Reichenberg, Abraham; Cederlöf, Martin; McMillan, Andrew; Trzaskowski, Maciej; Kapra, Ori; Fruchter, Eyal; Ginat, Karen; Davidson, Michael; Weiser, Mark; Larsson, Henrik; Plomin, Robert; Lichtenstein, Paul

    2016-01-01

    Intellectual disability (ID) occurs in almost 3% of newborns. Despite substantial research, a fundamental question about its origin and links to intelligence (IQ) still remains. ID has been shown to be inherited and has been accepted as the extreme low of the normal IQ distribution. However, ID displays a complex pattern of inheritance. Previously, noninherited rare mutations were shown to contribute to severe ID risk in individual families, but in the majority of cases causes remain unknown. Common variants associated with ID risk in the population have not been systematically established. Here we evaluate the hypothesis, originally proposed almost 1 century ago, that most ID is caused by the same genetic and environmental influences responsible for the normal distribution of IQ, but that severe ID is not. We studied more than 1,000,000 sibling pairs and 9,000 twin pairs assessed for IQ and for the presence of ID. We evaluated whether genetic and environmental influences at the extremes of the distribution are different from those operating in the normal range. Here we show that factors influencing mild ID (lowest 3% of IQ distribution) were similar to those influencing IQ in the normal range. In contrast, the factors influencing severe ID (lowest 0.5% of IQ distribution) differ from those influencing mild ID or IQ scores in the normal range. Taken together, our results suggest that most severe ID is a distinct condition, qualitatively different from the preponderance of ID, which, in turn, represents the low extreme of the normal distribution of intelligence. PMID:26711998

  9. Brain-targeted stem cell gene therapy corrects mucopolysaccharidosis type II via multiple mechanisms.

    PubMed

    Gleitz, Hélène Fe; Liao, Ai Yin; Cook, James R; Rowlston, Samuel F; Forte, Gabriella Ma; D'Souza, Zelpha; O'Leary, Claire; Holley, Rebecca J; Bigger, Brian W

    2018-06-08

    The pediatric lysosomal storage disorder mucopolysaccharidosis type II is caused by mutations in IDS, resulting in accumulation of heparan and dermatan sulfate, causing severe neurodegeneration, skeletal disease, and cardiorespiratory disease. Most patients manifest with cognitive symptoms, which cannot be treated with enzyme replacement therapy, as native IDS does not cross the blood-brain barrier. We tested a brain-targeted hematopoietic stem cell gene therapy approach using lentiviral IDS fused to ApoEII (IDS.ApoEII) compared to a lentivirus expressing normal IDS or a normal bone marrow transplant. In mucopolysaccharidosis II mice, all treatments corrected peripheral disease, but only IDS.ApoEII mediated complete normalization of brain pathology and behavior, providing significantly enhanced correction compared to IDS. A normal bone marrow transplant achieved no brain correction. Whilst corrected macrophages traffic to the brain, secreting IDS/IDS.ApoEII enzyme for cross-correction, IDS.ApoEII was additionally more active in plasma and was taken up and transcytosed across brain endothelia significantly better than IDS via both heparan sulfate/ApoE-dependent receptors and mannose-6-phosphate receptors. Brain-targeted hematopoietic stem cell gene therapy provides a promising therapy for MPS II patients. © 2018 The Authors. Published under the terms of the CC BY 4.0 license.

  10. New metrics for evaluating channel networks extracted in grid digital elevation models

    NASA Astrophysics Data System (ADS)

    Orlandini, S.; Moretti, G.

    2017-12-01

    Channel networks are critical components of drainage basins and delta regions. Despite the important role played by these systems in hydrology and geomorphology, there are at present no well-defined methods to evaluate numerically how two complex channel networks are geometrically far apart. The present study introduces new metrics for evaluating numerically channel networks extracted in grid digital elevation models with respect to a reference channel network (see the figure below). Streams of the evaluated network (EN) are delineated as in the Horton ordering system and examined through a priority climbing algorithm based on the triple index (ID1,ID2,ID3), where ID1 is a stream identifier that increases as the elevation of lower end of the stream increases, ID2 indicates the ID1 of the draining stream, and ID3 is the ID1 of the corresponding stream in the reference network (RN). Streams of the RN are identified by the double index (ID1,ID2). Streams of the EN are processed in the order of increasing ID1 (plots a-l in the figure below). For each processed stream of the EN, the closest stream of the RN is sought by considering all the streams of the RN sharing the same ID2. This ID2 in the RN is equal in the EN to the ID3 of the stream draining the processed stream, the one having ID1 equal to the ID2 of the processed stream. The mean stream planar distance (MSPD) and the mean stream elevation drop (MSED) are computed as the mean distance and drop, respectively, between corresponding streams. The MSPD is shown to be useful for evaluating slope direction methods and thresholds for channel initiation, whereas the MSED is shown to indicate the ability of grid coarsening strategies to retain the profiles of observed channels. The developed metrics fill a gap in the existing literature by allowing hydrologists and geomorphologists to compare descriptions of a fixed physical system obtained by using different terrain analysis methods, or different physical systems described by using the same methods.

  11. 20 CFR 645.255 - What nondiscrimination protections apply to participants in Welfare-to-Work programs?

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... regulations, including: (1) The Age Discrimination Act of 1975 (42 U.S.C. 6101 et seq.); (2) Section 504 of the Rehabilitation Act of 1973 (29 U.S.C. 794); (3) The Americans with Disabilities Act of 1990 (42 U.S.C. 12101 et seq.); and (4) Title VI of the Civil Rights Act of 1964 (42 U.S.C. 2000d et seq.). (b...

  12. 20 CFR 645.255 - What nondiscrimination protections apply to participants in Welfare-to-Work programs?

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... regulations, including: (1) The Age Discrimination Act of 1975 (42 U.S.C. 6101 et seq.); (2) Section 504 of the Rehabilitation Act of 1973 (29 U.S.C. 794); (3) The Americans with Disabilities Act of 1990 (42 U.S.C. 12101 et seq.); and (4) Title VI of the Civil Rights Act of 1964 (42 U.S.C. 2000d et seq.). (b...

  13. 20 CFR 645.255 - What nondiscrimination protections apply to participants in Welfare-to-Work programs?

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... regulations, including: (1) The Age Discrimination Act of 1975 (42 U.S.C. 6101 et seq.); (2) Section 504 of the Rehabilitation Act of 1973 (29 U.S.C. 794); (3) The Americans with Disabilities Act of 1990 (42 U.S.C. 12101 et seq.); and (4) Title VI of the Civil Rights Act of 1964 (42 U.S.C. 2000d et seq.). (b...

  14. 20 CFR 645.255 - What nondiscrimination protections apply to participants in Welfare-to-Work programs?

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... regulations, including: (1) The Age Discrimination Act of 1975 (42 U.S.C. 6101 et seq.); (2) Section 504 of the Rehabilitation Act of 1973 (29 U.S.C. 794); (3) The Americans with Disabilities Act of 1990 (42 U.S.C. 12101 et seq.); and (4) Title VI of the Civil Rights Act of 1964 (42 U.S.C. 2000d et seq.). (b...

  15. 20 CFR 645.255 - What nondiscrimination protections apply to participants in Welfare-to-Work programs?

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... regulations, including: (1) The Age Discrimination Act of 1975 (42 U.S.C. 6101 et seq.); (2) Section 504 of the Rehabilitation Act of 1973 (29 U.S.C. 794); (3) The Americans with Disabilities Act of 1990 (42 U.S.C. 12101 et seq.); and (4) Title VI of the Civil Rights Act of 1964 (42 U.S.C. 2000d et seq.). (b...

  16. TruSeq Stranded mRNA and Total RNA Sample Preparation Kits

    Cancer.gov

    Total RNA-Seq enabled by ribosomal RNA (rRNA) reduction is compatible with formalin-fixed paraffin embedded (FFPE) samples, which contain potentially critical biological information. The family of TruSeq Stranded Total RNA sample preparation kits provides a unique combination of unmatched data quality for both mRNA and whole-transcriptome analyses, robust interrogation of both standard and low-quality samples and workflows compatible with a wide range of study designs.

  17. SC3 - consensus clustering of single-cell RNA-Seq data

    PubMed Central

    Kiselev, Vladimir Yu.; Kirschner, Kristina; Schaub, Michael T.; Andrews, Tallulah; Yiu, Andrew; Chandra, Tamir; Natarajan, Kedar N; Reik, Wolf; Barahona, Mauricio; Green, Anthony R; Hemberg, Martin

    2017-01-01

    Single-cell RNA-seq (scRNA-seq) enables a quantitative cell-type characterisation based on global transcriptome profiles. We present Single-Cell Consensus Clustering (SC3), a user-friendly tool for unsupervised clustering which achieves high accuracy and robustness by combining multiple clustering solutions through a consensus approach. We demonstrate that SC3 is capable of identifying subclones based on the transcriptomes from neoplastic cells collected from patients. PMID:28346451

  18. Dysregulated microRNA Activity in Shwachman-Diamond Syndrome

    DTIC Science & Technology

    2016-09-01

    define transcriptional signatures of bone marrow failure in SDS using single cell RNA -seq of patient cells. We will analyze these datasets to test the...microRNA expression profiles from HSPCs to be overlaid onto mRNA profiles. 15. SUBJECT TERMS Single cell RNA -seq; bone marrow failure; hematopoiesis...myelopoiesis; targeted RNA -seq 16. SECURITY CLASSIFICATION OF: U 17. LIMITATION OF ABSTRACT 18. NUMBER OF PAGES 19a. NAME OF RESPONSIBLE PERSON

  19. Determination of in vivo RNA kinetics using RATE-seq.

    PubMed

    Neymotin, Benjamin; Athanasiadou, Rodoniki; Gresham, David

    2014-10-01

    The abundance of a transcript is determined by its rate of synthesis and its rate of degradation; however, global methods for quantifying RNA abundance cannot distinguish variation in these two processes. Here, we introduce RNA approach to equilibrium sequencing (RATE-seq), which uses in vivo metabolic labeling of RNA and approach to equilibrium kinetics, to determine absolute RNA degradation and synthesis rates. RATE-seq does not disturb cellular physiology, uses straightforward normalization with exogenous spike-ins, and can be readily adapted for studies in most organisms. We demonstrate the use of RATE-seq to estimate genome-wide kinetic parameters for coding and noncoding transcripts in Saccharomyces cerevisiae. © 2014 Neymotin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  20. SeqPig: simple and scalable scripting for large sequencing data sets in Hadoop.

    PubMed

    Schumacher, André; Pireddu, Luca; Niemenmaa, Matti; Kallio, Aleksi; Korpelainen, Eija; Zanetti, Gianluigi; Heljanko, Keijo

    2014-01-01

    Hadoop MapReduce-based approaches have become increasingly popular due to their scalability in processing large sequencing datasets. However, as these methods typically require in-depth expertise in Hadoop and Java, they are still out of reach of many bioinformaticians. To solve this problem, we have created SeqPig, a library and a collection of tools to manipulate, analyze and query sequencing datasets in a scalable and simple manner. SeqPigscripts use the Hadoop-based distributed scripting engine Apache Pig, which automatically parallelizes and distributes data processing tasks. We demonstrate SeqPig's scalability over many computing nodes and illustrate its use with example scripts. Available under the open source MIT license at http://sourceforge.net/projects/seqpig/

Top