efficient sequence-specific gene: Topics by Science.gov

Sample records for efficient sequence-specific gene

Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences

PubMed Central

Gibbs, Mark J; Armstrong, John S; Gibbs, Adrian J

2005-01-01

Background Most current DNA diagnostic tests for identifying organisms use specific oligonucleotide probes that are complementary in sequence to, and hence only hybridise with the DNA of one target species. By contrast, in traditional taxonomy, specimens are usually identified by 'dichotomous keys' that use combinations of characters shared by different members of the target set. Using one specific character for each target is the least efficient strategy for identification. Using combinations of shared bisectionally-distributed characters is much more efficient, and this strategy is most efficient when they separate the targets in a progressively binary way. Results We have developed a practical method for finding minimal sets of sub-sequences that identify individual sequences, and could be targeted by combinations of probes, so that the efficient strategy of traditional taxonomic identification could be used in DNA diagnosis. The sizes of minimal sub-sequence sets depended mostly on sequence diversity and sub-sequence length and interactions between these parameters. We found that 201 distinct cytochrome oxidase subunit-1 (CO1) genes from moths (Lepidoptera) were distinguished using only 15 sub-sequences 20 nucleotides long, whereas only 8–10 sub-sequences 6–10 nucleotides long were required to distinguish the CO1 genes of 92 species from the 9 largest orders of insects. Conclusion The presence/absence of sub-sequences in a set of gene sequences can be used like the questions in a traditional dichotomous taxonomic key; hybridisation probes complementary to such sub-sequences should provide a very efficient means for identifying individual species, subtypes or genotypes. Sequence diversity and sub-sequence length are the major factors that determine the numbers of distinguishing sub-sequences in any set of sequences. PMID:15817134
An Efficient Approach for the Development of Locus Specific Primers in Bread Wheat (Triticum aestivum L.) and Its Application to Re-Sequencing of Genes Involved in Frost Tolerance

PubMed Central

Babben, Steve; Perovic, Dragan; Koch, Michael; Ordon, Frank

2015-01-01

Recent declines in costs accelerated sequencing of many species with large genomes, including hexaploid wheat (Triticum aestivum L.). Although the draft sequence of bread wheat is known, it is still one of the major challenges to developlocus specific primers suitable to be used in marker assisted selection procedures, due to the high homology of the three genomes. In this study we describe an efficient approach for the development of locus specific primers comprising four steps, i.e. (i) identification of genomic and coding sequences (CDS) of candidate genes, (ii) intron- and exon-structure reconstruction, (iii) identification of wheat A, B and D sub-genome sequences and primer development based on sequence differences between the three sub-genomes, and (iv); testing of primers for functionality, correct size and localisation. This approach was applied to single, low and high copy genes involved in frost tolerance in wheat. In summary for 27 of these genes for which sequences were derived from Triticum aestivum, Triticum monococcum and Hordeum vulgare, a set of 119 primer pairs was developed and after testing on Nulli-tetrasomic (NT) lines, a set of 65 primer pairs (54.6%), corresponding to 19 candidate genes, turned out to be specific. Out of these a set of 35 fragments was selected for validation via Sanger's amplicon re-sequencing. All fragments, with the exception of one, could be assigned to the original reference sequence. The approach presented here showed a much higher specificity in primer development in comparison to techniques used so far in bread wheat and can be applied to other polyploid species with a known draft sequence. PMID:26565976
A regulatory sequence from the retinoid X receptor γ gene directs expression to horizontal cells and photoreceptors in the embryonic chicken retina.

PubMed

Blixt, Maria K E; Hallböök, Finn

2016-01-01

Combining techniques of episomal vector gene-specific Cre expression and genomic integration using the piggyBac transposon system enables studies of gene expression-specific cell lineage tracing in the chicken retina. In this work, we aimed to target the retinal horizontal cell progenitors. A 208 bp gene regulatory sequence from the chicken retinoid X receptor γ gene (RXRγ208) was used to drive Cre expression. RXRγ is expressed in progenitors and photoreceptors during development. The vector was combined with a piggyBac "donor" vector containing a floxed STOP sequence followed by enhanced green fluorescent protein (EGFP), as well as a piggyBac helper vector for efficient integration into the host cell genome. The vectors were introduced into the embryonic chicken retina with in ovo electroporation. Tissue electroporation targets specific developmental time points and in specific structures. Cells that drove Cre expression from the regulatory RXRγ208 sequence excised the floxed STOP-sequence and expressed GFP. The approach generated a stable lineage with robust expression of GFP in retinal cells that have activated transcription from the RXRγ208 sequence. Furthermore, GFP was expressed in cells that express horizontal or photoreceptor markers when electroporation was performed between developmental stages 22 and 28. Electroporation of a stage 12 optic cup gave multiple cell types in accordance with RXRγ gene expression in the early retina. In this study, we describe an easy, cost-effective, and time-efficient method for testing regulatory sequences in general. More specifically, our results open up the possibility for further studies of the RXRγ-gene regulatory network governing the formation of photoreceptor and horizontal cells. In addition, the method presents approaches to target the expression of effector genes, such as regulators of cell fate or cell cycle progression, to these cells and their progenitor.
Construction and Evaluation of Normalized cDNA Libraries Enriched with Full-Length Sequences for Rapid Discovery of New Genes from Sisal (Agave sisalana Perr.) Different Developmental Stages

PubMed Central

Zhou, Wen-Zhao; Zhang, Yan-Mei; Lu, Jun-Ying; Li, Jun-Feng

2012-01-01

To provide a resource of sisal-specific expressed sequence data and facilitate this powerful approach in new gene research, the preparation of normalized cDNA libraries enriched with full-length sequences is necessary. Four libraries were produced with RNA pooled from Agave sisalana multiple tissues to increase efficiency of normalization and maximize the number of independent genes by SMART™ method and the duplex-specific nuclease (DSN). This procedure kept the proportion of full-length cDNAs in the subtracted/normalized libraries and dramatically enhanced the discovery of new genes. Sequencing of 3875 cDNA clones of libraries revealed 3320 unigenes with an average insert length about 1.2 kb, indicating that the non-redundancy of libraries was about 85.7%. These unigene functions were predicted by comparing their sequences to functional domain databases and extensively annotated with Gene Ontology (GO) terms. Comparative analysis of sisal unigenes and other plant genomes revealed that four putative MADS-box genes and knotted-like homeobox (knox) gene were obtained from a total of 1162 full-length transcripts. Furthermore, real-time PCR showed that the characteristics of their transcripts mainly depended on the tight expression regulation of a number of genes during the leaf and flower development. Analysis of individual library sequence data indicated that the pooled-tissue approach was highly effective in discovering new genes and preparing libraries for efficient deep sequencing. PMID:23202944
Properties of a U1 RNA enhancer-like sequence.

PubMed Central

Ciliberto, G; Palla, F; Tebb, G; Mattaj, I W; Philipson, L

1987-01-01

The properties of a X.laevis U1B snRNA gene enhancer have been studied by microinjection in Xenopus oocytes. The enhancer-like sequence, defined as a short DNA stretch that is able to activate transcription in an orientation independent manner, is interchangeable between different U snRNA genes. The enhancer sequence alone does not, however, efficiently activate transcription from an SV40 pol II promoter but regains its activity when combined with the U-gene specific proximal sequence element. DNase I protection experiments show that the X.laevis U1B enhancer can interact specifically with a nuclear factor present in mammalian cells. Images PMID:3031597
Whole exome sequencing is an efficient, sensitive and specific method for determining the genetic cause of short-rib thoracic dystrophies.

PubMed

McInerney-Leo, A M; Harris, J E; Leo, P J; Marshall, M S; Gardiner, B; Kinning, E; Leong, H Y; McKenzie, F; Ong, W P; Vodopiutz, J; Wicking, C; Brown, M A; Zankl, A; Duncan, E L

2015-12-01

Short-rib thoracic dystrophies (SRTDs) are congenital disorders due to defects in primary cilium function. SRTDs are recessively inherited with mutations identified in 14 genes to date (comprising 398 exons). Conventional mutation detection (usually by iterative Sanger sequencing) is inefficient and expensive, and often not undertaken. Whole exome massive parallel sequencing has been used to identify new genes for SRTD (WDR34, WDR60 and IFT172); however, the clinical utility of whole exome sequencing (WES) has not been established. WES was performed in 11 individuals with SRTDs. Compound heterozygous or homozygous mutations were identified in six confirmed SRTD genes in 10 individuals (IFT172, DYNC2H1, TTC21B, WDR60, WDR34 and NEK1), giving overall sensitivity of 90.9%. WES data from 993 unaffected individuals sequenced using similar technology showed two individuals with rare (minor allele frequency <0.005) compound heterozygous variants of unknown significance in SRTD genes (specificity >99%). Costs for consumables, laboratory processing and bioinformatic analysis were
Strategies to Improve Efficiency and Specificity of Degenerate Primers in PCR.

PubMed

Campos, Maria Jorge; Quesada, Alberto

2017-01-01

PCR with degenerate primers can be used to identify the coding sequence of an unknown protein or to detect a genetic variant within a gene family. These primers, which are complex mixtures of slightly different oligonucleotide sequences, can be optimized to increase the efficiency and/or specificity of PCR in the amplification of a sequence of interest by the introduction of mismatches with the target sequence and balancing their position toward the primers 5'- or 3'-ends. In this work, we explain in detail examples of rational design of primers in two different applications, including the use of specific determinants at the 3'-end, to: (1) improve PCR efficiency with coding sequences for members of a protein family by fully degeneration at a core box of conserved genetic information, with the reduction of degeneration at the 5'-end, and (2) optimize specificity of allelic discrimination of closely related orthologous by 5'-end degenerate primers.
Large scale RNAi screen in Tribolium reveals novel target genes for pest control and the proteasome as prime target.

PubMed

Ulrich, Julia; Dao, Van Anh; Majumdar, Upalparna; Schmitt-Engel, Christian; Schwirz, Jonas; Schultheis, Dorothea; Ströhlein, Nadi; Troelenberg, Nicole; Grossmann, Daniela; Richter, Tobias; Dönitz, Jürgen; Gerischer, Lizzy; Leboulle, Gérard; Vilcinskas, Andreas; Stanke, Mario; Bucher, Gregor

2015-09-03

Insect pest control is challenged by insecticide resistance and negative impact on ecology and health. One promising pest specific alternative is the generation of transgenic plants, which express double stranded RNAs targeting essential genes of a pest species. Upon feeding, the dsRNA induces gene silencing in the pest resulting in its death. However, the identification of efficient RNAi target genes remains a major challenge as genomic tools and breeding capacity is limited in most pest insects impeding whole-animal-high-throughput-screening. We use the red flour beetle Tribolium castaneum as a screening platform in order to identify the most efficient RNAi target genes. From about 5,000 randomly screened genes of the iBeetle RNAi screen we identify 11 novel and highly efficient RNAi targets. Our data allowed us to determine GO term combinations that are predictive for efficient RNAi target genes with proteasomal genes being most predictive. Finally, we show that RNAi target genes do not appear to act synergistically and that protein sequence conservation does not correlate with the number of potential off target sites. Our results will aid the identification of RNAi target genes in many pest species by providing a manageable number of excellent candidate genes to be tested and the proteasome as prime target. Further, the identified GO term combinations will help to identify efficient target genes from organ specific transcriptomes. Our off target analysis is relevant for the sequence selection used in transgenic plants.
Self-Cloning CRISPR.

PubMed

Arbab, Mandana; Sherwood, Richard I

2016-08-17

CRISPR/Cas9-gene editing has emerged as a revolutionary technology to easily modify specific genomic loci by designing complementary sgRNA sequences and introducing these into cells along with Cas9. Self-cloning CRISPR/Cas9 (scCRISPR) uses a self-cleaving palindromic sgRNA plasmid (sgPal) that recombines with short PCR-amplified site-specific sgRNA sequences within the target cell by homologous recombination to circumvent the process of sgRNA plasmid construction. Through this mechanism, scCRISPR enables gene editing within 2 hr once sgRNA oligos are available, with high efficiency equivalent to conventional sgRNA targeting: >90% gene knockout in both mouse and human embryonic stem cells and cancer cell lines. Furthermore, using PCR-based addition of short homology arms, we achieve efficient site-specific knock-in of transgenes such as GFP without traditional plasmid cloning or genome-integrated selection cassette (2% to 4% knock-in rate). The methods in this paper describe the most rapid and efficient means of CRISPR gene editing. © 2016 by John Wiley & Sons, Inc. Copyright © 2016 John Wiley & Sons, Inc.
Transcriptional insulation of the human keratin 18 gene in transgenic mice.

PubMed Central

Neznanov, N; Thorey, I S; Ceceña, G; Oshima, R G

1993-01-01

Expression of the 10-kb human keratin 18 (K18) gene in transgenic mice results in efficient and appropriate tissue-specific expression in a variety of internal epithelial organs, including liver, lung, intestine, kidney, and the ependymal epithelium of brain, but not in spleen, heart, or skeletal muscle. Expression at the RNA level is directly proportional to the number of integrated K18 transgenes. These results indicate that the K18 gene is able to insulate itself both from the commonly observed cis-acting effects of the sites of integration and from the potential complications of duplicated copies of the gene arranged in head-to-tail fashion. To begin to identify the K18 gene sequences responsible for this property of transcriptional insulation, additional transgenic mouse lines containing deletions of either the 5' or 3' distal end of the K18 gene have been characterized. Deletion of 1.5 kb of the distal 5' flanking sequence has no effect upon either the tissue specificity or the copy number-dependent behavior of the transgene. In contrast, deletion of the 3.5-kb 3' flanking sequence of the gene results in the loss of the copy number-dependent behavior of the gene in liver and intestine. However, expression in kidney, lung, and brain remains efficient and copy number dependent in these transgenic mice. Furthermore, herpes simplex virus thymidine kinase gene expression is copy number dependent in transgenic mice when the gene is located between the distal 5'- and 3'-flanking sequences of the K18 gene. Each adult transgenic male expressed the thymidine kinase gene in testes and brain and proportionally to the number of integrated transgenes. We conclude that the characteristic of copy number-dependent expression of the K18 gene is tissue specific because the sequence requirements for transcriptional insulation in adult liver and intestine are different from those for lung and kidney. In addition, the behavior of the transgenic thymidine kinase gene in testes and brain suggests that the property of transcriptional insulation of the K18 gene may be conferred by the distal flanking sequences of the K18 gene and, additionally, may function for other genes. Images PMID:7681143
Sox2 regulatory region 2 sequence works as a DNA nuclear targeting sequence enhancing the efficiency of an exogenous gene expression in ES cells

DOE Office of Scientific and Technical Information (OSTI.GOV)

Funabashi, Hisakage; Takatsu, Makoto; Saito, Mikako

2010-10-01

Research highlights: {yields} SV40-DTS worked as a DTS in ES cells as well as other types of cells. {yields} Sox2 regulatory region 2 worked as a DTS in ES cells and thus was termed as SRR2-DTS. {yields} SRR2-DTS was suggested as an ES cell-specific DTS. -- Abstract: In this report, the effects of two DNA nuclear targeting sequence (DTS) candidates on the gene expression efficiency in ES cells were investigated. Reporter plasmids containing the simian virus 40 (SV40) promoter/enhancer sequence (SV40-DTS), a DTS for various types of cells but not being reported yet for ES cells, and the 81 basemore » pairs of Sox2 regulatory region 2 (SRR2) where two transcriptional factors in ES cells, Oct3/4 and Sox2, are bound (SRR2-DTS), were introduced into cytoplasm in living cells by femtoinjection. The gene expression efficiencies of each plasmid in mouse insulinoma cell line MIN6 cells and mouse ES cells were then evaluated. Plasmids including SV40-DTS and SRR2-DTS exhibited higher gene expression efficiency comparing to plasmids without these DTSs, and thus it was concluded that both sequences work as a DTS in ES cells. In addition, it was suggested that SRR2-DTS works as an ES cell-specific DTS. To the best of our knowledge, this is the first report to confirm the function of DTSs in ES cells.« less
[Efficient genome editing in human pluripotent stem cells through CRISPR/Cas9].

PubMed

Liu, Gai-gai; Li, Shuang; Wei, Yu-da; Zhang, Yong-xian; Ding, Qiu-rong

2015-11-01

The RNA-guided CRISPR (clustered regularly interspaced short palindromic repeat)-associated Cas9 nuclease has offered a new platform for genome editing with high efficiency. Here, we report the use of CRISPR/Cas9 technology to target a specific genomic region in human pluripotent stem cells. We show that CRISPR/Cas9 can be used to disrupt a gene by introducing frameshift mutations to gene coding region; to knock in specific sequences (e.g. FLAG tag DNA sequence) to targeted genomic locus via homology directed repair; to induce large genomic deletion through dual-guide multiplex. Our results demonstrate the versatile application of CRISPR/Cas9 in stem cell genome editing, which can be widely utilized for functional studies of genes or genome loci in human pluripotent stem cells.
CRISPR-Cas9-Edited Site Sequencing (CRES-Seq): An Efficient and High-Throughput Method for the Selection of CRISPR-Cas9-Edited Clones.

PubMed

Veeranagouda, Yaligara; Debono-Lagneaux, Delphine; Fournet, Hamida; Thill, Gilbert; Didier, Michel

2018-01-16

The emergence of clustered regularly interspaced short palindromic repeats-Cas9 (CRISPR-Cas9) gene editing systems has enabled the creation of specific mutants at low cost, in a short time and with high efficiency, in eukaryotic cells. Since a CRISPR-Cas9 system typically creates an array of mutations in targeted sites, a successful gene editing project requires careful selection of edited clones. This process can be very challenging, especially when working with multiallelic genes and/or polyploid cells (such as cancer and plants cells). Here we described a next-generation sequencing method called CRISPR-Cas9 Edited Site Sequencing (CRES-Seq) for the efficient and high-throughput screening of CRISPR-Cas9-edited clones. CRES-Seq facilitates the precise genotyping up to 96 CRISPR-Cas9-edited sites (CRES) in a single MiniSeq (Illumina) run with an approximate sequencing cost of $6/clone. CRES-Seq is particularly useful when multiple genes are simultaneously targeted by CRISPR-Cas9, and also for screening of clones generated from multiallelic genes/polyploid cells. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.
DNA sequence requirements for the accurate transcription of a protein-coding plastid gene in a plastid in vitro system from mustard (Sinapis alba L.)

PubMed Central

Link, Gerhard

1984-01-01

A nuclease-treated plastid extract from mustard (Sinapis alba L.) allows efficient transcription of cloned plastid DNA templates. In this in vitro system, the major runoff transcript of the truncated gene for the 32 000 mol. wt. photosystem II protein was accurately initiated from a site close to or identical with the in vivo start site. By using plasmids with deletions in the 5'-flanking region of this gene as templates, a DNA region required for efficient and selective initiation was detected ˜28-35 nucleotides upstream of the transcription start site. This region contains the sequence element TTGACA, which matches the consensus sequence for prokaryotic `−35' promoter elements. In the absence of this region, a region ˜13-27 nucleotides upstream of the start site still enables a basic level of specific transcription. This second region contains the sequence element TATATAA, which matches the consensus sequence for the `TATA' box of genes transcribed by RNA polymerase II (or B). The region between the `TATA'-like element and the transcription start site is not sufficient but may be required for specific transcription of the plastid gene. This latter region contains the sequence element TATACT, which resembles the prokaryotic `−10' (Pribnow) box. Based on the structural and transcriptional features of the 5' upstream region, a `promoter switch' mechanism is proposed, which may account for the developmentally regulated expression of this plastid gene. ImagesFig. 1.Fig. 2.Fig. 3.Fig. 4.Figure 5. PMID:16453540
Definition of Cis-Acting Elements Regulating Expression of the Drosophila Melanogaster Ninae Opsin Gene by Oligonucleotide-Directed Mutagenesis

PubMed Central

Mismer, D.; Rubin, G. M.

1989-01-01

We have analyzed the cis-acting regulatory sequences of the Rh1 (ninaE) gene in Drosophila melanogaster by P-element-mediated germline transformation of indicator genes transcribed from mutant ninaE promoter sequences. We have previously shown that a 200-bp region extending from -120 to +67 relative to the transcription start site is sufficient to obtain eye-specific expression from the ninaE promoter. In the present study, 22 different 4-13-bp sequences in the -120/+67 promoter region were altered by oligonucleotide-directed mutagenesis. Several of these sequences were found to be required for proper promoter function; two of these are conserved in the promoter of the homologous gene isolated from the related species Drosophila virilis. Alteration of a conserved 9-bp sequence results in aberrant, low level expression in the body. Alteration of a separate 11-bp sequence, found in the promoter regions of several photoreceptor-specific genes of Drosophila, results in an approximately 15-fold reduction in promoter efficiency but without apparent alteration of tissue-specificity. A protein factor capable of interacting with this 11-bp sequence has been detected by DNaseI footprinting in embryonic nuclear extracts. Finally, we have further characterized two separable enhancer sequences previously shown to be required for normal levels of expression from this promoter. PMID:2521839
An active role for endogenous beta-1,3-glucanase genes in transgene-mediated co-suppression in tobacco.

PubMed

Sanders, Matthew; Maddelein, Wendy; Depicker, Anna; Van Montagu, Marc; Cornelissen, Marc; Jacobs, John

2002-11-01

Post-transcriptional gene silencing (PTGS) is characterized by the accumulation of short interfering RNAs that are proposed to mediate sequence-specific degradation of cognate and secondary target mRNAs. In plants, it is unclear to what extent endogenous genes contribute to this process. Here, we address the role of the endogenous target genes in transgene-mediated PTGS of beta-1,3-glucanases in tobacco. We found that mRNA sequences of the endogenous glucanase glb gene with varying degrees of homology to the Nicotiana plumbaginifolia gn1 transgene are targeted by the silencing machinery, although less efficiently than corresponding transgene regions. Importantly, we show that endogene-specific nucleotides in the glb sequence provide specificity to the silencing process. Consistent with this finding, small sense and antisense 21- to 23-nucleotide RNAs homologous to the endogenous glb gene were detected. Combined, these data demonstrate that a co-suppressed endogenous glucan ase gene is involved in signal amplification and selection of homologous targets, and show that endogenous genes can actively participate in PTGS in plants. The findings are introduced as a further sophistication of the post-transciptional silencing model.
Maximizing mutagenesis with solubilized CRISPR-Cas9 ribonucleoprotein complexes.

PubMed

Burger, Alexa; Lindsay, Helen; Felker, Anastasia; Hess, Christopher; Anders, Carolin; Chiavacci, Elena; Zaugg, Jonas; Weber, Lukas M; Catena, Raul; Jinek, Martin; Robinson, Mark D; Mosimann, Christian

2016-06-01

CRISPR-Cas9 enables efficient sequence-specific mutagenesis for creating somatic or germline mutants of model organisms. Key constraints in vivo remain the expression and delivery of active Cas9-sgRNA ribonucleoprotein complexes (RNPs) with minimal toxicity, variable mutagenesis efficiencies depending on targeting sequence, and high mutation mosaicism. Here, we apply in vitro assembled, fluorescent Cas9-sgRNA RNPs in solubilizing salt solution to achieve maximal mutagenesis efficiency in zebrafish embryos. MiSeq-based sequence analysis of targeted loci in individual embryos using CrispRVariants, a customized software tool for mutagenesis quantification and visualization, reveals efficient bi-allelic mutagenesis that reaches saturation at several tested gene loci. Such virtually complete mutagenesis exposes loss-of-function phenotypes for candidate genes in somatic mutant embryos for subsequent generation of stable germline mutants. We further show that targeting of non-coding elements in gene regulatory regions using saturating mutagenesis uncovers functional control elements in transgenic reporters and endogenous genes in injected embryos. Our results establish that optimally solubilized, in vitro assembled fluorescent Cas9-sgRNA RNPs provide a reproducible reagent for direct and scalable loss-of-function studies and applications beyond zebrafish experiments that require maximal DNA cutting efficiency in vivo. © 2016. Published by The Company of Biologists Ltd.
Use of tuf Sequences for Genus-Specific PCR Detection and Phylogenetic Analysis of 28 Streptococcal Species

PubMed Central

Picard, François J.; Ke, Danbing; Boudreau, Dominique K.; Boissinot, Maurice; Huletsky, Ann; Richard, Dave; Ouellette, Marc; Roy, Paul H.; Bergeron, Michel G.

2004-01-01

A 761-bp portion of the tuf gene (encoding the elongation factor Tu) from 28 clinically relevant streptococcal species was obtained by sequencing amplicons generated using broad-range PCR primers. These tuf sequences were used to select Streptococcus-specific PCR primers and to perform phylogenetic analysis. The specificity of the PCR assay was verified using 102 different bacterial species, including the 28 streptococcal species. Genomic DNA purified from all streptococcal species was efficiently detected, whereas there was no amplification with DNA from 72 of the 74 nonstreptococcal bacterial species tested. There was cross-amplification with DNAs from Enterococcus durans and Lactococcus lactis. However, the 15 to 31% nucleotide sequence divergence in the 761-bp tuf portion of these two species compared to any streptococcal tuf sequence provides ample sequence divergence to allow the development of internal probes specific to streptococci. The Streptococcus-specific assay was highly sensitive for all 28 streptococcal species tested (i.e., detection limit of 1 to 10 genome copies per PCR). The tuf sequence data was also used to perform extensive phylogenetic analysis, which was generally in agreement with phylogeny determined on the basis of 16S rRNA gene data. However, the tuf gene provided a better discrimination at the streptococcal species level that should be particularly useful for the identification of very closely related species. In conclusion, tuf appears more suitable than the 16S ribosomal RNA gene for the development of diagnostic assays for the detection and identification of streptococcal species because of its higher level of species-specific genetic divergence. PMID:15297518
CRISPR-FOCUS: A web server for designing focused CRISPR screening experiments.

PubMed

Cao, Qingyi; Ma, Jian; Chen, Chen-Hao; Xu, Han; Chen, Zhi; Li, Wei; Liu, X Shirley

2017-01-01

The recently developed CRISPR screen technology, based on the CRISPR/Cas9 genome editing system, enables genome-wide interrogation of gene functions in an efficient and cost-effective manner. Although many computational algorithms and web servers have been developed to design single-guide RNAs (sgRNAs) with high specificity and efficiency, algorithms specifically designed for conducting CRISPR screens are still lacking. Here we present CRISPR-FOCUS, a web-based platform to search and prioritize sgRNAs for CRISPR screen experiments. With official gene symbols or RefSeq IDs as the only mandatory input, CRISPR-FOCUS filters and prioritizes sgRNAs based on multiple criteria, including efficiency, specificity, sequence conservation, isoform structure, as well as genomic variations including Single Nucleotide Polymorphisms and cancer somatic mutations. CRISPR-FOCUS also provides pre-defined positive and negative control sgRNAs, as well as other necessary sequences in the construct (e.g., U6 promoters to drive sgRNA transcription and RNA scaffolds of the CRISPR/Cas9). These features allow users to synthesize oligonucleotides directly based on the output of CRISPR-FOCUS. Overall, CRISPR-FOCUS provides a rational and high-throughput approach for sgRNA library design that enables users to efficiently conduct a focused screen experiment targeting up to thousands of genes. (CRISPR-FOCUS is freely available at http://cistrome.org/crispr-focus/).
Delivery methods for site-specific nucleases: Achieving the full potential of therapeutic gene editing.

PubMed

Liu, Jia; Shui, Sai-Lan

2016-12-28

The advent of site-specific nucleases, particularly CRISPR/Cas9, provides researchers with the unprecedented ability to manipulate genomic sequences. These nucleases are used to create model cell lines, engineer metabolic pathways, produce transgenic animals and plants, perform genome-wide functional screen and, most importantly, treat human diseases that are difficult to tackle by traditional medications. Considerable efforts have been devoted to improving the efficiency and specificity of nucleases for clinical applications. However, safe and efficient delivery methods remain the major obstacle for therapeutic gene editing. In this review, we summarize the recent progress on nuclease delivery methods, highlight their impact on the outcomes of gene editing and discuss the potential of different delivery approaches for therapeutic gene editing. Copyright © 2016 Elsevier B.V. All rights reserved.

Characterization of genetic elements required for site-specific integration of Lactobacillus delbrueckii subsp. bulgaricus bacteriophage mv4 and construction of an integration-proficient vector for Lactobacillus plantarum.

PubMed Central

Dupont, L; Boizet-Bonhoure, B; Coddeville, M; Auvray, F; Ritzenthaler, P

1995-01-01

Temperate phage mv4 integrates its DNA into the chromosome of Lactobacillus delbrueckii subsp. bulgaricus strains via site-specific recombination. Nucleotide sequencing of a 2.2-kb attP-containing phage fragment revealed the presence of four open reading frames. The larger open reading frame, close to the attP site, encoded a 427-amino-acid polypeptide with similarity in its C-terminal domain to site-specific recombinases of the integrase family. Comparison of the sequences of attP, bacterial attachment site attB, and host-phage junctions attL and attR identified a 17-bp common core sequence, where strand exchange occurs during recombination. Analysis of the attB sequence indicated that the core region overlaps the 3' end of a tRNA(Ser) gene. Phage mv4 DNA integration into the tRNA(Ser) gene preserved an intact tRNA(Ser) gene at the attL site. An integration vector based on the mv4 attP site and int gene was constructed. This vector transforms a heterologous host, L. plantarum, through site-specific integration into the tRNA(Ser) gene of the genome and will be useful for development of an efficient integration system for a number of additional bacterial species in which an identical tRNA gene is present. PMID:7836291
High-throughput identification of antigen-specific TCRs by TCR gene capture.

PubMed

Linnemann, Carsten; Heemskerk, Bianca; Kvistborg, Pia; Kluin, Roelof J C; Bolotin, Dmitriy A; Chen, Xiaojing; Bresser, Kaspar; Nieuwland, Marja; Schotte, Remko; Michels, Samira; Gomez-Eerland, Raquel; Jahn, Lorenz; Hombrink, Pleun; Legrand, Nicolas; Shu, Chengyi Jenny; Mamedov, Ilgar Z; Velds, Arno; Blank, Christian U; Haanen, John B A G; Turchaninova, Maria A; Kerkhoven, Ron M; Spits, Hergen; Hadrup, Sine Reker; Heemskerk, Mirjam H M; Blankenstein, Thomas; Chudakov, Dmitriy M; Bendle, Gavin M; Schumacher, Ton N M

2013-11-01

The transfer of T cell receptor (TCR) genes into patient T cells is a promising approach for the treatment of both viral infections and cancer. Although efficient methods exist to identify antibodies for the treatment of these diseases, comparable strategies to identify TCRs have been lacking. We have developed a high-throughput DNA-based strategy to identify TCR sequences by the capture and sequencing of genomic DNA fragments encoding the TCR genes. We establish the value of this approach by assembling a large library of cancer germline tumor antigen-reactive TCRs. Furthermore, by exploiting the quantitative nature of TCR gene capture, we show the feasibility of identifying antigen-specific TCRs in oligoclonal T cell populations from either human material or TCR-humanized mice. Finally, we demonstrate the ability to identify tumor-reactive TCRs within intratumoral T cell subsets without knowledge of antigen specificities, which may be the first step toward the development of autologous TCR gene therapy to target patient-specific neoantigens in human cancer.
Hybridization-based antibody cDNA recovery for the production of recombinant antibodies identified by repertoire sequencing.

PubMed

Valdés-Alemán, Javier; Téllez-Sosa, Juan; Ovilla-Muñoz, Marbella; Godoy-Lozano, Elizabeth; Velázquez-Ramírez, Daniel; Valdovinos-Torres, Humberto; Gómez-Barreto, Rosa E; Martinez-Barnetche, Jesús

2014-01-01

High-throughput sequencing of the antibody repertoire is enabling a thorough analysis of B cell diversity and clonal selection, which may improve the novel antibody discovery process. Theoretically, an adequate bioinformatic analysis could allow identification of candidate antigen-specific antibodies, requiring their recombinant production for experimental validation of their specificity. Gene synthesis is commonly used for the generation of recombinant antibodies identified in silico. Novel strategies that bypass gene synthesis could offer more accessible antibody identification and validation alternatives. We developed a hybridization-based recovery strategy that targets the complementarity-determining region 3 (CDRH3) for the enrichment of cDNA of candidate antigen-specific antibody sequences. Ten clonal groups of interest were identified through bioinformatic analysis of the heavy chain antibody repertoire of mice immunized with hen egg white lysozyme (HEL). cDNA from eight of the targeted clonal groups was recovered efficiently, leading to the generation of recombinant antibodies. One representative heavy chain sequence from each clonal group recovered was paired with previously reported anti-HEL light chains to generate full antibodies, later tested for HEL-binding capacity. The recovery process proposed represents a simple and scalable molecular strategy that could enhance antibody identification and specificity assessment, enabling a more cost-efficient generation of recombinant antibodies.
Canine olfactory receptor gene polymorphism and its relation to odor detection performance by sniffer dogs.

PubMed

Lesniak, Anna; Walczak, Marta; Jezierski, Tadeusz; Sacharczuk, Mariusz; Gawkowski, Maciej; Jaszczak, Kazimierz

2008-01-01

The outstanding sensitivity of the canine olfactory system has been acknowledged by using sniffer dogs in military and civilian service for detection of a variety of odors. It is hypothesized that the canine olfactory ability is determined by polymorphisms in olfactory receptor (OR) genes. We investigated 5 OR genes for polymorphic sites which might affect the olfactory ability of service dogs in different fields of specific substance detection. All investigated OR DNA sequences proved to have allelic variants, the majority of which lead to protein sequence alteration. Homozygous individuals at 2 gene loci significantly differed in their detection skills from other genotypes. This suggests a role of specific alleles in odor detection and a linkage between single-nucleotide polymorphism and odor recognition efficiency.
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.

PubMed

Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T

1993-02-01

An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
Detecting novel genes with sparse arrays

PubMed Central

Haiminen, Niina; Smit, Bart; Rautio, Jari; Vitikainen, Marika; Wiebe, Marilyn; Martinez, Diego; Chee, Christine; Kunkel, Joe; Sanchez, Charles; Nelson, Mary Anne; Pakula, Tiina; Saloheimo, Markku; Penttilä, Merja; Kivioja, Teemu

2014-01-01

Species-specific genes play an important role in defining the phenotype of an organism. However, current gene prediction methods can only efficiently find genes that share features such as sequence similarity or general sequence characteristics with previously known genes. Novel sequencing methods and tiling arrays can be used to find genes without prior information and they have demonstrated that novel genes can still be found from extensively studied model organisms. Unfortunately, these methods are expensive and thus are not easily applicable, e.g., to finding genes that are expressed only in very specific conditions. We demonstrate a method for finding novel genes with sparse arrays, applying it on the 33.9 Mb genome of the filamentous fungus Trichoderma reesei. Our computational method does not require normalisations between arrays and it takes into account the multiple-testing problem typical for analysis of microarray data. In contrast to tiling arrays, that use overlapping probes, only one 25mer microarray oligonucleotide probe was used for every 100 b. Thus, only relatively little space on a microarray slide was required to cover the intergenic regions of a genome. The analysis was done as a by-product of a conventional microarray experiment with no additional costs. We found at least 23 good candidates for novel transcripts that could code for proteins and all of which were expressed at high levels. Candidate genes were found to neighbour ire1 and cre1 and many other regulatory genes. Our simple, low-cost method can easily be applied to finding novel species-specific genes without prior knowledge of their sequence properties. PMID:20691772
Comparison of taxon-specific versus general locus sets for targeted sequence capture in plant phylogenomics.

PubMed

Chau, John H; Rahfeldt, Wolfgang A; Olmstead, Richard G

2018-03-01

Targeted sequence capture can be used to efficiently gather sequence data for large numbers of loci, such as single-copy nuclear loci. Most published studies in plants have used taxon-specific locus sets developed individually for a clade using multiple genomic and transcriptomic resources. General locus sets can also be developed from loci that have been identified as single-copy and have orthologs in large clades of plants. We identify and compare a taxon-specific locus set and three general locus sets (conserved ortholog set [COSII], shared single-copy nuclear [APVO SSC] genes, and pentatricopeptide repeat [PPR] genes) for targeted sequence capture in Buddleja (Scrophulariaceae) and outgroups. We evaluate their performance in terms of assembly success, sequence variability, and resolution and support of inferred phylogenetic trees. The taxon-specific locus set had the most target loci. Assembly success was high for all locus sets in Buddleja samples. For outgroups, general locus sets had greater assembly success. Taxon-specific and PPR loci had the highest average variability. The taxon-specific data set produced the best-supported tree, but all data sets showed improved resolution over previous non-sequence capture data sets. General locus sets can be a useful source of sequence capture targets, especially if multiple genomic resources are not available for a taxon.
How Changes in Anti-SD Sequences Would Affect SD Sequences in Escherichia coli and Bacillus subtilis.

PubMed

Abolbaghaei, Akram; Silke, Jordan R; Xia, Xuhua

2017-05-05

The 3' end of the small ribosomal RNAs (ssu rRNA) in bacteria is directly involved in the selection and binding of mRNA transcripts during translation initiation via well-documented interactions between a Shine-Dalgarno (SD) sequence located upstream of the initiation codon and an anti-SD (aSD) sequence at the 3' end of the ssu rRNA. Consequently, the 3' end of ssu rRNA (3'TAIL) is strongly conserved among bacterial species because a change in the region may impact the translation of many protein-coding genes. Escherichia coli and Bacillus subtilis differ in their 3' ends of ssu rRNA, being GAUC ACCUCCUUA 3' in E. coli and GAUC ACCUCCUU UCU3' or GAUC ACCUCCUU UCUA3' in B. subtilis Such differences in 3'TAIL lead to species-specific SDs (designated SD Ec for E. coli and SD Bs for B. subtilis ) that can form strong and well-positioned SD/aSD pairing in one species but not in the other. Selection mediated by the species-specific 3'TAIL is expected to favor SD Bs against SD Ec in B. subtilis , but favor SD Ec against SD Bs in E. coli Among well-positioned SDs, SD Ec is used more in E. coli than in B. subtilis , and SD Bs more in B. subtilis than in E. coli Highly expressed genes and genes of high translation efficiency tend to have longer SDs than lowly expressed genes and genes with low translation efficiency in both species, but more so in B. subtilis than in E. coli Both species overuse SDs matching the bolded part of the 3'TAIL shown above. The 3'TAIL difference contributes to the host specificity of phages. Copyright © 2017 Abolbaghaei et al.
Gene replacements and insertions in rice by intron targeting using CRISPR-Cas9.

PubMed

Li, Jun; Meng, Xiangbing; Zong, Yuan; Chen, Kunling; Zhang, Huawei; Liu, Jinxing; Li, Jiayang; Gao, Caixia

2016-09-12

Sequence-specific nucleases have been exploited to create targeted gene knockouts in various plants(1), but replacing a fragment and even obtaining gene insertions at specific loci in plant genomes remain a serious challenge. Here, we report efficient intron-mediated site-specific gene replacement and insertion approaches that generate mutations using the non-homologous end joining (NHEJ) pathway using the clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 (Cas9) system. Using a pair of single guide RNAs (sgRNAs) targeting adjacent introns and a donor DNA template including the same pair of sgRNA sites, we achieved gene replacements in the rice endogenous gene 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) at a frequency of 2.0%. We also obtained targeted gene insertions at a frequency of 2.2% using a sgRNA targeting one intron and a donor DNA template including the same sgRNA site. Rice plants harbouring the OsEPSPS gene with the intended substitutions were glyphosate-resistant. Furthermore, the site-specific gene replacements and insertions were faithfully transmitted to the next generation. These newly developed approaches can be generally used to replace targeted gene fragments and to insert exogenous DNA sequences into specific genomic sites in rice and other plants.
An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets.

PubMed

Hosseini, Parsa; Tremblay, Arianne; Matthews, Benjamin F; Alkharouf, Nadim W

2010-07-02

The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data in a CASAVA-build into functional annotations while producing corresponding gene expression measurements. Achieving such analysis is executed in an ultrafast and highly efficient manner, whether the analysis be a single-read or paired-end sequencing experiment. TASE is a user-friendly and freely available application, allowing rapid analysis and annotation of any given Illumina Solexa sequencing dataset with ease.
Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes).

PubMed

Kukekova, Anna V; Johnson, Jennifer L; Teiling, Clotilde; Li, Lewyn; Oskina, Irina N; Kharlamova, Anastasiya V; Gulevich, Rimma G; Padte, Ravee; Dubreuil, Michael M; Vladimirova, Anastasiya V; Shepeleva, Darya V; Shikhevich, Svetlana G; Sun, Qi; Ponnala, Lalit; Temnykh, Svetlana V; Trut, Lyudmila N; Acland, Gregory M

2011-10-03

Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p < 0.05) was observed for 335 genes, fewer than 3% of the total number of genes identified in the fox transcriptome. Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information.
Sequence comparison of prefrontal cortical brain transcriptome from a tame and an aggressive silver fox (Vulpes vulpes)

PubMed Central

2011-01-01

Background Two strains of the silver fox (Vulpes vulpes), with markedly different behavioral phenotypes, have been developed by long-term selection for behavior. Foxes from the tame strain exhibit friendly behavior towards humans, paralleling the sociability of canine puppies, whereas foxes from the aggressive strain are defensive and exhibit aggression to humans. To understand the genetic differences underlying these behavioral phenotypes fox-specific genomic resources are needed. Results cDNA from mRNA from pre-frontal cortex of a tame and an aggressive fox was sequenced using the Roche 454 FLX Titanium platform (> 2.5 million reads & 0.9 Gbase of tame fox sequence; >3.3 million reads & 1.2 Gbase of aggressive fox sequence). Over 80% of the fox reads were assembled into contigs. Mapping fox reads against the fox transcriptome assembly and the dog genome identified over 30,000 high confidence fox-specific SNPs. Fox transcripts for approximately 14,000 genes were identified using SwissProt and the dog RefSeq databases. An at least 2-fold expression difference between the two samples (p < 0.05) was observed for 335 genes, fewer than 3% of the total number of genes identified in the fox transcriptome. Conclusions Transcriptome sequencing significantly expanded genomic resources available for the fox, a species without a sequenced genome. In a very cost efficient manner this yielded a large number of fox-specific SNP markers for genetic studies and provided significant insights into the gene expression profile of the fox pre-frontal cortex; expression differences between the two fox samples; and a catalogue of potentially important gene-specific sequence variants. This result demonstrates the utility of this approach for developing genomic resources in species with limited genomic information. PMID:21967120
Sequence-defined cMET/HGFR-targeted Polymers as Gene Delivery Vehicles for the Theranostic Sodium Iodide Symporter (NIS) Gene

PubMed Central

Urnauer, Sarah; Morys, Stephan; Krhac Levacic, Ana; Müller, Andrea M; Schug, Christina; Schmohl, Kathrin A; Schwenk, Nathalie; Zach, Christian; Carlsen, Janette; Bartenstein, Peter; Wagner, Ernst; Spitzweg, Christine

2016-01-01

The sodium iodide symporter (NIS) as well-characterized theranostic gene represents an outstanding tool to target different cancer types allowing noninvasive imaging of functional NIS expression and therapeutic radioiodide application. Based on its overexpression on the surface of most cancer types, the cMET/hepatocyte growth factor receptor serves as ideal target for tumor-selective gene delivery. Sequence-defined polymers as nonviral gene delivery vehicles comprising polyethylene glycol (PEG) and cationic (oligoethanoamino) amide cores coupled with a cMET-binding peptide (cMBP2) were complexed with NIS-DNA and tested for receptor-specificity, transduction efficiency, and therapeutic efficacy in hepatocellular cancer cells HuH7. In vitro iodide uptake studies demonstrated high transduction efficiency and cMET-specificity of NIS-encoding polyplexes (cMBP2-PEG-Stp/NIS) compared to polyplexes without targeting ligand (Ala-PEG-Stp/NIS) and without coding DNA (cMBP2-PEG-Stp/Antisense-NIS). Tumor recruitment and vector biodistribution were investigated in vivo in a subcutaneous xenograft mouse model showing high tumor-selective iodide accumulation in cMBP2-PEG-Stp/NIS-treated mice (6.6 ± 1.6% ID/g 123I, biological half-life 3 hours) by 123I-scintigraphy. Therapy studies with three cycles of polyplexes and 131I application resulted in significant delay in tumor growth and prolonged survival. These data demonstrate the enormous potential of cMET-targeted sequence-defined polymers combined with the unique theranostic function of NIS allowing for optimized transfection efficiency while eliminating toxicity. PMID:27157666
Efficient CRISPR-rAAV engineering of endogenous genes to study protein function by allele-specific RNAi.

PubMed

Kaulich, Manuel; Lee, Yeon J; Lönn, Peter; Springer, Aaron D; Meade, Bryan R; Dowdy, Steven F

2015-04-20

Gene knockout strategies, RNAi and rescue experiments are all employed to study mammalian gene function. However, the disadvantages of these approaches include: loss of function adaptation, reduced viability and gene overexpression that rarely matches endogenous levels. Here, we developed an endogenous gene knockdown/rescue strategy that combines RNAi selectivity with a highly efficient CRISPR directed recombinant Adeno-Associated Virus (rAAV) mediated gene targeting approach to introduce allele-specific mutations plus an allele-selective siRNA Sensitive (siSN) site that allows for studying gene mutations while maintaining endogenous expression and regulation of the gene of interest. CRISPR/Cas9 plus rAAV targeted gene-replacement and introduction of allele-specific RNAi sensitivity mutations in the CDK2 and CDK1 genes resulted in a >85% site-specific recombination of Neo-resistant clones versus ∼8% for rAAV alone. RNAi knockdown of wild type (WT) Cdk2 with siWT in heterozygotic knockin cells resulted in the mutant Cdk2 phenotype cell cycle arrest, whereas allele specific knockdown of mutant CDK2 with siSN resulted in a wild type phenotype. Together, these observations demonstrate the ability of CRISPR plus rAAV to efficiently recombine a genomic locus and tag it with a selective siRNA sequence that allows for allele-selective phenotypic assays of the gene of interest while it remains expressed and regulated under endogenous control mechanisms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Single-Stranded γPNAs for In Vivo Site-Specific Genome Editing via Watson-Crick Recognition

PubMed Central

Bahal, Raman; Quijano, Elias; McNeer, Nicole Ali; Liu, Yanfeng; Bhunia, Dinesh C.; López-Giráldez, Francesco; Fields, Rachel J.; Saltzman, W. Mark; Ly, Danith H.; Glazer, Peter M.

2014-01-01

Triplex-forming peptide nucleic acids (PNAs) facilitate gene editing by stimulating recombination of donor DNAs within genomic DNA via site-specific formation of altered helical structures that further stimulate DNA repair. However, PNAs designed for triplex formation are sequence restricted to homopurine sites. Herein we describe a novel strategy where next generation single-stranded gamma PNAs (γPNAs) containing miniPEG substitutions at the gamma position can target genomic DNA in mouse bone marrow at mixed-sequence sites to induce targeted gene editing. In addition to enhanced binding, γPNAs confer increased solubility and improved formulation into poly(lactic-co-glycolic acid) (PLGA) nanoparticles for efficient intracellular delivery. Single-stranded γPNAs induce targeted gene editing at frequencies of 0.8% in mouse bone marrow cells treated ex vivo and 0.1% in vivo via IV injection, without detectable toxicity. These results suggest that γPNAs may provide a new tool for induced gene editing based on Watson-Crick recognition without sequence restriction. PMID:25174576
Single-stranded γPNAs for in vivo site-specific genome editing via Watson-Crick recognition.

PubMed

Bahal, Raman; Quijano, Elias; McNeer, Nicole A; Liu, Yanfeng; Bhunia, Dinesh C; Lopez-Giraldez, Francesco; Fields, Rachel J; Saltzman, William M; Ly, Danith H; Glazer, Peter M

2014-01-01

Triplex-forming peptide nucleic acids (PNAs) facilitate gene editing by stimulating recombination of donor DNAs within genomic DNA via site-specific formation of altered helical structures that further stimulate DNA repair. However, PNAs designed for triplex formation are sequence restricted to homopurine sites. Herein we describe a novel strategy where next generation single-stranded gamma PNAs (γPNAs) containing miniPEG substitutions at the gamma position can target genomic DNA in mouse bone marrow at mixed-sequence sites to induce targeted gene editing. In addition to enhanced binding, γPNAs confer increased solubility and improved formulation into poly(lactic-co-glycolic acid) (PLGA) nanoparticles for efficient intracellular delivery. Single-stranded γPNAs induce targeted gene editing at frequencies of 0.8% in mouse bone marrow cells treated ex vivo and 0.1% in vivo via IV injection, without detectable toxicity. These results suggest that γPNAs may provide a new tool for induced gene editing based on Watson-Crick recognition without sequence restriction.
The effectiveness of three regions in mitochondrial genome for aphid DNA barcoding: a case in Lachininae.

PubMed

Chen, Rui; Jiang, Li-Yun; Qiao, Ge-Xia

2012-01-01

The mitochondrial gene COI has been widely used by taxonomists as a standard DNA barcode sequence for the identification of many animal species. However, the COI region is of limited use for identifying certain species and is not efficiently amplified by PCR in all animal taxa. To evaluate the utility of COI as a DNA barcode and to identify other barcode genes, we chose the aphid subfamily Lachninae (Hemiptera: Aphididae) as the focus of our study. We compared the results obtained using COI with two other mitochondrial genes, COII and Cytb. In addition, we propose a new method to improve the efficiency of species identification using DNA barcoding. Three mitochondrial genes (COI, COII and Cytb) were sequenced and were used in the identification of over 80 species of Lachninae. The COI and COII genes demonstrated a greater PCR amplification efficiency than Cytb. Species identification using COII sequences had a higher frequency of success (96.9% in "best match" and 90.8% in "best close match") and yielded lower intra- and higher interspecific genetic divergence values than the other two markers. The use of "tag barcodes" is a new approach that involves attaching a species-specific tag to the standard DNA barcode. With this method, the "barcoding overlap" can be nearly eliminated. As a result, we were able to increase the identification success rate from 83.9% to 95.2% by using COI and the "best close match" technique. A COII-based identification system should be more effective in identifying lachnine species than COI or Cytb. However, the Cytb gene is an effective marker for the study of aphid population genetics due to its high sequence diversity. Furthermore, the use of "tag barcodes" can improve the accuracy of DNA barcoding identification by reducing or removing the overlap between intra- and inter-specific genetic divergence values.
Sequence-specific antimicrobials using efficiently delivered RNA-guided nucleases.

PubMed

Citorik, Robert J; Mimee, Mark; Lu, Timothy K

2014-11-01

Current antibiotics tend to be broad spectrum, leading to indiscriminate killing of commensal bacteria and accelerated evolution of drug resistance. Here, we use CRISPR-Cas technology to create antimicrobials whose spectrum of activity is chosen by design. RNA-guided nucleases (RGNs) targeting specific DNA sequences are delivered efficiently to microbial populations using bacteriophage or bacteria carrying plasmids transmissible by conjugation. The DNA targets of RGNs can be undesirable genes or polymorphisms, including antibiotic resistance and virulence determinants in carbapenem-resistant Enterobacteriaceae and enterohemorrhagic Escherichia coli. Delivery of RGNs significantly improves survival in a Galleria mellonella infection model. We also show that RGNs enable modulation of complex bacterial populations by selective knockdown of targeted strains based on genetic signatures. RGNs constitute a class of highly discriminatory, customizable antimicrobials that enact selective pressure at the DNA level to reduce the prevalence of undesired genes, minimize off-target effects and enable programmable remodeling of microbiota.
Influence of sequence mismatches on the specificity of recombinase polymerase amplification technology.

PubMed

Daher, Rana K; Stewart, Gale; Boissinot, Maurice; Boudreau, Dominique K; Bergeron, Michel G

2015-04-01

Recombinase polymerase amplification (RPA) technology relies on three major proteins, recombinase proteins, single-strand binding proteins, and polymerases, to specifically amplify nucleic acid sequences in an isothermal format. The performance of RPA with respect to sequence mismatches of closely-related non-target molecules is not well documented and the influence of the number and distribution of mismatches in DNA sequences on RPA amplification reaction is not well understood. We investigated the specificity of RPA by testing closely-related species bearing naturally occurring mismatches for the tuf gene sequence of Pseudomonas aeruginosa and/or Mycobacterium tuberculosis and for the cfb gene sequence of Streptococcus agalactiae. In addition, the impact of the number and distribution of mismatches on RPA efficiency was assessed by synthetically generating 14 types of mismatched forward primers for detecting five bacterial species of high diagnostic relevance such as Clostridium difficile, Staphylococcus aureus, S. agalactiae, P. aeruginosa, and M. tuberculosis as well as Bacillus atropheus subsp. globigii for which we use the spores as internal control in diagnostic assays. A total of 87 mismatched primers were tested in this study. We observed that target specific RPA primers with mismatches (n > 1) at their 3'extrimity hampered RPA reaction. In addition, 3 mismatches covering both extremities and the center of the primer sequence negatively affected RPA yield. We demonstrated that the specificity of RPA was multifactorial. Therefore its application in clinical settings must be selected and validated a priori. We recommend that the selection of a target gene must consider the presence of closely-related non-target genes. It is advisable to choose target regions with a high number of mismatches (≥36%, relative to the size of amplicon) with respect to closely-related species and the best case scenario would be by choosing a unique target gene. Copyright © 2014 Elsevier Ltd. All rights reserved.
Adeno-associated virus inverted terminal repeats stimulate gene editing.

PubMed

Hirsch, M L

2015-02-01

Advancements in genome editing have relied on technologies to specifically damage DNA which, in turn, stimulates DNA repair including homologous recombination (HR). As off-target concerns complicate the therapeutic translation of site-specific DNA endonucleases, an alternative strategy to stimulate gene editing based on fragile DNA was investigated. To do this, an episomal gene-editing reporter was generated by a disruptive insertion of the adeno-associated virus (AAV) inverted terminal repeat (ITR) into the egfp gene. Compared with a non-structured DNA control sequence, the ITR induced DNA damage as evidenced by increased gamma-H2AX and Mre11 foci formation. As local DNA damage stimulates HR, ITR-mediated gene editing was investigated using DNA oligonucleotides as repair substrates. The AAV ITR stimulated gene editing >1000-fold in a replication-independent manner and was not biased by the polarity of the repair oligonucleotide. Analysis of additional human DNA sequences demonstrated stimulation of gene editing to varying degrees. In particular, inverted yet not direct, Alu repeats induced gene editing, suggesting a role for DNA structure in the repair event. Collectively, the results demonstrate that inverted DNA repeats stimulate gene editing via double-strand break repair in an episomal context and allude to efficient gene editing of the human chromosome using fragile DNA sequences.

Histidine-rich stabilized polyplexes for cMet-directed tumor-targeted gene transfer

NASA Astrophysics Data System (ADS)

Kos, Petra; Lächelt, Ulrich; Herrmann, Annika; Mickler, Frauke Martina; Döblinger, Markus; He, Dongsheng; Krhač Levačić, Ana; Morys, Stephan; Bräuchle, Christoph; Wagner, Ernst

2015-03-01

Overexpression of the hepatocyte growth factor receptor/c-Met proto oncogene on the surface of a variety of tumor cells gives an opportunity to specifically target cancerous tissues. Herein, we report the first use of c-Met as receptor for non-viral tumor-targeted gene delivery. Sequence-defined oligomers comprising the c-Met binding peptide ligand cMBP2 for targeting, a monodisperse polyethylene glycol (PEG) for polyplex surface shielding, and various cationic (oligoethanamino) amide cores containing terminal cysteines for redox-sensitive polyplex stabilization, were assembled by solid-phase supported syntheses. The resulting oligomers exhibited a greatly enhanced cellular uptake and gene transfer over non-targeted control sequences, confirming the efficacy and target-specificity of the formed polyplexes. Implementation of endosomal escape-promoting histidines in the cationic core was required for gene expression without additional endosomolytic agent. The histidine-enriched polyplexes demonstrated stability in serum as well as receptor-specific gene transfer in vivo upon intratumoral injection. The co-formulation with an analogous PEG-free cationic oligomer led to a further compaction of pDNA polyplexes with an obvious change of shape as demonstrated by transmission electron microscopy. Such compaction was critically required for efficient intravenous gene delivery which resulted in greatly enhanced, cMBP2 ligand-dependent gene expression in the distant tumor.Overexpression of the hepatocyte growth factor receptor/c-Met proto oncogene on the surface of a variety of tumor cells gives an opportunity to specifically target cancerous tissues. Herein, we report the first use of c-Met as receptor for non-viral tumor-targeted gene delivery. Sequence-defined oligomers comprising the c-Met binding peptide ligand cMBP2 for targeting, a monodisperse polyethylene glycol (PEG) for polyplex surface shielding, and various cationic (oligoethanamino) amide cores containing terminal cysteines for redox-sensitive polyplex stabilization, were assembled by solid-phase supported syntheses. The resulting oligomers exhibited a greatly enhanced cellular uptake and gene transfer over non-targeted control sequences, confirming the efficacy and target-specificity of the formed polyplexes. Implementation of endosomal escape-promoting histidines in the cationic core was required for gene expression without additional endosomolytic agent. The histidine-enriched polyplexes demonstrated stability in serum as well as receptor-specific gene transfer in vivo upon intratumoral injection. The co-formulation with an analogous PEG-free cationic oligomer led to a further compaction of pDNA polyplexes with an obvious change of shape as demonstrated by transmission electron microscopy. Such compaction was critically required for efficient intravenous gene delivery which resulted in greatly enhanced, cMBP2 ligand-dependent gene expression in the distant tumor. Electronic supplementary information (ESI) available. See DOI: 10.1039/c4nr06556e
A Single Transcriptome of a Green Toad (Bufo viridis) Yields Candidate Genes for Sex Determination and -Differentiation and Non-Anonymous Population Genetic Markers

PubMed Central

Gerchen, Jörn F.; Reichert, Samuel J.; Röhr, Johannes T.; Dieterich, Christoph; Kloas, Werner

2016-01-01

Large genome size, including immense repetitive and non-coding fractions, still present challenges for capacity, bioinformatics and thus affordability of whole genome sequencing in most amphibians. Here, we test the performance of a single transcriptome to understand whether it can provide a cost-efficient resource for species with large unknown genomes. Using RNA from six different tissues from a single Palearctic green toad (Bufo viridis) specimen and Hiseq2000, we obtained 22,5 Mio reads and publish >100,000 unigene sequences. To evaluate efficacy and quality, we first use this data to identify green toad specific candidate genes, known from other vertebrates for their role in sex determination and differentiation. Of a list of 37 genes, the transcriptome yielded 32 (87%), many of which providing the first such data for this non-model anuran species. However, for many of these genes, only fragments could be retrieved. In order to allow also applications to population genetics, we further used the transcriptome for the targeted development of 21 non-anonymous microsatellites and tested them in genetic families and backcrosses. Eleven markers were specifically developed to be located on the B. viridis sex chromosomes; for eight markers we can indeed demonstrate sex-specific transmission in genetic families. Depending on phylogenetic distance, several markers, which are sex-linked in green toads, show high cross-amplification success across the anuran phylogeny, involving nine systematic anuran families. Our data support the view that single transcriptome sequencing (based on multiple tissues) provides a reliable genomic resource and cost-efficient method for non-model amphibian species with large genome size and, despite limitations, should be considered as long as genome sequencing remains unaffordable for most species. PMID:27232626
Translation efficiency of heterologous proteins is significantly affected by the genetic context of RBS sequences in engineered cyanobacterium Synechocystis sp. PCC 6803.

PubMed

Thiel, Kati; Mulaku, Edita; Dandapani, Hariharan; Nagy, Csaba; Aro, Eva-Mari; Kallio, Pauli

2018-03-02

Photosynthetic cyanobacteria have been studied as potential host organisms for direct solar-driven production of different carbon-based chemicals from CO 2 and water, as part of the development of sustainable future biotechnological applications. The engineering approaches, however, are still limited by the lack of comprehensive information on most optimal expression strategies and validated species-specific genetic elements which are essential for increasing the intricacy, predictability and efficiency of the systems. This study focused on the systematic evaluation of the key translational control elements, ribosome binding sites (RBS), in the cyanobacterial host Synechocystis sp. PCC 6803, with the objective of expanding the palette of tools for more rigorous engineering approaches. An expression system was established for the comparison of 13 selected RBS sequences in Synechocystis, using several alternative reporter proteins (sYFP2, codon-optimized GFPmut3 and ethylene forming enzyme) as quantitative indicators of the relative translation efficiencies. The set-up was shown to yield highly reproducible expression patterns in independent analytical series with low variation between biological replicates, thus allowing statistical comparison of the activities of the different RBSs in vivo. While the RBSs covered a relatively broad overall expression level range, the downstream gene sequence was demonstrated in a rigorous manner to have a clear impact on the resulting translational profiles. This was expected to reflect interfering sequence-specific mRNA-level interaction between the RBS and the coding region, yet correlation between potential secondary structure formation and observed translation levels could not be resolved with existing in silico prediction tools. The study expands our current understanding on the potential and limitations associated with the regulation of protein expression at translational level in engineered cyanobacteria. The acquired information can be used for selecting appropriate RBSs for optimizing over-expression constructs or multicistronic pathways in Synechocystis, while underlining the complications in predicting the activity due to gene-specific interactions which may reduce the translational efficiency for a given RBS-gene combination. Ultimately, the findings emphasize the need for additional characterized insulator sequence elements to decouple the interaction between the RBS and the coding region for future engineering approaches.
Crispr-mediated Gene Targeting of Human Induced Pluripotent Stem Cells.

PubMed

Byrne, Susan M; Church, George M

2015-01-01

CRISPR/Cas9 nuclease systems can create double-stranded DNA breaks at specific sequences to efficiently and precisely disrupt, excise, mutate, insert, or replace genes. However, human embryonic stem or induced pluripotent stem cells (iPSCs) are more difficult to transfect and less resilient to DNA damage than immortalized tumor cell lines. Here, we describe an optimized protocol for genome engineering of human iPSCs using a simple transient transfection of plasmids and/or single-stranded oligonucleotides. With this protocol, we achieve transfection efficiencies greater than 60%, with gene disruption efficiencies from 1-25% and gene insertion/replacement efficiencies from 0.5-10% without any further selection or enrichment steps. We also describe how to design and assess optimal sgRNA target sites and donor targeting vectors; cloning individual iPSC by single cell FACS sorting, and genotyping successfully edited cells.
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

PubMed Central

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-01-01

Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies. PMID:19383142
Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

PubMed

Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

2009-04-21

To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
An efficient annotation and gene-expression derivation tool for Illumina Solexa datasets

PubMed Central

2010-01-01

Background The data produced by an Illumina flow cell with all eight lanes occupied, produces well over a terabyte worth of images with gigabytes of reads following sequence alignment. The ability to translate such reads into meaningful annotation is therefore of great concern and importance. Very easily, one can get flooded with such a great volume of textual, unannotated data irrespective of read quality or size. CASAVA, a optional analysis tool for Illumina sequencing experiments, enables the ability to understand INDEL detection, SNP information, and allele calling. To not only extract from such analysis, a measure of gene expression in the form of tag-counts, but furthermore to annotate such reads is therefore of significant value. Findings We developed TASE (Tag counting and Analysis of Solexa Experiments), a rapid tag-counting and annotation software tool specifically designed for Illumina CASAVA sequencing datasets. Developed in Java and deployed using jTDS JDBC driver and a SQL Server backend, TASE provides an extremely fast means of calculating gene expression through tag-counts while annotating sequenced reads with the gene's presumed function, from any given CASAVA-build. Such a build is generated for both DNA and RNA sequencing. Analysis is broken into two distinct components: DNA sequence or read concatenation, followed by tag-counting and annotation. The end result produces output containing the homology-based functional annotation and respective gene expression measure signifying how many times sequenced reads were found within the genomic ranges of functional annotations. Conclusions TASE is a powerful tool to facilitate the process of annotating a given Illumina Solexa sequencing dataset. Our results indicate that both homology-based annotation and tag-count analysis are achieved in very efficient times, providing researchers to delve deep in a given CASAVA-build and maximize information extraction from a sequencing dataset. TASE is specially designed to translate sequence data in a CASAVA-build into functional annotations while producing corresponding gene expression measurements. Achieving such analysis is executed in an ultrafast and highly efficient manner, whether the analysis be a single-read or paired-end sequencing experiment. TASE is a user-friendly and freely available application, allowing rapid analysis and annotation of any given Illumina Solexa sequencing dataset with ease. PMID:20598141
Gene calling and bacterial genome annotation with BG7.

PubMed

Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

2015-01-01

New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).
Cell type-specific termination of transcription by transposable element sequences.

PubMed

Conley, Andrew B; Jordan, I King

2012-09-30

Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription termination by TEs seen here, along with the preference for sense-oriented TE insertions to provide TTS, is consistent with the observed antisense orientation bias of human TEs.
Simian virus 40 major late promoter: an upstream DNA sequence required for efficient in vitro transcription.

PubMed Central

Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P

1984-01-01

We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950
Adaptable gene-specific dye bias correction for two-channel DNA microarrays.

PubMed

Margaritis, Thanasis; Lijnzaad, Philip; van Leenen, Dik; Bouwmeester, Diane; Kemmeren, Patrick; van Hooff, Sander R; Holstege, Frank C P

2009-01-01

DNA microarray technology is a powerful tool for monitoring gene expression or for finding the location of DNA-bound proteins. DNA microarrays can suffer from gene-specific dye bias (GSDB), causing some probes to be affected more by the dye than by the sample. This results in large measurement errors, which vary considerably for different probes and also across different hybridizations. GSDB is not corrected by conventional normalization and has been difficult to address systematically because of its variance. We show that GSDB is influenced by label incorporation efficiency, explaining the variation of GSDB across different hybridizations. A correction method (Gene- And Slide-Specific Correction, GASSCO) is presented, whereby sequence-specific corrections are modulated by the overall bias of individual hybridizations. GASSCO outperforms earlier methods and works well on a variety of publically available datasets covering a range of platforms, organisms and applications, including ChIP on chip. A sequence-based model is also presented, which predicts which probes will suffer most from GSDB, useful for microarray probe design and correction of individual hybridizations. Software implementing the method is publicly available.
Adaptable gene-specific dye bias correction for two-channel DNA microarrays

PubMed Central

Margaritis, Thanasis; Lijnzaad, Philip; van Leenen, Dik; Bouwmeester, Diane; Kemmeren, Patrick; van Hooff, Sander R; Holstege, Frank CP

2009-01-01

DNA microarray technology is a powerful tool for monitoring gene expression or for finding the location of DNA-bound proteins. DNA microarrays can suffer from gene-specific dye bias (GSDB), causing some probes to be affected more by the dye than by the sample. This results in large measurement errors, which vary considerably for different probes and also across different hybridizations. GSDB is not corrected by conventional normalization and has been difficult to address systematically because of its variance. We show that GSDB is influenced by label incorporation efficiency, explaining the variation of GSDB across different hybridizations. A correction method (Gene- And Slide-Specific Correction, GASSCO) is presented, whereby sequence-specific corrections are modulated by the overall bias of individual hybridizations. GASSCO outperforms earlier methods and works well on a variety of publically available datasets covering a range of platforms, organisms and applications, including ChIP on chip. A sequence-based model is also presented, which predicts which probes will suffer most from GSDB, useful for microarray probe design and correction of individual hybridizations. Software implementing the method is publicly available. PMID:19401678
snoSeeker: an advanced computational package for screening of guide and orphan snoRNA genes in the human genome.

PubMed

Yang, Jian-Hua; Zhang, Xiao-Chen; Huang, Zhan-Peng; Zhou, Hui; Huang, Mian-Bo; Zhang, Shu; Chen, Yue-Qin; Qu, Liang-Hu

2006-01-01

Small nucleolar RNAs (snoRNAs) represent an abundant group of non-coding RNAs in eukaryotes. They can be divided into guide and orphan snoRNAs according to the presence or absence of antisense sequence to rRNAs or snRNAs. Current snoRNA-searching programs, which are essentially based on sequence complementarity to rRNAs or snRNAs, exist only for the screening of guide snoRNAs. In this study, we have developed an advanced computational package, snoSeeker, which includes CDseeker and ACAseeker programs, for the highly efficient and specific screening of both guide and orphan snoRNA genes in mammalian genomes. By using these programs, we have systematically scanned four human-mammal whole-genome alignment (WGA) sequences and identified 54 novel candidates including 26 orphan candidates as well as 266 known snoRNA genes. Eighteen novel snoRNAs were further experimentally confirmed with four snoRNAs exhibiting a tissue-specific or restricted expression pattern. The results of this study provide the most comprehensive listing of two families of snoRNA genes in the human genome till date.
An Efficient Method for High-Fidelity BAC/PAC Retrofitting with a Selectable Marker for Mammalian Cell Transfection

PubMed Central

Wang, Zunde; Engler, Peter; Longacre, Angelika; Storb, Ursula

2001-01-01

Large-scale genomic sequencing projects have provided DNA sequence information for many genes, but the biological functions for most of them will only be known through functional studies. Bacterial artificial chromosomes (BACs) and P1-derived artificial chromosomes (PACs) are large genomic clones stably maintained in bacteria and are very important in functional studies through transfection because of their large size and stability. Because most BAC or PAC vectors do not have a mammalian selection marker, transfecting mammalian cells with genes cloned in BACs or PACs requires the insertion into the BAC/PAC of a mammalian selectable marker. However, currently available procedures are not satisfactory in efficiency and fidelity. We describe a very simple and efficient procedure that allows one to retrofit dozens of BACs in a day with no detectable deletions or unwanted recombination. We use a BAC/PAC retrofitting vector that, on transformation into competent BAC or PAC strains, will catalyze the specific insertion of itself into BAC/PAC vectors through in vivo cre/loxP site-specific recombination. PMID:11156622
LOVD: easy creation of a locus-specific sequence variation database using an "LSDB-in-a-box" approach.

PubMed

Fokkema, Ivo F A C; den Dunnen, Johan T; Taschner, Peter E M

2005-08-01

The completion of the human genome project has initiated, as well as provided the basis for, the collection and study of all sequence variation between individuals. Direct access to up-to-date information on sequence variation is currently provided most efficiently through web-based, gene-centered, locus-specific databases (LSDBs). We have developed the Leiden Open (source) Variation Database (LOVD) software approaching the "LSDB-in-a-Box" idea for the easy creation and maintenance of a fully web-based gene sequence variation database. LOVD is platform-independent and uses PHP and MySQL open source software only. The basic gene-centered and modular design of the database follows the recommendations of the Human Genome Variation Society (HGVS) and focuses on the collection and display of DNA sequence variations. With minimal effort, the LOVD platform is extendable with clinical data. The open set-up should both facilitate and promote functional extension with scripts written by the community. The LOVD software is freely available from the Leiden Muscular Dystrophy pages (www.DMD.nl/LOVD/). To promote the use of LOVD, we currently offer curators the possibility to set up an LSDB on our Leiden server. (c) 2005 Wiley-Liss, Inc.
Quantification of Functionalised Gold Nanoparticle-Targeted Knockdown of Gene Expression in HeLa Cells

PubMed Central

Jiwaji, Meesbah; Sandison, Mairi E.; Reboud, Julien; Stevenson, Ross; Daly, Rónán; Barkess, Gráinne; Faulds, Karen; Kolch, Walter; Graham, Duncan; Girolami, Mark A.; Cooper, Jonathan M.; Pitt, Andrew R.

2014-01-01

Introduction Gene therapy continues to grow as an important area of research, primarily because of its potential in the treatment of disease. One significant area where there is a need for better understanding is in improving the efficiency of oligonucleotide delivery to the cell and indeed, following delivery, the characterization of the effects on the cell. Methods In this report, we compare different transfection reagents as delivery vehicles for gold nanoparticles functionalized with DNA oligonucleotides, and quantify their relative transfection efficiencies. The inhibitory properties of small interfering RNA (siRNA), single-stranded RNA (ssRNA) and single-stranded DNA (ssDNA) sequences targeted to human metallothionein hMT-IIa are also quantified in HeLa cells. Techniques used in this study include fluorescence and confocal microscopy, qPCR and Western analysis. Findings We show that the use of transfection reagents does significantly increase nanoparticle transfection efficiencies. Furthermore, siRNA, ssRNA and ssDNA sequences all have comparable inhibitory properties to ssDNA sequences immobilized onto gold nanoparticles. We also show that functionalized gold nanoparticles can co-localize with autophagosomes and illustrate other factors that can affect data collection and interpretation when performing studies with functionalized nanoparticles. Conclusions The desired outcome for biological knockdown studies is the efficient reduction of a specific target; which we demonstrate by using ssDNA inhibitory sequences targeted to human metallothionein IIa gene transcripts that result in the knockdown of both the mRNA transcript and the target protein. PMID:24926959
Seamless Genetic Conversion of SMN2 to SMN1 via CRISPR/Cpf1 and Single-Stranded Oligodeoxynucleotides in Spinal Muscular Atrophy Patient-Specific Induced Pluripotent Stem Cells.

PubMed

Zhou, Miaojin; Hu, Zhiqing; Qiu, Liyan; Zhou, Tao; Feng, Mai; Hu, Qian; Zeng, Baitao; Li, Zhuo; Sun, Qianru; Wu, Yong; Liu, Xionghao; Wu, Lingqian; Liang, Desheng

2018-05-09

Spinal muscular atrophy (SMA) is a kind of neuromuscular disease characterized by progressive motor neuron loss in the spinal cord. It is caused by mutations in the survival motor neuron 1 (SMN1) gene. SMN1 has a paralogous gene, survival motor neuron 2 (SMN2), in humans that is present in almost all SMA patients. The generation and genetic correction of SMA patient-specific induced pluripotent stem cells (iPSCs) is a viable, autologous therapeutic strategy for the disease. Here, c-Myc-free and non-integrating iPSCs were generated from the urine cells of an SMA patient using an episomal iPSC reprogramming vector, and a unique crRNA was designed that does not have similar sequences (≤3 mismatches) anywhere in the human reference genome. In situ gene conversion of the SMN2 gene to an SMN1-like gene in SMA-iPSCs was achieved using CRISPR/Cpf1 and single-stranded oligodeoxynucleotide with a high efficiency of 4/36. Seamlessly gene-converted iPSC lines contained no exogenous sequences and retained a normal karyotype. Significantly, the SMN expression and gems localization were rescued in the gene-converted iPSCs and their derived motor neurons. This is the first report of an efficient gene conversion mediated by Cpf1 homology-directed repair in human cells and may provide a universal gene therapeutic approach for most SMA patients.
Genomic sequencing and the impact of molecular diagnosis on patient care.

PubMed

Solomon, Benjamin D

2015-02-01

Evolving sequencing technologies allow more accurate, efficient and affordable genomic analysis. As a result, these technologies are increasingly available, especially to provide molecular diagnoses for patients with suspected genetic disorders. However, there are many challenges to using genomic sequencing to benefit patients, including concerns that there is insufficient evidence that identifying an underlying molecular explanation may positively impact a patient's healthcare. This concern has many repercussions, including funding and/or (in some countries and healthcare systems) insurance reimbursement for genomic sequencing. To investigate this concern, all monogenic disorders were analyzed based on the impact of achieving molecular diagnosis. Of the 2,849 individual genes in which germline mutations cause disorders (not including contiguous gene syndromes or what may be categorized as susceptibility alleles), our analyses showed a specific, available intervention related to at least one affected organ system for 1,419 (49.8%) genes. In 95.6% of these genes, the intervention(s) would be recommended during the pediatric time frame.
The CRISPR/Cas9 system produces specific and homozygous targeted gene editing in rice in one generation.

PubMed

Zhang, Hui; Zhang, Jinshan; Wei, Pengliang; Zhang, Botao; Gou, Feng; Feng, Zhengyan; Mao, Yanfei; Yang, Lan; Zhang, Heng; Xu, Nanfei; Zhu, Jian-Kang

2014-08-01

The CRISPR/Cas9 system has been demonstrated to efficiently induce targeted gene editing in a variety of organisms including plants. Recent work showed that CRISPR/Cas9-induced gene mutations in Arabidopsis were mostly somatic mutations in the early generation, although some mutations could be stably inherited in later generations. However, it remains unclear whether this system will work similarly in crops such as rice. In this study, we tested in two rice subspecies 11 target genes for their amenability to CRISPR/Cas9-induced editing and determined the patterns, specificity and heritability of the gene modifications. Analysis of the genotypes and frequency of edited genes in the first generation of transformed plants (T0) showed that the CRISPR/Cas9 system was highly efficient in rice, with target genes edited in nearly half of the transformed embryogenic cells before their first cell division. Homozygotes of edited target genes were readily found in T0 plants. The gene mutations were passed to the next generation (T1) following classic Mendelian law, without any detectable new mutation or reversion. Even with extensive searches including whole genome resequencing, we could not find any evidence of large-scale off-targeting in rice for any of the many targets tested in this study. By specifically sequencing the putative off-target sites of a large number of T0 plants, low-frequency mutations were found in only one off-target site where the sequence had 1-bp difference from the intended target. Overall, the data in this study point to the CRISPR/Cas9 system being a powerful tool in crop genome engineering. © 2014 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Establishing gene models from the Pinus pinaster genome using gene capture and BAC sequencing.

PubMed

Seoane-Zonjic, Pedro; Cañas, Rafael A; Bautista, Rocío; Gómez-Maldonado, Josefa; Arrillaga, Isabel; Fernández-Pozo, Noé; Claros, M Gonzalo; Cánovas, Francisco M; Ávila, Concepción

2016-02-27

In the era of DNA throughput sequencing, assembling and understanding gymnosperm mega-genomes remains a challenge. Although drafts of three conifer genomes have recently been published, this number is too low to understand the full complexity of conifer genomes. Using techniques focused on specific genes, gene models can be established that can aid in the assembly of gene-rich regions, and this information can be used to compare genomes and understand functional evolution. In this study, gene capture technology combined with BAC isolation and sequencing was used as an experimental approach to establish de novo gene structures without a reference genome. Probes were designed for 866 maritime pine transcripts to sequence genes captured from genomic DNA. The gene models were constructed using GeneAssembler, a new bioinformatic pipeline, which reconstructed over 82% of the gene structures, and a high proportion (85%) of the captured gene models contained sequences from the promoter regulatory region. In a parallel experiment, the P. pinaster BAC library was screened to isolate clones containing genes whose cDNA sequence were already available. BAC clones containing the asparagine synthetase, sucrose synthase and xyloglucan endotransglycosylase gene sequences were isolated and used in this study. The gene models derived from the gene capture approach were compared with the genomic sequences derived from the BAC clones. This combined approach is a particularly efficient way to capture the genomic structures of gene families with a small number of members. The experimental approach used in this study is a valuable combined technique to study genomic gene structures in species for which a reference genome is unavailable. It can be used to establish exon/intron boundaries in unknown gene structures, to reconstruct incomplete genes and to obtain promoter sequences that can be used for transcriptional studies. A bioinformatics algorithm (GeneAssembler) is also provided as a Ruby gem for this class of analyses.

TALE-mediated epigenetic suppression of CDKN2A increases replication in human fibroblasts.

PubMed

Bernstein, Diana L; Le Lay, John E; Ruano, Elena G; Kaestner, Klaus H

2015-05-01

Current strategies to alter disease-associated epigenetic modifications target ubiquitously expressed epigenetic regulators. This approach does not allow specific genes to be controlled in specific cell types; therefore, tools to selectively target epigenetic modifications in the desired cell type and strategies to more efficiently correct aberrant gene expression in disease are needed. Here, we have developed a method for directing DNA methylation to specific gene loci by conjugating catalytic domains of DNA methyltransferases (DNMTs) to engineered transcription activator-like effectors (TALEs). We demonstrated that these TALE-DNMTs direct DNA methylation specifically to the targeted gene locus in human cells. Further, we determined that minimizing direct nucleotide sequence repeats within the TALE moiety permits efficient lentivirus transduction, allowing easy targeting of primary cell types. Finally, we demonstrated that directed DNA methylation with a TALE-DNMT targeting the CDKN2A locus, which encodes the cyclin-dependent kinase inhibitor p16, decreased CDKN2A expression and increased replication of primary human fibroblasts, as intended. Moreover, overexpression of p16 in these cells reversed the proliferative phenotype, demonstrating the specificity of our epigenetic targeting. Together, our results demonstrate that TALE-DNMTs can selectively target specific genes and suggest that this strategy has potential application for the development of locus-specific epigenetic therapeutics.
TALE-mediated epigenetic suppression of CDKN2A increases replication in human fibroblasts

PubMed Central

Bernstein, Diana L.; Le Lay, John E.; Ruano, Elena G.; Kaestner, Klaus H.

2015-01-01

Current strategies to alter disease-associated epigenetic modifications target ubiquitously expressed epigenetic regulators. This approach does not allow specific genes to be controlled in specific cell types; therefore, tools to selectively target epigenetic modifications in the desired cell type and strategies to more efficiently correct aberrant gene expression in disease are needed. Here, we have developed a method for directing DNA methylation to specific gene loci by conjugating catalytic domains of DNA methyltransferases (DNMTs) to engineered transcription activator–like effectors (TALEs). We demonstrated that these TALE-DNMTs direct DNA methylation specifically to the targeted gene locus in human cells. Further, we determined that minimizing direct nucleotide sequence repeats within the TALE moiety permits efficient lentivirus transduction, allowing easy targeting of primary cell types. Finally, we demonstrated that directed DNA methylation with a TALE-DNMT targeting the CDKN2A locus, which encodes the cyclin-dependent kinase inhibitor p16, decreased CDKN2A expression and increased replication of primary human fibroblasts, as intended. Moreover, overexpression of p16 in these cells reversed the proliferative phenotype, demonstrating the specificity of our epigenetic targeting. Together, our results demonstrate that TALE-DNMTs can selectively target specific genes and suggest that this strategy has potential application for the development of locus-specific epigenetic therapeutics. PMID:25866970
Chromosomal location and genetic mapping of the mismatch repair gene homologs MSH2, MSH3, and MSH6 in rye and wheat

PubMed

Korzun; Borner; Siebert; Malyshev; Hilpert; Kunze; Puchta

1999-12-01

The efficiency of homeologous recombination is influenced by mismatch repair genes in bacteria, yeast, and mammals. To elucidate a possible role of these genes in homeologous pairing and cross-compatibility in plants, gene probes of wheat (Triticum aestivum) specific for the mismatch repair gene homologues MSH2, MSH3, and MSH6 were used to map them to their genomic positions in rye (Secale cereale). Whereas MSH2 was mapped to the short arm of chromosome 1R, MSH3 was mapped to the long arm of chromosome 2R and MSH6 to the long arm of chromosome 5R. Southern blots with nullisomic-tetrasomic (NT) lines of wheat indicated the presence of the sequences on the respective homeologous group of wheat chromosomes. Additionally, an MSH6-specific homologue could also be detected on homoeologous group 3 of wheat. However, in the well-known, highly homoeologous pairing wheat mutant ph1b the MSH6-specific sequence is not within the deleted part of chromosome 5BL, indicating that the pairing phenotype is not due to a loss of one of the mismatch repair genes tested.
Mining new crystal protein genes from Bacillus thuringiensis on the basis of mixed plasmid-enriched genome sequencing and a computational pipeline.

PubMed

Ye, Weixing; Zhu, Lei; Liu, Yingying; Crickmore, Neil; Peng, Donghai; Ruan, Lifang; Sun, Ming

2012-07-01

We have designed a high-throughput system for the identification of novel crystal protein genes (cry) from Bacillus thuringiensis strains. The system was developed with two goals: (i) to acquire the mixed plasmid-enriched genomic sequence of B. thuringiensis using next-generation sequencing biotechnology, and (ii) to identify cry genes with a computational pipeline (using BtToxin_scanner). In our pipeline method, we employed three different kinds of well-developed prediction methods, BLAST, hidden Markov model (HMM), and support vector machine (SVM), to predict the presence of Cry toxin genes. The pipeline proved to be fast (average speed, 1.02 Mb/min for proteins and open reading frames [ORFs] and 1.80 Mb/min for nucleotide sequences), sensitive (it detected 40% more protein toxin genes than a keyword extraction method using genomic sequences downloaded from GenBank), and highly specific. Twenty-one strains from our laboratory's collection were selected based on their plasmid pattern and/or crystal morphology. The plasmid-enriched genomic DNA was extracted from these strains and mixed for Illumina sequencing. The sequencing data were de novo assembled, and a total of 113 candidate cry sequences were identified using the computational pipeline. Twenty-seven candidate sequences were selected on the basis of their low level of sequence identity to known cry genes, and eight full-length genes were obtained with PCR. Finally, three new cry-type genes (primary ranks) and five cry holotypes, which were designated cry8Ac1, cry7Ha1, cry21Ca1, cry32Fa1, and cry21Da1 by the B. thuringiensis Toxin Nomenclature Committee, were identified. The system described here is both efficient and cost-effective and can greatly accelerate the discovery of novel cry genes.
NURD: an implementation of a new method to estimate isoform expression from non-uniform RNA-seq data

PubMed Central

2013-01-01

Background RNA-Seq technology has been used widely in transcriptome study, and one of the most important applications is to estimate the expression level of genes and their alternative splicing isoforms. There have been several algorithms published to estimate the expression based on different models. Recently Wu et al. published a method that can accurately estimate isoform level expression by considering position-related sequencing biases using nonparametric models. The method has advantages in handling different read distributions, but there hasn’t been an efficient program to implement this algorithm. Results We developed an efficient implementation of the algorithm in the program NURD. It uses a binary interval search algorithm. The program can correct both the global tendency of sequencing bias in the data and local sequencing bias specific to each gene. The correction makes the isoform expression estimation more reliable under various read distributions. And the implementation is computationally efficient in both the memory cost and running time and can be readily scaled up for huge datasets. Conclusion NURD is an efficient and reliable tool for estimating the isoform expression level. Given the reads mapping result and gene annotation file, NURD will output the expression estimation result. The package is freely available for academic use at http://bioinfo.au.tsinghua.edu.cn/software/NURD/. PMID:23837734
Cis-acting elements in the promoter region of the human aldolase C gene.

PubMed

Buono, P; de Conciliis, L; Olivetta, E; Izzo, P; Salvatore, F

1993-08-16

We investigated the cis-acting sequences involved in the expression of the human aldolase C gene by transient transfections into human neuroblastoma cells (SKNBE). We demonstrate that 420 bp of the 5'-flanking DNA direct at high efficiency the transcription of the CAT reporter gene. A deletion between -420 bp and -164 bp causes a 60% decrease of CAT activity. Gel shift and DNase I footprinting analyses revealed four protected elements: A, B, C and D. Competition analyses indicate that Sp1 or factors sharing a similar sequence specificity bind to elements A and B, but not to elements C and D. Sequence analysis shows a half palindromic ERE motif (GGTCA), in elements B and D. Region D binds a transactivating factor which appears also essential to stabilize the initiation complex.
The largest subunit of RNA polymerase II as a new marker gene to study assemblages of arbuscular mycorrhizal fungi in the field.

PubMed

Stockinger, Herbert; Peyret-Guzzon, Marine; Koegel, Sally; Bouffaud, Marie-Lara; Redecker, Dirk

2014-01-01

Due to the potential of arbuscular mycorrhizal fungi (AMF, Glomeromycota) to improve plant growth and soil quality, the influence of agricultural practice on their diversity continues to be an important research question. Up to now studies of community diversity in AMF have exclusively been based on nuclear ribosomal gene regions, which in AMF show high intra-organism polymorphism, seriously complicating interpretation of these data. We designed specific PCR primers for 454 sequencing of a region of the largest subunit of RNA polymerase II gene, and established a new reference dataset comprising all major AMF lineages. This gene is known to be monomorphic within fungal isolates but shows an excellent barcode gap between species. We designed a primer set to amplify all known lineages of AMF and demonstrated its applicability in combination with high-throughput sequencing in a long-term tillage experiment. The PCR primers showed a specificity of 99.94% for glomeromycotan sequences. We found evidence of significant shifts of the AMF communities caused by soil management and showed that tillage effects on different AMF taxa are clearly more complex than previously thought. The high resolving power of high-throughput sequencing highlights the need for quantitative measurements to efficiently detect these effects.
Floral gene resources from basal angiosperms for comparative genomics research

PubMed Central

Albert, Victor A; Soltis, Douglas E; Carlson, John E; Farmerie, William G; Wall, P Kerr; Ilut, Daniel C; Solow, Teri M; Mueller, Lukas A; Landherr, Lena L; Hu, Yi; Buzgo, Matyas; Kim, Sangtae; Yoo, Mi-Jeong; Frohlich, Michael W; Perl-Treves, Rafael; Schlarbaum, Scott E; Bliss, Barbara J; Zhang, Xiaohong; Tanksley, Steven D; Oppenheimer, David G; Soltis, Pamela S; Ma, Hong; dePamphilis, Claude W; Leebens-Mack, James H

2005-01-01

Background The Floral Genome Project was initiated to bridge the genomic gap between the most broadly studied plant model systems. Arabidopsis and rice, although now completely sequenced and under intensive comparative genomic investigation, are separated by at least 125 million years of evolutionary time, and cannot in isolation provide a comprehensive perspective on structural and functional aspects of flowering plant genome dynamics. Here we discuss new genomic resources available to the scientific community, comprising cDNA libraries and Expressed Sequence Tag (EST) sequences for a suite of phylogenetically basal angiosperms specifically selected to bridge the evolutionary gaps between model plants and provide insights into gene content and genome structure in the earliest flowering plants. Results Random sequencing of cDNAs from representatives of phylogenetically important eudicot, non-grass monocot, and gymnosperm lineages has so far (as of 12/1/04) generated 70,514 ESTs and 48,170 assembled unigenes. Efficient sorting of EST sequences into putative gene families based on whole Arabidopsis/rice proteome comparison has permitted ready identification of cDNA clones for finished sequencing. Preliminarily, (i) proportions of functional categories among sequenced floral genes seem representative of the entire Arabidopsis transcriptome, (ii) many known floral gene homologues have been captured, and (iii) phylogenetic analyses of ESTs are providing new insights into the process of gene family evolution in relation to the origin and diversification of the angiosperms. Conclusion Initial comparisons illustrate the utility of the EST data sets toward discovery of the basic floral transcriptome. These first findings also afford the opportunity to address a number of conspicuous evolutionary genomic questions, including reproductive organ transcriptome overlap between angiosperms and gymnosperms, genome-wide duplication history, lineage-specific gene duplication and functional divergence, and analyses of adaptive molecular evolution. Since not all genes in the floral transcriptome will be associated with flowering, these EST resources will also be of interest to plant scientists working on other functions, such as photosynthesis, signal transduction, and metabolic pathways. PMID:15799777
Phylogeny of nodulation genes and symbiotic diversity of Acacia senegal (L.) Willd. and A. seyal (Del.) Mesorhizobium strains from different regions of Senegal.

PubMed

Bakhoum, Niokhor; Galiana, Antoine; Le Roux, Christine; Kane, Aboubacry; Duponnois, Robin; Ndoye, Fatou; Fall, Dioumacor; Noba, Kandioura; Sylla, Samba Ndao; Diouf, Diégane

2015-04-01

Acacia senegal and Acacia seyal are small, deciduous legume trees, most highly valued for nitrogen fixation and for the production of gum arabic, a commodity of international trade since ancient times. Symbiotic nitrogen fixation by legumes represents the main natural input of atmospheric N2 into ecosystems which may ultimately benefit all organisms. We analyzed the nod and nif symbiotic genes and symbiotic properties of root-nodulating bacteria isolated from A. senegal and A. seyal in Senegal. The symbiotic genes of rhizobial strains from the two Acacia species were closed to those of Mesorhizobium plurifarium and grouped separately in the phylogenetic trees. Phylogeny of rhizobial nitrogen fixation gene nifH was similar to those of nodulation genes (nodA and nodC). All A. senegal rhizobial strains showed identical nodA, nodC, and nifH gene sequences. By contrast, A. seyal rhizobial strains exhibited different symbiotic gene sequences. Efficiency tests demonstrated that inoculation of both Acacia species significantly affected nodulation, total dry weight, acetylene reduction activity (ARA), and specific acetylene reduction activity (SARA) of plants. However, these cross-inoculation tests did not show any specificity of Mesorhizobium strains toward a given Acacia host species in terms of infectivity and efficiency as stated by principal component analysis (PCA). This study demonstrates that large-scale inoculation of A. senegal and A. seyal in the framework of reafforestation programs requires a preliminary step of rhizobial strain selection for both Acacia species.
A-WINGS: an integrated genome database for Pleurocybella porrigens (Angel's wing oyster mushroom, Sugihiratake).

PubMed

Yamamoto, Naoki; Suzuki, Tomohiro; Kobayashi, Masaaki; Dohra, Hideo; Sasaki, Yohei; Hirai, Hirofumi; Yokoyama, Koji; Kawagishi, Hirokazu; Yano, Kentaro

2014-12-03

The angel's wing oyster mushroom (Pleurocybella porrigens, Sugihiratake) is a well-known delicacy. However, its potential risk in acute encephalopathy was recently revealed by a food poisoning incident. To disclose the genes underlying the accident and provide mechanistic insight, we seek to develop an information infrastructure containing omics data. In our previous work, we sequenced the genome and transcriptome using next-generation sequencing techniques. The next step in achieving our goal is to develop a web database to facilitate the efficient mining of large-scale omics data and identification of genes specifically expressed in the mushroom. This paper introduces a web database A-WINGS (http://bioinf.mind.meiji.ac.jp/a-wings/) that provides integrated genomic and transcriptomic information for the angel's wing oyster mushroom. The database contains structure and functional annotations of transcripts and gene expressions. Functional annotations contain information on homologous sequences from NCBI nr and UniProt, Gene Ontology, and KEGG Orthology. Digital gene expression profiles were derived from RNA sequencing (RNA-seq) analysis in the fruiting bodies and mycelia. The omics information stored in the database is freely accessible through interactive and graphical interfaces by search functions that include 'GO TREE VIEW' browsing, keyword searches, and BLAST searches. The A-WINGS database will accelerate omics studies on specific aspects of the angel's wing oyster mushroom and the family Tricholomataceae.
Inducible Transgenic Models of BRCA1 Function

DTIC Science & Technology

2000-10-01

four different hammerhead ribozymes designed to specifically cleave the Brcal transcript. Hammerhead ribozymes are catalytic RNAs that efficiently...cleave RNA and thereby down- regulate gene expression. Hammerhead ribozymes can cleave any RNA containing a 5’-UH-3’ consensus sequence where U can be...replaced by C, and H=C, U or A. Hammerhead ribozymes have been shown to effectively and selectively inhibit gene expression in bacteria, plants, cell
Genome engineering in human cells.

PubMed

Song, Minjung; Kim, Young-Hoon; Kim, Jin-Soo; Kim, Hyongbum

2014-01-01

Genome editing in human cells is of great value in research, medicine, and biotechnology. Programmable nucleases including zinc-finger nucleases, transcription activator-like effector nucleases, and RNA-guided engineered nucleases recognize a specific target sequence and make a double-strand break at that site, which can result in gene disruption, gene insertion, gene correction, or chromosomal rearrangements. The target sequence complexities of these programmable nucleases are higher than 3.2 mega base pairs, the size of the haploid human genome. Here, we briefly introduce the structure of the human genome and the characteristics of each programmable nuclease, and review their applications in human cells including pluripotent stem cells. In addition, we discuss various delivery methods for nucleases, programmable nickases, and enrichment of gene-edited human cells, all of which facilitate efficient and precise genome editing in human cells.
A multicolor panel of TALE-KRAB based transcriptional repressor vectors enabling knockdown of multiple gene targets

PubMed Central

Zhang, Zhonghui; Wu, Elise; Qian, Zhijian; Wu, Wen-Shu

2014-01-01

Stable and efficient knockdown of multiple gene targets is highly desirable for dissection of molecular pathways. Because it allows sequence-specific DNA binding, transcription activator-like effector (TALE) offers a new genetic perturbation technique that allows for gene-specific repression. Here, we constructed a multicolor lentiviral TALE-Kruppel-associated box (KRAB) expression vector platform that enables knockdown of multiple gene targets. This platform is fully compatible with the Golden Gate TALEN and TAL Effector Kit 2.0, a widely used and efficient method for TALE assembly. We showed that this multicolor TALE-KRAB vector system when combined together with bone marrow transplantation could quickly knock down c-kit and PU.1 genes in hematopoietic stem and progenitor cells of recipient mice. Furthermore, our data demonstrated that this platform simultaneously knocked down both c-Kit and PU.1 genes in the same primary cell populations. Together, our results suggest that this multicolor TALE-KRAB vector platform is a promising and versatile tool for knockdown of multiple gene targets and could greatly facilitate dissection of molecular pathways. PMID:25475013
A multicolor panel of TALE-KRAB based transcriptional repressor vectors enabling knockdown of multiple gene targets.

PubMed

Zhang, Zhonghui; Wu, Elise; Qian, Zhijian; Wu, Wen-Shu

2014-12-05

Stable and efficient knockdown of multiple gene targets is highly desirable for dissection of molecular pathways. Because it allows sequence-specific DNA binding, transcription activator-like effector (TALE) offers a new genetic perturbation technique that allows for gene-specific repression. Here, we constructed a multicolor lentiviral TALE-Kruppel-associated box (KRAB) expression vector platform that enables knockdown of multiple gene targets. This platform is fully compatible with the Golden Gate TALEN and TAL Effector Kit 2.0, a widely used and efficient method for TALE assembly. We showed that this multicolor TALE-KRAB vector system when combined together with bone marrow transplantation could quickly knock down c-kit and PU.1 genes in hematopoietic stem and progenitor cells of recipient mice. Furthermore, our data demonstrated that this platform simultaneously knocked down both c-Kit and PU.1 genes in the same primary cell populations. Together, our results suggest that this multicolor TALE-KRAB vector platform is a promising and versatile tool for knockdown of multiple gene targets and could greatly facilitate dissection of molecular pathways.
A Simple and Efficient Method for Assembling TALE Protein Based on Plasmid Library

PubMed Central

Xu, Huarong; Xin, Ying; Zhang, Tingting; Ma, Lixia; Wang, Xin; Chen, Zhilong; Zhang, Zhiying

2013-01-01

DNA binding domain of the transcription activator-like effectors (TALEs) from Xanthomonas sp. consists of tandem repeats that can be rearranged according to a simple cipher to target new DNA sequences with high DNA-binding specificity. This technology has been successfully applied in varieties of species for genome engineering. However, assembling long TALE tandem repeats remains a big challenge precluding wide use of this technology. Although several new methodologies for efficiently assembling TALE repeats have been recently reported, all of them require either sophisticated facilities or skilled technicians to carry them out. Here, we described a simple and efficient method for generating customized TALE nucleases (TALENs) and TALE transcription factors (TALE-TFs) based on TALE repeat tetramer library. A tetramer library consisting of 256 tetramers covers all possible combinations of 4 base pairs. A set of unique primers was designed for amplification of these tetramers. PCR products were assembled by one step of digestion/ligation reaction. 12 TALE constructs including 4 TALEN pairs targeted to mouse Gt(ROSA)26Sor gene and mouse Mstn gene sequences as well as 4 TALE-TF constructs targeted to mouse Oct4, c-Myc, Klf4 and Sox2 gene promoter sequences were generated by using our method. The construction routines took 3 days and parallel constructions were available. The rate of positive clones during colony PCR verification was 64% on average. Sequencing results suggested that all TALE constructs were performed with high successful rate. This is a rapid and cost-efficient method using the most common enzymes and facilities with a high success rate. PMID:23840477
A simple and efficient method for assembling TALE protein based on plasmid library.

PubMed

Zhang, Zhiqiang; Li, Duo; Xu, Huarong; Xin, Ying; Zhang, Tingting; Ma, Lixia; Wang, Xin; Chen, Zhilong; Zhang, Zhiying

2013-01-01

DNA binding domain of the transcription activator-like effectors (TALEs) from Xanthomonas sp. consists of tandem repeats that can be rearranged according to a simple cipher to target new DNA sequences with high DNA-binding specificity. This technology has been successfully applied in varieties of species for genome engineering. However, assembling long TALE tandem repeats remains a big challenge precluding wide use of this technology. Although several new methodologies for efficiently assembling TALE repeats have been recently reported, all of them require either sophisticated facilities or skilled technicians to carry them out. Here, we described a simple and efficient method for generating customized TALE nucleases (TALENs) and TALE transcription factors (TALE-TFs) based on TALE repeat tetramer library. A tetramer library consisting of 256 tetramers covers all possible combinations of 4 base pairs. A set of unique primers was designed for amplification of these tetramers. PCR products were assembled by one step of digestion/ligation reaction. 12 TALE constructs including 4 TALEN pairs targeted to mouse Gt(ROSA)26Sor gene and mouse Mstn gene sequences as well as 4 TALE-TF constructs targeted to mouse Oct4, c-Myc, Klf4 and Sox2 gene promoter sequences were generated by using our method. The construction routines took 3 days and parallel constructions were available. The rate of positive clones during colony PCR verification was 64% on average. Sequencing results suggested that all TALE constructs were performed with high successful rate. This is a rapid and cost-efficient method using the most common enzymes and facilities with a high success rate.
The Sequences of 1504 Mutants in the Model Rice Variety Kitaake Facilitate Rapid Functional Genomic Studies

PubMed Central

Pham, Nikki T.; Wei, Tong; Schackwitz, Wendy S.; Lipzen, Anna M.; Duong, Phat Q.; Jones, Kyle C.; Ruan, Deling; Bauer, Diane; Peng, Yi; Schmutz, Jeremy

2017-01-01

The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportion of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. This work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations. PMID:28576844
Flow cytometric purification of Colletotrichum higginsianum biotrophic hyphae from Arabidopsis leaves for stage-specific transcriptome analysis.

PubMed

Takahara, Hiroyuki; Dolf, Andreas; Endl, Elmar; O'Connell, Richard

2009-08-01

Generation of stage-specific cDNA libraries is a powerful approach to identify pathogen genes that are differentially expressed during plant infection. Biotrophic pathogens develop specialized infection structures inside living plant cells, but sampling the transcriptome of these structures is problematic due to the low ratio of fungal to plant RNA, and the lack of efficient methods to isolate them from infected plants. Here we established a method, based on fluorescence-activated cell sorting (FACS), to purify the intracellular biotrophic hyphae of Colletotrichum higginsianum from homogenates of infected Arabidopsis leaves. Specific selection of viable hyphae using a fluorescent vital marker provided intact RNA for cDNA library construction. Pilot-scale sequencing showed that the library was enriched with plant-induced and pathogenicity-related fungal genes, including some encoding small, soluble secreted proteins that represent candidate fungal effectors. The high purity of the hyphae (94%) prevented contamination of the library by sequences derived from host cells or other fungal cell types. RT-PCR confirmed that genes identified in the FACS-purified hyphae were also expressed in planta. The method has wide applicability for isolating the infection structures of other plant pathogens, and will facilitate cell-specific transcriptome analysis via deep sequencing and microarray hybridization, as well as proteomic analyses.
Alignment-free genome tree inference by learning group-specific distance metrics.

PubMed

Patil, Kaustubh R; McHardy, Alice C

2013-01-01

Understanding the evolutionary relationships between organisms is vital for their in-depth study. Gene-based methods are often used to infer such relationships, which are not without drawbacks. One can now attempt to use genome-scale information, because of the ever increasing number of genomes available. This opportunity also presents a challenge in terms of computational efficiency. Two fundamentally different methods are often employed for sequence comparisons, namely alignment-based and alignment-free methods. Alignment-free methods rely on the genome signature concept and provide a computationally efficient way that is also applicable to nonhomologous sequences. The genome signature contains evolutionary signal as it is more similar for closely related organisms than for distantly related ones. We used genome-scale sequence information to infer taxonomic distances between organisms without additional information such as gene annotations. We propose a method to improve genome tree inference by learning specific distance metrics over the genome signature for groups of organisms with similar phylogenetic, genomic, or ecological properties. Specifically, our method learns a Mahalanobis metric for a set of genomes and a reference taxonomy to guide the learning process. By applying this method to more than a thousand prokaryotic genomes, we showed that, indeed, better distance metrics could be learned for most of the 18 groups of organisms tested here. Once a group-specific metric is available, it can be used to estimate the taxonomic distances for other sequenced organisms from the group. This study also presents a large scale comparison between 10 methods--9 alignment-free and 1 alignment-based.
Regulation of ecmF gene expression and genetic hierarchy among STATa, CudA, and MybC on several prestalk A-specific gene expressions in Dictyostelium.

PubMed

Saga, Yukika; Inamura, Tomoka; Shimada, Nao; Kawata, Takefumi

2016-05-01

STATa, a Dictyostelium homologue of metazoan signal transducer and activator of transcription, is important for the organizer function in the tip region of the migrating Dictyostelium slug. We previously showed that ecmF gene expression depends on STATa in prestalk A (pstA) cells, where STATa is activated. Deletion and site-directed mutagenesis analysis of the ecmF/lacZ fusion gene in wild-type and STATa null strains identified an imperfect inverted repeat sequence, ACAAATANTATTTGT, as a STATa-responsive element. An upstream sequence element was required for efficient expression in the rear region of pstA zone; an element downstream of the inverted repeat was necessary for sufficient prestalk expression during culmination. Band shift analyses using purified STATa protein detected no sequence-specific binding to those ecmF elements. The only verified upregulated target gene of STATa is cudA gene; CudA directly activates expL7 gene expression in prestalk cells. However, ecmF gene expression was almost unaffected in a cudA null mutant. Several previously reported putative STATa target genes were also expressed in cudA null mutant but were downregulated in STATa null mutant. Moreover, mybC, which encodes another transcription factor, belonged to this category, and ecmF expression was downregulated in a mybC null mutant. These findings demonstrate the existence of a genetic hierarchy for pstA-specific genes, which can be classified into two distinct STATa downstream pathways, CudA dependent and independent. The ecmF expression is indirectly upregulated by STATa in a CudA-independent activation manner but dependent on MybC, whose expression is positively regulated by STATa. © 2016 Japanese Society of Developmental Biologists.

Transcriptome analysis of Pseudomonas syringae identifies new genes, ncRNAs, and antisense activity

USDA-ARS?s Scientific Manuscript database

To fully understand how bacteria respond to their environment, it is essential to assess genome-wide transcriptional activity. New high throughput sequencing technologies make it possible to query the transcriptome of an organism in an efficient unbiased manner. We applied a strand-specific method t...
Minimal doses of a sequence-optimized transgene mediate high-level and long-term EPO expression in vivo: challenging CpG-free gene design.

PubMed

Kosovac, D; Wild, J; Ludwig, C; Meissner, S; Bauer, A P; Wagner, R

2011-02-01

Advanced gene delivery techniques can be combined with rational gene design to further improve the efficiency of plasmid DNA (pDNA)-mediated transgene expression in vivo. Herein, we analyzed the influence of intragenic sequence modifications on transgene expression in vitro and in vivo using murine erythropoietin (mEPO) as a transgene model. A single electro-gene transfer of an RNA- and codon-optimized mEPOopt gene into skeletal muscle resulted in a 3- to 4-fold increase of mEPO production sustained for >1 year and triggered a significant increase in hematocrit and hemoglobin without causing adverse effects. mEPO expression and hematologic levels were significantly lower when using comparable amounts of the wild type (mEPOwt) gene and only marginal effects were induced by mEPOΔCpG lacking intragenic CpG dinucleotides, even at high pDNA amounts. Corresponding with these observations, in vitro analysis of transfected cells revealed a 2- to 3-fold increased (mEPOopt) and 50% decreased (mEPOΔCpG) erythropoietin expression compared with mEPOwt, respectively. RNA analyses demonstrated that the specific design of the transgene sequence influenced expression levels by modulating transcriptional activity and nuclear plus cytoplasmic RNA amounts rather than translation. In sum, whereas CpG depletion negatively interferes with efficient expression in postmitotic tissues, mEPOopt doses <0.5 μg were sufficient to trigger optimal long-term hematologic effects encouraging the use of sequence-optimized transgenes to further reduce effective pDNA amounts.
Unique core genomes of the bacterial family vibrionaceae: insights into niche adaptation and speciation.

PubMed

Kahlke, Tim; Goesmann, Alexander; Hjerde, Erik; Willassen, Nils Peder; Haugen, Peik

2012-05-10

The criteria for defining bacterial species and even the concept of bacterial species itself are under debate, and the discussion is apparently intensifying as more genome sequence data is becoming available. However, it is still unclear how the new advances in genomics should be used most efficiently to address this question. In this study we identify genes that are common to any group of genomes in our dataset, to determine whether genes specific to a particular taxon exist and to investigate their potential role in adaptation of bacteria to their specific niche. These genes were named unique core genes. Additionally, we investigate the existence and importance of unique core genes that are found in isolates of phylogenetically non-coherent groups. These groups of isolates, that share a genetic feature without sharing a closest common ancestor, are termed genophyletic groups. The bacterial family Vibrionaceae was used as the model, and we compiled and compared genome sequences of 64 different isolates. Using the software orthoMCL we determined clusters of homologous genes among the investigated genome sequences. We used multilocus sequence analysis to build a host phylogeny and mapped the numbers of unique core genes of all distinct groups of isolates onto the tree. The results show that unique core genes are more likely to be found in monophyletic groups of isolates. Genophyletic groups of isolates, in contrast, are less common especially for large groups of isolate. The subsequent annotation of unique core genes that are present in genophyletic groups indicate a high degree of horizontally transferred genes. Finally, the annotation of the unique core genes of Vibrio cholerae revealed genes involved in aerotaxis and biosynthesis of the iron-chelator vibriobactin. The presented work indicates that genes specific for any taxon inside the bacterial family Vibrionaceae exist. These unique core genes encode conserved metabolic functions that can shed light on the adaptation of a species to its ecological niche. Additionally, our study suggests that unique core genes can be used to aid classification of bacteria and contribute to a bacterial species definition on a genomic level. Furthermore, these genes may be of importance in clinical diagnostics and drug development.
The genome sequence of the model ascomycete fungus Podospora anserina.

PubMed

Espagne, Eric; Lespinet, Olivier; Malagnac, Fabienne; Da Silva, Corinne; Jaillon, Olivier; Porcel, Betina M; Couloux, Arnaud; Aury, Jean-Marc; Ségurens, Béatrice; Poulain, Julie; Anthouard, Véronique; Grossetete, Sandrine; Khalili, Hamid; Coppin, Evelyne; Déquard-Chablat, Michelle; Picard, Marguerite; Contamine, Véronique; Arnaise, Sylvie; Bourdais, Anne; Berteaux-Lecellier, Véronique; Gautheret, Daniel; de Vries, Ronald P; Battaglia, Evy; Coutinho, Pedro M; Danchin, Etienne Gj; Henrissat, Bernard; Khoury, Riyad El; Sainsard-Chanet, Annie; Boivin, Antoine; Pinan-Lucarré, Bérangère; Sellem, Carole H; Debuchy, Robert; Wincker, Patrick; Weissenbach, Jean; Silar, Philippe

2008-01-01

The dung-inhabiting ascomycete fungus Podospora anserina is a model used to study various aspects of eukaryotic and fungal biology, such as ageing, prions and sexual development. We present a 10X draft sequence of P. anserina genome, linked to the sequences of a large expressed sequence tag collection. Similar to higher eukaryotes, the P. anserina transcription/splicing machinery generates numerous non-conventional transcripts. Comparison of the P. anserina genome and orthologous gene set with the one of its close relatives, Neurospora crassa, shows that synteny is poorly conserved, the main result of evolution being gene shuffling in the same chromosome. The P. anserina genome contains fewer repeated sequences and has evolved new genes by duplication since its separation from N. crassa, despite the presence of the repeat induced point mutation mechanism that mutates duplicated sequences. We also provide evidence that frequent gene loss took place in the lineages leading to P. anserina and N. crassa. P. anserina contains a large and highly specialized set of genes involved in utilization of natural carbon sources commonly found in its natural biotope. It includes genes potentially involved in lignin degradation and efficient cellulose breakdown. The features of the P. anserina genome indicate a highly dynamic evolution since the divergence of P. anserina and N. crassa, leading to the ability of the former to use specific complex carbon sources that match its needs in its natural biotope.
Sequence-specific bias correction for RNA-seq data using recurrent neural networks.

PubMed

Zhang, Yao-Zhong; Yamaguchi, Rui; Imoto, Seiya; Miyano, Satoru

2017-01-25

The recent success of deep learning techniques in machine learning and artificial intelligence has stimulated a great deal of interest among bioinformaticians, who now wish to bring the power of deep learning to bare on a host of bioinformatical problems. Deep learning is ideally suited for biological problems that require automatic or hierarchical feature representation for biological data when prior knowledge is limited. In this work, we address the sequence-specific bias correction problem for RNA-seq data redusing Recurrent Neural Networks (RNNs) to model nucleotide sequences without pre-determining sequence structures. The sequence-specific bias of a read is then calculated based on the sequence probabilities estimated by RNNs, and used in the estimation of gene abundance. We explore the application of two popular RNN recurrent units for this task and demonstrate that RNN-based approaches provide a flexible way to model nucleotide sequences without knowledge of predetermined sequence structures. Our experiments show that training a RNN-based nucleotide sequence model is efficient and RNN-based bias correction methods compare well with the-state-of-the-art sequence-specific bias correction method on the commonly used MAQC-III data set. RNNs provides an alternative and flexible way to calculate sequence-specific bias without explicitly pre-determining sequence structures.
Development of Genome Engineering Tools from Plant-Specific PPR Proteins Using Animal Cultured Cells.

PubMed

Kobayashi, Takehito; Yagi, Yusuke; Nakamura, Takahiro

2016-01-01

The pentatricopeptide repeat (PPR) motif is a sequence-specific RNA/DNA-binding module. Elucidation of the RNA/DNA recognition mechanism has enabled engineering of PPR motifs as new RNA/DNA manipulation tools in living cells, including for genome editing. However, the biochemical characteristics of PPR proteins remain unknown, mostly due to the instability and/or unfolding propensities of PPR proteins in heterologous expression systems such as bacteria and yeast. To overcome this issue, we constructed reporter systems using animal cultured cells. The cell-based system has highly attractive features for PPR engineering: robust eukaryotic gene expression; availability of various vectors, reagents, and antibodies; highly efficient DNA delivery ratio (>80 %); and rapid, high-throughput data production. In this chapter, we introduce an example of such reporter systems: a PPR-based sequence-specific translational activation system. The cell-based reporter system can be applied to characterize plant genes of interested and to PPR engineering.
New technology and resources for cryptococcal research

PubMed Central

Zhang, Nannan; Park, Yoon-Dong; Williamson, Peter R.

2014-01-01

Rapid advances in molecular biology and genome sequencing have enabled the generation of new technology and resources for cryptococcal research. RNAi-mediated specific gene knock down has become routine and more efficient by utilizing modified shRNA plasmids and convergent promoter RNAi constructs. This system was recently applied in a high-throughput screen to identify genes involved in host-pathogen interactions. Gene deletion efficiencies have also been improved by increasing rates of homologous recombination through a number of approaches, including a combination of double-joint PCR with split-marker transformation, the use of dominant selectable markers and the introduction of Cre-Loxp systems into Cryptococcus. Moreover, visualization of cryptococcal proteins has become more facile using fusions with codon-optimized fluorescent tags, such as green or red fluorescent proteins or, mCherry. Using recent genome-wide analytical tools, new transcriptional factors and regulatory proteins have been identified in novel virulence-related signaling pathways by employing microarray analysis, RNA-sequencing and proteomic analysis. PMID:25460849
A cDNA from a mouse pancreatic beta cell encoding a putative transcription factor of the insulin gene.

PubMed Central

Walker, M D; Park, C W; Rosen, A; Aronheim, A

1990-01-01

Cell specific expression of the insulin gene is achieved through transcriptional mechanisms operating on multiple DNA sequence elements located in the 5' flanking region of the gene. Of particular importance in the rat insulin I gene are two closely similar 9 bp sequences (IEB1 and IEB2): mutation of either of these leads to 5-10 fold reduction in transcriptional activity. We have screened an expression cDNA library derived from mouse pancreatic endocrine beta cells with a radioactive DNA probe containing multiple copies of the IEB1 sequence. A cDNA clone (A1) isolated by this procedure encodes a protein which shows efficient binding to the IEB1 probe, but much weaker binding to either an unrelated DNA probe or to a probe bearing a single base pair insertion within the recognition sequence. DNA sequence analysis indicates a protein belonging to the helix-loop-helix family of DNA-binding proteins. The ability of the protein encoded by clone A1 to recognize a number of wild type and mutant DNA sequences correlates closely with the ability of each sequence element to support transcription in vivo in the context of the insulin 5' flanking DNA. We conclude that the isolated cDNA may encode a transcription factor that participates in control of insulin gene expression. Images PMID:2181401
Gene-carried hepatoma targeting complex induced high gene transfection efficiency with low toxicity and significant antitumor activity.

PubMed

Zhao, Qing-Qing; Hu, Yu-Lan; Zhou, Yang; Li, Ni; Han, Min; Tang, Gu-Ping; Qiu, Feng; Tabata, Yasuhiko; Gao, Jian-Qing

2012-01-01

The success of gene transfection is largely dependent on the development of a vehicle or vector that can efficiently deliver a gene to cells with minimal toxicity. A liver cancer-targeted specific peptide (FQHPSF sequence) was successfully synthesized and linked with chitosan-linked polyethylenimine (CP) to form a new targeted gene delivery vector called CPT (CP/peptide). The structure of CPT was confirmed by (1)H nuclear magnetic resonance spectroscopy and ultraviolet spectrophotometry. The particle size of CPT/ DNA complexes was measured using laser diffraction spectrometry and the cytotoxicity of the copolymer was evaluated by methylthiazol tetrazolium method. The transfection efficiency evaluation of the CP copolymer was performed using luciferase activity assay. Cellular internalization of the CP/DNA complex was observed under confocal laser scanning microscopy. The targeting specificity of the polymer coupled to peptide was measured by competitive inhibition transfection study. The liver targeting specificity of the CPT copolymer in vivo was demonstrated by combining the copolymer with a therapeutic gene, interleukin-12, and assessed by its abilities in suppressing the growth of ascites tumor in mouse model. The results showed that the liver cancer-targeted specific peptide was successfully synthesized and linked with CP to form a new targeted gene delivery vector called CPT. The composition of CPT was confirmed and the vector showed low cytotoxicity and strong targeting specificity to liver tumors in vitro. The in vivo study results showed that interleukin-12 delivered by the new gene vector CPT/DNA significantly enhanced the antitumor effect on ascites tumor-bearing imprinting control region mice as compared with polyethylenimine (25 kDa), CP, and other controls, which further demonstrate the targeting specificity of the new synthesized polymer. The synthesized CPT copolymer was proven to be an effective liver cancer-targeted vector for therapeutic gene delivery, which could be a potential candidate for targeted cancer gene therapy.
The opportunities and challenges of large-scale molecular approaches to songbird neurobiology

PubMed Central

Mello, C.V.; Clayton, D.F.

2014-01-01

High-through put methods for analyzing genome structure and function are having a large impact in song-bird neurobiology. Methods include genome sequencing and annotation, comparative genomics, DNA microarrays and transcriptomics, and the development of a brain atlas of gene expression. Key emerging findings include the identification of complex transcriptional programs active during singing, the robust brain expression of non-coding RNAs, evidence of profound variations in gene expression across brain regions, and the identification of molecular specializations within song production and learning circuits. Current challenges include the statistical analysis of large datasets, effective genome curations, the efficient localization of gene expression changes to specific neuronal circuits and cells, and the dissection of behavioral and environmental factors that influence brain gene expression. The field requires efficient methods for comparisons with organisms like chicken, which offer important anatomical, functional and behavioral contrasts. As sequencing costs plummet, opportunities emerge for comparative approaches that may help reveal evolutionary transitions contributing to vocal learning, social behavior and other properties that make songbirds such compelling research subjects. PMID:25280907
Unbiased Combinatorial Genomic Approaches to Identify Alternative Therapeutic Targets within the TSC Signaling Network

DTIC Science & Technology

2014-06-01

Specifically, we combined the CRISPR genome editing system with a novel approach allowing efficient single cell cloning of Drosophila cells with the aim of...and culture these to produce cultures completely lacking wildtype sequence at the target locus. No robust methods existed to clone single Drosophila ...targeting all kinases and phosphatases (563 genes) in the Drosophila genome . 65 samples that displayed synthetic lethality (15 genes) or synthetic
Tissue-specifically regulated site-specific excision of selectable marker genes in bivalent insecticidal, genetically-modified rice.

PubMed

Hu, Zhan; Ding, Xuezhi; Hu, Shengbiao; Sun, Yunjun; Xia, Liqiu

2013-12-01

Marker-free, genetically-modified rice was created by the tissue-specifically regulated Cre/loxP system, in which the Cre recombinase gene and hygromycin phosphotransferase gene (hpt) were flanked by two directly oriented loxP sites. Cre expression was activated by the tissue-specific promoter OsMADS45 in flower or napin in seed, resulting in simultaneous excision of the recombinase and marker genes. Segregation of T1 progeny was performed to select recombined plants. The excision was confirmed by PCR, Southern blot and sequence analyses indicating that efficiency varied from 10 to 53 % for OsMADS45 and from 12 to 36 % for napin. The expression of cry1Ac and vip3A was detected by RT-PCR analysis in marker-free transgenic rice. These results suggested that our tissue-specifically regulated Cre/loxP system could auto-excise marker genes from transgenic rice and alleviate public concerns about the security of GM crops.
Improved PCR assay for the specific detection and quantitation of Escherichia coli serotype O157 in water.

PubMed

Cho, Min Seok; Joh, Kiseong; Ahn, Tae-Young; Park, Dong Suk

2014-09-01

Escherichia coli serotype O157 is still a major global healthcare problem. However, only limited information is now available on the molecular and serological detection of pathogenic bacteria. Therefore, the development of appropriate strategies for their rapid identification and monitoring is still needed. In general, the sequence analysis based on stx, slt, eae, hlyA, rfb, and fliCh7 genes is widely employed for the identification of E. coli serotype O157; but there have been critical defects in the diagnosis and identification of E. coli serotype O157, in that they are also present in other E. coli serogroups. In this study, NCBI-BLAST searches using the nucleotide sequences of the putative regulatory protein gene from E. coli O157:H7 str. Sakai found sequence difference at the serotype level. The specific primers from the putative regulatory protein gene were designed and investigated for their sensitivity and specificity for detecting the pathogen in environment water samples. The specificity of the primer set was evaluated using genomic DNA from 8 isolates of E. coli serotype O157 and 32 other reference strains. In addition, the sensitivity and specificity of this assay were confirmed by successful identification of E. coli serotype O157 in environmental water samples. In conclusion, this study showed that the newly developed quantitative serotype-specific PCR method is a highly specific and efficient tool for the surveillance and rapid detection of high-risk E. coli serotype O157.
miRNA-embedded shRNAs for Lineage-specific BCL11A Knockdown and Hemoglobin F Induction

PubMed Central

Guda, Swaroopa; Brendel, Christian; Renella, Raffaele; Du, Peng; Bauer, Daniel E; Canver, Matthew C; Grenier, Jennifer K; Grimson, Andrew W; Kamran, Sophia C; Thornton, James; de Boer, Helen; Root, David E; Milsom, Michael D; Orkin, Stuart H; Gregory, Richard I; Williams, David A

2015-01-01

RNA interference (RNAi) technology using short hairpin RNAs (shRNAs) expressed via RNA polymerase (pol) III promoters has been widely exploited to modulate gene expression in a variety of mammalian cell types. For certain applications, such as lineage-specific knockdown, embedding targeting sequences into pol II-driven microRNA (miRNA) architecture is required. Here, using the potential therapeutic target BCL11A, we demonstrate that pol III-driven shRNAs lead to significantly increased knockdown but also increased cytotoxcity in comparison to pol II-driven miRNA adapted shRNAs (shRNAmiR) in multiple hematopoietic cell lines. We show that the two expression systems yield mature guide strand sequences that differ by a 4 bp shift. This results in alternate seed sequences and consequently influences the efficacy of target gene knockdown. Incorporating a corresponding 4 bp shift into the guide strand of shRNAmiRs resulted in improved knockdown efficiency of BCL11A. This was associated with a significant de-repression of the hemoglobin target of BCL11A, human γ-globin or the murine homolog Hbb-y. Our results suggest the requirement for optimization of shRNA sequences upon incorporation into a miRNA backbone. These findings have important implications in future design of shRNAmiRs for RNAi-based therapy in hemoglobinopathies and other diseases requiring lineage-specific expression of gene silencing sequences. PMID:26080908
A gene-specific non-enhancer sequence is critical for expression from the promoter of the small heat shock protein gene αB-crystallin

PubMed Central

2014-01-01

Background Deciphering of the information content of eukaryotic promoters has remained confined to universal landmarks and conserved sequence elements such as enhancers and transcription factor binding motifs, which are considered sufficient for gene activation and regulation. Gene-specific sequences, interspersed between the canonical transacting factor binding sites or adjoining them within a promoter, are generally taken to be devoid of any regulatory information and have therefore been largely ignored. An unanswered question therefore is, do gene-specific sequences within a eukaryotic promoter have a role in gene activation? Here, we present an exhaustive experimental analysis of a gene-specific sequence adjoining the heat shock element (HSE) in the proximal promoter of the small heat shock protein gene, αB-crystallin (cryab). These sequences are highly conserved between the rodents and the humans. Results Using human retinal pigment epithelial cells in culture as the host, we have identified a 10-bp gene-specific promoter sequence (GPS), which, unlike an enhancer, controls expression from the promoter of this gene, only when in appropriate position and orientation. Notably, the data suggests that GPS in comparison with the HSE works in a context-independent fashion. Additionally, when moved upstream, about a nucleosome length of DNA (−154 bp) from the transcription start site (TSS), the activity of the promoter is markedly inhibited, suggesting its involvement in local promoter access. Importantly, we demonstrate that deletion of the GPS results in complete loss of cryab promoter activity in transgenic mice. Conclusions These data suggest that gene-specific sequences such as the GPS, identified here, may have critical roles in regulating gene-specific activity from eukaryotic promoters. PMID:24589182
Genotypic, Phenotypic and Clinical Validation of GeneXpert in Extra-Pulmonary and Pulmonary Tuberculosis in India

PubMed Central

Singh, Urvashi B.; Pandey, Pooja; Mehta, Girija; Bhatnagar, Anuj K.; Mohan, Anant; Goyal, Vinay; Ahuja, Vineet; Ramachandran, Ranjani; Sachdeva, Kuldeep S.; Samantaray, Jyotish C.

2016-01-01

Background Newer molecular diagnostics have brought paradigm shift in early diagnosis of tuberculosis [TB]. WHO recommended use of GeneXpert MTB/RIF [Xpert] for Extra-pulmonary [EP] TB; critics have since questioned its efficiency. Methods The present study was designed to assess the performance of GeneXpert in 761 extra-pulmonary and 384 pulmonary specimens from patients clinically suspected of TB and compare with Phenotypic, Genotypic and Composite reference standards [CRS]. Results Comparison of GeneXpert results to CRS, demonstrated sensitivity of 100% and 90.68%, specificity of 100% and 99.62% for pulmonary and extra-pulmonary samples. On comparison with culture, sensitivity for Rifampicin [Rif] resistance detection was 87.5% and 81.82% respectively, while specificity was 100% for both pulmonary and extra-pulmonary TB. On comparison to sequencing of rpoB gene [Rif resistance determining region, RRDR], sensitivity was respectively 93.33% and 90% while specificity was 100% in both pulmonary and extra-pulmonary TB. GeneXpert assay missed 533CCG mutation in one sputum and dual mutation [517 & 519] in one pus sample, detected by sequencing. Sequencing picked dual mutation [529, 530] in a sputum sample sensitive to Rif, demonstrating, not all RRDR mutations lead to resistance. Conclusions Current study reports observations in a patient care setting in a high burden region, from a large collection of pulmonary and extra-pulmonary samples and puts to rest questions regarding sensitivity, specificity, detection of infrequent mutations and mutations responsible for low-level Rif resistance by GeneXpert. Improvements in the assay could offer further improvement in sensitivity of detection in different patient samples; nevertheless it may be difficult to improve sensitivity of Rif resistance detection if only one gene is targeted. Assay specificity was high both for TB detection and Rif resistance detection. Despite a few misses, the assay offers major boost to early diagnosis of TB and MDR-TB, in difficult to diagnose pauci-bacillary TB. PMID:26894283
SigEMD: A powerful method for differential gene expression analysis in single-cell RNA sequencing data.

PubMed

Wang, Tianyu; Nabavi, Sheida

2018-04-24

Differential gene expression analysis is one of the significant efforts in single cell RNA sequencing (scRNAseq) analysis to discover the specific changes in expression levels of individual cell types. Since scRNAseq exhibits multimodality, large amounts of zero counts, and sparsity, it is different from the traditional bulk RNA sequencing (RNAseq) data. The new challenges of scRNAseq data promote the development of new methods for identifying differentially expressed (DE) genes. In this study, we proposed a new method, SigEMD, that combines a data imputation approach, a logistic regression model and a nonparametric method based on the Earth Mover's Distance, to precisely and efficiently identify DE genes in scRNAseq data. The regression model and data imputation are used to reduce the impact of large amounts of zero counts, and the nonparametric method is used to improve the sensitivity of detecting DE genes from multimodal scRNAseq data. By additionally employing gene interaction network information to adjust the final states of DE genes, we further reduce the false positives of calling DE genes. We used simulated datasets and real datasets to evaluate the detection accuracy of the proposed method and to compare its performance with those of other differential expression analysis methods. Results indicate that the proposed method has an overall powerful performance in terms of precision in detection, sensitivity, and specificity. Copyright © 2018 Elsevier Inc. All rights reserved.
High-efficiency transformation of Pichia stipitis based on its URA3 gene and a homologous autonomous replication sequence, ARS2.

PubMed Central

Yang, V W; Marks, J A; Davis, B P; Jeffries, T W

1994-01-01

This paper describes the first high-efficiency transformation system for the xylose-fermenting yeast Pichia stipitis. The system includes integrating and autonomously replicating plasmids based on the gene for orotidine-5'-phosphate decarboxylase (URA3) and an autonomous replicating sequence (ARS) element (ARS2) isolated from P. stipitis CBS 6054. Ura- auxotrophs were obtained by selecting for resistance to 5-fluoroorotic acid and were identified as ura3 mutants by transformation with P. stipitis URA3. P. stipitis URA3 was cloned by its homology to Saccharomyces cerevisiae URA3, with which it is 69% identical in the coding region. P. stipitis ARS elements were cloned functionally through plasmid rescue. These sequences confer autonomous replication when cloned into vectors bearing the P. stipitis URA3 gene. P. stipitis ARS2 has features similar to those of the consensus ARS of S. cerevisiae and other ARS elements. Circular plasmids bearing the P. stipitis URA3 gene with various amounts of flanking sequences produced 600 to 8,600 Ura+ transformants per micrograms of DNA by electroporation. Most transformants obtained with circular vectors arose without integration of vector sequences. One vector yielded 5,200 to 12,500 Ura+ transformants per micrograms of DNA after it was linearized at various restriction enzyme sites within the P. stipitis URA3 insert. Transformants arising from linearized vectors produced stable integrants, and integration events were site specific for the genomic ura3 in 20% of the transformants examined. Plasmids bearing the P. stipitis URA3 gene and ARS2 element produced more than 30,000 transformants per micrograms of plasmid DNA. Autonomously replicating plasmids were stable for at least 50 generations in selection medium and were present at an average of 10 copies per nucleus. Images PMID:7811063
Codon-Anticodon Recognition in the Bacillus subtilis glyQS T Box Riboswitch

PubMed Central

Caserta, Enrico; Liu, Liang-Chun; Grundy, Frank J.; Henkin, Tina M.

2015-01-01

Many amino acid-related genes in Gram-positive bacteria are regulated by the T box riboswitch. The leader RNA of genes in the T box family controls the expression of downstream genes by monitoring the aminoacylation status of the cognate tRNA. Previous studies identified a three-nucleotide codon, termed the “Specifier Sequence,” in the riboswitch that corresponds to the amino acid identity of the downstream genes. Pairing of the Specifier Sequence with the anticodon of the cognate tRNA is the primary determinant of specific tRNA recognition. This interaction mimics codon-anticodon pairing in translation but occurs in the absence of the ribosome. The goal of the current study was to determine the effect of a full range of mismatches for comparison with codon recognition in translation. Mutations were individually introduced into the Specifier Sequence of the glyQS leader RNA and tRNAGly anticodon to test the effect of all possible pairing combinations on tRNA binding affinity and antitermination efficiency. The functional role of the conserved purine 3′ of the Specifier Sequence was also verifiedin this study. We found that substitutions at the Specifier Sequence resulted in reduced binding, the magnitude of which correlates well with the predicted stability of the RNA-RNA pairing. However, the tolerance for specific mismatches in antitermination was generally different from that during decoding, which reveals a unique tRNA recognition pattern in the T box antitermination system. PMID:26229106
DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Guotian; Jain, Rashmi; Chern, Mawsheng

The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportionmore » of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. In conclusion, this work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations.« less

Mapping of a Novel Race Specific Resistance Gene to Phytophthora Root Rot of Pepper (Capsicum annuum) Using Bulked Segregant Analysis Combined with Specific Length Amplified Fragment Sequencing Strategy.

PubMed

Xu, Xiaomei; Chao, Juan; Cheng, Xueli; Wang, Rui; Sun, Baojuan; Wang, Hengming; Luo, Shaobo; Xu, Xiaowan; Wu, Tingquan; Li, Ying

2016-01-01

Phytophthora root rot caused by Phytophthora capsici (P. capsici) is a serious limitation to pepper production in Southern China, with high temperature and humidity. Mapping PRR resistance genes can provide linked DNA markers for breeding PRR resistant varieties by molecular marker-assisted selection (MAS). Two BC1 populations and an F2 population derived from a cross between P. capsici-resistant accession, Criollo de Morelos 334 (CM334) and P. capsici-susceptible accession, New Mexico Capsicum Accession 10399 (NMCA10399) were used to investigate the genetic characteristics of PRR resistance. PRR resistance to isolate Byl4 (race 3) was controlled by a single dominant gene, PhR10, that was mapped to an interval of 16.39Mb at the end of the long arm of chromosome 10. Integration of bulked segregant analysis (BSA) and Specific Length Amplified Fragment sequencing (SLAF-seq) provided an efficient genetic mapping strategy. Ten polymorphic Simple Sequence Repeat (SSR) markers were found within this region and used to screen the genotypes of 636 BC1 plants, delimiting PhR10 to a 2.57 Mb interval between markers P52-11-21 (1.5 cM away) and P52-11-41 (1.1 cM). A total of 163 genes were annotated within this region and 31 were predicted to be associated with disease resistance. PhR10 is a novel race specific gene for PRR, and this paper describes linked SSR markers suitable for marker-assisted selection of PRR resistant varieties, also laying a foundation for cloning the resistance gene.
DNA context represents transcription regulation of the gene in mouse embryonic stem cells

NASA Astrophysics Data System (ADS)

Ha, Misook; Hong, Soondo

2016-04-01

Understanding gene regulatory information in DNA remains a significant challenge in biomedical research. This study presents a computational approach to infer gene regulatory programs from primary DNA sequences. Using DNA around transcription start sites as attributes, our model predicts gene regulation in the gene. We find that H3K27ac around TSS is an informative descriptor of the transcription program in mouse embryonic stem cells. We build a computational model inferring the cell-type-specific H3K27ac signatures in the DNA around TSS. A comparison of embryonic stem cell and liver cell-specific H3K27ac signatures in DNA shows that the H3K27ac signatures in DNA around TSS efficiently distinguish the cell-type specific H3K27ac peaks and the gene regulation. The arrangement of the H3K27ac signatures inferred from the DNA represents the transcription regulation of the gene in mESC. We show that the DNA around transcription start sites is associated with the gene regulatory program by specific interaction with H3K27ac.
DNA context represents transcription regulation of the gene in mouse embryonic stem cells.

PubMed

Ha, Misook; Hong, Soondo

2016-04-14

Understanding gene regulatory information in DNA remains a significant challenge in biomedical research. This study presents a computational approach to infer gene regulatory programs from primary DNA sequences. Using DNA around transcription start sites as attributes, our model predicts gene regulation in the gene. We find that H3K27ac around TSS is an informative descriptor of the transcription program in mouse embryonic stem cells. We build a computational model inferring the cell-type-specific H3K27ac signatures in the DNA around TSS. A comparison of embryonic stem cell and liver cell-specific H3K27ac signatures in DNA shows that the H3K27ac signatures in DNA around TSS efficiently distinguish the cell-type specific H3K27ac peaks and the gene regulation. The arrangement of the H3K27ac signatures inferred from the DNA represents the transcription regulation of the gene in mESC. We show that the DNA around transcription start sites is associated with the gene regulatory program by specific interaction with H3K27ac.
Site-directed mutagenesis in Petunia × hybrida protoplast system using direct delivery of purified recombinant Cas9 ribonucleoproteins.

PubMed

Subburaj, Saminathan; Chung, Sung Jin; Lee, Choongil; Ryu, Seuk-Min; Kim, Duk Hyoung; Kim, Jin-Soo; Bae, Sangsu; Lee, Geung-Joo

2016-07-01

Site-directed mutagenesis of nitrate reductase genes using direct delivery of purified Cas9 protein preassembled with guide RNA produces mutations efficiently in Petunia × hybrida protoplast system. The clustered, regularly interspaced, short palindromic repeat (CRISPR)-CRISPR associated endonuclease 9 (CRISPR/Cas9) system has been recently announced as a powerful molecular breeding tool for site-directed mutagenesis in higher plants. Here, we report a site-directed mutagenesis method targeting Petunia nitrate reductase (NR) gene locus. This method could create mutations efficiently using direct delivery of purified Cas9 protein and single guide RNA (sgRNA) into protoplast cells. After transient introduction of RNA-guided endonuclease (RGEN) ribonucleoproteins (RNPs) with different sgRNAs targeting NR genes, mutagenesis at the targeted loci was detected by T7E1 assay and confirmed by targeted deep sequencing. T7E1 assay showed that RGEN RNPs induced site-specific mutations at frequencies ranging from 2.4 to 21 % at four different sites (NR1, 2, 4 and 6) in the PhNR gene locus with average mutation efficiency of 14.9 ± 2.2 %. Targeted deep DNA sequencing revealed mutation rates of 5.3-17.8 % with average mutation rate of 11.5 ± 2 % at the same NR gene target sites in DNA fragments of analyzed protoplast transfectants. Further analysis from targeted deep sequencing showed that the average ratio of deletion to insertion produced collectively by the four NR-RGEN target sites (NR1, 2, 4, and 6) was about 63:37. Our results demonstrated that direct delivery of RGEN RNPs into protoplast cells of Petunia can be exploited as an efficient tool for site-directed mutagenesis of genes or genome editing in plant systems.
Biosynthesis of riboflavin: an unusual riboflavin synthase of Methanobacterium thermoautotrophicum.

PubMed Central

Eberhardt, S; Korn, S; Lottspeich, F; Bacher, A

1997-01-01

Riboflavin synthase was purified by a factor of about 1,500 from cell extract of Methanobacterium thermoautotrophicum. The enzyme had a specific activity of about 2,700 nmol mg(-1) h(-1) at 65 degrees C, which is relatively low compared to those of riboflavin synthases of eubacteria and yeast. Amino acid sequences obtained after proteolytic cleavage had no similarity with known riboflavin synthases. The gene coding for riboflavin synthase (designated ribC) was subsequently cloned by marker rescue with a ribC mutant of Escherichia coli. The ribC gene of M. thermoautotrophicum specifies a protein of 153 amino acid residues. The predicted amino acid sequence agrees with the information gleaned from Edman degradation of the isolated protein and shows 67% identity with the sequence predicted for the unannotated reading frame MJ1184 of Methanococcus jannaschii. The ribC gene is adjacent to a cluster of four genes with similarity to the genes cbiMNQO of Salmonella typhimurium, which form part of the cob operon (this operon contains most of the genes involved in the biosynthesis of vitamin B12). The amino acid sequence predicted by the ribC gene of M. thermoautotrophicum shows no similarity whatsoever to the sequences of riboflavin synthases of eubacteria and yeast. Most notably, the M. thermoautotrophicum protein does not show the internal sequence homology characteristic of eubacterial and yeast riboflavin synthases. The protein of M. thermoautotrophicum can be expressed efficiently in a recombinant E. coli strain. The specific activity of the purified, recombinant protein is 1,900 nmol mg(-1) h(-1) at 65 degrees C. In contrast to riboflavin synthases from eubacteria and fungi, the methanobacterial enzyme has an absolute requirement for magnesium ions. The 5' phosphate of 6,7-dimethyl-8-ribityllumazine does not act as a substrate. The findings suggest that riboflavin synthase has evolved independently in eubacteria and methanobacteria. PMID:9139911
Efficient introduction of specific homozygous and heterozygous mutations using CRISPR/Cas9.

PubMed

Paquet, Dominik; Kwart, Dylan; Chen, Antonia; Sproul, Andrew; Jacob, Samson; Teo, Shaun; Olsen, Kimberly Moore; Gregg, Andrew; Noggle, Scott; Tessier-Lavigne, Marc

2016-05-05

The bacterial CRISPR/Cas9 system allows sequence-specific gene editing in many organisms and holds promise as a tool to generate models of human diseases, for example, in human pluripotent stem cells. CRISPR/Cas9 introduces targeted double-stranded breaks (DSBs) with high efficiency, which are typically repaired by non-homologous end-joining (NHEJ) resulting in nonspecific insertions, deletions or other mutations (indels). DSBs may also be repaired by homology-directed repair (HDR) using a DNA repair template, such as an introduced single-stranded oligo DNA nucleotide (ssODN), allowing knock-in of specific mutations. Although CRISPR/Cas9 is used extensively to engineer gene knockouts through NHEJ, editing by HDR remains inefficient and can be corrupted by additional indels, preventing its widespread use for modelling genetic disorders through introducing disease-associated mutations. Furthermore, targeted mutational knock-in at single alleles to model diseases caused by heterozygous mutations has not been reported. Here we describe a CRISPR/Cas9-based genome-editing framework that allows selective introduction of mono- and bi-allelic sequence changes with high efficiency and accuracy. We show that HDR accuracy is increased dramatically by incorporating silent CRISPR/Cas-blocking mutations along with pathogenic mutations, and establish a method termed 'CORRECT' for scarless genome editing. By characterizing and exploiting a stereotyped inverse relationship between a mutation's incorporation rate and its distance to the DSB, we achieve predictable control of zygosity. Homozygous introduction requires a guide RNA targeting close to the intended mutation, whereas heterozygous introduction can be accomplished by distance-dependent suboptimal mutation incorporation or by use of mixed repair templates. Using this approach, we generated human induced pluripotent stem cells with heterozygous and homozygous dominant early onset Alzheimer's disease-causing mutations in amyloid precursor protein (APP(Swe)) and presenilin 1 (PSEN1(M146V)) and derived cortical neurons, which displayed genotype-dependent disease-associated phenotypes. Our findings enable efficient introduction of specific sequence changes with CRISPR/Cas9, facilitating study of human disease.
The Sequences of 1504 Mutants in the Model Rice Variety Kitaake Facilitate Rapid Functional Genomic Studies.

PubMed

Li, Guotian; Jain, Rashmi; Chern, Mawsheng; Pham, Nikki T; Martin, Joel A; Wei, Tong; Schackwitz, Wendy S; Lipzen, Anna M; Duong, Phat Q; Jones, Kyle C; Jiang, Liangrong; Ruan, Deling; Bauer, Diane; Peng, Yi; Barry, Kerrie W; Schmutz, Jeremy; Ronald, Pamela C

2017-06-01

The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake ( Oryza sativa ssp japonica ), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportion of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. This work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations. © 2017 American Society of Plant Biologists. All rights reserved.
The Sequences of 1,504 Mutants in the Model Rice Variety Kitaake Facilitate Rapid Functional Genomic Studies

DOE PAGES

Li, Guotian; Jain, Rashmi; Chern, Mawsheng; ...

2017-06-02

The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportionmore » of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. In conclusion, this work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations.« less
Resistance gene homologues in Theobroma cacao as useful genetic markers.

PubMed

Kuhn, D N; Heath, M; Wisser, R J; Meerow, A; Brown, J S; Lopes, U; Schnell, R J

2003-07-01

Resistance gene homologue (RGH) sequences have been developed into useful genetic markers for marker-assisted selection (MAS) of disease resistant Theobroma cacao. A plasmid library of amplified fragments was created from seven different cultivars of cacao. Over 600 cloned recombinant amplicons were evaluated. From these, 74 unique RGHs were identified that could be placed into 11 categories based on sequence analysis. Primers specific to each category were designed. The primers specific for a single RGH category amplified fragments of equal length from the seven different cultivars used to create the library. However, these fragments exhibited single-strand conformational polymorphism (SSCP), which allowed us to map six of the RGH categories in an F(2) population of T. cacao. RGHs 1, 4 and 5 were in the same linkage group, with RGH 4 and 5 separated by less than 4 cM. As SSCP can be efficiently performed on our automated sequencer, we have developed a convenient and rapid high throughput assay for RGH alleles.
Molecular identification of the ompL1 gene within Leptospira interrogans standard serovars.

PubMed

Dezhbord, Mehrangiz; Esmaelizad, Majid; Khaki, Pejvak; Fotohi, Fariba; Zarehparvar Moghaddam, Athena

2014-06-11

Leptospirosis, caused by infection with pathogenic Leptospira species, is one of the most prevalent zoonotic diseases in the world. Current leptospiral vaccines are mainly multivalent dead whole-cell mixtures made of several local dominant serovars. Therefore, design and construction of an efficient recombinant vaccine for leptospirosis control is very important. OmpL1 is an immunogenic porin protein that could be of special significance in vaccination and serodiagnosis for leptospirosis. Three strains belonging to pathogenic L. interrogans were analyzed. The specific primers for proliferation of the ompL1 gene were designed. The amplified gene was cloned. In order to investigate the ompL1 nucleotide sequence and homological analysis of this gene, ompL1 genes cloned from standard vaccinal Leptospira serovars prevalent in Iran were sequenced and cloned. PCR amplification of the ompL1 gene using the designed primers resulted in a 963 bp ompL1 gene product. The PCR based on the ompL1 gene detected all pathogenic reference serovars of Leptospira spp. tested. Based on alignment and phylogenetic analysis, although the ompL1 nucleotide sequence was slightly different within three vaccinal serovars (100%-85% identity), amino acid alignment of the OmpL1 proteins revealed that there would be inconsiderable difference among them. The ompL1 gene of the three isolates was well conserved, differing only by a total of 6 bp and the proteins by 2 amino acids. The cloned gene could be further used for expression and recombinant OmpL1 as an efficient and conserved antigen, and may be a useful vaccine candidate against leptospirosis in our region.
Improved efficiency in amplification of Escherichia coli o-antigen gene clusters using genome-wide sequence comparison

USDA-ARS?s Scientific Manuscript database

Background: In many bacteria including E. coli, genes encoding O-antigens are clustered in the chromosome, with a 39-bp JUMPstart sequence and gnd gene located upstream and downstream of the cluster, respectively. For determining the DNA sequence of the E. coli O-antigen gene cluster, one set of P...
Discrimination of the Lactobacillus acidophilus group using sequencing, species-specific PCR and SNaPshot mini-sequencing technology based on the recA gene.

PubMed

Huang, Chien-Hsun; Chang, Mu-Tzu; Huang, Mu-Chiou; Wang, Li-Tin; Huang, Lina; Lee, Fwu-Ling

2012-10-01

To clearly identify specific species and subspecies of the Lactobacillus acidophilus group using phenotypic and genotypic (16S rDNA sequence analysis) techniques alone is difficult. The aim of this study was to use the recA gene for species discrimination in the L. acidophilus group, as well as to develop a species-specific primer and single nucleotide polymorphism primer based on the recA gene sequence for species and subspecies identification. The average sequence similarity for the recA gene among type strains was 80.0%, and most members of the L. acidophilus group could be clearly distinguished. The species-specific primer was designed according to the recA gene sequencing, which was employed for polymerase chain reaction with the template DNA of Lactobacillus strains. A single 231-bp species-specific band was found only in L. delbrueckii. A SNaPshot mini-sequencing assay using recA as a target gene was also developed. The specificity of the mini-sequencing assay was evaluated using 31 strains of L. delbrueckii species and was able to unambiguously discriminate strains belonging to the subspecies L. delbrueckii subsp. bulgaricus. The phylogenetic relationships of most strains in the L. acidophilus group can be resolved using recA gene sequencing, and a novel method to identify the species and subspecies of the L. delbrueckii and L. delbrueckii subsp. bulgaricus was developed by species-specific polymerase chain reaction combined with SNaPshot mini-sequencing. Copyright © 2012 Society of Chemical Industry.
In situ genetic correction of F8 intron 22 inversion in hemophilia A patient-specific iPSCs.

PubMed

Wu, Yong; Hu, Zhiqing; Li, Zhuo; Pang, Jialun; Feng, Mai; Hu, Xuyun; Wang, Xiaolin; Lin-Peng, Siyuan; Liu, Bo; Chen, Fangping; Wu, Lingqian; Liang, Desheng

2016-01-08

Nearly half of severe Hemophilia A (HA) cases are caused by F8 intron 22 inversion (Inv22). This 0.6-Mb inversion splits the 186-kb F8 into two parts with opposite transcription directions. The inverted 5' part (141 kb) preserves the first 22 exons that are driven by the intrinsic F8 promoter, leading to a truncated F8 transcript due to the lack of the last 627 bp coding sequence of exons 23-26. Here we describe an in situ genetic correction of Inv22 in patient-specific induced pluripotent stem cells (iPSCs). By using TALENs, the 627 bp sequence plus a polyA signal was precisely targeted at the junction of exon 22 and intron 22 via homologous recombination (HR) with high targeting efficiencies of 62.5% and 52.9%. The gene-corrected iPSCs retained a normal karyotype following removal of drug selection cassette using a Cre-LoxP system. Importantly, both F8 transcription and FVIII secretion were rescued in the candidate cell types for HA gene therapy including endothelial cells (ECs) and mesenchymal stem cells (MSCs) derived from the gene-corrected iPSCs. This is the first report of an efficient in situ genetic correction of the large inversion mutation using a strategy of targeted gene addition.
In situ genetic correction of F8 intron 22 inversion in hemophilia A patient-specific iPSCs

PubMed Central

Wu, Yong; Hu, Zhiqing; Li, Zhuo; Pang, Jialun; Feng, Mai; Hu, Xuyun; Wang, Xiaolin; Lin-Peng, Siyuan; Liu, Bo; Chen, Fangping; Wu, Lingqian; Liang, Desheng

2016-01-01

Nearly half of severe Hemophilia A (HA) cases are caused by F8 intron 22 inversion (Inv22). This 0.6-Mb inversion splits the 186-kb F8 into two parts with opposite transcription directions. The inverted 5′ part (141 kb) preserves the first 22 exons that are driven by the intrinsic F8 promoter, leading to a truncated F8 transcript due to the lack of the last 627 bp coding sequence of exons 23–26. Here we describe an in situ genetic correction of Inv22 in patient-specific induced pluripotent stem cells (iPSCs). By using TALENs, the 627 bp sequence plus a polyA signal was precisely targeted at the junction of exon 22 and intron 22 via homologous recombination (HR) with high targeting efficiencies of 62.5% and 52.9%. The gene-corrected iPSCs retained a normal karyotype following removal of drug selection cassette using a Cre-LoxP system. Importantly, both F8 transcription and FVIII secretion were rescued in the candidate cell types for HA gene therapy including endothelial cells (ECs) and mesenchymal stem cells (MSCs) derived from the gene-corrected iPSCs. This is the first report of an efficient in situ genetic correction of the large inversion mutation using a strategy of targeted gene addition. PMID:26743572
Function-Based Algorithms for Biological Sequences

ERIC Educational Resources Information Center

Mohanty, Pragyan Sheela P.

2015-01-01

Two problems at two different abstraction levels of computational biology are studied. At the molecular level, efficient pattern matching algorithms in DNA sequences are presented. For gene order data, an efficient data structure is presented capable of storing all gene re-orderings in a systematic manner. A common characteristic of presented…
Simultaneous knockdown of six non-family genes using a single synthetic RNAi fragment in Arabidopsis thaliana

DOE Office of Scientific and Technical Information (OSTI.GOV)

Czarnecki, Olaf; Bryan, Anthony C.; Jawdy, Sara S.

Genetic engineering of plants that results in successful establishment of new biochemical or regulatory pathways requires stable introduction of one or more genes into the plant genome. It might also be necessary to down-regulate or turn off expression of endogenous genes in order to reduce activity of competing pathways. An established way to knockdown gene expression in plants is expressing a hairpin-RNAi construct, eventually leading to degradation of a specifically targeted mRNA. Knockdown of multiple genes that do not share homologous sequences is still challenging and involves either sophisticated cloning strategies to create vectors with different serial expression constructs ormore » multiple transformation events that is often restricted by a lack of available transformation markers. Synthetic RNAi fragments were assembled in yeast carrying homologous sequences to six or seven non-family genes and introduced into pAGRIKOLA. Transformation of Arabidopsis thaliana and subsequent expression analysis of targeted genes proved efficient knockdown of all target genes. In conclusion, we present a simple and cost-effective method to create constructs to simultaneously knockdown multiple non-family genes or genes that do not share sequence homology. The presented method can be applied in plant and animal synthetic biology as well as traditional plant and animal genetic engineering.« less
Simultaneous knockdown of six non-family genes using a single synthetic RNAi fragment in Arabidopsis thaliana

DOE PAGES

Czarnecki, Olaf; Bryan, Anthony C.; Jawdy, Sara S.; ...

2016-02-17

Genetic engineering of plants that results in successful establishment of new biochemical or regulatory pathways requires stable introduction of one or more genes into the plant genome. It might also be necessary to down-regulate or turn off expression of endogenous genes in order to reduce activity of competing pathways. An established way to knockdown gene expression in plants is expressing a hairpin-RNAi construct, eventually leading to degradation of a specifically targeted mRNA. Knockdown of multiple genes that do not share homologous sequences is still challenging and involves either sophisticated cloning strategies to create vectors with different serial expression constructs ormore » multiple transformation events that is often restricted by a lack of available transformation markers. Synthetic RNAi fragments were assembled in yeast carrying homologous sequences to six or seven non-family genes and introduced into pAGRIKOLA. Transformation of Arabidopsis thaliana and subsequent expression analysis of targeted genes proved efficient knockdown of all target genes. In conclusion, we present a simple and cost-effective method to create constructs to simultaneously knockdown multiple non-family genes or genes that do not share sequence homology. The presented method can be applied in plant and animal synthetic biology as well as traditional plant and animal genetic engineering.« less
Process parameter optimization for hydantoinase-mediated synthesis of optically pure carbamoyl amino acids of industrial value using Pseudomonas aeruginosa resting cells.

PubMed

Engineer, Anupama S; Dhakephalkar, Anita P; Gaikaiwari, Raghavendra P; Dhakephalkar, Prashant K

2013-12-01

Hydantoinase-mediated enzymatic synthesis of optically pure carbamoyl amino acids was investigated as an environmentally friendly, energy-efficient alternative to the otherwise energy-intensive, polluting chemical synthesis. Hydantoinase-producing bacterial strain was identified as Pseudomonas aeruginosa by 16S rRNA gene sequencing and biochemical profiling using the BIOLOG Microbial Identification System. Hydantoinase activity was assessed using hydantoin analogs and 5-monosubstituted hydantoins as substrates in a colorimetric assay. The hydantoinase gene was PCR amplified using gene-specific primers and sequenced on an automated gene analyzer. Hydantoinase gene sequence of P. aeruginosa MCM B-887 revealed maximum homology of only 87 % with proven hydantoinase gene sequences in GenBank. MCM B-887 resting cells converted >99 % of substrate into N-carbamoyl amino acids under optimized condition at 42 °C, pH 8.0, and 100 mM substrate concentration in <120 min. Hydantoin hydrolyzing activity was D-selective and included broad substrate profile of 5-methyl hydantoin, 5-phenyl hydantoin, 5-hydroxyphenyl hydantoin, o-chlorophenyl hydantoin, as well as hydantoin analogs such as allantoin, dihydrouracil, etc. MCM B-887 resting cells may thus be suitable for bio-transformations leading to the synthesis of optically pure, unnatural carbamoyl amino acids of industrial importance.
An analysis by metabolic labelling of the encephalomyocarditis virus ribosomal frameshifting efficiency and stimulators.

PubMed

Ling, Roger; Firth, Andrew E

2017-08-01

Programmed -1 ribosomal frameshifting is a mechanism of gene expression whereby specific signals within messenger RNAs direct a proportion of ribosomes to shift -1 nt and continue translating in the new reading frame. Such frameshifting normally depends on an RNA structure stimulator 3'-adjacent to a 'slippery' heptanucleotide shift site sequence. Recently we identified an unusual frameshifting mechanism in encephalomyocarditis virus, where the stimulator involves a trans-acting virus protein. Thus, in contrast to other examples of -1 frameshifting, the efficiency of frameshifting in encephalomyocarditis virus is best studied in the context of virus infection. Here we use metabolic labelling to analyse the frameshifting efficiency of wild-type and mutant viruses. Confirming previous results, frameshifting depends on a G_GUU_UUU shift site sequence and a 3'-adjacent stem-loop structure, but is not appreciably affected by the 'StopGo' sequence present ~30 nt upstream. At late timepoints, frameshifting was estimated to be 46-76 % efficient.
The genome sequence of the model ascomycete fungus Podospora anserina

PubMed Central

Espagne, Eric; Lespinet, Olivier; Malagnac, Fabienne; Da Silva, Corinne; Jaillon, Olivier; Porcel, Betina M; Couloux, Arnaud; Aury, Jean-Marc; Ségurens, Béatrice; Poulain, Julie; Anthouard, Véronique; Grossetete, Sandrine; Khalili, Hamid; Coppin, Evelyne; Déquard-Chablat, Michelle; Picard, Marguerite; Contamine, Véronique; Arnaise, Sylvie; Bourdais, Anne; Berteaux-Lecellier, Véronique; Gautheret, Daniel; de Vries, Ronald P; Battaglia, Evy; Coutinho, Pedro M; Danchin, Etienne GJ; Henrissat, Bernard; Khoury, Riyad EL; Sainsard-Chanet, Annie; Boivin, Antoine; Pinan-Lucarré, Bérangère; Sellem, Carole H; Debuchy, Robert; Wincker, Patrick; Weissenbach, Jean; Silar, Philippe

2008-01-01

Background The dung-inhabiting ascomycete fungus Podospora anserina is a model used to study various aspects of eukaryotic and fungal biology, such as ageing, prions and sexual development. Results We present a 10X draft sequence of P. anserina genome, linked to the sequences of a large expressed sequence tag collection. Similar to higher eukaryotes, the P. anserina transcription/splicing machinery generates numerous non-conventional transcripts. Comparison of the P. anserina genome and orthologous gene set with the one of its close relatives, Neurospora crassa, shows that synteny is poorly conserved, the main result of evolution being gene shuffling in the same chromosome. The P. anserina genome contains fewer repeated sequences and has evolved new genes by duplication since its separation from N. crassa, despite the presence of the repeat induced point mutation mechanism that mutates duplicated sequences. We also provide evidence that frequent gene loss took place in the lineages leading to P. anserina and N. crassa. P. anserina contains a large and highly specialized set of genes involved in utilization of natural carbon sources commonly found in its natural biotope. It includes genes potentially involved in lignin degradation and efficient cellulose breakdown. Conclusion The features of the P. anserina genome indicate a highly dynamic evolution since the divergence of P. anserina and N. crassa, leading to the ability of the former to use specific complex carbon sources that match its needs in its natural biotope. PMID:18460219

Functionally Relevant Microsatellite Markers From Chickpea Transcription Factor Genes for Efficient Genotyping Applications and Trait Association Mapping

PubMed Central

Kujur, Alice; Bajaj, Deepak; Saxena, Maneesha S.; Tripathi, Shailesh; Upadhyaya, Hari D.; Gowda, C.L.L.; Singh, Sube; Jain, Mukesh; Tyagi, Akhilesh K.; Parida, Swarup K.

2013-01-01

We developed 1108 transcription factor gene-derived microsatellite (TFGMS) and 161 transcription factor functional domain-associated microsatellite (TFFDMS) markers from 707 TFs of chickpea. The robust amplification efficiency (96.5%) and high intra-specific polymorphic potential (34%) detected by markers suggest their immense utilities in efficient large-scale genotyping applications, including construction of both physical and functional transcript maps and understanding population structure. Candidate gene-based association analysis revealed strong genetic association of TFFDMS markers with three major seed and pod traits. Further, TFGMS markers in the 5′ untranslated regions of TF genes showing differential expression during seed development had higher trait association potential. The significance of TFFDMS markers was demonstrated by correlating their allelic variation with amino acid sequence expansion/contraction in the functional domain and alteration of secondary protein structure encoded by genes. The seed weight-associated markers were validated through traditional bi-parental genetic mapping. The determination of gene-specific linkage disequilibrium (LD) patterns in desi and kabuli based on single nucleotide polymorphism-microsatellite marker haplotypes revealed extended LD decay, enhanced LD resolution and trait association potential of genes. The evolutionary history of a strong seed-size/weight-associated TF based on natural variation and haplotype sharing among desi, kabuli and wild unravelled useful information having implication for seed-size trait evolution during chickpea domestication. PMID:23633531
Chromatin accessibility and guide sequence secondary structure affect CRISPR-Cas9 gene editing efficiency.

PubMed

Jensen, Kristopher Torp; Fløe, Lasse; Petersen, Trine Skov; Huang, Jinrong; Xu, Fengping; Bolund, Lars; Luo, Yonglun; Lin, Lin

2017-07-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated protein 9 (CRISPR-Cas9) systems have emerged as the method of choice for genome editing, but large variations in on-target efficiencies continue to limit their applicability. Here, we investigate the effect of chromatin accessibility on Cas9-mediated gene editing efficiency for 20 gRNAs targeting 10 genomic loci in HEK293T cells using both SpCas9 and the eSpCas9(1.1) variant. Our study indicates that gene editing is more efficient in euchromatin than in heterochromatin, and we validate this finding in HeLa cells and in human fibroblasts. Furthermore, we investigate the gRNA sequence determinants of CRISPR-Cas9 activity using a surrogate reporter system and find that the efficiency of Cas9-mediated gene editing is dependent on guide sequence secondary structure formation. This knowledge can aid in the further improvement of tools for gRNA design. © 2017 Federation of European Biochemical Societies.
Development of a multiplex PCR assay for detection and discrimination of Theileria annulata and Theileria sergenti in cattle.

PubMed

Junlong, Liu; Li, Youquan; Liu, Aihong; Guan, Guiquan; Xie, Junren; Yin, Hong; Luo, Jianxun

2015-07-01

Aim to construct a simple and efficient diagnostic assay for Theileria annulata and Theileria sergenti, a multiplex polymerase chain reaction (PCR) method was developed in this study. Following the alignment of the related sequences, two primer sets were designed specific targeting on T. annulata cytochrome b (COB) gene and T. sergenti internal transcribed spacer (ITS) sequences. It was found that the designed primers could react in one PCR system and generating amplifications of 818 and 393 base pair for T. sergenti and T. annulata, respectively. The standard genomic DNA of both species Theileria was serial tenfold diluted for testing the sensitivity, while specificity test confirmed both primer sets have no cross-reaction with other Theileria and Babesia species. In addition, 378 field samples were used for evaluation of the utility of the multiplex PCR assay for detection of the pathogens infection. The detection results were compared with the other two published PCR methods which targeting on T. annulata COB gene and T. sergenti major piroplasm surface protein (MPSP) gene, respectively. The developed multiplex PCR assay has similar efficient detection with COB and MPSP PCR, which indicates this multiplex PCR may be a valuable assay for the epidemiological studies for T. annulata and T. sergenti.
Single-cell analyses of transcriptional heterogeneity during drug tolerance transition in cancer cells by RNA sequencing.

PubMed

Lee, Mei-Chong Wendy; Lopez-Diaz, Fernando J; Khan, Shahid Yar; Tariq, Muhammad Akram; Dayn, Yelena; Vaske, Charles Joseph; Radenbaugh, Amie J; Kim, Hyunsung John; Emerson, Beverly M; Pourmand, Nader

2014-11-04

The acute cellular response to stress generates a subpopulation of reversibly stress-tolerant cells under conditions that are lethal to the majority of the population. Stress tolerance is attributed to heterogeneity of gene expression within the population to ensure survival of a minority. We performed whole transcriptome sequencing analyses of metastatic human breast cancer cells subjected to the chemotherapeutic agent paclitaxel at the single-cell and population levels. Here we show that specific transcriptional programs are enacted within untreated, stressed, and drug-tolerant cell groups while generating high heterogeneity between single cells within and between groups. We further demonstrate that drug-tolerant cells contain specific RNA variants residing in genes involved in microtubule organization and stabilization, as well as cell adhesion and cell surface signaling. In addition, the gene expression profile of drug-tolerant cells is similar to that of untreated cells within a few doublings. Thus, single-cell analyses reveal the dynamics of the stress response in terms of cell-specific RNA variants driving heterogeneity, the survival of a minority population through generation of specific RNA variants, and the efficient reconversion of stress-tolerant cells back to normalcy.
Single-cell analyses of transcriptional heterogeneity during drug tolerance transition in cancer cells by RNA sequencing

PubMed Central

Lee, Mei-Chong Wendy; Lopez-Diaz, Fernando J.; Khan, Shahid Yar; Tariq, Muhammad Akram; Dayn, Yelena; Vaske, Charles Joseph; Radenbaugh, Amie J.; Kim, Hyunsung John; Emerson, Beverly M.; Pourmand, Nader

2014-01-01

The acute cellular response to stress generates a subpopulation of reversibly stress-tolerant cells under conditions that are lethal to the majority of the population. Stress tolerance is attributed to heterogeneity of gene expression within the population to ensure survival of a minority. We performed whole transcriptome sequencing analyses of metastatic human breast cancer cells subjected to the chemotherapeutic agent paclitaxel at the single-cell and population levels. Here we show that specific transcriptional programs are enacted within untreated, stressed, and drug-tolerant cell groups while generating high heterogeneity between single cells within and between groups. We further demonstrate that drug-tolerant cells contain specific RNA variants residing in genes involved in microtubule organization and stabilization, as well as cell adhesion and cell surface signaling. In addition, the gene expression profile of drug-tolerant cells is similar to that of untreated cells within a few doublings. Thus, single-cell analyses reveal the dynamics of the stress response in terms of cell-specific RNA variants driving heterogeneity, the survival of a minority population through generation of specific RNA variants, and the efficient reconversion of stress-tolerant cells back to normalcy. PMID:25339441
Transcription Factor Information System (TFIS): A Tool for Detection of Transcription Factor Binding Sites.

PubMed

Narad, Priyanka; Kumar, Abhishek; Chakraborty, Amlan; Patni, Pranav; Sengupta, Abhishek; Wadhwa, Gulshan; Upadhyaya, K C

2017-09-01

Transcription factors are trans-acting proteins that interact with specific nucleotide sequences known as transcription factor binding site (TFBS), and these interactions are implicated in regulation of the gene expression. Regulation of transcriptional activation of a gene often involves multiple interactions of transcription factors with various sequence elements. Identification of these sequence elements is the first step in understanding the underlying molecular mechanism(s) that regulate the gene expression. For in silico identification of these sequence elements, we have developed an online computational tool named transcription factor information system (TFIS) for detecting TFBS for the first time using a collection of JAVA programs and is mainly based on TFBS detection using position weight matrix (PWM). The database used for obtaining position frequency matrices (PFM) is JASPAR and HOCOMOCO, which is an open-access database of transcription factor binding profiles. Pseudo-counts are used while converting PFM to PWM, and TFBS detection is carried out on the basis of percent score taken as threshold value. TFIS is equipped with advanced features such as direct sequence retrieving from NCBI database using gene identification number and accession number, detecting binding site for common TF in a batch of gene sequences, and TFBS detection after generating PWM from known raw binding sequences in addition to general detection methods. TFIS can detect the presence of potential TFBSs in both the directions at the same time. This feature increases its efficiency. And the results for this dual detection are presented in different colors specific to the orientation of the binding site. Results obtained by the TFIS are more detailed and specific to the detected TFs as integration of more informative links from various related web servers are added in the result pages like Gene Ontology, PAZAR database and Transcription Factor Encyclopedia in addition to NCBI and UniProt. Common TFs like SP1, AP1 and NF-KB of the Amyloid beta precursor gene is easily detected using TFIS along with multiple binding sites. In another scenario of embryonic developmental process, TFs of the FOX family (FOXL1 and FOXC1) were also identified. TFIS is platform-independent which is publicly available along with its support and documentation at http://tfistool.appspot.com and http://www.bioinfoplus.com/tfis/ . TFIS is licensed under the GNU General Public License, version 3 (GPL-3.0).
Isolation and Identification of Gene-Specific MicroRNAs.

PubMed

Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

2018-01-01

Computer programming has identified hundreds of genomic hairpin sequences, many with functions yet to be determined. Because transfection of hairpin-like microRNA precursors (pre-miRNAs) into mammalian cells is not always sufficient to trigger RNA-induced gene silencing complex (RISC) assembly, a key step for inducing RNA interference (RNAi)-related gene silencing, we have developed an intronic miRNA expression system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene, and hence successfully increase the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis mechanism has been found to depend on a coupled interaction of nascent messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA so obtained is transcribed by type-II RNA polymerases, coexpressed within a primary gene transcript, and then excised out of the gene transcript by intracellular RNA splicing and processing machineries. After that, ribonuclease III (RNaseIII) endonucleases further process the spliced introns into mature miRNAs. Using this intronic miRNA expression system, we have shown for the first time that the intron-derived miRNAs are able to elicit strong RNAi effects in not only human and mouse cells in vitro but also in zebrafishes, chicken embryos, and adult mice in vivo. We have also developed a miRNA isolation protocol, based on the complementarity between the designed miRNA and its targeted gene sequence, to purify and identify the mature miRNAs generated. As a result, several intronic miRNA identities and structures have been confirmed. According to this proof-of-principle methodology, we now have full knowledge to design various intronic pre-miRNA inserts that are more efficient and effective for inducing specific gene silencing effects in vitro and in vivo.
Evaluation of two main RNA-seq approaches for gene quantification in clinical RNA sequencing: polyA+ selection versus rRNA depletion.

PubMed

Zhao, Shanrong; Zhang, Ying; Gamini, Ramya; Zhang, Baohong; von Schack, David

2018-03-19

To allow efficient transcript/gene detection, highly abundant ribosomal RNAs (rRNA) are generally removed from total RNA either by positive polyA+ selection or by rRNA depletion (negative selection) before sequencing. Comparisons between the two methods have been carried out by various groups, but the assessments have relied largely on non-clinical samples. In this study, we evaluated these two RNA sequencing approaches using human blood and colon tissue samples. Our analyses showed that rRNA depletion captured more unique transcriptome features, whereas polyA+ selection outperformed rRNA depletion with higher exonic coverage and better accuracy of gene quantification. For blood- and colon-derived RNAs, we found that 220% and 50% more reads, respectively, would have to be sequenced to achieve the same level of exonic coverage in the rRNA depletion method compared with the polyA+ selection method. Therefore, in most cases we strongly recommend polyA+ selection over rRNA depletion for gene quantification in clinical RNA sequencing. Our evaluation revealed that a small number of lncRNAs and small RNAs made up a large fraction of the reads in the rRNA depletion RNA sequencing data. Thus, we recommend that these RNAs are specifically depleted to improve the sequencing depth of the remaining RNAs.
Evaluation of the Bacterial Diversity in the Human Tongue Coating Based on Genus-Specific Primers for 16S rRNA Sequencing.

PubMed

Sun, Beili; Zhou, Dongrui; Tu, Jing; Lu, Zuhong

2017-01-01

The characteristics of tongue coating are very important symbols for disease diagnosis in traditional Chinese medicine (TCM) theory. As a habitat of oral microbiota, bacteria on the tongue dorsum have been proved to be the cause of many oral diseases. The high-throughput next-generation sequencing (NGS) platforms have been widely applied in the analysis of bacterial 16S rRNA gene. We developed a methodology based on genus-specific multiprimer amplification and ligation-based sequencing for microbiota analysis. In order to validate the efficiency of the approach, we thoroughly analyzed six tongue coating samples from lung cancer patients with different TCM types, and more than 600 genera of bacteria were detected by this platform. The results showed that ligation-based parallel sequencing combined with enzyme digestion and multiamplification could expand the effective length of sequencing reads and could be applied in the microbiota analysis.
Strategies for Improving siRNA-Induced Gene Silencing Efficiency

PubMed Central

Safari, Fatemeh; Rahmani Barouji, Solmaz; Tamaddon, Ali Mohammad

2017-01-01

Purpose: Human telomerase reverse transcriptase (hTERT) plays a crucial role in tumorigenesis and progression of cancers. Gene silencing of hTERT by short interfering RNA (siRNA) is considered as a promising strategy for cancer gene therapy. Various algorithms have been devised for designing a high efficient siRNA which is a significant issue in the clinical usage. Thereby, in the present study, the relation of siRNA designing criteria and the gene silencing efficiency was evaluated. Methods: The siRNA sequences were designed and characterized by using on line soft wares. Cationic co-polymer (polyethylene glycol-g-polyethylene imine (PEG-g-PEI)) was used for the construction of polyelectrolyte complexes (PECs) containing siRNAs. The cellular uptake of the PECs was evaluated. The gene silencing efficiency of different siRNA sequences was investigated and the effect of observing the rational designing on the functionality of siRNAs was assessed. Results: The size of PEG-g-PEI siRNA with N/P (Nitrogen/Phosphate) ratio of 2.5 was 114 ± 0.645 nm. The transfection efficiency of PECs was desirable (95.5% ± 2.4%.). The results of Real-Time PCR showed that main sequence (MS) reduced the hTERT expression up to 90% and control positive sequence (CPS) up to 63%. These findings demonstrated that the accessibility to the target site has priority than the other criteria such as sequence preferences and thermodynamic features. Conclusion: siRNA opens a hopeful window in cancer therapy which provides a convenient and tolerable therapeutic approach. Thereby, using the set of criteria and rational algorithms in the designing of siRNA remarkably affect the gene silencing efficiency. PMID:29399550
Strategies for Improving siRNA-Induced Gene Silencing Efficiency.

PubMed

Safari, Fatemeh; Rahmani Barouji, Solmaz; Tamaddon, Ali Mohammad

2017-12-01

Purpose: Human telomerase reverse transcriptase (hTERT) plays a crucial role in tumorigenesis and progression of cancers. Gene silencing of hTERT by short interfering RNA (siRNA) is considered as a promising strategy for cancer gene therapy. Various algorithms have been devised for designing a high efficient siRNA which is a significant issue in the clinical usage. Thereby, in the present study, the relation of siRNA designing criteria and the gene silencing efficiency was evaluated. Methods: The siRNA sequences were designed and characterized by using on line soft wares. Cationic co-polymer (polyethylene glycol-g-polyethylene imine (PEG-g-PEI)) was used for the construction of polyelectrolyte complexes (PECs) containing siRNAs. The cellular uptake of the PECs was evaluated. The gene silencing efficiency of different siRNA sequences was investigated and the effect of observing the rational designing on the functionality of siRNAs was assessed. Results: The size of PEG-g-PEI siRNA with N/P (Nitrogen/Phosphate) ratio of 2.5 was 114 ± 0.645 nm. The transfection efficiency of PECs was desirable (95.5% ± 2.4%.). The results of Real-Time PCR showed that main sequence (MS) reduced the hTERT expression up to 90% and control positive sequence (CPS) up to 63%. These findings demonstrated that the accessibility to the target site has priority than the other criteria such as sequence preferences and thermodynamic features. Conclusion: siRNA opens a hopeful window in cancer therapy which provides a convenient and tolerable therapeutic approach. Thereby, using the set of criteria and rational algorithms in the designing of siRNA remarkably affect the gene silencing efficiency.
BG7: A New Approach for Bacterial Genome Annotation Designed for Next Generation Sequencing Data

PubMed Central

Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Pareja, Eduardo; Tobes, Raquel

2012-01-01

BG7 is a new system for de novo bacterial, archaeal and viral genome annotation based on a new approach specifically designed for annotating genomes sequenced with next generation sequencing technologies. The system is versatile and able to annotate genes even in the step of preliminary assembly of the genome. It is especially efficient detecting unexpected genes horizontally acquired from bacterial or archaeal distant genomes, phages, plasmids, and mobile elements. From the initial phases of the gene annotation process, BG7 exploits the massive availability of annotated protein sequences in databases. BG7 predicts ORFs and infers their function based on protein similarity with a wide set of reference proteins, integrating ORF prediction and functional annotation phases in just one step. BG7 is especially tolerant to sequencing errors in start and stop codons, to frameshifts, and to assembly or scaffolding errors. The system is also tolerant to the high level of gene fragmentation which is frequently found in not fully assembled genomes. BG7 current version – which is developed in Java, takes advantage of Amazon Web Services (AWS) cloud computing features, but it can also be run locally in any operating system. BG7 is a fast, automated and scalable system that can cope with the challenge of analyzing the huge amount of genomes that are being sequenced with NGS technologies. Its capabilities and efficiency were demonstrated in the 2011 EHEC Germany outbreak in which BG7 was used to get the first annotations right the next day after the first entero-hemorrhagic E. coli genome sequences were made publicly available. The suitability of BG7 for genome annotation has been proved for Illumina, 454, Ion Torrent, and PacBio sequencing technologies. Besides, thanks to its plasticity, our system could be very easily adapted to work with new technologies in the future. PMID:23185310
Rapid Evolution of Ovarian-Biased Genes in the Yellow Fever Mosquito (Aedes aegypti).

PubMed

Whittle, Carrie A; Extavour, Cassandra G

2017-08-01

Males and females exhibit highly dimorphic phenotypes, particularly in their gonads, which is believed to be driven largely by differential gene expression. Typically, the protein sequences of genes upregulated in males, or male-biased genes, evolve rapidly as compared to female-biased and unbiased genes. To date, the specific study of gonad-biased genes remains uncommon in metazoans. Here, we identified and studied a total of 2927, 2013, and 4449 coding sequences (CDS) with ovary-biased, testis-biased, and unbiased expression, respectively, in the yellow fever mosquito Aedes aegypti The results showed that ovary-biased and unbiased CDS had higher nonsynonymous to synonymous substitution rates (dN/dS) and lower optimal codon usage (those codons that promote efficient translation) than testis-biased genes. Further, we observed higher dN/dS in ovary-biased genes than in testis-biased genes, even for genes coexpressed in nonsexual (embryo) tissues. Ovary-specific genes evolved exceptionally fast, as compared to testis- or embryo-specific genes, and exhibited higher frequency of positive selection. Genes with ovary expression were preferentially involved in olfactory binding and reception. We hypothesize that at least two potential mechanisms could explain rapid evolution of ovary-biased genes in this mosquito: (1) the evolutionary rate of ovary-biased genes may be accelerated by sexual selection (including female-female competition or male-mate choice) affecting olfactory genes during female swarming by males, and/or by adaptive evolution of olfactory signaling within the female reproductive system ( e.g. , sperm-ovary signaling); and/or (2) testis-biased genes may exhibit decelerated evolutionary rates due to the formation of mating plugs in the female after copulation, which limits male-male sperm competition. Copyright © 2017 by the Genetics Society of America.
Putative and unique gene sequence utilization for the design of species specific probes as modeled by Lactobacillus plantarum

USDA-ARS?s Scientific Manuscript database

The concept of utilizing putative and unique gene sequences for the design of species specific probes was tested. The abundance profile of assigned functions within the Lactobacillus plantarum genome was used for the identification of the putative and unique gene sequence, csh. The targeted gene (cs...
Depletion of Unwanted Nucleic Acid Templates by Selective Cleavage: LNAzymes, Catalytically Active Oligonucleotides Containing Locked Nucleic Acids, Open a New Window for Detecting Rare Microbial Community Members

PubMed Central

Dolinšek, Jan; Dorninger, Christiane; Lagkouvardos, Ilias; Wagner, Michael

2013-01-01

Many studies of molecular microbial ecology rely on the characterization of microbial communities by PCR amplification, cloning, sequencing, and phylogenetic analysis of genes encoding rRNAs or functional marker enzymes. However, if the established clone libraries are dominated by one or a few sequence types, the cloned diversity is difficult to analyze by random clone sequencing. Here we present a novel approach to deplete unwanted sequence types from complex nucleic acid mixtures prior to cloning and downstream analyses. It employs catalytically active oligonucleotides containing locked nucleic acids (LNAzymes) for the specific cleavage of selected RNA targets. When combined with in vitro transcription and reverse transcriptase PCR, this LNAzyme-based technique can be used with DNA or RNA extracts from microbial communities. The simultaneous application of more than one specific LNAzyme allows the concurrent depletion of different sequence types from the same nucleic acid preparation. This new method was evaluated with defined mixtures of cloned 16S rRNA genes and then used to identify accompanying bacteria in an enrichment culture dominated by the nitrite oxidizer “Candidatus Nitrospira defluvii.” In silico analysis revealed that the majority of publicly deposited rRNA-targeted oligonucleotide probes may be used as specific LNAzymes with no or only minor sequence modifications. This efficient and cost-effective approach will greatly facilitate tasks such as the identification of microbial symbionts in nucleic acid preparations dominated by plastid or mitochondrial rRNA genes from eukaryotic hosts, the detection of contaminants in microbial cultures, and the analysis of rare organisms in microbial communities of highly uneven composition. PMID:23263968
'Cold shock' increases the frequency of homology directed repair gene editing in induced pluripotent stem cells.

PubMed

Guo, Q; Mintier, G; Ma-Edmonds, M; Storton, D; Wang, X; Xiao, X; Kienzle, B; Zhao, D; Feder, John N

2018-02-01

Using CRISPR/Cas9 delivered as a RNA modality in conjunction with a lipid specifically formulated for large RNA molecules, we demonstrate that homology directed repair (HDR) rates between 20-40% can be achieved in induced pluripotent stem cells (iPSC). Furthermore, low HDR rates (between 1-20%) can be enhanced two- to ten-fold in both iPSCs and HEK293 cells by 'cold shocking' cells at 32 °C for 24-48 hours following transfection. This method can also increases the proportion of loci that have undergone complete sequence conversion across the donor sequence, or 'perfect HDR', as opposed to partial sequence conversion where nucleotides more distal to the CRISPR cut site are less efficiently incorporated ('partial HDR'). We demonstrate that the structure of the single-stranded DNA oligo donor can influence the fidelity of HDR, with oligos symmetric with respect to the CRISPR cleavage site and complementary to the target strand being more efficient at directing 'perfect HDR' compared to asymmetric non-target strand complementary oligos. Our protocol represents an efficient method for making CRISPR-mediated, specific DNA sequence changes within the genome that will facilitate the rapid generation of genetic models of human disease in iPSCs as well as other genome engineered cell lines.
Design of Protein Multi-specificity Using an Independent Sequence Search Reduces the Barrier to Low Energy Sequences

PubMed Central

Sevy, Alexander M.; Jacobs, Tim M.; Crowe, James E.; Meiler, Jens

2015-01-01

Computational protein design has found great success in engineering proteins for thermodynamic stability, binding specificity, or enzymatic activity in a ‘single state’ design (SSD) paradigm. Multi-specificity design (MSD), on the other hand, involves considering the stability of multiple protein states simultaneously. We have developed a novel MSD algorithm, which we refer to as REstrained CONvergence in multi-specificity design (RECON). The algorithm allows each state to adopt its own sequence throughout the design process rather than enforcing a single sequence on all states. Convergence to a single sequence is encouraged through an incrementally increasing convergence restraint for corresponding positions. Compared to MSD algorithms that enforce (constrain) an identical sequence on all states the energy landscape is simplified, which accelerates the search drastically. As a result, RECON can readily be used in simulations with a flexible protein backbone. We have benchmarked RECON on two design tasks. First, we designed antibodies derived from a common germline gene against their diverse targets to assess recovery of the germline, polyspecific sequence. Second, we design “promiscuous”, polyspecific proteins against all binding partners and measure recovery of the native sequence. We show that RECON is able to efficiently recover native-like, biologically relevant sequences in this diverse set of protein complexes. PMID:26147100
Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”

PubMed Central

Tettelin, Hervé; Masignani, Vega; Cieslewicz, Michael J.; Donati, Claudio; Medini, Duccio; Ward, Naomi L.; Angiuoli, Samuel V.; Crabtree, Jonathan; Jones, Amanda L.; Durkin, A. Scott; DeBoy, Robert T.; Davidsen, Tanja M.; Mora, Marirosa; Scarselli, Maria; Margarit y Ros, Immaculada; Peterson, Jeremy D.; Hauser, Christopher R.; Sundaram, Jaideep P.; Nelson, William C.; Madupu, Ramana; Brinkac, Lauren M.; Dodson, Robert J.; Rosovitz, Mary J.; Sullivan, Steven A.; Daugherty, Sean C.; Haft, Daniel H.; Selengut, Jeremy; Gwinn, Michelle L.; Zhou, Liwei; Zafar, Nikhat; Khouri, Hoda; Radune, Diana; Dimitrov, George; Watkins, Kisha; O'Connor, Kevin J. B.; Smith, Shannon; Utterback, Teresa R.; White, Owen; Rubens, Craig E.; Grandi, Guido; Madoff, Lawrence C.; Kasper, Dennis L.; Telford, John L.; Wessels, Michael R.; Rappuoli, Rino; Fraser, Claire M.

2005-01-01

The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes. PMID:16172379
“Agrolistic” transformation of plant cells: Integration of T-strands generated in planta

PubMed Central

Hansen, Geneviève; Chilton, Mary-Dell

1996-01-01

We describe a novel plant transformation technique, termed “agrolistic,” that combines the advantages of the Agrobacterium transformation system with the high efficiency of biolistic DNA delivery. Agrolistic transformation allows integration of the gene of interest without undesired vector sequence. The virulence genes virD1 and virD2 from Agrobacterium tumefaciens that are required in bacteria for excision of T-strands from the tumor-inducing plasmid were placed under the control of the CaMV35S promoter and codelivered with a target plasmid containing border sequences flanking the gene of interest. Transient expression assays in tobacco and in maize cells indicated that vir gene products caused strand-specific nicking in planta at the right border sequence, similar to VirD1/VirD2-catalyzed T-strand excision observed in Agrobacterium. Agrolistically transformed tobacco calli were obtained after codelivery of virD1 and virD2 genes together with a selectable marker flanked by border sequences. Some inserts exhibited right junctions with plant DNA that corresponded precisely to the sequence expected for T-DNA (portion of the tumor-inducing plasmid that is transferred to plant cells) insertion events. We designate these as “agrolistic” inserts, as distinguished from “biolistic” inserts. Both types of inserts were found in some transformed lines. The frequency of agrolistic inserts was 20% that of biolistic inserts. PMID:8962167
Multi-kilobase homozygous targeted gene replacement in human induced pluripotent stem cells.

PubMed

Byrne, Susan M; Ortiz, Luis; Mali, Prashant; Aach, John; Church, George M

2015-02-18

Sequence-specific nucleases such as TALEN and the CRISPR/Cas9 system have so far been used to disrupt, correct or insert transgenes at precise locations in mammalian genomes. We demonstrate efficient 'knock-in' targeted replacement of multi-kilobase genes in human induced pluripotent stem cells (iPSC). Using a model system replacing endogenous human genes with their mouse counterpart, we performed a comprehensive study of targeting vector design parameters for homologous recombination. A 2.7 kilobase (kb) homozygous gene replacement was achieved in up to 11% of iPSC without selection. The optimal homology arm length was around 2 kb, with homology length being especially critical on the arm not adjacent to the cut site. Homologous sequence inside the cut sites was detrimental to targeting efficiency, consistent with a synthesis-dependent strand annealing (SDSA) mechanism. Using two nuclease sites, we observed a high degree of gene excisions and inversions, which sometimes occurred more frequently than indel mutations. While homozygous deletions of 86 kb were achieved with up to 8% frequency, deletion frequencies were not solely a function of nuclease activity and deletion size. Our results analyzing the optimal parameters for targeting vector design will inform future gene targeting efforts involving multi-kilobase gene segments, particularly in human iPSC. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Characterisation of a DNA sequence element that directs Dictyostelium stalk cell-specific gene expression.

PubMed

Ceccarelli, A; Zhukovskaya, N; Kawata, T; Bozzaro, S; Williams, J

2000-12-01

The ecmB gene of Dictyostelium is expressed at culmination both in the prestalk cells that enter the stalk tube and in ancillary stalk cell structures such as the basal disc. Stalk tube-specific expression is regulated by sequence elements within the cap-site proximal part of the promoter, the stalk tube (ST) promoter region. Dd-STATa, a member of the STAT transcription factor family, binds to elements present in the ST promoter-region and represses transcription prior to entry into the stalk tube. We have characterised an activatory DNA sequence element, that lies distal to the repressor elements and that is both necessary and sufficient for expression within the stalk tube. We have mapped this activator to a 28 nucleotide region (the 28-mer) within which we have identified a GA-containing sequence element that is required for efficient gene transcription. The Dd-STATa protein binds to the 28-mer in an in vitro binding assay, and binding is dependent upon the GA-containing sequence. However, the ecmB gene is expressed in a Dd-STATa null mutant, therefore Dd-STATa cannot be responsible for activating the 28-mer in vivo. Instead, we identified a distinct 28-mer binding activity in nuclear extracts from the Dd-STATa null mutant, the activity of this GA binding activity being largely masked in wild type extracts by the high affinity binding of the Dd-STATa protein. We suggest, that in addition to the long range repression exerted by binding to the two known repressor sites, Dd-STATa inhibits transcription by direct competition with this putative activator for binding to the GA sequence.
Generation of Newly Discovered Resistance Gene mcr-1 Knockout in Escherichia coli Using the CRISPR/Cas9 System.

PubMed

Sun, Lichang; He, Tao; Zhang, Lili; Pang, Maoda; Zhang, Qiaoyan; Zhou, Yan; Bao, Hongduo; Wang, Ran

2017-07-28

The mcr-1 gene is a new "superbug" gene discoverd in China in 2016 that makes bacteria highly resistant to the last-resort class of antibiotics. The mcr-1 gene raised serious concern about its possible global dissemination and spread. Here, we report a potential anti-resistant strategy using the CRISPR/Cas9-mediated approach that can efficiently induce mcr-1 gene knockout in Escherichia coli . Our findings suggested that using the CRISPR/Cas9 system to knock out the resistance gene mcr-1 might be a potential anti-resistant strategy. Bovine myeloid antimicrobial peptide-27 could help deliver plasmid pCas::mcr targeting specific DNA sequences of the mcr-1 gene into microbial populations.
A Novel Method for Gene-Specific Enhancement of Protein Translation by Targeting 5’UTRs of Selected Tumor Suppressors

PubMed Central

Master, Adam; Wójcicka, Anna; Giżewska, Kamilla; Popławski, Piotr; Williams, Graham R.; Nauman, Alicja

2016-01-01

Background Translational control is a mechanism of protein synthesis regulation emerging as an important target for new therapeutics. Naturally occurring microRNAs and synthetic small inhibitory RNAs (siRNAs) are the most recognized regulatory molecules acting via RNA interference. Surprisingly, recent studies have shown that interfering RNAs may also activate gene transcription via the newly discovered phenomenon of small RNA-induced gene activation (RNAa). Thus far, the small activating RNAs (saRNAs) have only been demonstrated as promoter-specific transcriptional activators. Findings We demonstrate that oligonucleotide-based trans-acting factors can also specifically enhance gene expression at the level of protein translation by acting at sequence-specific targets within the messenger RNA 5’-untranslated region (5’UTR). We designed a set of short synthetic oligonucleotides (dGoligos), specifically targeting alternatively spliced 5’UTRs in transcripts expressed from the THRB and CDKN2A suppressor genes. The in vitro translation efficiency of reporter constructs containing alternative TRβ1 5’UTRs was increased by up to more than 55-fold following exposure to specific dGoligos. Moreover, we found that the most folded 5’UTR has higher translational regulatory potential when compared to the weakly folded TRβ1 variant. This suggests such a strategy may be especially applied to enhance translation from relatively inactive transcripts containing long 5’UTRs of complex structure. Significance This report represents the first method for gene-specific translation enhancement using selective trans-acting factors designed to target specific 5’UTR cis-acting elements. This simple strategy may be developed further to complement other available methods for gene expression regulation including gene silencing. The dGoligo-mediated translation-enhancing approach has the potential to be transferred to increase the translation efficiency of any suitable target gene and may have future application in gene therapy strategies to enhance expression of proteins including tumor suppressors. PMID:27171412
Bacteroides fragilis mobilizable transposon Tn5520 requires a 71 base pair origin of transfer sequence and a single mobilization protein for relaxosome formation during conjugation.

PubMed

Vedantam, Gayatri; Knopf, Sarah; Hecht, David W

2006-01-01

Tn5520 is the smallest known bacterial mobilizable transposon and was isolated from an antibiotic resistant Bacteroides fragilis clinical isolate. When a conjugation apparatus is provided in trans, Tn5520 is mobilized (transferred) efficiently within, and from, both Bacteroides spp. and Escherichia coli. Only two genes are present on Tn5520; one encodes an integrase, and the other a multifunctional mobilization (Mob) protein BmpH. BmpH is essential for Tn5520 mobility. The focus of this study was to identify the Tn5520 origin of conjugative transfer (oriT) and to study BmpH-oriT binding. We delimited the functional Tn5520 oriT to a 71 bp sequence upstream of the bmpH gene. A plasmid vector harbouring this minimal 71 bp oriT was mobilized at the same frequency as that of intact Tn5520. The minimal oriT contains one 17 bp inverted repeat (IR) sequence. We constructed and tested multiple IR mutants and showed that the IR was essential in its entirety for mobilization. A nick site sequence (5'-GCTAC-3') was also identified within the minimal oriT; this sequence resembled nick sites found in plasmids of Gram positive origin. We further showed that mutation of a highly conserved GC dinucleotide in the nick site sequence completely abolished mobilization. We also purified BmpH and showed that it specifically bound a Tn5520 oriT fragment in electrophoretic mobility shift assays. We also identified non-nick site sequences within the minimal oriT that were essential for mobilization. We hypothesize that transposon-based single Mob protein systems may contribute to efficient gene dissemination from Bacteroides spp., because fewer DNA processing proteins are required for relaxosome formation.
Interactions of HIPPI, a molecular partner of Huntingtin interacting protein HIP1, with the specific motif present at the putative promoter sequence of the caspase-1, caspase-8 and caspase-10 genes.

PubMed

Majumder, P; Choudhury, A; Banerjee, M; Lahiri, A; Bhattacharyya, N P

2007-08-01

To investigate the mechanism of increased expression of caspase-1 caused by exogenous Hippi, observed earlier in HeLa and Neuro2A cells, in this work we identified a specific motif AAAGACATG (- 101 to - 93) at the caspase-1 gene upstream sequence where HIPPI could bind. Various mutations in this specific sequence compromised the interaction, showing the specificity of the interactions. In the luciferase reporter assay, when the reporter gene was driven by caspase-1 gene upstream sequences (- 151 to - 92) with the mutation G to T at position - 98, luciferase activity was decreased significantly in green fluorescent protein-Hippi-expressing HeLa cells in comparison to that obtained with the wild-type caspase-1 gene 60 bp upstream sequence, indicating the biological significance of such binding. It was observed that the C-terminal 'pseudo' death effector domain of HIPPI interacted with the 60 bp (- 151 to - 92) upstream sequence of the caspase-1 gene containing the motif. We further observed that expression of caspase-8 and caspase-10 was increased in green fluorescent protein-Hippi-expressing HeLa cells. In addition, HIPPI interacted in vitro with putative promoter sequences of these genes, containing a similar motif. In summary, we identified a novel function of HIPPI; it binds to specific upstream sequences of the caspase-1, caspase-8 and caspase-10 genes and alters the expression of the genes. This result showed the motif-specific interaction of HIPPI with DNA, and indicates that it could act as transcription regulator.
Polymerase chain reaction (PCR) amplification of a nucleoprotein gene sequence of infectious hematopoietic necrosis virus

USGS Publications Warehouse

Arakawa, C.K.; Deering, R.E.; Higman, K.H.; Oshima, K.H.; O'Hara, P.J.; Winton, J.R.

1990-01-01

The polymerase chain reaction [PCR) was used to amplify a portion of the nucleoprotein [NI gene of infectious hematopoietic necrosis virus (IHNV). Using a published sequence for the Round Butte isolate of IHNV, a pair of PCR pnmers was synthesized that spanned a 252 nucleotide region of the N gene from residue 319 to residue 570 of the open reading frame. This region included a 30 nucleotide target sequence for a synthetic oligonucleotide probe developed for detection of IHNV N gene messenger RNA. After 25 cycles of amplification of either messenger or genomic RNA, the PCR product (DNA) of the expected size was easily visible on agarose gels stained with ethidium bromide. The specificity of the amplified DNA was confirmed by Southern and dot-blot analysis using the biotinylated oligonucleotide probe. The PCR was able to amplify the N gene sequence of purified genomic RNA from isolates of IHNV representing 5 different electropherotypes. Using the IHNV primer set, no PCR product was obtained from viral hemorrhagic septicemia virus RNA, but 2 higher molecular weight products were synthesized from hirame rhabdovirus RNA that did not hybridize with the biotinylated probe. The PCR could be efficiently performed with all IHNV genomic RNA template concentrations tested (1 ng to 1 pg). The lowest level of sensitivity was not determined. The PCR was used to amplify RNA extracted from infected cell cultures and selected tissues of Infected rainbow trout. The combination of PCR and nucleic acid probe promises to provide a detection method for IHNV that is rapid, h~ghly specific, and sensitive.
CRISPR-cas System as a Genome Engineering Platform: Applications in Biomedicine and Biotechnology.

PubMed

Hashemi, Atieh

2018-01-01

Genome editing mediated by Clustered Regularly Interspaced Palindromic Repeats (CRISPR) and its associated proteins (Cas) has recently been considered to be used as efficient, rapid and site-specific tool in the modification of endogenous genes in biomedically important cell types and whole organisms. It has become a predictable and precise method of choice for genome engineering by specifying a 20-nt targeting sequence within its guide RNA. Firstly, this review aims to describe the biology of CRISPR system. Next, the applications of CRISPR-Cas9 in various ways, such as efficient generation of a wide variety of biomedically important cellular models as well as those of animals, modifying epigenomes, conducting genome-wide screens, gene therapy, labelling specific genomic loci in living cells, metabolic engineering of yeast and bacteria and endogenous gene expression regulation by an altered version of this system were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

PubMed

Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

2012-05-01

The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Virus-Clip: a fast and memory-efficient viral integration site detection tool at single-base resolution with annotation capability.

PubMed

Ho, Daniel W H; Sze, Karen M F; Ng, Irene O L

2015-08-28

Viral integration into the human genome upon infection is an important risk factor for various human malignancies. We developed viral integration site detection tool called Virus-Clip, which makes use of information extracted from soft-clipped sequencing reads to identify exact positions of human and virus breakpoints of integration events. With initial read alignment to virus reference genome and streamlined procedures, Virus-Clip delivers a simple, fast and memory-efficient solution to viral integration site detection. Moreover, it can also automatically annotate the integration events with the corresponding affected human genes. Virus-Clip has been verified using whole-transcriptome sequencing data and its detection was validated to have satisfactory sensitivity and specificity. Marked advancement in performance was detected, compared to existing tools. It is applicable to versatile types of data including whole-genome sequencing, whole-transcriptome sequencing, and targeted sequencing. Virus-Clip is available at http://web.hku.hk/~dwhho/Virus-Clip.zip.
Comprehensive Interrogation of Natural TALE DNA Binding Modules and Transcriptional Repressor Domains

PubMed Central

Cong, Le; Zhou, Ruhong; Kuo, Yu-chi; Cunniff, Margaret; Zhang, Feng

2012-01-01

Transcription activator-like effectors (TALE) are sequence-specific DNA binding proteins that harbor modular, repetitive DNA binding domains. TALEs have enabled the creation of customizable designer transcriptional factors and sequence-specific nucleases for genome engineering. Here we report two improvements of the TALE toolbox for achieving efficient activation and repression of endogenous gene expression in mammalian cells. We show that the naturally occurring repeat variable diresidue (RVD) Asn-His (NH) has high biological activity and specificity for guanine, a highly prevalent base in mammalian genomes. We also report an effective TALE transcriptional repressor architecture for targeted inhibition of transcription in mammalian cells. These findings will improve the precision and effectiveness of genome engineering that can be achieved using TALEs. PMID:22828628
Primary Airway Epithelial Cell Gene Editing Using CRISPR-Cas9.

PubMed

Everman, Jamie L; Rios, Cydney; Seibold, Max A

2018-01-01

The adaptation of the clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR associated endonuclease 9 (CRISPR-Cas9) machinery from prokaryotic organisms has resulted in a gene editing system that is highly versatile, easily constructed, and can be leveraged to generate human cells knocked out (KO) for a specific gene. While standard transfection techniques can be used for the introduction of CRISPR-Cas9 expression cassettes to many cell types, delivery by this method is not efficient in many primary cell types, including primary human airway epithelial cells (AECs). More efficient delivery in AECs can be achieved through lentiviral-mediated transduction, allowing the CRISPR-Cas9 system to be integrated into the genome of the cell, resulting in stable expression of the nuclease machinery and increasing editing rates. In parallel, advancements have been made in the culture, expansion, selection, and differentiation of AECs, which allow the robust generation of a bulk edited AEC population from transduced cells. Applying these methods, we detail here our latest protocol to generate mucociliary epithelial cultures knocked out for a specific gene from donor-isolated primary human basal airway epithelial cells. This protocol includes methods to: (1) design and generate lentivirus which targets a specific gene for KO with CRISPR-Cas9 machinery, (2) efficiently transduce AECs, (3) culture and select for a bulk edited AEC population, (4) molecularly screen AECs for Cas9 cutting and specific sequence edits, and (5) further expand and differentiate edited cells to a mucociliary airway epithelial culture. The AEC knockouts generated using this protocol provide an excellent primary cell model system with which to characterize the function of genes involved in airway dysfunction and disease.
SnipViz: a compact and lightweight web site widget for display and dissemination of multiple versions of gene and protein sequences.

PubMed

Jaschob, Daniel; Davis, Trisha N; Riffle, Michael

2014-07-23

As high throughput sequencing continues to grow more commonplace, the need to disseminate the resulting data via web applications continues to grow. Particularly, there is a need to disseminate multiple versions of related gene and protein sequences simultaneously--whether they represent alleles present in a single species, variations of the same gene among different strains, or homologs among separate species. Often this is accomplished by displaying all versions of the sequence at once in a manner that is not intuitive or space-efficient and does not facilitate human understanding of the data. Web-based applications needing to disseminate multiple versions of sequences would benefit from a drop-in module designed to effectively disseminate these data. SnipViz is a client-side software tool designed to disseminate multiple versions of related gene and protein sequences on web sites. SnipViz has a space-efficient, interactive, and dynamic interface for navigating, analyzing and visualizing sequence data. It is written using standard World Wide Web technologies (HTML, Javascript, and CSS) and is compatible with most web browsers. SnipViz is designed as a modular client-side web component and may be incorporated into virtually any web site and be implemented without any programming. SnipViz is a drop-in client-side module for web sites designed to efficiently visualize and disseminate gene and protein sequences. SnipViz is open source and is freely available at https://github.com/yeastrc/snipviz.
Vector for IS element entrapment and functional characterization based on turning on expression of distal promoterless genes.

PubMed

Szeverényi, I; Hodel, A; Arber, W; Olasz, F

1996-09-26

We constructed and characterized a novel trap vector for rapid isolation of insertion sequences. The strategy used for the isolation of IS elements is based on the ability of many IS elements to turn on the expression of otherwise silent genes distal to some sites of insertion. The simple transposition of an IS element can sometimes cause the constitutive expression of promoterless antibiotic resistance genes resulting in selectable phenotypes. The trap vector pAW1326 is based on a pBR322 replicon, it carries ampicillin and streptomycin resistance genes, and also silenced genes that confer chloramphenicol and kanamycin resistance once activated. The trap vector pAW1326 proved to be efficient and 85 percent of all isolated mutations were insertions. The majority of IS elements resident in the studied Escherichia coli strains tested became trapped, namely IS2, IS3, IS5, IS150, IS186 and Tn1000. We also encountered an insertion sequence, called IS10L/R-2, which is a hybrid of the two IS variants IS10L and IS10R. IS10L/R-2 is absent from most E. coli strains, but it is detectable in some strains such as JM109 which had been submitted to Tn10 mutagenesis. The distribution of the insertion sequences within the trap region was not random. Rather, the integration of chromosomal mobile genetic elements into the offered target sequence occurred in element-specific clusters. This is explained both by the target specificity and by the specific requirements for the activation of gene transcription by the DNA rearrangement. The employed trap vector pAW1326 proved to be useful for the isolation of mobile genetic elements, for a demonstration of their transposition activity as well as for the further characterization of some of the functional parameters of transposition.
New encoded single-indicator sequences based on physico-chemical parameters for efficient exon identification.

PubMed

Meher, J K; Meher, P K; Dash, G N; Raval, M K

2012-01-01

The first step in gene identification problem based on genomic signal processing is to convert character strings into numerical sequences. These numerical sequences are then analysed spectrally or using digital filtering techniques for the period-3 peaks, which are present in exons (coding areas) and absent in introns (non-coding areas). In this paper, we have shown that single-indicator sequences can be generated by encoding schemes based on physico-chemical properties. Two new methods are proposed for generating single-indicator sequences based on hydration energy and dipole moments. The proposed methods produce high peak at exon locations and effectively suppress false exons (intron regions having greater peak than exon regions) resulting in high discriminating factor, sensitivity and specificity.
Design and Evaluation of Illumina MiSeq-Compatible, 18S rRNA Gene-Specific Primers for Improved Characterization of Mixed Phototrophic Communities.

PubMed

Bradley, Ian M; Pinto, Ameet J; Guest, Jeremy S

2016-10-01

The use of high-throughput sequencing technologies with the 16S rRNA gene for characterization of bacterial and archaeal communities has become routine. However, the adoption of sequencing methods for eukaryotes has been slow, despite their significance to natural and engineered systems. There are large variations among the target genes used for amplicon sequencing, and for the 18S rRNA gene, there is no consensus on which hypervariable region provides the most suitable representation of diversity. Additionally, it is unclear how much PCR/sequencing bias affects the depiction of community structure using current primers. The present study amplified the V4 and V8-V9 regions from seven microalgal mock communities as well as eukaryotic communities from freshwater, coastal, and wastewater samples to examine the effect of PCR/sequencing bias on community structure and membership. We found that degeneracies on the 3' end of the current V4-specific primers impact read length and mean relative abundance. Furthermore, the PCR/sequencing error is markedly higher for GC-rich members than for communities with balanced GC content. Importantly, the V4 region failed to reliably capture 2 of the 12 mock community members, and the V8-V9 hypervariable region more accurately represents mean relative abundance and alpha and beta diversity. Overall, the V4 and V8-V9 regions show similar community representations over freshwater, coastal, and wastewater environments, but specific samples show markedly different communities. These results indicate that multiple primer sets may be advantageous for gaining a more complete understanding of community structure and highlight the importance of including mock communities composed of species of interest. The quantification of error associated with community representation by amplicon sequencing is a critical challenge that is often ignored. When target genes are amplified using currently available primers, differential amplification efficiencies result in inaccurate estimates of community structure. The extent to which amplification bias affects community representation and the accuracy with which different gene targets represent community structure are not known. As a result, there is no consensus on which region provides the most suitable representation of diversity for eukaryotes. This study determined the accuracy with which commonly used 18S rRNA gene primer sets represent community structure and identified particular biases related to PCR amplification and Illumina MiSeq sequencing in order to more accurately study eukaryotic microbial communities. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Development and validation of real-time PCR screening methods for detection of cry1A.105 and cry2Ab2 genes in genetically modified organisms.

PubMed

Dinon, Andréia Z; Prins, Theo W; van Dijk, Jeroen P; Arisi, Ana Carolina M; Scholtens, Ingrid M J; Kok, Esther J

2011-05-01

Primers and probes were developed for the element-specific detection of cry1A.105 and cry2Ab2 genes, based on their DNA sequence as present in GM maize MON89034. Cry genes are present in many genetically modified (GM) plants and they are important targets for developing GMO element-specific detection methods. Element-specific methods can be of use to screen for the presence of GMOs in food and feed supply chains. Moreover, a combination of GMO elements may indicate the potential presence of unapproved GMOs (UGMs). Primer-probe combinations were evaluated in terms of specificity, efficiency and limit of detection. Except for specificity, the complete experiment was performed in 9 PCR runs, on 9 different days and by testing 8 DNA concentrations. The results showed a high specificity and efficiency for cry1A.105 and cry2Ab2 detection. The limit of detection was between 0.05 and 0.01 ng DNA per PCR reaction for both assays. These data confirm the applicability of these new primer-probe combinations for element detection that can contribute to the screening for GM and UGM crops in food and feed samples.
An Optimal Bahadur-Efficient Method in Detection of Sparse Signals with Applications to Pathway Analysis in Sequencing Association Studies.

PubMed

Dai, Hongying; Wu, Guodong; Wu, Michael; Zhi, Degui

2016-01-01

Next-generation sequencing data pose a severe curse of dimensionality, complicating traditional "single marker-single trait" analysis. We propose a two-stage combined p-value method for pathway analysis. The first stage is at the gene level, where we integrate effects within a gene using the Sequence Kernel Association Test (SKAT). The second stage is at the pathway level, where we perform a correlated Lancaster procedure to detect joint effects from multiple genes within a pathway. We show that the Lancaster procedure is optimal in Bahadur efficiency among all combined p-value methods. The Bahadur efficiency,[Formula: see text], compares sample sizes among different statistical tests when signals become sparse in sequencing data, i.e. ε →0. The optimal Bahadur efficiency ensures that the Lancaster procedure asymptotically requires a minimal sample size to detect sparse signals ([Formula: see text]). The Lancaster procedure can also be applied to meta-analysis. Extensive empirical assessments of exome sequencing data show that the proposed method outperforms Gene Set Enrichment Analysis (GSEA). We applied the competitive Lancaster procedure to meta-analysis data generated by the Global Lipids Genetics Consortium to identify pathways significantly associated with high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, triglycerides, and total cholesterol.
Genome sequencing and comparative genomics reveal a repertoire of putative pathogenicity genes in chilli anthracnose fungus Colletotrichum truncatum.

PubMed

Rao, Soumya; Nandineni, Madhusudan R

2017-01-01

Colletotrichum truncatum, a major fungal phytopathogen, causes the anthracnose disease on an economically important spice crop chilli (Capsicum annuum), resulting in huge economic losses in tropical and sub-tropical countries. It follows a subcuticular intramural infection strategy on chilli with a short, asymptomatic, endophytic phase, which contrasts with the intracellular hemibiotrophic lifestyle adopted by most of the Colletotrichum species. However, little is known about the molecular determinants and the mechanism of pathogenicity in this fungus. A high quality whole genome sequence and gene annotation based on transcriptome data of an Indian isolate of C. truncatum from chilli has been obtained. Analysis of the genome sequence revealed a rich repertoire of pathogenicity genes in C. truncatum encoding secreted proteins, effectors, plant cell wall degrading enzymes, secondary metabolism associated proteins, with potential roles in the host-specific infection strategy, placing it next only to the Fusarium species. The size of genome assembly, number of predicted genes and some of the functional categories were similar to other sequenced Colletotrichum species. The comparative genomic analyses with other species and related fungi identified some unique genes and certain highly expanded gene families of CAZymes, proteases and secondary metabolism associated genes in the genome of C. truncatum. The draft genome assembly and functional annotation of potential pathogenicity genes of C. truncatum provide an important genomic resource for understanding the biology and lifestyle of this important phytopathogen and will pave the way for designing efficient disease control regimens.
Genome sequencing and comparative genomics reveal a repertoire of putative pathogenicity genes in chilli anthracnose fungus Colletotrichum truncatum

PubMed Central

Rao, Soumya

2017-01-01

Colletotrichum truncatum, a major fungal phytopathogen, causes the anthracnose disease on an economically important spice crop chilli (Capsicum annuum), resulting in huge economic losses in tropical and sub-tropical countries. It follows a subcuticular intramural infection strategy on chilli with a short, asymptomatic, endophytic phase, which contrasts with the intracellular hemibiotrophic lifestyle adopted by most of the Colletotrichum species. However, little is known about the molecular determinants and the mechanism of pathogenicity in this fungus. A high quality whole genome sequence and gene annotation based on transcriptome data of an Indian isolate of C. truncatum from chilli has been obtained. Analysis of the genome sequence revealed a rich repertoire of pathogenicity genes in C. truncatum encoding secreted proteins, effectors, plant cell wall degrading enzymes, secondary metabolism associated proteins, with potential roles in the host-specific infection strategy, placing it next only to the Fusarium species. The size of genome assembly, number of predicted genes and some of the functional categories were similar to other sequenced Colletotrichum species. The comparative genomic analyses with other species and related fungi identified some unique genes and certain highly expanded gene families of CAZymes, proteases and secondary metabolism associated genes in the genome of C. truncatum. The draft genome assembly and functional annotation of potential pathogenicity genes of C. truncatum provide an important genomic resource for understanding the biology and lifestyle of this important phytopathogen and will pave the way for designing efficient disease control regimens. PMID:28846714
Complete genome sequence of enterohemorrhagic Escherichia coli O157:H7 and genomic comparison with a laboratory strain K-12.

PubMed

Hayashi, T; Makino, K; Ohnishi, M; Kurokawa, K; Ishii, K; Yokoyama, K; Han, C G; Ohtsubo, E; Nakayama, K; Murata, T; Tanaka, M; Tobe, T; Iida, T; Takami, H; Honda, T; Sasakawa, C; Ogasawara, N; Yasunaga, T; Kuhara, S; Shiba, T; Hattori, M; Shinagawa, H

2001-02-28

Escherichia coli O157:H7 is a major food-borne infectious pathogen that causes diarrhea, hemorrhagic colitis, and hemolytic uremic syndrome. Here we report the complete chromosome sequence of an O157:H7 strain isolated from the Sakai outbreak, and the results of genomic comparison with a benign laboratory strain, K-12 MG1655. The chromosome is 5.5 Mb in size, 859 Kb larger than that of K-12. We identified a 4.1-Mb sequence highly conserved between the two strains, which may represent the fundamental backbone of the E. coli chromosome. The remaining 1.4-Mb sequence comprises of O157:H7-specific sequences, most of which are horizontally transferred foreign DNAs. The predominant roles of bacteriophages in the emergence of O157:H7 is evident by the presence of 24 prophages and prophage-like elements that occupy more than half of the O157:H7-specific sequences. The O157:H7 chromosome encodes 1632 proteins and 20 tRNAs that are not present in K-12. Among these, at least 131 proteins are assumed to have virulence-related functions. Genome-wide codon usage analysis suggested that the O157:H7-specific tRNAs are involved in the efficient expression of the strain-specific genes. A complete set of the genes specific to O157:H7 presented here sheds new insight into the pathogenicity and the physiology of O157:H7, and will open a way to fully understand the molecular mechanisms underlying the O157:H7 infection.

Combining Single Strand Oligodeoxynucleotides and CRISPR/Cas9 to Correct Gene Mutations in β-Thalassemia-induced Pluripotent Stem Cells.

PubMed

Niu, Xiaohua; He, Wenyin; Song, Bing; Ou, Zhanhui; Fan, Di; Chen, Yuchang; Fan, Yong; Sun, Xiaofang

2016-08-05

β-Thalassemia (β-Thal) is one of the most common genetic diseases in the world. The generation of patient-specific β-Thal-induced pluripotent stem cells (iPSCs), correction of the disease-causing mutations in those cells, and then differentiation into hematopoietic stem cells offers a new therapeutic strategy for this disease. Here, we designed a CRISPR/Cas9 to specifically target the Homo sapiens hemoglobin β (HBB) gene CD41/42(-CTTT) mutation. We demonstrated that the combination of single strand oligodeoxynucleotides with CRISPR/Cas9 was capable of correcting the HBB gene CD41/42 mutation in β-Thal iPSCs. After applying a correction-specific PCR assay to purify the corrected clones followed by sequencing to confirm mutation correction, we verified that the purified clones retained full pluripotency and exhibited normal karyotyping. Additionally, whole-exome sequencing showed that the mutation load to the exomes was minimal after CRISPR/Cas9 targeting. Furthermore, the corrected iPSCs were selected for erythroblast differentiation and restored the expression of HBB protein compared with the parental iPSCs. This method provides an efficient and safe strategy to correct the HBB gene mutation in β-Thal iPSCs. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
TypeLoader: A fast and efficient automated workflow for the annotation and submission of novel full-length HLA alleles.

PubMed

Surendranath, V; Albrecht, V; Hayhurst, J D; Schöne, B; Robinson, J; Marsh, S G E; Schmidt, A H; Lange, V

2017-07-01

Recent years have seen a rapid increase in the discovery of novel allelic variants of the human leukocyte antigen (HLA) genes. Commonly, only the exons encoding the peptide binding domains of novel HLA alleles are submitted. As a result, the IPD-IMGT/HLA Database lacks sequence information outside those regions for the majority of known alleles. This has implications for the application of the new sequencing technologies, which deliver sequence data often covering the complete gene. As these technologies simplify the characterization of the complete gene regions, it is desirable for novel alleles to be submitted as full-length sequences to the database. However, the manual annotation of full-length alleles and the generation of specific formats required by the sequence repositories is prone to error and time consuming. We have developed TypeLoader to address both these facets. With only the full-length sequence as a starting point, Typeloader performs automatic sequence annotation and subsequently handles all steps involved in preparing the specific formats for submission with very little manual intervention. TypeLoader is routinely used at the DKMS Life Science Lab and has aided in the successful submission of more than 900 novel HLA alleles as full-length sequences to the European Nucleotide Archive repository and the IPD-IMGT/HLA Database with a 95% reduction in the time spent on annotation and submission when compared with handling these processes manually. TypeLoader is implemented as a web application and can be easily installed and used on a standalone Linux desktop system or within a Linux client/server architecture. TypeLoader is downloadable from http://www.github.com/DKMS-LSL/typeloader. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Rapid identification of lettuce seed germination mutants by bulked segregant analysis and whole genome sequencing.

PubMed

Huo, Heqiang; Henry, Isabelle M; Coppoolse, Eric R; Verhoef-Post, Miriam; Schut, Johan W; de Rooij, Han; Vogelaar, Aat; Joosen, Ronny V L; Woudenberg, Leo; Comai, Luca; Bradford, Kent J

2016-11-01

Lettuce (Lactuca sativa) seeds exhibit thermoinhibition, or failure to complete germination when imbibed at warm temperatures. Chemical mutagenesis was employed to develop lettuce lines that exhibit germination thermotolerance. Two independent thermotolerant lettuce seed mutant lines, TG01 and TG10, were generated through ethyl methanesulfonate mutagenesis. Genetic and physiological analyses indicated that these two mutations were allelic and recessive. To identify the causal gene(s), we applied bulked segregant analysis by whole genome sequencing. For each mutant, bulked DNA samples of segregating thermotolerant (mutant) seeds were sequenced and analyzed for homozygous single-nucleotide polymorphisms. Two independent candidate mutations were identified at different physical positions in the zeaxanthin epoxidase gene (ABSCISIC ACID DEFICIENT 1/ZEAXANTHIN EPOXIDASE, or ABA1/ZEP) in TG01 and TG10. The mutation in TG01 caused an amino acid replacement, whereas the mutation in TG10 resulted in alternative mRNA splicing. Endogenous abscisic acid contents were reduced in both mutants, and expression of the ABA1 gene from wild-type lettuce under its own promoter fully complemented the TG01 mutant. Conventional genetic mapping confirmed that the causal mutations were located near the ZEP/ABA1 gene, but the bulked segregant whole genome sequencing approach more efficiently identified the specific gene responsible for the phenotype. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
An efficient approach for cloning the dNDP-glucose synthase gene from actinomycetes and its application in Streptomyces spectabilis, a spectinomycin producer.

PubMed

Hyun, C; Kim, S S; Sohng, J K; Hahn, J; Kim, J; Suh, J

2000-02-01

Specifically designed PCR primers were applied to amplify a segment of dTDP-glucose synthase gene from six actinomycete strains. About 300-bp or 580-bp DNA fragments were obtained from all the organisms tested. By DNA sequence analysis, seven amplified fragments showed high homology with dTDP-glucose synthase genes that participate in the biosynthesis of secondary metabolites or in deoxy-sugar moieties in lipopolysaccharides. In addition, we have cloned a 45-kb region of DNA from Streptomyces spectabilis ATCC27741, a spectinomycin producer which contained the dTDP-glucose synthase and dTDP-glucose 4,6-dehydratase genes named spcD and spcE, respectively. The spcE gene was expressed in Escherichia coli and the activity was assayed in cell extracts. The enzyme showed substrate specificity only to dTDP-glucose.
[RS-1 enhanced the efficiency of CRISPR-Cas9 mediated knock-in of human lactoferrin].

PubMed

Zhou, Wenjun; Guo, Rihong; Deng, Mingtian; Wang, Feng; Zhang, Yanli

2017-08-25

This study aims to knock out the goat β-lactoglobulin (BLG) gene using CRISPR-Cas9 system and knock in human lactoferrin (hLF) at the BLG locus, and further study the effect of RAD51 stimulatory compound (RS-1) on homologous recombination efficiency. First, we designed an sgRNA targeting the first exon of goat BLG gene and constructed a co-expression vector pCas9-sgBLG. This sgRNA vector was then transfected into goat ear fibroblasts (GEFs), and the target region was examined by T7EN1 assay and sequencing. Second, we constructed a targeting vector pBHA-hLF-NIE including NEO and EGFP genes based on BLG gene locus. This targeting vector together with pCas9-sgBLG expression vector was co-transfected into GEFs. Transfected cells were then treated with 0, 5, 10 and 20 μmol/L RS-1 for 72 h to analyse the EGFP expression efficiency. Next, we used 800 μg/mL G418 to screen G418-resistent cell clones, and studied hLF site-specific knock-in cell clones by PCR and sequencing. The editing efficiency of sgBLG was between 25% and 31%. The EGFP expression efficiency indicated that the gene knock-in efficiency was improved by RS-1 in a dose-dependent manner, which could reach 3.5-fold compared to the control group. The percentage of positive cells with hLF knock-in was increased to 32.61% when 10 μmol/L RS-1 was used. However, when the concentration of RS-1 increased to 20 μmol/L, the percentage of positive cells decreased to 22.22% and resulted in an increase of senescent cell clone number. These results suggested that hLF knock-in and BLG knock-out in GEFs were achieved by using CRISPR/Cas9 system, and optimum concentration of RS-1 could improve knock-in efficiency, which provides a reference for efficiently obtaining gene knock-in cells using CRISPR/Cas9 in the future.
Conditional genomic rearrangement by designed meiotic recombination using VDE (PI-SceI) in yeast.

PubMed

Fukuda, Tomoyuki; Ohya, Yoshikazu; Ohta, Kunihiro

2007-10-01

Meiotic recombination plays critical roles in the acquisition of genetic diversity and has been utilized for conventional breeding of livestock and crops. The frequency of meiotic recombination is normally low, and is extremely low in regions called "recombination cold domains". Here, we describe a new and highly efficient method to modulate yeast meiotic gene rearrangements using VDE (PI-SceI), an intein-encoded endonuclease that causes an efficient unidirectional meiotic gene conversion at its recognition sequence (VRS). We designed universal targeting vectors, by use of which the strain that inserts the VRS at a desired site is acquired. Meiotic induction of the strains provided unidirectional gene conversions and frequent genetic rearrangements of flanking genes with little impact on cell viability. This system thus opens the way for the designed modulation of meiotic gene rearrangements, regardless of recombinational activity of chromosomal domains. Finally, the VDE-VRS system enabled us to conduct meiosis-specific conditional knockout of genes where VDE-initiated gene conversion disrupts the target gene during meiosis, serving as a novel approach to examine the functions of genes during germination of resultant spores.
Delimiting regulatory sequences of the Drosophila melanogaster Ddc gene.

PubMed Central

Hirsh, J; Morgan, B A; Scholnick, S B

1986-01-01

We delimited sequences necessary for in vivo expression of the Drosophila melanogaster dopa decarboxylase gene Ddc. The expression of in vitro-altered genes was assayed following germ line integration via P-element vectors. Sequences between -209 and -24 were necessary for normally regulated expression, although genes lacking these sequences could be expressed at 10 to 50% of wild-type levels at specific developmental times. These genes showed components of normal developmental expression, which suggests that they retain some regulatory elements. All Ddc genes lacking the normal immediate 5'-flanking sequences were grossly deficient in larval central nervous system expression. Thus, this upstream region must contain at least one element necessary for this expression. A mutated Ddc gene without a normal TATA boxlike sequence used the normal RNA start points, indicating that this sequences is not required for start point specificity. Images PMID:3099170
[Gene transfer agent--a novel and widespread occurrence mechanism of gene exchange in ocean-a review].

PubMed

Cai, Haiyuan

2012-01-01

Gene Transfer Agent (GTA) particles are released by bacteria and resemble small, tailed bacteriophages. GTA particles contain small, random pieces of host DNA rather than GTA structural genes or a phage genome. Gene transfer mediated by GTA is efficient and species specific based on knowledge of currently best studied GTAs produced by 4 anaerobes. Genome sequencing projects have revealed a remarkable distribution of GTA gene clusters in the genomes of marine bacterioplankton, implying GTA may be an important mechanism for horizontal gene transfer in ocean. On basis of characterization of the 4 best studied GTAs, this review described GTAs released by numerically dominant marine bacteria, discussed their properties that were important for horizontal gene transfer in ocean, and gave future perspectives to advance GTA research.
Integration of promoters, inverted repeat sequences and proteomic data into a model for high silencing efficiency of coeliac disease related gliadins in bread wheat

PubMed Central

2013-01-01

Background Wheat gluten has unique nutritional and technological characteristics, but is also a major trigger of allergies and intolerances. One of the most severe diseases caused by gluten is coeliac disease. The peptides produced in the digestive tract by the incomplete digestion of gluten proteins trigger the disease. The majority of the epitopes responsible reside in the gliadin fraction of gluten. The location of the multiple gliadin genes in blocks has to date complicated their elimination by classical breeding techniques or by the use of biotechnological tools. As an approach to silence multiple gliadin genes we have produced 38 transgenic lines of bread wheat containing combinations of two endosperm-specific promoters and three different inverted repeat sequences to silence three fractions of gliadins by RNA interference. Results The effects of the RNA interference constructs on the content of the gluten proteins, total protein and starch, thousand seed weights and SDSS quality tests of flour were analyzed in these transgenic lines in two consecutive years. The characteristics of the inverted repeat sequences were the main factor that determined the efficiency of silencing. The promoter used had less influence on silencing, although a synergy in silencing efficiency was observed when the two promoters were used simultaneously. Genotype and the environment also influenced silencing efficiency. Conclusions We conclude that to obtain wheat lines with an optimum reduction of toxic gluten epitopes one needs to take into account the factors of inverted repeat sequences design, promoter choice and also the wheat background used. PMID:24044767
Development of chromosome-specific markers with high polymorphism for allotetraploid cotton based on genome-wide characterization of simple sequence repeats in diploid cottons (Gossypium arboreum L. and Gossypium raimondii Ulbrich).

PubMed

Lu, Cairui; Zou, Changsong; Zhang, Youping; Yu, Daoqian; Cheng, Hailiang; Jiang, Pengfei; Yang, Wencui; Wang, Qiaolian; Feng, Xiaoxu; Prosper, Mtawa Andrew; Guo, Xiaoping; Song, Guoli

2015-02-06

Tetraploid cotton contains two sets of homologous chromosomes, the At- and Dt-subgenomes. Consequently, many markers in cotton were mapped to multiple positions during linkage genetic map construction, posing a challenge to anchoring linkage groups and mapping economically-important genes to particular chromosomes. Chromosome-specific markers could solve this problem. Recently, the genomes of two diploid species were sequenced whose progenitors were putative contributors of the At- and Dt-subgenomes to tetraploid cotton. These sequences provide a powerful tool for developing chromosome-specific markers given the high level of synteny among tetraploid and diploid cotton genomes. In this study, simple sequence repeats (SSRs) on each chromosome in the two diploid genomes were characterized. Chromosome-specific SSRs were developed by comparative analysis and proved to distinguish chromosomes. A total of 200,744 and 142,409 SSRs were detected on the 13 chromosomes of Gossypium arboreum L. and Gossypium raimondii Ulbrich, respectively. Chromosome-specific SSRs were obtained by comparing SSR flanking sequences from each chromosome with those from the other 25 chromosomes. The average was 7,996 per chromosome. To confirm their chromosome specificity, these SSRs were used to distinguish two homologous chromosomes in tetraploid cotton through linkage group construction. The chromosome-specific SSRs and previously-reported chromosome markers were grouped together, and no marker mapped to another homologous chromosome, proving that the chromosome-specific SSRs were unique and could distinguish homologous chromosomes in tetraploid cotton. Because longer dinucleotide AT-rich repeats were the most polymorphic in previous reports, the SSRs on each chromosome were sorted by motif type and repeat length for convenient selection. The primer sequences of all chromosome-specific SSRs were also made publicly available. Chromosome-specific SSRs are efficient tools for chromosome identification by anchoring linkage groups to particular chromosomes during genetic mapping and are especially useful in mapping of qualitative-trait genes or quantitative trait loci with just a few markers. The SSRs reported here will facilitate a number of genetic and genomic studies in cotton, including construction of high-density genetic maps, positional gene cloning, fingerprinting, and genetic diversity and comparative evolutionary analyses among Gossypium species.
Abundance of Dioxygenase Genes Similar to Ralstonia sp. Strain U2 nagAc Is Correlated with Naphthalene Concentrations in Coal Tar-Contaminated Freshwater Sediments

PubMed Central

Dionisi, Hebe M.; Chewning, Christopher S.; Morgan, Katherine H.; Menn, Fu-Min; Easter, James P.; Sayler, Gary S.

2004-01-01

We designed a real-time PCR assay able to recognize dioxygenase large-subunit gene sequences with more than 90% similarity to the Ralstonia sp. strain U2 nagAc gene (nagAc-like gene sequences) in order to study the importance of organisms carrying these genes in the biodegradation of naphthalene. Sequencing of PCR products indicated that this real-time PCR assay was specific and able to detect a variety of nagAc-like gene sequences. One to 100 ng of contaminated-sediment total DNA in 25-μl reaction mixtures produced an amplification efficiency of 0.97 without evident PCR inhibition. The assay was applied to surficial freshwater sediment samples obtained in or in close proximity to a coal tar-contaminated Superfund site. Naphthalene concentrations in the analyzed samples varied between 0.18 and 106 mg/kg of dry weight sediment. The assay for nagAc-like sequences indicated the presence of (4.1 ± 0.7) × 103 to (2.9 ± 0.3) × 105 copies of nagAc-like dioxygenase genes per μg of DNA extracted from sediment samples. These values corresponded to (1.2 ± 0.6) × 105 to (5.4 ± 0.4) × 107 copies of this target per g of dry weight sediment when losses of DNA during extraction were taken into account. There was a positive correlation between naphthalene concentrations and nagAc-like gene copies per microgram of DNA (r = 0.89) and per gram of dry weight sediment (r = 0.77). These results provide evidence of the ecological significance of organisms carrying nagAc-like genes in the biodegradation of naphthalene. PMID:15240274
Whole exome sequencing frequently detects a monogenic cause in early onset nephrolithiasis and nephrocalcinosis.

PubMed

Daga, Ankana; Majmundar, Amar J; Braun, Daniela A; Gee, Heon Yung; Lawson, Jennifer A; Shril, Shirlee; Jobst-Schwan, Tilman; Vivante, Asaf; Schapiro, David; Tan, Weizhen; Warejko, Jillian K; Widmeier, Eugen; Nelson, Caleb P; Fathy, Hanan M; Gucev, Zoran; Soliman, Neveen A; Hashmi, Seema; Halbritter, Jan; Halty, Margarita; Kari, Jameela A; El-Desoky, Sherif; Ferguson, Michael A; Somers, Michael J G; Traum, Avram Z; Stein, Deborah R; Daouk, Ghaleb H; Rodig, Nancy M; Katz, Avi; Hanna, Christian; Schwaderer, Andrew L; Sayer, John A; Wassner, Ari J; Mane, Shrikant; Lifton, Richard P; Milosevic, Danko; Tasic, Velibor; Baum, Michelle A; Hildebrandt, Friedhelm

2018-01-01

The incidence of nephrolithiasis continues to rise. Previously, we showed that a monogenic cause could be detected in 11.4% of individuals with adult-onset nephrolithiasis or nephrocalcinosis and in 16.7-20.8% of individuals with onset before 18 years of age, using gene panel sequencing of 30 genes known to cause nephrolithiasis/nephrocalcinosis. To overcome the limitations of panel sequencing, we utilized whole exome sequencing in 51 families, who presented before age 25 years with at least one renal stone or with a renal ultrasound finding of nephrocalcinosis to identify the underlying molecular genetic cause of disease. In 15 of 51 families, we detected a monogenic causative mutation by whole exome sequencing. A mutation in seven recessive genes (AGXT, ATP6V1B1, CLDN16, CLDN19, GRHPR, SLC3A1, SLC12A1), in one dominant gene (SLC9A3R1), and in one gene (SLC34A1) with both recessive and dominant inheritance was detected. Seven of the 19 different mutations were not previously described as disease-causing. In one family, a causative mutation in one of 117 genes that may represent phenocopies of nephrolithiasis-causing genes was detected. In nine of 15 families, the genetic diagnosis may have specific implications for stone management and prevention. Several factors that correlated with the higher detection rate in our cohort were younger age at onset of nephrolithiasis/nephrocalcinosis, presence of multiple affected members in a family, and presence of consanguinity. Thus, we established whole exome sequencing as an efficient approach toward a molecular genetic diagnosis in individuals with nephrolithiasis/nephrocalcinosis who manifest before age 25 years. Copyright © 2017 International Society of Nephrology. Published by Elsevier Inc. All rights reserved.
Draft sequencing and comparative genomics of Xylella fastidiosa strains reveal novel biological insights.

PubMed

Bhattacharyya, Anamitra; Stilwagen, Stephanie; Reznik, Gary; Feil, Helene; Feil, William S; Anderson, Iain; Bernal, Axel; D'Souza, Mark; Ivanova, Natalia; Kapatral, Vinayak; Larsen, Niels; Los, Tamara; Lykidis, Athanasios; Selkov, Eugene; Walunas, Theresa L; Purcell, Alexander; Edwards, Rob A; Hawkins, Trevor; Haselkorn, Robert; Overbeek, Ross; Kyrpides, Nikos C; Predki, Paul F

2002-10-01

Draft sequencing is a rapid and efficient method for determining the near-complete sequence of microbial genomes. Here we report a comparative analysis of one complete and two draft genome sequences of the phytopathogenic bacterium, Xylella fastidiosa, which causes serious disease in plants, including citrus, almond, and oleander. We present highlights of an in silico analysis based on a comparison of reconstructions of core biological subsystems. Cellular pathway reconstructions have been used to identify a small number of genes, which are likely to reside within the draft genomes but are not captured in the draft assembly. These represented only a small fraction of all genes and were predominantly large and small ribosomal subunit protein components. By using this approach, some of the inherent limitations of draft sequence can be significantly reduced. Despite the incomplete nature of the draft genomes, it is possible to identify several phage-related genes, which appear to be absent from the draft genomes and not the result of insufficient sequence sampling. This region may therefore identify potential host-specific functions. Based on this first functional reconstruction of a phytopathogenic microbe, we spotlight an unusual respiration machinery as a potential target for biological control. We also predicted and developed a new defined growth medium for Xylella.
Sex determination of ovine embryos by SRY and amelogenin (AMEL) genes using maternal circulating cell free DNA.

PubMed

Saberivand, Adel; Ahsan, Sima

2016-01-01

Simple and precise methods for sex determination in animals are a pre-requisite for a number of applications in animal production and forensics. Some of the existing methods depend only on the detection of Y-chromosome specific sequences. However, the detection of Y and X-chromosome specific sequences is advantageous. In the present study the accuracy of sex determination by SRY (sex-determining region Y) and AMEL (Amelogenin) gene detection was assessed using a polymerase chain reaction (PCR) of DNA extracted from free fetal cells in maternal blood, which is noninvasive for fetus and easier to collect. The PCR amplification of SRY primers produced a single band of 171bp from ewes bearing a male fetus, whereas no band was amplified from the DNA extracted from ewes pregnant to a female fetus. Moreover, two bands of 182 and 242bp in male and a single band of 242 in female fetuses were produced by AMEL gene primers in the PCR reaction. Using this technique 100% of samples were successfully sexed, excluding twins. In conclusion, we demonstrated that sex determination using DNA of free fetal cells in maternal plasma is efficient using both SRY and AMEL gene sequences. It also is evident that this method is not suitable for sex determination of twin pregnancies. Copyright © 2015 Elsevier B.V. All rights reserved.
A dual selection based, targeted gene replacement tool for Magnaporthe grisea and Fusarium oxysporum.

PubMed

Khang, Chang Hyun; Park, Sook-Young; Lee, Yong-Hwan; Kang, Seogchan

2005-06-01

Rapid progress in fungal genome sequencing presents many new opportunities for functional genomic analysis of fungal biology through the systematic mutagenesis of the genes identified through sequencing. However, the lack of efficient tools for targeted gene replacement is a limiting factor for fungal functional genomics, as it often necessitates the screening of a large number of transformants to identify the desired mutant. We developed an efficient method of gene replacement and evaluated factors affecting the efficiency of this method using two plant pathogenic fungi, Magnaporthe grisea and Fusarium oxysporum. This method is based on Agrobacterium tumefaciens-mediated transformation with a mutant allele of the target gene flanked by the herpes simplex virus thymidine kinase (HSVtk) gene as a conditional negative selection marker against ectopic transformants. The HSVtk gene product converts 5-fluoro-2'-deoxyuridine to a compound toxic to diverse fungi. Because ectopic transformants express HSVtk, while gene replacement mutants lack HSVtk, growing transformants on a medium amended with 5-fluoro-2'-deoxyuridine facilitates the identification of targeted mutants by counter-selecting against ectopic transformants. In addition to M. grisea and F. oxysporum, the method and associated vectors are likely to be applicable to manipulating genes in a broad spectrum of fungi, thus potentially serving as an efficient, universal functional genomic tool for harnessing the growing body of fungal genome sequence data to study fungal biology.
Sequence and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency

PubMed Central

Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.

2013-01-01

Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
An automatic and efficient pipeline for disease gene identification through utilizing family-based sequencing data.

PubMed

Song, Dandan; Li, Ning; Liao, Lejian

2015-01-01

Due to the generation of enormous amounts of data at both lower costs as well as in shorter times, whole-exome sequencing technologies provide dramatic opportunities for identifying disease genes implicated in Mendelian disorders. Since upwards of thousands genomic variants can be sequenced in each exome, it is challenging to filter pathogenic variants in protein coding regions and reduce the number of missing true variants. Therefore, an automatic and efficient pipeline for finding disease variants in Mendelian disorders is designed by exploiting a combination of variants filtering steps to analyze the family-based exome sequencing approach. Recent studies on the Freeman-Sheldon disease are revisited and show that the proposed method outperforms other existing candidate gene identification methods.
Initial description of primate-specific cystine-knot Prometheus genes and differential gene expansions of D-dopachrome tautomerase genes

PubMed Central

Premzl, Marko

2015-01-01

Using eutherian comparative genomic analysis protocol and public genomic sequence data sets, the present work attempted to update and revise two gene data sets. The most comprehensive third party annotation gene data sets of eutherian adenohypophysis cystine-knot genes (128 complete coding sequences), and d-dopachrome tautomerases and macrophage migration inhibitory factor genes (30 complete coding sequences) were annotated. For example, the present study first described primate-specific cystine-knot Prometheus genes, as well as differential gene expansions of D-dopachrome tautomerase genes. Furthermore, new frameworks of future experiments of two eutherian gene data sets were proposed. PMID:25941635
NETWORK ASSISTED ANALYSIS TO REVEAL THE GENETIC BASIS OF AUTISM1

PubMed Central

Liu, Li; Lei, Jing; Roeder, Kathryn

2016-01-01

While studies show that autism is highly heritable, the nature of the genetic basis of this disorder remains illusive. Based on the idea that highly correlated genes are functionally interrelated and more likely to affect risk, we develop a novel statistical tool to find more potentially autism risk genes by combining the genetic association scores with gene co-expression in specific brain regions and periods of development. The gene dependence network is estimated using a novel partial neighborhood selection (PNS) algorithm, where node specific properties are incorporated into network estimation for improved statistical and computational efficiency. Then we adopt a hidden Markov random field (HMRF) model to combine the estimated network and the genetic association scores in a systematic manner. The proposed modeling framework can be naturally extended to incorporate additional structural information concerning the dependence between genes. Using currently available genetic association data from whole exome sequencing studies and brain gene expression levels, the proposed algorithm successfully identified 333 genes that plausibly affect autism risk. PMID:27134692
Efficient gusA transient expression in Porphyra yezoensis protoplasts mediated by endogenous beta-tubulin flanking sequences

NASA Astrophysics Data System (ADS)

Gong, Qianhong; Yu, Wengong; Dai, Jixun; Liu, Hongquan; Xu, Rifu; Guan, Huashi; Pan, Kehou

2007-01-01

Endogenous tubulin promoter has been widely used for expressing foreign genes in green algae, but the efficiency and feasibility of endogenous tubulin promoter in the economically important Porphyra yezoensis (Rhodophyta) are unknown. In this study, the flanking sequences of beta-tubulin gene from P. yezoensis were amplified and two transient expression vectors were constructed to determine their transcription promoting feasibility for foreign gene gusA. The testing vector pATubGUS was constructed by inserting 5'-and 3'-flanking regions ( Tub5' and Tub3') up-and down-stream of β-glucuronidase (GUS) gene ( gusA), respectively, into pA, a derivative of pCAT®3-enhancer vector. The control construct, pAGUSTub3, contains only gusA and Tub3'. These constructs were electroporated into P. yezoensis protoplasts and the GUS activities were quantitatively analyzed by spectrometry. The results demonstrated that gusA gene was efficiently expressed in P. yezoensis protoplasts under the regulation of 5'-flanking sequence of the beta-tubulin gene. More interestingly, the pATubGUS produced stronger GUS activity in P. yezoensis protoplasts when compared to the result from pBI221, in which the gusA gene was directed by a constitutive CaMV 35S promoter. The data suggest that the integration of P. yezoensis protoplast and its endogenous beta-tubulin flanking sequences is a potential novel system for foreign gene expression.

Genomics insights into different cellobiose hydrolysis activities in two Trichoderma hamatum strains.

PubMed

Cheng, Peng; Liu, Bo; Su, Yi; Hu, Yao; Hong, Yahui; Yi, Xinxin; Chen, Lei; Su, Shengying; Chu, Jeffrey S C; Chen, Nansheng; Xiong, Xingyao

2017-04-19

Efficient biomass bioconversion is a promising solution to alternative energy resources and environmental issues associated with lignocellulosic wastes. The Trichoderma species of cellulolytic fungi have strong cellulose-degrading capability, and their cellulase systems have been extensively studied. Currently, a major limitation of Trichoderma strains is their low production of β-glucosidases. We isolated two Trichoderma hamatum strains YYH13 and YYH16 with drastically different cellulose degrading efficiencies. YYH13 has higher cellobiose-hydrolyzing efficiency. To understand mechanisms underlying such differences, we sequenced the genomes of YYH13 and YYH16, which are essentially identical (38.93 and 38.92 Mb, respectively) and are similar to that of the T. hamatum strain GD12. Using GeneMark-ES, we annotated 11,316 and 11,755 protein-coding genes in YYH13 and YYH16, respectively. Comparative analysis identified 13 functionally important genes in YYH13 under positive selection. Through examining orthologous relationships, we identified 172,655, and 320 genome-specific genes in YYH13, YYH16, and GD12, respectively. We found 15 protease families that show differences between YYH13 and YYH16. Enzymatic tests showed that exoglucanase, endoglucanase, and β-glucosidase activities were higher in YYH13 than YYH16. Additionally, YYH13 contains 10 families of carbohydrate-active enzymes, including GH1, GH3, GH18, GH35, and GH55 families of chitinases, glucosidases, galactosidases, and glucanases, which are subject to stronger positive selection pressure. Furthermore, we found that the β-glucosidase gene (YYH1311079) and pGEX-KG/YYH1311079 bacterial expression vector may provide valuable insight for designing β-glucosidase with higher cellobiose-hydrolyzing efficiencies. This study suggests that the YYH13 strain of T. hamatum has the potential to serve as a model organism for producing cellulase because of its strong ability to efficiently degrade cellulosic biomass. The genome sequences of YYH13 and YYH16 represents a valuable resource for studying efficient production of biofuels.
Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage.

PubMed

Brok-Volchanskaya, Vera S; Kadyrov, Farid A; Sivogrivov, Dmitry E; Kolosov, Peter M; Sokolov, Andrey S; Shlyapnikov, Michael G; Kryukov, Valentine M; Granovsky, Igor E

2008-04-01

Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3' 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TpsiC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages.
mRNA localization to the mitochondrial surface allows the efficient translocation inside the organelle of a nuclear recoded ATP6 protein

PubMed Central

Kaltimbacher, Valérie; Bonnet, Crystel; Lecoeuvre, Gaëlle; Forster, Valérie; Sahel, José-Alain; Corral-Debrinski, Marisol

2006-01-01

As previously established in yeast, two sequences within mRNAs are responsible for their specific localization to the mitochondrial surface—the region coding for the mitochondrial targeting sequence and the 3′UTR. This phenomenon is conserved in human cells. Therefore, we decided to use mRNA localization as a tool to address to mitochondria, a protein that is not normally imported. For this purpose, we associated a nuclear recoded ATP6 gene with the mitochondrial targeting sequence and the 3′UTR of the nuclear SOD2 gene, which mRNA exclusively localizes to the mitochondrial surface in HeLa cells. The ATP6 gene is naturally located into the organelle and encodes a highly hydrophobic protein of the respiratory chain complex V. In this study, we demonstrated that hybrid ATP6 mRNAs, as the endogenous SOD2 mRNA, localize to the mitochondrial surface in human cells. Remarkably, fusion proteins localize to mitochondria in vivo. Indeed, ATP6 precursors synthesized in the cytoplasm were imported into mitochondria in a highly efficient way, especially when both the MTS and the 3′UTR of the SOD2 gene were associated with the re-engineered ATP6 gene. Hence, these data indicate that mRNA targeting to the mitochondrial surface represents an attractive strategy for allowing the mitochondrial import of proteins originally encoded by the mitochondrial genome without any amino acid change in the protein that could interfere with its biologic activity. PMID:16751614
Engineering Synthetic Gene Circuits in Living Cells with CRISPR Technology.

PubMed

Jusiak, Barbara; Cleto, Sara; Perez-Piñera, Pablo; Lu, Timothy K

2016-07-01

One of the goals of synthetic biology is to build regulatory circuits that control cell behavior, for both basic research purposes and biomedical applications. The ability to build transcriptional regulatory devices depends on the availability of programmable, sequence-specific, and effective synthetic transcription factors (TFs). The prokaryotic clustered regularly interspaced short palindromic repeat (CRISPR) system, recently harnessed for transcriptional regulation in various heterologous host cells, offers unprecedented ease in designing synthetic TFs. We review how CRISPR can be used to build synthetic gene circuits and discuss recent advances in CRISPR-mediated gene regulation that offer the potential to build increasingly complex, programmable, and efficient gene circuits in the future. Copyright © 2016. Published by Elsevier Ltd.
Improvement and Optimization of Two Engineered Phage Resistance Mechanisms in Lactococcus lactis

PubMed Central

McGrath, Stephen; Fitzgerald, Gerald F.; van Sinderen, Douwe

2001-01-01

Homologous replication module genes were identified for four P335 type phages. DNA sequence analysis revealed that all four phages exhibited more than 90% DNA homology for at least two genes, designated rep2009 and orf17. One of these genes, rep2009, codes for a putative replisome organizer protein and contains an assumed origin of phage DNA replication (ori2009), which was identical for all four phages. DNA fragments representing the ori2009 sequence confer a phage-encoded resistance (Per) phenotype on lactococcal hosts when they are supplied on a high-copy-number vector. Furthermore, cloning multiple copies of the ori2009 sequence was found to increase the effectiveness of the Per phenotype conferred. A number of antisense plasmids targeting specific genes of the replication module were constructed. Two separate plasmids targeting rep2009 and orf17 were found to efficiently inhibit proliferation of all four phages by interfering with intracellular phage DNA replication. These results represent two highly effective strategies for inhibiting bacteriophage proliferation, and they also identify a novel gene, orf17, which appears to be important for phage DNA replication. Furthermore, these results indicate that although the actual mechanisms of DNA replication are very similar, if not identical, for all four phages, expression of the replication genes is significantly different in each case. PMID:11157223
Characterization of the temperate phage vB_RleM_PPF1 and its site-specific integration into the Rhizobium leguminosarum F1 genome.

PubMed

Halmillawewa, Anupama P; Restrepo-Córdoba, Marcela; Perry, Benjamin J; Yost, Christopher K; Hynes, Michael F

2016-02-01

Bacteriophages may play an important role in regulating population size and diversity of the root nodule symbiont Rhizobium leguminosarum, as well as participating in horizontal gene transfer. Although phages that infect this species have been isolated in the past, our knowledge of their molecular biology, and especially of genome composition, is extremely limited, and this lack of information impacts on the ability to assess phage population dynamics and limits potential agricultural applications of rhizobiophages. To help address this deficit in available sequence and biological information, the complete genome sequence of the Myoviridae temperate phage PPF1 that infects R. leguminosarum biovar viciae strain F1 was determined. The genome is 54,506 bp in length with an average G+C content of 61.9 %. The genome contains 94 putative open reading frames (ORFs) and 74.5 % of these predicted ORFs share homology at the protein level with previously reported sequences in the database. However, putative functions could only be assigned to 25.5 % (24 ORFs) of the predicted genes. PPF1 was capable of efficiently lysogenizing its rhizobial host R. leguminosarum F1. The site-specific recombination system of the phage targets an integration site that lies within a putative tRNA-Pro (CGG) gene in R. leguminosarum F1. Upon integration, the phage is capable of restoring the disrupted tRNA gene, owing to the 50 bp homologous sequence (att core region) it shares with its rhizobial host genome. Phage PPF1 is the first temperate phage infecting members of the genus Rhizobium for which a complete genome sequence, as well as other biological data such as the integration site, is available.
The immediate upstream region of the 5′-UTR from the AUG start codon has a pronounced effect on the translational efficiency in Arabidopsis thaliana

PubMed Central

Kim, Younghyun; Lee, Goeun; Jeon, Eunhyun; Sohn, Eun ju; Lee, Yongjik; Kang, Hyangju; Lee, Dong wook; Kim, Dae Heon; Hwang, Inhwan

2014-01-01

The nucleotide sequence around the translational initiation site is an important cis-acting element for post-transcriptional regulation. However, it has not been fully understood how the sequence context at the 5′-untranslated region (5′-UTR) affects the translational efficiency of individual mRNAs. In this study, we provide evidence that the 5′-UTRs of Arabidopsis genes showing a great difference in the nucleotide sequence vary greatly in translational efficiency with more than a 200-fold difference. Of the four types of nucleotides, the A residue was the most favourable nucleotide from positions −1 to −21 of the 5′-UTRs in Arabidopsis genes. In particular, the A residue in the 5′-UTR from positions −1 to −5 was required for a high-level translational efficiency. In contrast, the T residue in the 5′-UTR from positions −1 to −5 was the least favourable nucleotide in translational efficiency. Furthermore, the effect of the sequence context in the −1 to −21 region of the 5′-UTR was conserved in different plant species. Based on these observations, we propose that the sequence context immediately upstream of the AUG initiation codon plays a crucial role in determining the translational efficiency of plant genes. PMID:24084084
Long-term functional adeno-associated virus-microdystrophin expression in the dystrophic CXMDj dog.

PubMed

Koo, Taeyoung; Okada, Takashi; Athanasopoulos, Takis; Foster, Helen; Takeda, Shin'ichi; Dickson, George

2011-09-01

Duchenne muscular dystrophy (DMD) is a severe, inherited, muscle-wasting disorder caused by mutations in the dystrophin gene. Preclinical studies of adeno-associated virus gene therapy for DMD have been described in mouse and dog models of this disease. However, low and transient expression of microdystrophin in dystrophic dogs and a lack of long-term microdystrophin expression associated with a CD8(+) T-cell response in DMD patients suggests that the development of improved microdystrophin genes and delivery strategies is essential for successful clinical trials in DMD patients. We have previously shown the efficiency of mRNA sequence optimization of mouse microdystrophin in ameliorating the pathology of dystrophic mdx mice. In the present study, we generated adeno-associated virus (AAV)2/8 vectors expressing an mRNA sequence-optimized canine microdystrophin under the control of a muscle-specific promoter and injected intramuscularly into a single canine X-linked muscular dystrophy (CXMDj) dog. Expression of stable and high levels of microdystrophin was observed along with an association of the dystrophin-associated protein complex in intramuscularly injected muscles of a CXMDj dog for at least 8 weeks without immune responses. Treated muscles were highly protected from dystrophic damage, with reduced levels of myofiber permeability and central nucleation. The data obtained in the present study suggest that the use of canine-specific and mRNA sequence-optimized microdystrophin genes in conjunction with a muscle-specific promoter results in high and stable levels of microdystrophin expression in a canine model of DMD. This approach will potentially allow the reduction of dosage and contribute towards the development of a safe and effective AAV gene therapy clinical trial protocol for DMD. Copyright © 2011 John Wiley & Sons, Ltd.
Simultaneous and Sequential Integration by Cre/loxP Site-Specific Recombination in Saccharomyces cerevisiae.

PubMed

Choi, Ho-Jung; Kim, Yeon-Hee

2018-05-28

A Cre/ loxP -δ-integration system was developed to allow sequential and simultaneous integration of a multiple gene expression cassette in Saccharomyces cerevisiae . To allow repeated integrations, the reusable Candida glabrata MARKER ( CgMARKER ) carrying loxP sequences was used, and the integrated CgMARKER was efficiently removed by inducing Cre recombinase. The XYLP and XYLB genes encoding endoxylanase and β-xylosidase, respectively, were used as model genes for xylan metabolism in this system, and the copy number of these genes was increased to 15.8 and 16.9 copies/cell, respectively, by repeated integration. This integration system is a promising approach for the easy construction of yeast strains with enhanced metabolic pathways through multicopy gene expression.
Random Splicing of Several Exons Caused by a Single Base Change in the Target Exon of CRISPR/Cas9 Mediated Gene Knockout.

PubMed

Kapahnke, Marcel; Banning, Antje; Tikkanen, Ritva

2016-12-14

The clustered regularly interspaced short palindromic repeats (CRISPR)-associated sequence 9 (CRISPR/Cas9) system is widely used for genome editing purposes as it facilitates an efficient knockout of a specific gene in, e.g. cultured cells. Targeted double-strand breaks are introduced to the target sequence of the guide RNAs, which activates the cellular DNA repair mechanism for non-homologous-end-joining, resulting in unprecise repair and introduction of small deletions or insertions. Due to this, sequence alterations in the coding region of the target gene frequently cause frame-shift mutations, facilitating degradation of the mRNA. We here show that such CRISPR/Cas9-mediated alterations in the target exon may also result in altered splicing of the respective pre-mRNA, most likely due to mutations of splice-regulatory sequences. Using the human FLOT-1 gene as an example, we demonstrate that such altered splicing products also give rise to aberrant protein products. These may potentially function as dominant-negative proteins and thus interfere with the interpretation of the data generated with these cell lines. Since most researchers only control the consequences of CRISPR knockout at genomic and protein level, our data should encourage to also check the alterations at the mRNA level.
Construction of transformed, cultured silkworm cells and transgenic silkworm using the site-specific integrase system from phage φC31.

PubMed

Yin, Yajuan; Cao, Guangli; Xue, Renyu; Gong, Chengliang

2014-10-01

The Streptomyces bacteriophage, φC31, uses a site-specific integrase enzyme to perform efficient recombination. The recombination system uses specific sequences to integrate exogenous DNA from the phage into a host. The sequences are known as the attP site in the phage and the attB site in the host. The system can be used as a genetic manipulation tool. In this study it has been applied to the transformation of cultured BmN cells and the construction of transgenic Bombyx mori individuals. A plasmid, pSK-attB/Pie1-EGFP/Zeo-PASV40, containing a cassette designed to express a egfp-zeocin fusion gene, was co-transfected into cultured BmN cells with a helper plasmid, pSK-Pie1/NLS-Int/NSL. Expression of the egfp-zeocin fusion gene was driven by an ie-1 promoter, downstream of a φC31 attB site. The helper plasmid encoded the φC31 integrase enzyme, which was flanked by two nuclear localization signals. Expression of the egfp-zeocin fusion gene could be observed in transformed cells. The two plasmids were also transferred into silkworm eggs to obtain transgenic silkworms. Successful integration of the fusion gene was indicated by the detection of green fluorescence, which was emitted by the silkworms. Nucleotide sequence analysis demonstrated that the attB site had been cut, to allow recombination between the attB and endogenous pseudo attP sites in the cultured silkworm cells and silkworm individuals.
Fluorescent signatures for variable DNA sequences

PubMed Central

Rice, John E.; Reis, Arthur H.; Rice, Lisa M.; Carver-Brown, Rachel K.; Wangh, Lawrence J.

2012-01-01

Life abounds with genetic variations writ in sequences that are often only a few hundred nucleotides long. Rapid detection of these variations for identification of genetic diseases, pathogens and organisms has become the mainstay of molecular science and medicine. This report describes a new, highly informative closed-tube polymerase chain reaction (PCR) strategy for analysis of both known and unknown sequence variations. It combines efficient quantitative amplification of single-stranded DNA targets through LATE-PCR with sets of Lights-On/Lights-Off probes that hybridize to their target sequences over a broad temperature range. Contiguous pairs of Lights-On/Lights-Off probes of the same fluorescent color are used to scan hundreds of nucleotides for the presence of mutations. Sets of probes in different colors can be combined in the same tube to analyze even longer single-stranded targets. Each set of hybridized Lights-On/Lights-Off probes generates a composite fluorescent contour, which is mathematically converted to a sequence-specific fluorescent signature. The versatility and broad utility of this new technology is illustrated in this report by characterization of variant sequences in three different DNA targets: the rpoB gene of Mycobacterium tuberculosis, a sequence in the mitochondrial cytochrome C oxidase subunit 1 gene of nematodes and the V3 hypervariable region of the bacterial 16 s ribosomal RNA gene. We anticipate widespread use of these technologies for diagnostics, species identification and basic research. PMID:22879378
Implementation of next-generation sequencing for molecular diagnosis of hereditary breast and ovarian cancer highlights its genetic heterogeneity.

PubMed

Pinto, Pedro; Paulo, Paula; Santos, Catarina; Rocha, Patrícia; Pinto, Carla; Veiga, Isabel; Pinheiro, Manuela; Peixoto, Ana; Teixeira, Manuel R

2016-09-01

Molecular diagnosis of hereditary breast and ovarian cancer (HBOC) by standard methodologies has been limited to the BRCA1 and BRCA2 genes. With the recent development of new sequencing methodologies, the speed and efficiency of DNA testing have dramatically improved. The aim of this work was to validate the use of next-generation sequencing (NGS) for the detection of BRCA1/BRCA2 point mutations in a diagnostic setting and to study the role of other genes associated with HBOC in Portuguese families. A cohort of 94 high-risk families was included in the study, and they were initially screened for the two common founder mutations with variant-specific methods. Fourteen index patients were shown to carry the Portuguese founder mutation BRCA2 c.156_157insAlu, and the remaining 80 were analyzed in parallel by Sanger sequencing for the BRCA1/BRCA2 genes and by NGS for a panel of 17 genes that have been described as involved in predisposition to breast and/or ovarian cancer. A total of 506 variants in the BRCA1/BRCA2 genes were detected by both methodologies, with a 100 % concordance between them. This strategy allowed the detection of a total of 39 deleterious mutations in the 94 index patients, namely 10 in BRCA1 (25.6 %), 21 in BRCA2 (53.8 %), four in PALB2 (10.3 %), two in ATM (5.1 %), one in CHEK2 (2.6 %), and one in TP53 (2.6 %), with 20.5 % of the deleterious mutations being found in genes other than BRCA1/BRCA2. These results demonstrate the efficiency of NGS for the detection of BRCA1/BRCA2 point mutations and highlight the genetic heterogeneity of HBOC.
Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing.

PubMed

Vega-Arreguín, Julio C; Ibarra-Laclette, Enrique; Jiménez-Moraila, Beatriz; Martínez, Octavio; Vielle-Calzada, Jean Philippe; Herrera-Estrella, Luis; Herrera-Estrella, Alfredo

2009-07-06

In-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags (ESTs), although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage. To explore, in depth, the transcriptional diversity in an ancient maize landrace, we developed a protocol to optimize the sequencing of cDNAs and performed 4 consecutive GS20-454 pyrosequencing runs of a cDNA library obtained from 2 week-old Palomero Toluqueño maize plants. The protocol reported here allowed obtaining over 90% of informative sequences. These GS20-454 runs generated over 1.5 Million reads, representing the largest amount of sequences reported from a single plant cDNA library. A collection of 367,391 quality-filtered reads (30.09 Mb) from a single run was sufficient to identify transcripts corresponding to 34% of public maize ESTs databases; total sequences generated after 4 filtered runs increased this coverage to 50%. Comparisons of all 1.5 Million reads to the Maize Assembled Genomic Islands (MAGIs) provided evidence for the transcriptional activity of 11% of MAGIs. We estimate that 5.67% (86,069 sequences) do not align with public ESTs or annotated genes, potentially representing new maize transcripts. Following the assembly of 74.4% of the reads in 65,493 contigs, real-time PCR of selected genes confirmed a predicted correlation between the abundance of GS20-454 sequences and corresponding levels of gene expression. A protocol was developed that significantly increases the number, length and quality of cDNA reads using massive 454 parallel sequencing. We show that recurrent 454 pyrosequencing of a single cDNA sample is necessary to attain a thorough representation of the transcriptional universe present in maize, that can also be used to estimate transcript abundance of specific genes. This data suggests that the molecular and functional diversity contained in the vast native landraces remains to be explored, and that large-scale transcriptional sequencing of a presumed ancestor of the modern maize varieties represents a valuable approach to characterize the functional diversity of maize for future agricultural and evolutionary studies.
Sequenced sorghum mutant library- an efficient platform for discovery of causal gene mutations

USDA-ARS?s Scientific Manuscript database

Ethyl methanesulfonate (EMS) efficiently generates high-density mutations in genomes. We applied whole-genome sequencing to 256 phenotyped mutant lines of sorghum (Sorghum bicolor L. Moench) to 16x coverage. Comparisons with the reference sequence revealed >1.8 million canonical EMS-induced G/C to A...
Inducible Transgenic Models of BRCA1 Function

DTIC Science & Technology

1999-10-01

inducible expression vectors were created to conditionally express four different hammerhead ribozymes designed to specifically cleave the Brca...transcript. Hammerhead ribozymes are catalytic RNAs that efficiently cleave RNA and thereby down- regulate gene expression. Hammerhead ribozymes can cleave...any RNA containing its 5’-UH-3’ consensus sequence where U can be replaced by a C, and H=C, U or A. Hammerhead ribozymes effectively and selectively
Biotechnological applications of mobile group II introns and their reverse transcriptases: gene targeting, RNA-seq, and non-coding RNA analysis.

PubMed

Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M

2014-01-13

Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The high processivity and fidelity of group II intron reverse transcriptases along with their novel template-switching activity, which can directly link RNA-seq adaptor sequences to cDNAs during reverse transcription, open new approaches for RNA-seq and the identification and profiling of non-coding RNAs, with potentially wide applications in research and biotechnology.
The pig X and Y Chromosomes: structure, sequence, and evolution

PubMed Central

Skinner, Benjamin M.; Sargent, Carole A.; Churcher, Carol; Hunt, Toby; Herrero, Javier; Loveland, Jane E.; Dunn, Matt; Louzada, Sandra; Fu, Beiyuan; Chow, William; Gilbert, James; Austin-Guest, Siobhan; Beal, Kathryn; Carvalho-Silva, Denise; Cheng, William; Gordon, Daria; Grafham, Darren; Hardy, Matt; Harley, Jo; Hauser, Heidi; Howden, Philip; Howe, Kerstin; Lachani, Kim; Ellis, Peter J.I.; Kelly, Daniel; Kerry, Giselle; Kerwin, James; Ng, Bee Ling; Threadgold, Glen; Wileman, Thomas; Wood, Jonathan M.D.; Yang, Fengtang; Harrow, Jen; Affara, Nabeel A.; Tyler-Smith, Chris

2016-01-01

We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which are protein coding. Gene order closely matches that found in primates (including humans) and carnivores (including cats and dogs), which is inferred to be ancestral. Nevertheless, several protein-coding genes present on the human X Chromosome were absent from the pig, and 38 pig-specific X-chromosomal genes were annotated, 22 of which were olfactory receptors. The pig Y-specific Chromosome sequence generated here comprises 30 megabases (Mb). A 15-Mb subset of this sequence was assembled, revealing two clusters of male-specific low copy number genes, separated by an ampliconic region including the HSFY gene family, which together make up most of the short arm. Both clusters contain palindromes with high sequence identity, presumably maintained by gene conversion. Many of the ancestral X-related genes previously reported in at least one mammalian Y Chromosome are represented either as active genes or partial sequences. This sequencing project has allowed us to identify genes—both single copy and amplified—on the pig Y Chromosome, to compare the pig X and Y Chromosomes for homologous sequences, and thereby to reveal mechanisms underlying pig X and Y Chromosome evolution. PMID:26560630
Gene silencing efficiency and INF-β induction effects of splicing miRNA 155-based artificial miRNA with pre-miRNA stem-loop structures.

PubMed

Sin, Onsam; Mabiala, Prudence; Liu, Ye; Sun, Ying; Hu, Tao; Liu, Qingzhen; Guo, Deyin

2012-02-01

Artificial microRNA (miRNA) expression vectors have been developed and used for RNA interference. The secondary structure of artificial miRNA is important for RNA interference efficacy. We designed two groups of six artificial splicing miRNA 155-based miRNAs (SM155-based miRNAs) with the same target in the coding region or 3' UTR of a target gene and studied their RNA silencing efficiency and interferon β (IFN-β) induction effects. SM155-based miRNA with a mismatch at the +1 position and a bulge at the +11, +12 positions in a miRNA precursor stem-loop structure showed the highest gene silencing efficiency and lowest IFN-β induction effect (increased IFN-β mRNA level by 10% in both target cases), regardless of the specificity of the target sequence, suggesting that pSM155-based miRNA with this design could be a valuable miRNA expression vector.
Prediction of EST functional relationships via literature mining with user-specified parameters.

PubMed

Wang, Hei-Chia; Huang, Tian-Hsiang

2009-04-01

The massive amount of expressed sequence tags (ESTs) gathered over recent years has triggered great interest in efficient applications for genomic research. In particular, EST functional relationships can be used to determine a possible gene network for biological processes of interest. In recent years, many researchers have tried to determine EST functional relationships by analyzing the biological literature. However, it has been challenging to find efficient prediction methods. Moreover, an annotated EST is usually associated with many functions, so successful methods must be able to distinguish between relevant and irrelevant functions based on user specifications. This paper proposes a method to discover functional relationships between ESTs of interest by analyzing literature from the Medical Literature Analysis and Retrieval System Online, with user-specified parameters for selecting keywords. This method performs better than the multiple kernel documents method in setting up a specific threshold for gathering materials. The method is also able to uncover known functional relationships, as shown by a comparison with the Kyoto Encyclopedia of Genes and Genomes database. The reliable EST relationships predicted by the proposed method can help to construct gene networks for specific biological functions of interest.

An Efficient Strategy Combining SSR Markers- and Advanced QTL-seq-driven QTL Mapping Unravels Candidate Genes Regulating Grain Weight in Rice

PubMed Central

Daware, Anurag; Das, Sweta; Srivastava, Rishi; Badoni, Saurabh; Singh, Ashok K.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.

2016-01-01

Development and use of genome-wide informative simple sequence repeat (SSR) markers and novel integrated genomic strategies are vital to drive genomics-assisted breeding applications and for efficient dissection of quantitative trait loci (QTLs) underlying complex traits in rice. The present study developed 6244 genome-wide informative SSR markers exhibiting in silico fragment length polymorphism based on repeat-unit variations among genomic sequences of 11 indica, japonica, aus, and wild rice accessions. These markers were mapped on diverse coding and non-coding sequence components of known cloned/candidate genes annotated from 12 chromosomes and revealed a much higher amplification (97%) and polymorphic potential (88%) along with wider genetic/functional diversity level (16–74% with a mean 53%) especially among accessions belonging to indica cultivar group, suggesting their utility in large-scale genomics-assisted breeding applications in rice. A high-density 3791 SSR markers-anchored genetic linkage map (IR 64 × Sonasal) spanning 2060 cM total map-length with an average inter-marker distance of 0.54 cM was generated. This reference genetic map identified six major genomic regions harboring robust QTLs (31% combined phenotypic variation explained with a 5.7–8.7 LOD) governing grain weight on six rice chromosomes. One strong grain weight major QTL region (OsqGW5.1) was narrowed-down by integrating traditional QTL mapping with high-resolution QTL region-specific integrated SSR and single nucleotide polymorphism markers-based QTL-seq analysis and differential expression profiling. This led us to delineate two natural allelic variants in two known cis-regulatory elements (RAV1AAT and CARGCW8GAT) of glycosyl hydrolase and serine carboxypeptidase genes exhibiting pronounced seed-specific differential regulation in low (Sonasal) and high (IR 64) grain weight mapping parental accessions. Our genome-wide SSR marker resource (polymorphic within/between diverse cultivar groups) and integrated genomic strategy can efficiently scan functionally relevant potential molecular tags (markers, candidate genes and alleles) regulating complex agronomic traits (grain weight) and expedite marker-assisted genetic enhancement in rice. PMID:27833617
Bacillus anthracis genome organization in light of whole transcriptome sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.

2010-03-22

Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computationalmore » predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.« less
Genome-wide identification of allele-specific expression (ASE) in response to Marek's disease virus infection using next generation sequencing.

PubMed

Maceachern, Sean; Muir, William M; Crosby, Seth; Cheng, Hans H

2011-06-03

Marek's disease (MD), a T cell lymphoma induced by the highly oncogenic α-herpesvirus Marek's disease virus (MDV), is the main chronic infectious disease concern threatening the poultry industry. Enhancing genetic resistance to MD in commercial poultry is an attractive method to augment MD vaccines, which is currently the control method of choice. In order to optimally implement this control strategy through marker-assisted selection (MAS) and to gain biological information, it is necessary to identify specific genes that influence MD incidence. A genome-wide screen for allele-specific expression (ASE) in response to MDV infection was conducted. The highly inbred ADOL chicken lines 6 (MD resistant) and 7 (MD susceptible) were inter-mated in reciprocal crosses and half of the progeny challenged with MDV. Splenic RNA pools at a single time after infection for each treatment group point were generated, sequenced using a next generation sequencer, then analyzed for allele-specific expression (ASE). To validate and extend the results, Illumina GoldenGate assays for selected cSNPs were developed and used on all RNA samples from all 6 time points following MDV challenge. RNA sequencing resulted in 11-13+ million mappable reads per treatment group, 1.7+ Gb total sequence, and 22,655 high-confidence cSNPs. Analysis of these cSNPs revealed that 5360 cSNPs in 3773 genes exhibited statistically significant allelic imbalance. Of the 1536 GoldenGate assays, 1465 were successfully scored with all but 19 exhibiting evidence for allelic imbalance. ASE is an efficient method to identify potentially all or most of the genes influencing this complex trait. The identified cSNPs can be further evaluated in resource populations to determine their allelic direction and size of effect on genetic resistance to MD as well as being directly implemented in genomic selection programs. The described method, although demonstrated in inbred chicken lines, is applicable to all traits in any diploid species, and should prove to be a simple method to identify the majority of genes controlling any complex trait.
Isolation and functional characterization of TIF-IB, a factor that confers promoter specificity to mouse RNA polymerase I.

PubMed

Schnapp, A; Clos, J; Hädelt, W; Schreck, R; Cvekl, A; Grummt, I

1990-03-25

The murine ribosomal gene promoter contains two cis-acting control elements which operate in concert to promote efficient and accurate transcription initiation by RNA polymerase I. The start site proximal core element which is indispensable for promoter recognition by RNA polymerase I (pol I) encompasses sequences from position -39 to -1. An upstream control element (UCE) which is located between nucleotides -142 and -112 stimulates the efficiency of transcription initiation both in vivo and in vitro. Here we report the isolation and functional characterization of a specific rDNA binding protein, the transcription initiation factor TIF-IB, which specifically interacts with the core region of the mouse ribosomal RNA gene promoter. Highly purified TIF-IB complements transcriptional activity in the presence of two other essential initiation factors TIF-IA and TIF-IC. We demonstrate that the binding efficiency of purified TIF-IB to the core promoter is strongly enhanced by the presence in cis of the UCE. This positive effect of upstream sequences on TIF-IB binding is observed throughout the purification procedure suggesting that the synergistic action of the two distant promoter elements is not mediated by a protein different from TIF-IB. Increasing the distance between both control elements still facilitates stable factor binding but eliminates transcriptional activation. The results demonstrate that TIF-IB binding to the rDNA promoter is an essential early step in the assembly of a functional transcription initiation complex. The subsequent interaction of TIF-IB with other auxiliary transcription initiation factors, however, requires the correct spacing between the UCE and the core promoter element.
The residual repair capacity of xeroderma pigmentosum complementation group C fibroblasts is highly specific for transcriptionally active DNA.

PubMed Central

Venema, J; van Hoffen, A; Natarajan, A T; van Zeeland, A A; Mullenders, L H

1990-01-01

We have measured removal of pyrimidine dimers in defined DNA sequences in confluent and actively growing normal human and xeroderma pigmentosum complementation group C (XP-C) fibroblasts exposed to 10 J/m2 UV-irradiation. In normal fibroblasts 45% and 90% of the dimers are removed from the transcriptionally active adenosine deaminase (ADA) gene within 4 and 24 hours after irradiation respectively. Equal repair efficiencies are found in fragments located entirely within the transcription unit or partly in the 3' flanking region of the ADA gene. The rate and extent of dimer removal from the dihydrofolate reductase (DHFR) gene is very similar to that of the ADA gene. Repair of the transcriptionally inactive 754 locus is less efficient: 18% and 52% of the dimers are removed within 4 and 24 hours respectively. In spite of the limited overall repair capacity, confluent XP-C fibroblasts efficiently remove dimers from the ADA and DHFR genes: about 90% and 50% within 24 hours respectively. The 3' end of the ADA gene is repaired as efficiently as in normal human fibroblasts, but less efficient repair occurs in DNA fragments located in the DHFR gene and at the 5' end of the ADA gene. Repair of the inactive 754 locus does not exceed the very slow rate of dimer removal from the genome overall. Confluent and actively growing XP-C cells show similar efficiencies of repair of the ADA, DHFR and 754 genes. Our findings suggest the existence of two independently operating pathways directed towards repair of pyrimidine dimers in either active or inactive chromatin. XP-C cells have lost the capacity to repair inactive chromatin, but are still able to repair active chromatin. Images PMID:2308842
MYO7A and USH2A gene sequence variants in Italian patients with Usher syndrome.

PubMed

Sodi, Andrea; Mariottini, Alessandro; Passerini, Ilaria; Murro, Vittoria; Tachyla, Iryna; Bianchi, Benedetta; Menchini, Ugo; Torricelli, Francesca

2014-01-01

To analyze the spectrum of sequence variants in the MYO7A and USH2A genes in a group of Italian patients affected by Usher syndrome (USH). Thirty-six Italian patients with a diagnosis of USH were recruited. They received a standard ophthalmologic examination, visual field testing, optical coherence tomography (OCT) scan, and electrophysiological tests. Fluorescein angiography and fundus autofluorescence imaging were performed in selected cases. All the patients underwent an audiologic examination for the 0.25-8,000 Hz frequencies. Vestibular function was evaluated with specific tests. DNA samples were analyzed for sequence variants of the MYO7A gene (for USH1) and the USH2A gene (for USH2) with direct sequencing techniques. A few patients were analyzed for both genes. In the MYO7A gene, ten missense variants were found; three patients were compound heterozygous, and two were homozygous. Thirty-four USH2A gene variants were detected, including eight missense variants, nine nonsense variants, six splicing variants, and 11 duplications/deletions; 19 patients were compound heterozygous, and three were homozygous. Four MYO7A and 17 USH2A variants have already been described in the literature. Among the novel mutations there are four USH2A large deletions, detected with multiplex ligation dependent probe amplification (MLPA) technology. Two potentially pathogenic variants were found in 27 patients (75%). Affected patients showed variable clinical pictures without a clear genotype-phenotype correlation. Ten variants in the MYO7A gene and 34 variants in the USH2A gene were detected in Italian patients with USH at a high detection rate. A selective analysis of these genes may be valuable for molecular analysis, combining diagnostic efficiency with little time wastage and less resource consumption.
Targeting vector construction through recombineering.

PubMed

Malureanu, Liviu A

2011-01-01

Gene targeting in mouse embryonic stem cells is an essential, yet still very expensive and highly time-consuming, tool and method to study gene function at the organismal level or to create mouse models of human diseases. Conventional cloning-based methods have been largely used for generating targeting vectors, but are hampered by a number of limiting factors, including the variety and location of restriction enzymes in the gene locus of interest, the specific PCR amplification of repetitive DNA sequences, and cloning of large DNA fragments. Recombineering is a technique that exploits the highly efficient homologous recombination function encoded by λ phage in Escherichia coli. Bacteriophage-based recombination can recombine homologous sequences as short as 30-50 bases, allowing manipulations such as insertion, deletion, or mutation of virtually any genomic region. The large availability of mouse genomic bacterial artificial chromosome (BAC) libraries covering most of the genome facilitates the retrieval of genomic DNA sequences from the bacterial chromosomes through recombineering. This chapter describes a successfully applied protocol and aims to be a detailed guide through the steps of generation of targeting vectors through recombineering.
Integrated genome sequence and linkage map of physic nut (Jatropha curcas L.), a biodiesel plant.

PubMed

Wu, Pingzhi; Zhou, Changpin; Cheng, Shifeng; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Chen, Yanbo; Chen, Yan; Ni, Peixiang; Wang, Ying; Xu, Xun; Huang, Ying; Song, Chi; Wang, Zhiwen; Shi, Nan; Zhang, Xudong; Fang, Xiaohua; Yang, Qing; Jiang, Huawu; Chen, Yaping; Li, Meiru; Wang, Ying; Chen, Fan; Wang, Jun; Wu, Guojiang

2015-03-01

The family Euphorbiaceae includes some of the most efficient biomass accumulators. Whole genome sequencing and the development of genetic maps of these species are important components in molecular breeding and genetic improvement. Here we report the draft genome of physic nut (Jatropha curcas L.), a biodiesel plant. The assembled genome has a total length of 320.5 Mbp and contains 27,172 putative protein-coding genes. We established a linkage map containing 1208 markers and anchored the genome assembly (81.7%) to this map to produce 11 pseudochromosomes. After gene family clustering, 15,268 families were identified, of which 13,887 existed in the castor bean genome. Analysis of the genome highlighted specific expansion and contraction of a number of gene families during the evolution of this species, including the ribosome-inactivating proteins and oil biosynthesis pathway enzymes. The genomic sequence and linkage map provide a valuable resource not only for fundamental and applied research on physic nut but also for evolutionary and comparative genomics analysis, particularly in the Euphorbiaceae. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.
RNA interference as a key to knockdown overexpressed cyclooxygenase-2 gene in tumour cells

PubMed Central

Strillacci, A; Griffoni, C; Spisni, E; Manara, M C; Tomasi, V

2006-01-01

Silencing those genes that are overexpressed in cancer and contribute to the survival and progression of tumour cells is the aim of several researches. Cyclooxygenase-2 (COX-2) is one of the most intensively studied genes since it is overexpressed in most tumours, mainly in colon cancer. The use of specific COX-2 inhibitors to treat colon cancer has generated great enthusiasm. Yet, the side effects of some inhibitors emerging during long-term treatment have caused much concern. Genes silencing by RNA interference (RNAi) has led to new directions in the field of experimental oncology. In this study, we detected sequences directed against COX-2 mRNA, that potently downregulate COX-2 gene expression and inhibit phorbol 12-myristate 13-acetate-induced angiogenesis in vitro in a specific, nontoxic manner. Moreover, we found that the insertion of a specific cassette carrying anti-COX-2 short hairpin RNA sequence into a viral vector (pSUPER.retro) greatly increased silencing potency in a colon cancer cell line (HT29) without activating any interferon response. Phenotypically, COX-2 deficient HT29 cells showed a significant impairment of their in vitro malignant behaviour. Thus, the retroviral approach enhancing COX-2 knockdown, mediated by RNAi, proved to be an useful tool to better understand the role of COX-2 in colon cancer. Furthermore, the higher infection efficiency we observed in tumour cells, if compared to normal endothelial cells, may disclose the possibility to specifically treat tumour cells without impairing endothelial COX-2 activity. PMID:16622456
Gfi1-Cre knock-in mouse line: A tool for inner ear hair cell-specific gene deletion

PubMed Central

Yang, Hua; Gan, Jean; Xie, Xiaoling; Deng, Min; Feng, Liang; Chen, Xiaowei; Gao, Zhiqiang; Gan, Lin

2010-01-01

Summary Gfi1encodes a zinc-finger transcription factor essential for the development and maintenance of haematopoiesis and the inner ear. In mouse inner ear, Gfi1 expression is confined to hair cells during development and in adulthood. To construct a genetic tool for inner ear hair cell-specific gene deletion, we generated a Gfi1-Cre mouse line by knocking-in Cre coding sequences into the Gfi1 locus and inactivating the endogenous Gfi1. The specificity and efficiency of Gfi1-Cre recombinase-mediated recombination in the developing inner ear was revealed through the expression of the conditional R26R-lacZ reporter gene. The onset of lacZ expression in the Gfi1Cre/+ inner ear was first detected at E13.5 in the vestibule and at E15.5 in the cochlea, coinciding with the generation of hair cells. Throughout inner ear development, lacZ expression was detected only in hair cells. Thus, Gfi1-Cre knock-in mouse line provides a useful tool for gene manipulations specifically in inner ear hair cells. PMID:20533399
A systematic comparison of error correction enzymes by next-generation sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lubock, Nathan B.; Zhang, Di; Sidore, Angus M.

Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared sixmore » different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.« less
A systematic comparison of error correction enzymes by next-generation sequencing

DOE PAGES

Lubock, Nathan B.; Zhang, Di; Sidore, Angus M.; ...

2017-08-01

Gene synthesis, the process of assembling genelength fragments from shorter groups of oligonucleotides (oligos), is becoming an increasingly important tool in molecular and synthetic biology. The length, quality and cost of gene synthesis are limited by errors produced during oligo synthesis and subsequent assembly. Enzymatic error correction methods are cost-effective means to ameliorate errors in gene synthesis. Previous analyses of these methods relied on cloning and Sanger sequencing to evaluate their efficiencies, limiting quantitative assessment. Here, we develop a method to quantify errors in synthetic DNA by next-generation sequencing. We analyzed errors in model gene assemblies and systematically compared sixmore » different error correction enzymes across 11 conditions. We find that ErrASE and T7 Endonuclease I are the most effective at decreasing average error rates (up to 5.8-fold relative to the input), whereas MutS is the best for increasing the number of perfect assemblies (up to 25.2-fold). We are able to quantify differential specificities such as ErrASE preferentially corrects C/G transversions whereas T7 Endonuclease I preferentially corrects A/T transversions. More generally, this experimental and computational pipeline is a fast, scalable and extensible way to analyze errors in gene assemblies, to profile error correction methods, and to benchmark DNA synthesis methods.« less
Genome-wide localization and expression profiling establish Sp2 as a sequence-specific transcription factor regulating vitally important genes

PubMed Central

Terrados, Gloria; Finkernagel, Florian; Stielow, Bastian; Sadic, Dennis; Neubert, Juliane; Herdt, Olga; Krause, Michael; Scharfe, Maren; Jarek, Michael; Suske, Guntram

2012-01-01

The transcription factor Sp2 is essential for early mouse development and for proliferation of mouse embryonic fibroblasts in culture. Yet its mechanisms of action and its target genes are largely unknown. In this study, we have combined RNA interference, in vitro DNA binding, chromatin immunoprecipitation sequencing and global gene-expression profiling to investigate the role of Sp2 for cellular functions, to define target sites and to identify genes regulated by Sp2. We show that Sp2 is important for cellular proliferation that it binds to GC-boxes and occupies proximal promoters of genes essential for vital cellular processes including gene expression, replication, metabolism and signalling. Moreover, we identified important key target genes and cellular pathways that are directly regulated by Sp2. Most significantly, Sp2 binds and activates numerous sequence-specific transcription factor and co-activator genes, and represses the whole battery of cholesterol synthesis genes. Our results establish Sp2 as a sequence-specific regulator of vitally important genes. PMID:22684502
Genotyping microarray: Mutation screening in Spanish families with autosomal dominant retinitis pigmentosa

PubMed Central

García-Hoyos, María; Cortón, Marta; Ávila-Fernández, Almudena; Riveiro-Álvarez, Rosa; Giménez, Ascensión; Hernan, Inma; Carballo, Miguel; Ayuso, Carmen

2012-01-01

Purpose Presently, 22 genes have been described in association with autosomal dominant retinitis pigmentosa (adRP); however, they explain only 50% of all cases, making genetic diagnosis of this disease difficult and costly. The aim of this study was to evaluate a specific genotyping microarray for its application to the molecular diagnosis of adRP in Spanish patients. Methods We analyzed 139 unrelated Spanish families with adRP. Samples were studied by using a genotyping microarray (adRP). All mutations found were further confirmed with automatic sequencing. Rhodopsin (RHO) sequencing was performed in all negative samples for the genotyping microarray. Results The adRP genotyping microarray detected the mutation associated with the disease in 20 of the 139 families with adRP. As in other populations, RHO was found to be the most frequently mutated gene in these families (7.9% of the microarray genotyped families). The rate of false positives (microarray results not confirmed with sequencing) and false negatives (mutations in RHO detected with sequencing but not with the genotyping microarray) were established, and high levels of analytical sensitivity (95%) and specificity (100%) were found. Diagnostic accuracy was 15.1%. Conclusions The adRP genotyping microarray is a quick, cost-efficient first step in the molecular diagnosis of Spanish patients with adRP. PMID:22736939
Virus-specific DNA sequences present in cells which carry the herpes simplex virus thymidine kinase gene.

PubMed

Minson, A C; Darby, G K; Wildy, P

1979-11-01

Two independently derived cell lines which carry the herpes simplex type 2 thymidine kinase gene have been examined for the presence of HSV-2-specific DNA sequences. Both cell lines contained 1 to 3 copies per cell of a sequence lying within map co-ordinates 0.2 to 0.4 of the HSV-2 genome. Revertant cells, which contained no detectable thymidine kinase, did not contain this DNA sequence. The failure of EcoR1-restricted HSV-2 DNA to act as a donor of the thymidine kinase gene in transformation experiments suggests that the gene lies close to the EcoR1 restriction site within this sequence at a map position of approx. 0.3. The HSV-2 kinase gene is therefore approximately co-linear with the HSV-1 gene.
The Co-regulation Data Harvester: Automating gene annotation starting from a transcriptome database

NASA Astrophysics Data System (ADS)

Tsypin, Lev M.; Turkewitz, Aaron P.

Identifying co-regulated genes provides a useful approach for defining pathway-specific machinery in an organism. To be efficient, this approach relies on thorough genome annotation, a process much slower than genome sequencing per se. Tetrahymena thermophila, a unicellular eukaryote, has been a useful model organism and has a fully sequenced but sparsely annotated genome. One important resource for studying this organism has been an online transcriptomic database. We have developed an automated approach to gene annotation in the context of transcriptome data in T. thermophila, called the Co-regulation Data Harvester (CDH). Beginning with a gene of interest, the CDH identifies co-regulated genes by accessing the Tetrahymena transcriptome database. It then identifies their closely related genes (orthologs) in other organisms by using reciprocal BLAST searches. Finally, it collates the annotations of those orthologs' functions, which provides the user with information to help predict the cellular role of the initial query. The CDH, which is freely available, represents a powerful new tool for analyzing cell biological pathways in Tetrahymena. Moreover, to the extent that genes and pathways are conserved between organisms, the inferences obtained via the CDH should be relevant, and can be explored, in many other systems.
Elucidating the role of highly homologous Nicotiana benthamiana ubiquitin E2 gene family members in plant immunity through an improved virus-induced gene silencing approach.

PubMed

Zhou, Bangjun; Zeng, Lirong

2017-01-01

Virus-induced gene silencing (VIGS) has been used in many plant species as an attractive post transcriptional gene silencing (PTGS) method for studying gene function either individually or at large-scale in a high-throughput manner. However, the specificity and efficiency for knocking down members of a highly homologous gene family have remained to date a significant challenge in VIGS due to silencing of off-targets. Here we present an improved method for the selection and evaluation of gene fragments used for VIGS to specifically and efficiently knock down members of a highly homologous gene family. Using this method, we knocked down twelve and four members, respectively of group III of the gene family encoding ubiquitin-conjugating enzymes (E2) in Nicotiana benthamiana . Assays using these VIGS-treated plants revealed that the group III E2s are essential for plant development, plant immunity-associated reactive oxygen species (ROS) production, expression of the gene NbRbohB that is required for ROS production, and suppression of immunity-associated programmed cell death (PCD) by AvrPtoB, an effector protein of the bacterial pathogen Pseudomons syringae . Moreover, functional redundancy for plant development and ROS production was found to exist among members of group III E2s. We have found that employment of a gene fragment as short as approximately 70 base pairs (bp) that contains at least three mismatched nucleotides to other genes within any 21-bp sequences prevents silencing of off-target(s) in VIGS. This improved approach in the selection and evaluation of gene fragments allows for specific and efficient knocking down of highly homologous members of a gene family. Using this approach, we implicated N. benthamiana group III E2s in plant development, immunity-associated ROS production, and suppression of multiple immunity-associated PCD by AvrPtoB. We also unraveled functional redundancy among group III members in their requirement for plant development and plant immunity-associated ROS production.
Electrotransformation and expression of bacterial genes encoding hygromycin phosphotransferase and beta-galactosidase in the pathogenic fungus Histoplasma capsulatum.

PubMed

Woods, J P; Heinecke, E L; Goldman, W E

1998-04-01

We developed an efficient electrotransformation system for the pathogenic fungus Histoplasma capsulatum and used it to examine the effects of features of the transforming DNA on transformation efficiency and fate of the transforming DNA and to demonstrate fungal expression of two recombinant Escherichia coli genes, hph and lacZ. Linearized DNA and plasmids containing Histoplasma telomeric sequences showed the greatest transformation efficiencies, while the plasmid vector had no significant effect, nor did the derivation of the selectable URA5 marker (native Histoplasma gene or a heterologous Podospora anserina gene). Electrotransformation resulted in more frequent multimerization, other modification, or possibly chromosomal integration of transforming telomeric plasmids when saturating amounts of DNA were used, but this effect was not observed with smaller amounts of transforming DNA. We developed another selection system using a hygromycin B resistance marker from plasmid pAN7-1, consisting of the E. coli hph gene flanked by Aspergillus nidulans promoter and terminator sequences. Much of the heterologous fungal sequences could be removed without compromising function in H. capsulatum, allowing construction of a substantially smaller effective marker fragment. Transformation efficiency increased when nonselective conditions were maintained for a time after electrotransformation before selection with the protein synthesis inhibitor hygromycin B was imposed. Finally, we constructed a readily detectable and quantifiable reporter gene by fusing Histoplasma URA5 with E. coli lacZ, resulting in expression of functional beta-galactosidase in H. capsulatum. Demonstration of expression of bacterial genes as effective selectable markers and reporters, together with a highly efficient electrotransformation system, provide valuable approaches for molecular genetic analysis and manipulation of H. capsulatum, which have proven useful for examination of targeted gene disruption, regulated gene expression, and potential virulence determinants in this fungus.
Efficient gene editing in Corynebacterium glutamicum using the CRISPR/Cas9 system.

PubMed

Peng, Feng; Wang, Xinyue; Sun, Yang; Dong, Guibin; Yang, Yankun; Liu, Xiuxia; Bai, Zhonghu

2017-11-14

Corynebacterium glutamicum (C. glutamicum) has traditionally been used as a microbial cell factory for the industrial production of many amino acids and other industrially important commodities. C. glutamicum has recently been established as a host for recombinant protein expression; however, some intrinsic disadvantages could be improved by genetic modification. Gene editing techniques, such as deletion, insertion, or replacement, are important tools for modifying chromosomes. In this research, we report a CRISPR/Cas9 system in C. glutamicum for rapid and efficient genome editing, including gene deletion and insertion. The system consists of two plasmids: one containing a target-specific guide RNA and a homologous sequence to a target gene, the other expressing Cas9 protein. With high efficiency (up to 100%), this system was used to disrupt the porB, mepA, clpX and Ncgl0911 genes, which affect the ability to express proteins. The porB- and mepA-deletion strains had enhanced expression of green fluorescent protein, compared with the wild-type stain. This system can also be used to engineer point mutations and gene insertions. In this study, we adapted the CRISPR/Cas9 system from S. pyogens to gene deletion, point mutations and insertion in C. glutamicum. Compared with published genome modification methods, methods based on the CRISPR/Cas9 system can rapidly and efficiently achieve genome editing. Our research provides a powerful tool for facilitating the study of gene function, metabolic pathways, and enhanced productivity in C. glutamicum.
Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage

PubMed Central

Brok-Volchanskaya, Vera S.; Kadyrov, Farid A.; Sivogrivov, Dmitry E.; Kolosov, Peter M.; Sokolov, Andrey S.; Shlyapnikov, Michael G.; Kryukov, Valentine M.; Granovsky, Igor E.

2008-01-01

Homing endonucleases initiate nonreciprocal transfer of DNA segments containing their own genes and the flanking sequences by cleaving the recipient DNA. Bacteriophage T4 segB gene, which is located in a cluster of tRNA genes, encodes a protein of unknown function, homologous to homing endonucleases of the GIY-YIG family. We demonstrate that SegB protein is a site-specific endonuclease, which produces mostly 3′ 2-nt protruding ends at its DNA cleavage site. Analysis of SegB cleavage sites suggests that SegB recognizes a 27-bp sequence. It contains 11-bp conserved sequence, which corresponds to a conserved motif of tRNA TψC stem-loop, whereas the remainder of the recognition site is rather degenerate. T4-related phages T2L, RB1 and RB3 contain tRNA gene regions that are homologous to that of phage T4 but lack segB gene and several tRNA genes. In co-infections of phages T4 and T2L, segB gene is inherited with nearly 100% of efficiency. The preferred inheritance depends absolutely on the segB gene integrity and is accompanied by the loss of the T2L tRNA gene region markers. We suggest that SegB is a homing endonuclease that functions to ensure spreading of its own gene and the surrounding tRNA genes among T4-related phages. PMID:18281701

Implementing targeted region capture sequencing for the clinical detection of Alagille syndrome: An efficient and cost‑effective method.

PubMed

Huang, Tianhong; Yang, Guilin; Dang, Xiao; Ao, Feijian; Li, Jiankang; He, Yizhou; Tang, Qiyuan; He, Qing

2017-11-01

Alagille syndrome (AGS) is a highly variable, autosomal dominant disease that affects multiple structures including the liver, heart, eyes, bones and face. Targeted region capture sequencing focuses on a panel of known pathogenic genes and provides a rapid, cost‑effective and accurate method for molecular diagnosis. In a Chinese family, this method was used on the proband and Sanger sequencing was applied to validate the candidate mutation. A de novo heterozygous mutation (c.3254_3255insT p.Leu1085PhefsX24) of the jagged 1 gene was identified as the potential disease‑causing gene mutation. In conclusion, the present study suggested that target region capture sequencing is an efficient, reliable and accurate approach for the clinical diagnosis of AGS. Furthermore, these results expand on the understanding of the pathogenesis of AGS.
mCAL: A New Approach for Versatile Multiplex Action of Cas9 Using One sgRNA and Loci Flanked by a Programmed Target Sequence.

PubMed

Finnigan, Gregory C; Thorner, Jeremy

2016-07-07

Genome editing exploiting CRISPR/Cas9 has been adopted widely in academia and in the biotechnology industry to manipulate DNA sequences in diverse organisms. Molecular engineering of Cas9 itself and its guide RNA, and the strategies for using them, have increased efficiency, optimized specificity, reduced inappropriate off-target effects, and introduced modifications for performing other functions (transcriptional regulation, high-resolution imaging, protein recruitment, and high-throughput screening). Moreover, Cas9 has the ability to multiplex, i.e., to act at different genomic targets within the same nucleus. Currently, however, introducing concurrent changes at multiple loci involves: (i) identification of appropriate genomic sites, especially the availability of suitable PAM sequences; (ii) the design, construction, and expression of multiple sgRNA directed against those sites; (iii) potential difficulties in altering essential genes; and (iv) lingering concerns about "off-target" effects. We have devised a new approach that circumvents these drawbacks, as we demonstrate here using the yeast Saccharomyces cerevisiae First, any gene(s) of interest are flanked upstream and downstream with a single unique target sequence that does not normally exist in the genome. Thereafter, expression of one sgRNA and cotransformation with appropriate PCR fragments permits concomitant Cas9-mediated alteration of multiple genes (both essential and nonessential). The system we developed also allows for maintenance of the integrated, inducible Cas9-expression cassette or its simultaneous scarless excision. Our scheme-dubbed mCAL for " M: ultiplexing of C: as9 at A: rtificial L: oci"-can be applied to any organism in which the CRISPR/Cas9 methodology is currently being utilized. In principle, it can be applied to install synthetic sequences into the genome, to generate genomic libraries, and to program strains or cell lines so that they can be conveniently (and repeatedly) manipulated at multiple loci with extremely high efficiency. Copyright © 2016 Finnigan and Thorner.
Engineered external guide sequences are highly effective in inhibiting gene expression and replication of hepatitis B virus in cultured cells.

PubMed

Zhang, Zhigang; Vu, Gia-Phong; Gong, Hao; Xia, Chuan; Chen, Yuan-Chuan; Liu, Fenyong; Wu, Jianguo; Lu, Sangwei

2013-01-01

External guide sequences (EGSs) are RNA molecules that consist of a sequence complementary to a target mRNA and recruit intracellular ribonuclease P (RNase P), a tRNA processing enzyme, for specific degradation of the target mRNA. We have previously used an in vitro selection procedure to generate EGS variants that efficiently induce human RNase P to cleave a target mRNA in vitro. In this study, we constructed EGSs from a variant to target the overlapping region of the S mRNA, pre-S/L mRNA, and pregenomic RNA (pgRNA) of hepatitis B virus (HBV), which are essential for viral replication and infection. The EGS variant was about 50-fold more efficient in inducing human RNase P to cleave the mRNA in vitro than the EGS derived from a natural tRNA. Following Salmonella-mediated gene delivery, the EGSs were expressed in cultured HBV-carrying cells. A reduction of about 97% and 75% in the level of HBV RNAs and proteins and an inhibition of about 6,000- and 130-fold in the levels of capsid-associated HBV DNA were observed in cells treated with Salmonella vectors carrying the expression cassette for the variant and the tRNA-derived EGS, respectively. Our study provides direct evidence that the EGS variant is more effective in blocking HBV gene expression and DNA replication than the tRNA-derived EGS. Furthermore, these results demonstrate the feasibility of developing Salmonella-mediated gene delivery of highly active EGS RNA variants as a novel approach for gene-targeting applications such as anti-HBV therapy.
Isolation and identification of gene-specific microRNAs.

PubMed

Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

2006-01-01

Prediction of microRNA (miRNA) candidates using computer programming has identified hundreds and hundreds of genomic hairpin sequences, of which, the functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene-silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem, and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. By insertion of a hairpin-like pre-miRNA structure into the intron region of a gene, this intronic miRNA biogenesis system has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA-expressing system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafish, chicken embryos, and adult mice. Based on the strand complementarity between the designed miRNA and its target gene sequence, we have also developed a miRNA isolation protocol to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proof- of-principle method, we now have the knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing system.
Isolation and identification of gene-specific microRNAs.

PubMed

Lin, Shi-Lung; Chang, Donald C; Ying, Shao-Yao

2013-01-01

Computer programming has identified hundreds of genomic hairpin sequences, many with functions remain to be determined. Because direct transfection of hairpin-like miRNA precursors (pre)-miRNAs in mammalian cells is not always sufficient to trigger effective RNA-induced gene silencing complex (RISC) assembly, a key step for RNA interference (RNAi)-related gene silencing, we developed an intronic miRNA-expressing system to overcome this problem by inserting a hairpin-like pre-miRNA structure into the intron region of a gene and successfully increased the efficiency and effectiveness of miRNA-associated RNAi induction in vitro and in vivo. This intronic miRNA biogenesis has been found to depend on a coupled interaction of nascent precursor messenger RNA transcription and intron excision within a specific nuclear region proximal to genomic perichromatin fibrils. The intronic miRNA was transcribed by RNA type II polymerases, coexpressed with a primary gene transcript, and excised out of its encoding gene transcript by intracellular RNA splicing and processing mechanisms. Currently, some ribonuclease III endonucleases have been found to be involved in the processing of spliced introns and probably facilitating the intronic miRNA maturation. Using this miRNA generation system, we have shown for the first time that the intron-derived miRNAs were able to induce strong RNAi effects in not only human and mouse cells but also zebrafishes, chicken embryos, and adult mice. We have also developed an miRNA isolation protocol, based on the complementarity between the designed miRNA and its target gene sequence, to purify and identify the mature miRNAs generated by the intronic miRNA-expressing system. Several intronic miRNA identities and structures are currently confirmed to be active in vitro and in vivo. According to this proven-of-principle method, we now have full knowledge to design pre-miRNA inserts that are more efficient and effective for the intronic miRNA-expressing systems.
Resistance (R) Genes: Applications and Prospects for Plant Biotechnology and Breeding.

PubMed

Pandolfi, Valesca; Neto, Jose Ribamar Costa Ferreira; da Silva, Manasses Daniel; Amorim, Lidiane Lindinalva Barbosa; Wanderley-Nogueira, Ana Carolina; de Oliveira Silva, Roberta Lane; Kido, Ederson Akio; Crovella, Sergio; Iseppon, Ana Maria Benko

2017-01-01

The discovery of novel plant resistance (R) genes (including their homologs and analogs) opened interesting possibilities for controlling plant diseases caused by several pathogens. However, due to environmental pressure and high selection operated by pathogens, several crop plants have lost specificity, broad-spectrum or durability of resistance. On the other hand, the advances in plant genome sequencing and biotechnological approaches, combined with the increasing knowledge on Rgenes have provided new insights on their applications for plant genetic breeding, allowing the identification and implementation of novel and efficient strategies that enhance or optimize their use for efficiently controlling plant diseases. The present review focuses on main perspectives of application of R-genes and its co-players for the acquisition of resistance to pathogens in cultivated plants, with emphasis on biotechnological inferences, including transgenesis, cisgenesis, directed mutagenesis and gene editing, with examples of success and challenges to be faced. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
Efficient sequence-specific isolation of DNA fragments and chromatin by in vitro enChIP technology using recombinant CRISPR ribonucleoproteins.

PubMed

Fujita, Toshitsugu; Yuno, Miyuki; Fujii, Hodaka

2016-04-01

The clustered regularly interspaced short palindromic repeats (CRISPR) system is widely used for various biological applications, including genome editing. We developed engineered DNA-binding molecule-mediated chromatin immunoprecipitation (enChIP) using CRISPR to isolate target genomic regions from cells for their biochemical characterization. In this study, we developed 'in vitro enChIP' using recombinant CRISPR ribonucleoproteins (RNPs) to isolate target genomic regions. in vitro enChIP has the great advantage over conventional enChIP of not requiring expression of CRISPR complexes in cells. We first showed that in vitro enChIP using recombinant CRISPR RNPs can be used to isolate target DNA from mixtures of purified DNA in a sequence-specific manner. In addition, we showed that this technology can be used to efficiently isolate target genomic regions, while retaining their intracellular molecular interactions, with negligible contamination from irrelevant genomic regions. Thus, in vitro enChIP technology is of potential use for sequence-specific isolation of DNA, as well as for identification of molecules interacting with genomic regions of interest in vivo in combination with downstream analysis. © 2016 The Authors. Genes to Cells published by Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.
Comparative transcriptomics of Entelegyne spiders (Araneae, Entelegynae), with emphasis on molecular evolution of orphan genes.

PubMed

Carlson, David E; Hedin, Marshal

2017-01-01

Next-generation sequencing technology is rapidly transforming the landscape of evolutionary biology, and has become a cost-effective and efficient means of collecting exome information for non-model organisms. Due to their taxonomic diversity, production of interesting venom and silk proteins, and the relative scarcity of existing genomic resources, spiders in particular are excellent targets for next-generation sequencing (NGS) methods. In this study, the transcriptomes of six entelegyne spider species from three genera (Cicurina travisae, C. vibora, Habronattus signatus, H. ustulatus, Nesticus bishopi, and N. cooperi) were sequenced and de novo assembled. Each assembly was assessed for quality and completeness and functionally annotated using gene ontology information. Approximately 100 transcripts with evidence of homology to venom proteins were discovered. After identifying more than 3,000 putatively orthologous genes across all six taxa, we used comparative analyses to identify 24 instances of positively selected genes. In addition, between ~ 550 and 1,100 unique orphan genes were found in each genus. These unique, uncharacterized genes exhibited elevated rates of amino acid substitution, potentially consistent with lineage-specific adaptive evolution. The data generated for this study represent a valuable resource for future phylogenetic and molecular evolutionary research, and our results provide new insight into the forces driving genome evolution in taxa that span the root of entelegyne spider phylogeny.
Knock-in strategy at 3'-end of Crx gene by CRISPR/Cas9 system shows the gene expression profiles during human photoreceptor differentiation.

PubMed

Homma, Kohei; Usui, Sumiko; Kaneda, Makoto

2017-03-01

Fluorescent reporter gene knock-in induced pluripotent stem cell (iPSC) lines have been used to evaluate the efficiency of differentiation into specific cell lineages. Here, we report a knock-in strategy for the generation of human iPSC reporter lines in which a 2A peptide sequence and a red fluorescent protein (E2-Crimson) gene were inserted at the termination codon of the cone-rod homeobox (Crx) gene, a photoreceptor-specific transcriptional factor gene. The knock-in iPSC lines were differentiated into fluorescence-expressing cells in 3D retinal differentiation culture, and the fluorescent cells also expressed Crx specifically in the nucleus. We found that the fluorescence intensity was positively correlated with the expression levels of Crx mRNA and that fluorescent cells expressed rod photoreceptor-specific genes in the later stage of differentiation. Finally, we treated the fluorescent cells with DAPT, a Notch inhibitor, and found that DAPT-enhanced retinal differentiation was associated with up-regulation of Crx, Otx2 and NeuroD1, and down-regulation of Hes5 and Ngn2. These suggest that this knock-in strategy at the 3'-end of the target gene, combined with the 2A peptide linked to fluorescent proteins, offers a useful tool for labeling specific cell lineages or monitoring expression of any marker genes without affecting the function of the target gene. © 2017 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.
Using the TIGR gene index databases for biological discovery.

PubMed

Lee, Yuandan; Quackenbush, John

2003-11-01

The TIGR Gene Index web pages provide access to analyses of ESTs and gene sequences for nearly 60 species, as well as a number of resources derived from these. Each species-specific database is presented using a common format with a homepage. A variety of methods exist that allow users to search each species-specific database. Methods implemented currently include nucleotide or protein sequence queries using WU-BLAST, text-based searches using various sequence identifiers, searches by gene, tissue and library name, and searches using functional classes through Gene Ontology assignments. This protocol provides guidance for using the Gene Index Databases to extract information.
Recombination–deletion between homologous cassettes in retrovirus is suppressed via a strategy of degenerate codon substitution

PubMed Central

Im, Eung Jun; Bais, Anthony J; Yang, Wen; Ma, Qiangzhong; Guo, Xiuyang; Sepe, Steven M; Junghans, Richard P

2014-01-01

Transduction and expression procedures in gene therapy protocols may optimally transfer more than a single gene to correct a defect and/or transmit new functions to recipient cells or organisms. This may be accomplished by transduction with two (or more) vectors, or, more efficiently, in a single vector. Occasionally, it may be useful to coexpress homologous genes or chimeric proteins with regions of shared homology. Retroviridae include the dominant vector systems for gene transfer (e.g., gamma-retro and lentiviruses) and are capable of such multigene expression. However, these same viruses are known for efficient recombination–deletion when domains are duplicated within the viral genome. This problem can be averted by resorting to two-vector strategies (two-chain two-vector), but at a penalty to cost, convenience, and efficiency. Employing a chimeric antigen receptor system as an example, we confirm that coexpression of two genes with homologous domains in a single gamma-retroviral vector (two-chain single-vector) leads to recombination–deletion between repeated sequences, excising the equivalent of one of the chimeric antigen receptors. Here, we show that a degenerate codon substitution strategy in the two-chain single-vector format efficiently suppressed intravector deletional loss with rescue of balanced gene coexpression by minimizing sequence homology between repeated domains and preserving the final protein sequence. PMID:25419532
PanGEA: identification of allele specific gene expression using the 454 technology.

PubMed

Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian

2009-05-14

Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: http://www.kofler.or.at/bioinformatics/PanGEA
PanGEA: Identification of allele specific gene expression using the 454 technology

PubMed Central

Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian

2009-01-01

Background Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. Results We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology Conclusion To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: PMID:19442283
Exploiting CRISPR-Cas nucleases to produce sequence-specific antimicrobials.

PubMed

Bikard, David; Euler, Chad W; Jiang, Wenyan; Nussenzweig, Philip M; Goldberg, Gregory W; Duportet, Xavier; Fischetti, Vincent A; Marraffini, Luciano A

2014-11-01

Antibiotics target conserved bacterial cellular pathways or growth functions and therefore cannot selectively kill specific members of a complex microbial population. Here, we develop programmable, sequence-specific antimicrobials using the RNA-guided nuclease Cas9 (refs.1,2) delivered by a bacteriophage. We show that Cas9, reprogrammed to target virulence genes, kills virulent, but not avirulent, Staphylococcus aureus. Reprogramming the nuclease to target antibiotic resistance genes destroys staphylococcal plasmids that harbor antibiotic resistance genes and immunizes avirulent staphylococci to prevent the spread of plasmid-borne resistance genes. We also show that CRISPR-Cas9 antimicrobials function in vivo to kill S. aureus in a mouse skin colonization model. This technology creates opportunities to manipulate complex bacterial populations in a sequence-specific manner.
Avian acute leukemia viruses MC29 and MH2 share specific RNA sequences: Evidence for a second class of transforming genes

PubMed Central

Duesberg, Peter H.; Vogt, Peter K.

1979-01-01

The genome of the defective avian tumor virus MH2 was identified as a RNA of 5.7 kilobases by its presence in different MH2-helper virus complexes and its absence from pure helper virus, by its unique fingerprint pattern of RNase T1-resistant (T1) oligonucleotides that differed from those of two helper virus RNAs, and by its structural analogy to the RNA of MC29, another avian acute leukemia virus. Two sets of sequences were distinguished in MH2 RNA: 66% hybridized with DNA complementary to helper-independent avian tumor viruses, termed group-specific, and 34% were specific. The percentage of specific sequences is considered a minimal estimate because the MH2 RNA used was about 30% contaminated by helper virus RNA. No sequences related to the transforming src gene of avian sarcoma viruses were found in MH2. MH2 shared three large T1 oligonucleotides with MC29, two of which could also be isolated from a RNase A- and T1-resistant hybrid formed between MH2 RNA and MC29 specific cDNA. These oligonucleotides belong to a group of six that define the specific segment of MC29 RNA described previously. The group-specific sequences of MH2 and MC29 RNA shared only the two smallest out of about 20 T1 oligonucleotides associated with MH2 RNA. It is concluded that the specific sequences of MH2 and MC29 are related, and it is proposed that they are necessary for, or identical with, the onc genes of these viruses. These sequences would define a related class of transforming genes in avian tumor viruses that differs from the src genes of avian sarcoma viruses. Images PMID:221900
Rapid diversification of five Oryza AA genomes associated with rice adaptation.

PubMed

Zhang, Qun-Jie; Zhu, Ting; Xia, En-Hua; Shi, Chao; Liu, Yun-Long; Zhang, Yun; Liu, Yuan; Jiang, Wen-Kai; Zhao, You-Jie; Mao, Shu-Yan; Zhang, Li-Ping; Huang, Hui; Jiao, Jun-Ying; Xu, Ping-Zhen; Yao, Qiu-Yang; Zeng, Fan-Chun; Yang, Li-Li; Gao, Ju; Tao, Da-Yun; Wang, Yue-Ju; Bennetzen, Jeffrey L; Gao, Li-Zhi

2014-11-18

Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm.
Rapid diversification of five Oryza AA genomes associated with rice adaptation

PubMed Central

Zhang, Qun-Jie; Zhu, Ting; Xia, En-Hua; Shi, Chao; Liu, Yun-Long; Zhang, Yun; Liu, Yuan; Jiang, Wen-Kai; Zhao, You-Jie; Mao, Shu-Yan; Zhang, Li-Ping; Huang, Hui; Jiao, Jun-Ying; Xu, Ping-Zhen; Yao, Qiu-Yang; Zeng, Fan-Chun; Yang, Li-Li; Gao, Ju; Tao, Da-Yun; Wang, Yue-Ju; Bennetzen, Jeffrey L.; Gao, Li-Zhi

2014-01-01

Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm. PMID:25368197
Stratification of co-evolving genomic groups using ranked phylogenetic profiles

PubMed Central

Freilich, Shiri; Goldovsky, Leon; Gottlieb, Assaf; Blanc, Eric; Tsoka, Sophia; Ouzounis, Christos A

2009-01-01

Background Previous methods of detecting the taxonomic origins of arbitrary sequence collections, with a significant impact to genome analysis and in particular metagenomics, have primarily focused on compositional features of genomes. The evolutionary patterns of phylogenetic distribution of genes or proteins, represented by phylogenetic profiles, provide an alternative approach for the detection of taxonomic origins, but typically suffer from low accuracy. Herein, we present rank-BLAST, a novel approach for the assignment of protein sequences into genomic groups of the same taxonomic origin, based on the ranking order of phylogenetic profiles of target genes or proteins across the reference database. Results The rank-BLAST approach is validated by computing the phylogenetic profiles of all sequences for five distinct microbial species of varying degrees of phylogenetic proximity, against a reference database of 243 fully sequenced genomes. The approach - a combination of sequence searches, statistical estimation and clustering - analyses the degree of sequence divergence between sets of protein sequences and allows the classification of protein sequences according to the species of origin with high accuracy, allowing taxonomic classification of 64% of the proteins studied. In most cases, a main cluster is detected, representing the corresponding species. Secondary, functionally distinct and species-specific clusters exhibit different patterns of phylogenetic distribution, thus flagging gene groups of interest. Detailed analyses of such cases are provided as examples. Conclusion Our results indicate that the rank-BLAST approach can capture the taxonomic origins of sequence collections in an accurate and efficient manner. The approach can be useful both for the analysis of genome evolution and the detection of species groups in metagenomics samples. PMID:19860884
Mapping Second Chromosome Mutations to Defined Genomic Regions in Drosophila melanogaster

PubMed Central

Kahsai, Lily; Cook, Kevin R.

2017-01-01

Hundreds of Drosophila melanogaster stocks are currently maintained at the Bloomington Drosophila Stock Center with mutations that have not been associated with sequence-defined genes. They have been preserved because they have interesting loss-of-function phenotypes. The experimental value of these mutations would be increased by tying them to specific genomic intervals so that geneticists can more easily associate them with annotated genes. Here, we report the mapping of 85 second chromosome complementation groups in the Bloomington collection to specific, small clusters of contiguous genes or individual genes in the sequenced genome. This information should prove valuable to Drosophila geneticists interested in processes associated with particular phenotypes and those searching for mutations affecting specific sequence-defined genes. PMID:29066472
Sequence-Based Discovery Demonstrates That Fixed Light Chain Human Transgenic Rats Produce a Diverse Repertoire of Antigen-Specific Antibodies.

PubMed

Harris, Katherine E; Aldred, Shelley Force; Davison, Laura M; Ogana, Heather Anne N; Boudreau, Andrew; Brüggemann, Marianne; Osborn, Michael; Ma, Biao; Buelow, Benjamin; Clarke, Starlynn C; Dang, Kevin H; Iyer, Suhasini; Jorgensen, Brett; Pham, Duy T; Pratap, Payal P; Rangaswamy, Udaya S; Schellenberger, Ute; van Schooten, Wim C; Ugamraj, Harshad S; Vafa, Omid; Buelow, Roland; Trinklein, Nathan D

2018-01-01

We created a novel transgenic rat that expresses human antibodies comprising a diverse repertoire of heavy chains with a single common rearranged kappa light chain (IgKV3-15-JK1). This fixed light chain animal, called OmniFlic, presents a unique system for human therapeutic antibody discovery and a model to study heavy chain repertoire diversity in the context of a constant light chain. The purpose of this study was to analyze heavy chain variable gene usage, clonotype diversity, and to describe the sequence characteristics of antigen-specific monoclonal antibodies (mAbs) isolated from immunized OmniFlic animals. Using next-generation sequencing antibody repertoire analysis, we measured heavy chain variable gene usage and the diversity of clonotypes present in the lymph node germinal centers of 75 OmniFlic rats immunized with 9 different protein antigens. Furthermore, we expressed 2,560 unique heavy chain sequences sampled from a diverse set of clonotypes as fixed light chain antibody proteins and measured their binding to antigen by ELISA. Finally, we measured patterns and overall levels of somatic hypermutation in the full B-cell repertoire and in the 2,560 mAbs tested for binding. The results demonstrate that OmniFlic animals produce an abundance of antigen-specific antibodies with heavy chain clonotype diversity that is similar to what has been described with unrestricted light chain use in mammals. In addition, we show that sequence-based discovery is a highly effective and efficient way to identify a large number of diverse monoclonal antibodies to a protein target of interest.

Digital gene expression analysis of the zebra finch genome

PubMed Central

2010-01-01

Background In order to understand patterns of adaptation and molecular evolution it is important to quantify both variation in gene expression and nucleotide sequence divergence. Gene expression profiling in non-model organisms has recently been facilitated by the advent of massively parallel sequencing technology. Here we investigate tissue specific gene expression patterns in the zebra finch (Taeniopygia guttata) with special emphasis on the genes of the major histocompatibility complex (MHC). Results Almost 2 million 454-sequencing reads from cDNA of six different tissues were assembled and analysed. A total of 11,793 zebra finch transcripts were represented in this EST data, indicating a transcriptome coverage of about 65%. There was a positive correlation between the tissue specificity of gene expression and non-synonymous to synonymous nucleotide substitution ratio of genes, suggesting that genes with a specialised function are evolving at a higher rate (or with less constraint) than genes with a more general function. In line with this, there was also a negative correlation between overall expression levels and expression specificity of contigs. We found evidence for expression of 10 different genes related to the MHC. MHC genes showed relatively tissue specific expression levels and were in general primarily expressed in spleen. Several MHC genes, including MHC class I also showed expression in brain. Furthermore, for all genes with highest levels of expression in spleen there was an overrepresentation of several gene ontology terms related to immune function. Conclusions Our study highlights the usefulness of next-generation sequence data for quantifying gene expression in the genome as a whole as well as in specific candidate genes. Overall, the data show predicted patterns of gene expression profiles and molecular evolution in the zebra finch genome. Expression of MHC genes in particular, corresponds well with expression patterns in other vertebrates. PMID:20359325
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

PubMed Central

Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

2001-01-01

Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022
The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

PubMed

Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

2001-10-09

Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.
KONAGAbase: a genomic and transcriptomic database for the diamondback moth, Plutella xylostella.

PubMed

Jouraku, Akiya; Yamamoto, Kimiko; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Narukawa, Junko; Miyamoto, Kazuhisa; Kurita, Kanako; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Noda, Hiroaki

2013-07-09

The diamondback moth (DBM), Plutella xylostella, is one of the most harmful insect pests for crucifer crops worldwide. DBM has rapidly evolved high resistance to most conventional insecticides such as pyrethroids, organophosphates, fipronil, spinosad, Bacillus thuringiensis, and diamides. Therefore, it is important to develop genomic and transcriptomic DBM resources for analysis of genes related to insecticide resistance, both to clarify the mechanism of resistance of DBM and to facilitate the development of insecticides with a novel mode of action for more effective and environmentally less harmful insecticide rotation. To contribute to this goal, we developed KONAGAbase, a genomic and transcriptomic database for DBM (KONAGA is the Japanese word for DBM). KONAGAbase provides (1) transcriptomic sequences of 37,340 ESTs/mRNAs and 147,370 RNA-seq contigs which were clustered and assembled into 84,570 unigenes (30,695 contigs, 50,548 pseudo singletons, and 3,327 singletons); and (2) genomic sequences of 88,530 WGS contigs with 246,244 degenerate contigs and 106,455 singletons from which 6,310 de novo identified repeat sequences and 34,890 predicted gene-coding sequences were extracted. The unigenes and predicted gene-coding sequences were clustered and 32,800 representative sequences were extracted as a comprehensive putative gene set. These sequences were annotated with BLAST descriptions, Gene Ontology (GO) terms, and Pfam descriptions, respectively. KONAGAbase contains rich graphical user interface (GUI)-based web interfaces for easy and efficient searching, browsing, and downloading sequences and annotation data. Five useful search interfaces consisting of BLAST search, keyword search, BLAST result-based search, GO tree-based search, and genome browser are provided. KONAGAbase is publicly available from our website (http://dbm.dna.affrc.go.jp/px/) through standard web browsers. KONAGAbase provides DBM comprehensive transcriptomic and draft genomic sequences with useful annotation information with easy-to-use web interfaces, which helps researchers to efficiently search for target sequences such as insect resistance-related genes. KONAGAbase will be continuously updated and additional genomic/transcriptomic resources and analysis tools will be provided for further efficient analysis of the mechanism of insecticide resistance and the development of effective insecticides with a novel mode of action for DBM.
Digital transcriptome analysis of putative sex-determination genes in papaya (Carica papaya).

PubMed

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Y(h)) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Y(h) chromosome, implying a loss of many genes on the Y(h) chromosome. Nevertheless, candidate Y(h) chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya.
Digital Transcriptome Analysis of Putative Sex-Determination Genes in Papaya (Carica papaya)

PubMed Central

Urasaki, Naoya; Tarora, Kazuhiko; Shudo, Ayano; Ueno, Hiroki; Tamaki, Moritoshi; Miyagi, Norimichi; Adaniya, Shinichi; Matsumura, Hideo

2012-01-01

Papaya (Carica papaya) is a trioecious plant species that has male, female and hermaphrodite flowers on different plants. The primitive sex chromosomes genetically determine the sex of the papaya. Although draft sequences of the papaya genome are already available, the genes for sex determination have not been identified, likely due to the complicated structure of its sex-chromosome sequences. To identify the candidate genes for sex determination, we conducted a transcriptome analysis of flower samples from male, female and hermaphrodite plants using high-throughput SuperSAGE for digital gene expression analysis. Among the short sequence tags obtained from the transcripts, 312 unique tags were specifically mapped to the primitive sex chromosome (X or Yh) sequences. An annotation analysis revealed that retroelements are the most abundant sequences observed in the genes corresponding to these tags. The majority of tags on the sex chromosomes were located on the X chromosome, and only 30 tags were commonly mapped to both the X and Yh chromosome, implying a loss of many genes on the Yh chromosome. Nevertheless, candidate Yh chromosome-specific female determination genes, including a MADS-box gene, were identified. Information on these sex chromosome-specific expressed genes will help elucidating sex determination in the papaya. PMID:22815863
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana

PubMed Central

Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea

2012-01-01

The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
Hormone-induced modifications of the chromatin structure surrounding upstream regulatory regions conserved between the mouse and rabbit whey acidic protein genes.

PubMed Central

Millot, Benjamin; Montoliu, Lluís; Fontaine, Marie-Louise; Mata, Teresa; Devinoy, Eve

2003-01-01

The upstream regulatory regions of the mouse and rabbit whey acidic protein (WAP) genes have been used extensively to target the efficient expression of foreign genes into the mammary gland of transgenic animals. Therefore both regions have been studied to elucidate fully the mechanisms controlling WAP gene expression. Three DNase I-hypersensitive sites (HSS0, HSS1 and HSS2) have been described upstream of the rabbit WAP gene in the lactating mammary gland and correspond to important regulatory regions. These sites are surrounded by variable chromatin structures during mammary-gland development. In the present study, we describe the upstream sequence of the mouse WAP gene. Analysis of genomic sequences shows that the mouse WAP gene is situated between two widely expressed genes (Cpr2 and Ramp3). We show that the hypersensitive sites found upstream of the rabbit WAP gene are also detected in the mouse WAP gene. Further, they encompass functional signal transducer and activator of transcription 5-binding sites, as has been observed in the rabbit. A new hypersensitive site (HSS3), not specific to the mammary gland, was mapped 8 kb upstream of the rabbit WAP gene. Unlike the three HSSs described above, HSS3 is also detected in the liver, but similar to HSS1, it does not depend on lactogenic hormone treatments during cell culture. The region surrounding HSS3 encompasses a potential matrix attachment region, which is also conserved upstream of the mouse WAP gene and contains a functional transcription factor Ets-1 (E26 transformation-specific-1)-binding site. Finally, we demonstrate for the first time that variations in the chromatin structure are dependent on prolactin alone. PMID:12580766
Gene therapy for prostate cancer: where are we now?

PubMed

Steiner, M S; Gingrich, J R

2000-10-01

The ability to recombine specifically and alter DNA sequences followed by techniques to transfer these sequences or even whole genes into normal and diseased cells has revolutionized medical research and ushered the clinicians of today into the age of gene therapy. We provide urologists a review of relevant background information, outline current treatment strategies and clinical trials, and delineate current challenges facing the field of gene therapy for advanced prostate cancer. We comprehensively reviewed the literature, including PubMed and recent abstract proceedings from national meetings, relevant to gene therapy and advanced prostate cancer. We selected for review literature representative of the principal scientific background for current gene therapy strategies and National Institutes of Health Recombinant DNA Advisory Committee approved clinical trials. Current prostate cancer gene therapy strategies include correcting aberrant gene expression, exploiting programmed cell death pathways, targeting critical cell biological functions, introducing toxic or cell lytic suicide genes, enhancing the immune system antitumor response and combining treatment with conventional cytotoxic chemotherapy or radiation therapy. Many challenges lie ahead for gene therapy, including improving DNA transfer efficiency to cells locally and at distant sites, enhancing levels of gene expression and overcoming immune responses that limit the time that genes are expressed. Nevertheless, despite these current challenges it is almost certain that gene therapy will be part of the urological armamentarium against prostate cancer in this century.
A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence

PubMed Central

2018-01-01

FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722
The Draft Genome Sequence of a Novel High-Efficient Butanol-Producing Bacterium Clostridium Diolis Strain WST.

PubMed

Chen, Chaoyang; Sun, Chongran; Wu, Yi-Rui

2018-03-21

A wild-type solventogenic strain Clostridium diolis WST, isolated from mangrove sediments, was characterized to produce high amount of butanol and acetone with negligible level of ethanol and acids from glucose via a unique acetone-butanol (AB) fermentation pathway. Through the genomic sequencing, the assembled draft genome of strain WST is calculated to be 5.85 Mb with a GC content of 29.69% and contains 5263 genes that contribute to the annotation of 5049 protein-coding sequences. Within these annotated genes, the butanol dehydrogenase gene (bdh) was determined to be in a higher amount from strain WST compared to other Clostridial strains, which is positively related to its high-efficient production of butanol. Therefore, we present a draft genome sequence analysis of strain WST in this article that should facilitate to further understand the solventogenic mechanism of this special microorganism.
[Correlation of codon biases and potential secondary structures with mRNA translation efficiency in unicellular organisms].

PubMed

Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G

2007-01-01

Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
Oligo/Polynucleotide-Based Gene Modification: Strategies and Therapeutic Potential

PubMed Central

Sargent, R. Geoffrey; Kim, Soya

2011-01-01

Oligonucleotide- and polynucleotide-based gene modification strategies were developed as an alternative to transgene-based and classical gene targeting-based gene therapy approaches for treatment of genetic disorders. Unlike the transgene-based strategies, oligo/polynucleotide gene targeting approaches maintain gene integrity and the relationship between the protein coding and gene-specific regulatory sequences. Oligo/polynucleotide-based gene modification also has several advantages over classical vector-based homologous recombination approaches. These include essentially complete homology to the target sequence and the potential to rapidly engineer patient-specific oligo/polynucleotide gene modification reagents. Several oligo/polynucleotide-based approaches have been shown to successfully mediate sequence-specific modification of genomic DNA in mammalian cells. The strategies involve the use of polynucleotide small DNA fragments, triplex-forming oligonucleotides, and single-stranded oligodeoxynucleotides to mediate homologous exchange. The primary focus of this review will be on the mechanistic aspects of the small fragment homologous replacement, triplex-forming oligonucleotide-mediated, and single-stranded oligodeoxynucleotide-mediated gene modification strategies as it relates to their therapeutic potential. PMID:21417933
Cloning, sequencing and characterization of lipase genes from a polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans

USDA-ARS?s Scientific Manuscript database

Lipase (lip) and lipase-specific foldase (lif) genes of a biodegradable polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans NRRL B-2649 were cloned using primers based on consensus sequences, followed by PCR-based genome walking. Sequence analyses showed a putative Lip gene-product (...
Adenovirus EIIA early promoter: transcriptional control elements and induction by the viral pre-early EIA gene, which appears to be sequence independent.

PubMed Central

Murthy, S C; Bhat, G P; Thimmappaya, B

1985-01-01

A molecular dissection of the adenovirus EIIA early (E) promoter was undertaken to study the sequence elements required for transcription and to examine the nucleotide sequences, if any, specific for its trans-activation by the viral pre-early EIA gene product. A chimeric gene in which the EIIA-E promoter region fused to the coding sequences of the bacterial chloramphenicol acetyltransferase (CAT) gene was used in transient assays to identify the transcriptional control regions. Deletion mapping studies revealed that the upstream DNA sequences up to -86 were sufficient for the optimal basal level transcription in HeLa cells and also for the EIA-induced transcription. A series of linker-scanning (LS) mutants were constructed to precisely identify the nucleotide sequences that control transcription. Analysis of these LS mutants allowed us to identify two regions of the promoter that are critical for the EIIA-E transcription. These regions are located between -29 and -21 (region I) and between -82 and -66 (region II). Mutations in region I affected initiation and appeared functionally similar to the "TATA" sequence of the commonly studied promoters. To examine whether or not the EIIA-E promoter contained DNA sequences specific for the trans-activation by the EIA, the LS mutants were analyzed in a cotransfection assay containing a plasmid carrying the EIA gene. CAT activity of all of the LS mutants was induced by the EIA gene in this assay, suggesting that the induction of transcription of the EIIA-E promoter by the EIA gene is not sequence-specific. Images PMID:3857577
Enhanced green fluorescent protein (egfp) gene expression in Tetraselmis subcordiformis chloroplast with endogenous regulators.

PubMed

Cui, Yulin; Zhao, Jialin; Hou, Shichang; Qin, Song

2016-05-01

On the basis of fundamental genetic transformation technologies, the goal of this study was to optimize Tetraselmis subcordiformis chloroplast transformation through the use of endogenous regulators. The genes rrn16S, rbcL, psbA, and psbC are commonly highly expressed in chloroplasts, and the regulators of these genes are often used in chloroplast transformation. For lack of a known chloroplast genome sequence, the genome-walking method was used here to obtain full sequences of T. subcordiformis endogenous regulators. The resulting regulators, including three promoters, two terminators, and a ribosome combination sequence, were inserted into the previously constructed plasmid pPSC-R, with the egfp gene included as a reporter gene, and five chloroplast expression vectors prepared. These vectors were successfully transformed into T. subcordiformis by particle bombardment and the efficiency of each vector tested by assessing EGFP fluorescence via microscopy. The results showed that these vectors exhibited higher efficiency than the former vector pPSC-G carrying exogenous regulators, and the vector pRFA with Prrn, psbA-5'RE, and TpsbA showed the highest efficiency. This research provides a set of effective endogenous regulators for T. subcordiformis and will facilitate future fundamental studies of this alga.
Improved bioactivity of G-rich triplex-forming oligonucleotides containing modified guanine bases

PubMed Central

Rogers, Faye A; Lloyd, Janice A; Tiwari, Meetu Kaushik

2014-01-01

Triplex structures generated by sequence-specific triplex-forming oligonucleotides (TFOs) have proven to be promising tools for gene targeting strategies. In addition, triplex technology has been highly utilized to study the molecular mechanisms of DNA repair, recombination and mutagenesis. However, triplex formation utilizing guanine-rich oligonucleotides as third strands can be inhibited by potassium-induced self-association resulting in G-quadruplex formation. We report here that guanine-rich TFOs partially substituted with 8-aza-7-deaza-guanine (PPG) have improved target site binding in potassium compared with TFOs containing the natural guanine base. We designed PPG-substituted TFOs to bind to a polypurine sequence in the supFG1 reporter gene. The binding efficiency of PPG-substituted TFOs to the target sequence was analyzed using electrophoresis mobility gel shift assays. We have determined that in the presence of potassium, the non-substituted TFO, AG30 did not bind to its target sequence, however binding was observed with the PPG-substituted AG30 under conditions with up to 140 mM KCl. The PPG-TFOs were able to maintain their ability to induce genomic modifications as measured by an assay for gene-targeted mutagenesis. In addition, these compounds were capable of triplex-induced DNA double strand breaks, which resulted in activation of apoptosis. PMID:25483840
Genome sequence analysis of a flocculant-producing bacterium, Paenibacillus shenyangensis.

PubMed

Fu, Lili; Jiang, Binhui; Liu, Jinliang; Zhao, Xin; Liu, Qian; Hu, Xiaomin

2016-03-01

To explore the metabolic process of Paenibacillus shenyangensis that is an efficient bioflocculant-producing bacterium. The biosynthesis mechanism of bioflocculation was used to enrich the genome of Paenibacillus shenyangensis and provide a basis for molecular genetics and functional genomics analyses. According to the analysis of de novo assembly, a total of 5,501,467 bp clean reads were generated, and were assembled into 92 contigs. 4800 unigenes were predicted of which 4393 were annotated showing a specific gene function in the NCBI-Nr database. 3423 genes were found in the database of cluster of orthologous groups. Among the 168 Kyoto Encyclopedia of Genes and Genomes database, cell growth and metabolism were the main biological processes, and a potential metabolic pathway was predicted from glucose to exopolysaccharide within the starch and sucrose metabolism pathway. By using the high-throughput sequencing technology, we provide a genome analysis of Paenibacillus shenyangensis that predicts the main metabolic processes and a potential pathway of exopolysaccharide biosynthesis.
RiboMaker: computational design of conformation-based riboregulation.

PubMed

Rodrigo, Guillermo; Jaramillo, Alfonso

2014-09-01

The ability to engineer control systems of gene expression is instrumental for synthetic biology. Thus, bioinformatic methods that assist such engineering are appealing because they can guide the sequence design and prevent costly experimental screening. In particular, RNA is an ideal substrate to de novo design regulators of protein expression by following sequence-to-function models. We have implemented a novel algorithm, RiboMaker, aimed at the computational, automated design of bacterial riboregulation. RiboMaker reads the sequence and structure specifications, which codify for a gene regulatory behaviour, and optimizes the sequences of a small regulatory RNA and a 5'-untranslated region for an efficient intermolecular interaction. To this end, it implements an evolutionary design strategy, where random mutations are selected according to a physicochemical model based on free energies. The resulting sequences can then be tested experimentally, providing a new tool for synthetic biology, and also for investigating the riboregulation principles in natural systems. Web server is available at http://ribomaker.jaramillolab.org/. Source code, instructions and examples are freely available for download at http://sourceforge.net/projects/ribomaker/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Novel mechanism and factor for regulation by HIV-1 Tat.

PubMed Central

Zhou, Q; Sharp, P A

1995-01-01

Tat regulation of human immunodeficiency virus (HIV) transcription is unique because of its specificity for an RNA target, TAR, and its ability to increase the efficiency of elongation by polymerase. A reconstituted reaction that is Tat-specific and TAR-dependent for activation of HIV transcription has been used to identify and partially purify a cellular activity that is required for trans-activation by Tat, but not by other activators. In the reaction, Tat stimulates the efficiency of elongation by polymerase, whereas Sp1 and other DNA sequence-specific transcription factors activate the rate of initiation. Furthermore, while TATA binding protein (TBP)-associated factors (TAFs) in the TFIID complex are required for activation by transcription factors, they are dispensable for Tat function. Thus, Tat acts through a novel mechanism, which is mediated by a specific host cellular factor, to stimulate HIV-1 gene expression. Images PMID:7835343

Deep-sea vent phage DNA polymerase specifically initiates DNA synthesis in the absence of primers.

PubMed

Zhu, Bin; Wang, Longfei; Mitsunobu, Hitoshi; Lu, Xueling; Hernandez, Alfredo J; Yoshida-Takashima, Yukari; Nunoura, Takuro; Tabor, Stanley; Richardson, Charles C

2017-03-21

A DNA polymerase is encoded by the deep-sea vent phage NrS-1. NrS-1 has a unique genome organization containing genes that are predicted to encode a helicase and a single-stranded DNA (ssDNA)-binding protein. The gene for an unknown protein shares weak homology with the bifunctional primase-polymerases (prim-pols) from archaeal plasmids but is missing the zinc-binding domain typically found in primases. We show that this gene product has efficient DNA polymerase activity and is processive in DNA synthesis in the presence of the NrS-1 helicase and ssDNA-binding protein. Remarkably, this NrS-1 DNA polymerase initiates DNA synthesis from a specific template DNA sequence in the absence of any primer. The de novo DNA polymerase activity resides in the N-terminal domain of the protein, whereas the C-terminal domain enhances DNA binding.
Development of sequence-specific antimicrobials based on programmable CRISPR-Cas nucleases

PubMed Central

Bikard, David; Euler, Chad; Jiang, Wenyan; Nussenzweig, Philip M.; Goldberg, Gregory W.; Duportet, Xavier; Fischetti, Vincent A.; Marraffini, Luciano A.

2014-01-01

Antibiotics target conserved bacterial cellular pathways or growth functions and therefore cannot selectively kill specific members of a complex microbial population. Here, we develop programmable, sequence-specific antimicrobials using the RNA-guided nuclease Cas91, 2 delivered by a bacteriophage. We show that Cas9 re-programmed to target virulence genes kills virulent, but not avirulent, Staphylococcus aureus. Re-programming the nuclease to target antibiotic resistance genes destroys staphylococcal plasmids that harbor antibiotic resistance genes3, 4 and immunizes avirulent staphylococci to prevent the spread of plasmid-borne resistance genes. We also demonstrate the approach in vivo, showing its efficacy against S. aureus in a mouse skin colonization model. This new technology creates opportunities to manipulate complex bacterial populations in a sequence-specific manner. PMID:25282355
Associations between single nucleotide polymorphisms in multiple candidate genes and body weight in rabbits

PubMed Central

El-Sabrout, Karim; Aggag, Sarah A.

2017-01-01

Aim: In this study, we examined parts of six growth genes (growth hormone [GH], melanocortin 4 receptor [MC4R], growth hormone receptor [GHR], phosphorglycerate mutase [PGAM], myostatin [MSTN], and fibroblast growth factor [FGF]) as specific primers for two rabbit lines (V-line, Alexandria) using nucleotide sequence analysis, to investigate association between detecting single nucleotide polymorphism (SNP) of these genes and body weight (BW) at market. Materials and Methods: Each line kits were grouped into high and low weight rabbits to identify DNA markers useful for association studies with high BW. DNA from blood samples of each group was extracted to amplify the six growth genes. SNP technique was used to study the associate polymorphism in the six growth genes and marketing BW (at 63 days) in the two rabbit lines. The purified polymerase chain reaction products were sequenced in those had the highest and lowest BW in each line. Results: Alignment of sequence data from each group revealed the following SNPs: At nucleotide 23 (A-C) and nucleotide 35 (T-G) in MC4R gene (sense mutation) of Alexandria and V-line high BW. Furthermore, we detected the following SNPs variation between the two lines: A SNP (T-C) at nucleotide 27 was identified by MC4R gene (sense mutation) and another one (A-C) at nucleotide 14 was identified by GHR gene (nonsense mutation) of Alexandria line. The results of individual BW at market (63 days) indicated that Alexandria rabbits had significantly higher BW compared with V-line rabbits. MC4R polymorphism showed significant association with high BW in rabbits. Conclusion: The results of polymorphism demonstrate the possibility to detect an association between BW in rabbits and the efficiency of the used primers to predict through the genetic specificity using the SNP of MC4R. PMID:28246458
The rapid evolution of molecular genetic diagnostics in neuromuscular diseases.

PubMed

Volk, Alexander E; Kubisch, Christian

2017-10-01

The development of massively parallel sequencing (MPS) has revolutionized molecular genetic diagnostics in monogenic disorders. The present review gives a brief overview of different MPS-based approaches used in clinical diagnostics of neuromuscular disorders (NMDs) and highlights their advantages and limitations. MPS-based approaches like gene panel sequencing, (whole) exome sequencing, (whole) genome sequencing, and RNA sequencing have been used to identify the genetic cause in NMDs. Although gene panel sequencing has evolved as a standard test for heterogeneous diseases, it is still debated, mainly because of financial issues and unsolved problems of variant interpretation, whether genome sequencing (and to a lesser extent also exome sequencing) of single patients can already be regarded as routine diagnostics. However, it has been shown that the inclusion of parents and additional family members often leads to a substantial increase in the diagnostic yield in exome-wide/genome-wide MPS approaches. In addition, MPS-based RNA sequencing just enters the research and diagnostic scene. Next-generation sequencing increasingly enables the detection of the genetic cause in highly heterogeneous diseases like NMDs in an efficient and affordable way. Gene panel sequencing and family-based exome sequencing have been proven as potent and cost-efficient diagnostic tools. Although clinical validation and interpretation of genome sequencing is still challenging, diagnostic RNA sequencing represents a promising tool to bypass some hurdles of diagnostics using genomic DNA.
[Development of specific and degenerated primers to CesA genes encoding flax (Linum usitatissimum L.) cellulose synthase].

PubMed

Grushetskaia, Z E; Lemesh, V A; Khotyleva, L V

2010-01-01

Cellulose synthase catalytic subunit genes, CesA, have been discovered in several higher plant species, and it has been shown that the CesA gene family has multiple members. HVR2 fragment of these genes determine the class specificity of the CESA protein and its participation in the primary or secondary cell wall synthesis. The aim of this study was development of specific and degenerated primers to flax CesA gene fragments leading to obtaining the class specific HVR2 region of the gene. Two pairs of specific primers to the certain fragments of CesA-1 and CesA-6 genes and one pair of degenerated primers to HVR2 region of all flax CesA genes were developed basing on comparison of six CesA EST sequences of flax and full cDNA sequences of Arabidopsis, poplar, maize and cotton plants, obtained from GenBank. After amplification of flax cDNA, the bands of expected size were detected (201 and 300 b.p. for the CesA-1 and CesA-6, and 600 b.p. for the HVR2 region of CesA respectively). The developed markers can be used for cloning and sequencing of flax CesA genes, identifying their number in flax genome, tissue and stage specificity.
Genome sequence of the lager brewing yeast, an interspecies hybrid.

PubMed

Nakao, Yoshihiro; Kanamori, Takeshi; Itoh, Takehiko; Kodama, Yukiko; Rainieri, Sandra; Nakamura, Norihisa; Shimonaga, Tomoko; Hattori, Masahira; Ashikari, Toshihiko

2009-04-01

This work presents the genome sequencing of the lager brewing yeast (Saccharomyces pastorianus) Weihenstephan 34/70, a strain widely used in lager beer brewing. The 25 Mb genome comprises two nuclear sub-genomes originating from Saccharomyces cerevisiae and Saccharomyces bayanus and one circular mitochondrial genome originating from S. bayanus. Thirty-six different types of chromosomes were found including eight chromosomes with translocations between the two sub-genomes, whose breakpoints are within the orthologous open reading frames. Several gene loci responsible for typical lager brewing yeast characteristics such as maltotriose uptake and sulfite production have been increased in number by chromosomal rearrangements. Despite an overall high degree of conservation of the synteny with S. cerevisiae and S. bayanus, the syntenies were not well conserved in the sub-telomeric regions that contain lager brewing yeast characteristic and specific genes. Deletion of larger chromosomal regions, a massive unilateral decrease of the ribosomal DNA cluster and bilateral truncations of over 60 genes reflect a post-hybridization evolution process. Truncations and deletions of less efficient maltose and maltotriose uptake genes may indicate the result of adaptation to brewing. The genome sequence of this interspecies hybrid yeast provides a new tool for better understanding of lager brewing yeast behavior in industrial beer production.
Genome Sequence of the Lager Brewing Yeast, an Interspecies Hybrid

PubMed Central

Nakao, Yoshihiro; Kanamori, Takeshi; Itoh, Takehiko; Kodama, Yukiko; Rainieri, Sandra; Nakamura, Norihisa; Shimonaga, Tomoko; Hattori, Masahira; Ashikari, Toshihiko

2009-01-01

This work presents the genome sequencing of the lager brewing yeast (Saccharomyces pastorianus) Weihenstephan 34/70, a strain widely used in lager beer brewing. The 25 Mb genome comprises two nuclear sub-genomes originating from Saccharomyces cerevisiae and Saccharomyces bayanus and one circular mitochondrial genome originating from S. bayanus. Thirty-six different types of chromosomes were found including eight chromosomes with translocations between the two sub-genomes, whose breakpoints are within the orthologous open reading frames. Several gene loci responsible for typical lager brewing yeast characteristics such as maltotriose uptake and sulfite production have been increased in number by chromosomal rearrangements. Despite an overall high degree of conservation of the synteny with S. cerevisiae and S. bayanus, the syntenies were not well conserved in the sub-telomeric regions that contain lager brewing yeast characteristic and specific genes. Deletion of larger chromosomal regions, a massive unilateral decrease of the ribosomal DNA cluster and bilateral truncations of over 60 genes reflect a post-hybridization evolution process. Truncations and deletions of less efficient maltose and maltotriose uptake genes may indicate the result of adaptation to brewing. The genome sequence of this interspecies hybrid yeast provides a new tool for better understanding of lager brewing yeast behavior in industrial beer production. PMID:19261625
Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

PubMed

Pietrowski, D; Förster, M

2000-01-01

The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).
Novel Method for High-Throughput Full-Length IGHV-D-J Sequencing of the Immune Repertoire from Bulk B-Cells with Single-Cell Resolution.

PubMed

Vergani, Stefano; Korsunsky, Ilya; Mazzarello, Andrea Nicola; Ferrer, Gerardo; Chiorazzi, Nicholas; Bagnara, Davide

2017-01-01

Efficient and accurate high-throughput DNA sequencing of the adaptive immune receptor repertoire (AIRR) is necessary to study immune diversity in healthy subjects and disease-related conditions. The high complexity and diversity of the AIRR coupled with the limited amount of starting material, which can compromise identification of the full biological diversity makes such sequencing particularly challenging. AIRR sequencing protocols often fail to fully capture the sampled AIRR diversity, especially for samples containing restricted numbers of B lymphocytes. Here, we describe a library preparation method for immunoglobulin sequencing that results in an exhaustive full-length repertoire where virtually every sampled B-cell is sequenced. This maximizes the likelihood of identifying and quantifying the entire IGHV-D-J repertoire of a sample, including the detection of rearrangements present in only one cell in the starting population. The methodology establishes the importance of circumventing genetic material dilution in the preamplification phases and incorporates the use of certain described concepts: (1) balancing the starting material amount and depth of sequencing, (2) avoiding IGHV gene-specific amplification, and (3) using Unique Molecular Identifier. Together, this methodology is highly efficient, in particular for detecting rare rearrangements in the sampled population and when only a limited amount of starting material is available.
TargetLink, a new method for identifying the endogenous target set of a specific microRNA in intact living cells.

PubMed

Xu, Yan; Chen, Yan; Li, Daliang; Liu, Qing; Xuan, Zhenyu; Li, Wen-Hong

2017-02-01

MicroRNAs are small non-coding RNAs acting as posttranscriptional repressors of gene expression. Identifying mRNA targets of a given miRNA remains an outstanding challenge in the field. We have developed a new experimental approach, TargetLink, that applied locked nucleic acid (LNA) as the affinity probe to enrich target genes of a specific microRNA in intact cells. TargetLink also consists a rigorous and systematic data analysis pipeline to identify target genes by comparing LNA-enriched sequences between experimental and control samples. Using miR-21 as a test microRNA, we identified 12 target genes of miR-21 in a human colorectal cancer cell by this approach. The majority of the identified targets interacted with miR-21 via imperfect seed pairing. Target validation confirmed that miR-21 repressed the expression of the identified targets. The cellular abundance of the identified miR-21 target transcripts varied over a wide range, with some targets expressed at a rather low level, confirming that both abundant and rare transcripts are susceptible to regulation by microRNAs, and that TargetLink is an efficient approach for identifying the target set of a specific microRNA in intact cells. C20orf111, one of the novel targets identified by TargetLink, was found to reside in the nuclear speckle and to be reliably repressed by miR-21 through the interaction at its coding sequence.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase

DOEpatents

Studier, F.W.; Davanloo, P.; Rosenberg, A.H.

1984-03-30

This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the T7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties.
Enrichment of individual KIR2DL4 sequences from genomic DNA using long-template PCR and allele-specific hybridization to magnetic bead-bound oligonucleotide probes.

PubMed

Roberts, C H; Turino, C; Madrigal, J A; Marsh, S G E

2007-06-01

DNA enrichment by allele-specific hybridization (DEASH) was used as a means to isolate individual alleles of the killer cell immunoglobulin-like receptor (KIR2DL4) gene from heterozygous genomic DNA. Using long-template polymerase chain reaction (LT-PCR), the complete KIR2DL4 gene was amplified from a cell line that had previously been characterized for its KIR gene content by PCR using sequence-specific primers (PCR-SSP). The whole gene amplicons were sequenced and we identified two heterozygous positions in accordance with the predictions of the PCR-SSP. The amplicons were then hybridized to allele-specific, biotinylated oligonucleotide probes and through binding to streptavidin-coated beads, the targeted alleles were enriched. A second PCR amplified only the exonic regions of the enriched allele, and these were then sequenced in full. We show DEASH to be capable of enriching single alleles from a heterozygous PCR product, and through sequencing the enriched DNA, we are able to produce complete coding sequences of the KIR2DL4 alleles in accordance with the typing predicted by PCR-SSP.
From famine to feast? Selecting nuclear DNA sequence loci for plant species-level phylogeny reconstruction

PubMed Central

Hughes, Colin E; Eastwood, Ruth J; Donovan Bailey, C

2005-01-01

Phylogenetic analyses of DNA sequences have prompted spectacular progress in assembling the Tree of Life. However, progress in constructing phylogenies among closely related species, at least for plants, has been less encouraging. We show that for plants, the rapid accumulation of DNA characters at higher taxonomic levels has not been matched by conventional sequence loci at the species level, leaving a lack of well-resolved gene trees that is hindering investigations of many fundamental questions in plant evolutionary biology. The most popular approach to address this problem has been to use low-copy nuclear genes as a source of DNA sequence data. However, this has had limited success because levels of variation among nuclear intron sequences across groups of closely related species are extremely variable and generally lower than conventionally used loci, and because no universally useful low-copy nuclear DNA sequence loci have been developed. This suggests that solutions will, for the most part, be lineage-specific, prompting a move away from ‘universal’ gene thinking for species-level phylogenetics. The benefits and limitations of alternative approaches to locate more variable nuclear loci are discussed and the potential of anonymous non-genic nuclear loci is highlighted. Given the virtually unlimited number of loci that can be generated using these new approaches, it is clear that effective screening will be critical for efficient selection of the most informative loci. Strategies for screening are outlined. PMID:16553318
Secretome Characterization and Correlation Analysis Reveal Putative Pathogenicity Mechanisms and Identify Candidate Avirulence Genes in the Wheat Stripe Rust Fungus Puccinia striiformis f. sp. tritici.

PubMed

Xia, Chongjing; Wang, Meinan; Cornejo, Omar E; Jiwan, Derick A; See, Deven R; Chen, Xianming

2017-01-01

Stripe (yellow) rust, caused by Puccinia striiformis f. sp. tritici ( Pst ), is one of the most destructive diseases of wheat worldwide. Planting resistant cultivars is an effective way to control this disease, but race-specific resistance can be overcome quickly due to the rapid evolving Pst population. Studying the pathogenicity mechanisms is critical for understanding how Pst virulence changes and how to develop wheat cultivars with durable resistance to stripe rust. We re-sequenced 7 Pst isolates and included additional 7 previously sequenced isolates to represent balanced virulence/avirulence profiles for several avirulence loci in seretome analyses. We observed an uneven distribution of heterozygosity among the isolates. Secretome comparison of Pst with other rust fungi identified a large portion of species-specific secreted proteins, suggesting that they may have specific roles when interacting with the wheat host. Thirty-two effectors of Pst were identified from its secretome. We identified candidates for Avr genes corresponding to six Yr genes by correlating polymorphisms for effector genes to the virulence/avirulence profiles of the 14 Pst isolates. The putative AvYr76 was present in the avirulent isolates, but absent in the virulent isolates, suggesting that deleting the coding region of the candidate avirulence gene has produced races virulent to resistance gene Yr76 . We conclude that incorporating avirulence/virulence phenotypes into correlation analysis with variations in genomic structure and secretome, particularly presence/absence polymorphisms of effectors, is an efficient way to identify candidate Avr genes in Pst . The candidate effector genes provide a rich resource for further studies to determine the evolutionary history of Pst populations and the co-evolutionary arms race between Pst and wheat. The Avr candidates identified in this study will lead to cloning avirulence genes in Pst , which will enable us to understand molecular mechanisms underlying Pst -wheat interactions, to determine the effectiveness of resistance genes and further to develop durable resistance to stripe rust.
Insertion and deletion mutagenesis of the human cytomegalovirus genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Spaete, R.R.; Mocarski, E.S.

1987-10-01

Studies on human cytomegalovirus (CMV) have been limited by a paucity of molecular genetic techniques available for manipulating the viral genome. The authors have developed methods for site-specific insertion and deletion mutagenesis of CMV utilizing a modified Escherichia coli lacZ gene as a genetic marker. The lacZ gene was placed under the control of the major ..beta.. gene regulatory signals and inserted into the viral genome by homologous recombination, disrupting one of two copies of this ..beta.. gene within the L-component repeats of CMV DNA. They observed high-level expression of ..beta..-galactosidase by the recombinant in a temporally authentic manner, withmore » levels of this enzyme approaching 1% of total protein in infected cells. Thus, CMV is an efficient vector for high-level expression of foreign gene products in human cells. Using back selection of lacZ-deficient virus in the presence of the chromogenic substrate 5-bromo-4-chloro-3-indolyl ..beta..-D-galactoside, they generated random endpoint deletion mutants. Analysis of these mutant revealed that CMV DNA sequences flanking the insert had been removed, thereby establishing this approach as a means of determining whether sequences flanking a lacZ insertion are dispensable for viral growth. In an initial test of the methods, they have shown that 7800 base pairs of one copy of L-component repeat sequences can be deleted without affecting viral growth in human fibroblasts.« less
Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency.

PubMed

Dang, Ying; Jia, Gengxiang; Choi, Jennie; Ma, Hongming; Anaya, Edgar; Ye, Chunting; Shankar, Premlata; Wu, Haoquan

2015-12-15

Single-guide RNA (sgRNA) is one of the two key components of the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 genome-editing system. The current commonly used sgRNA structure has a shortened duplex compared with the native bacterial CRISPR RNA (crRNA)-transactivating crRNA (tracrRNA) duplex and contains a continuous sequence of thymines, which is the pause signal for RNA polymerase III and thus could potentially reduce transcription efficiency. Here, we systematically investigate the effect of these two elements on knockout efficiency and showed that modifying the sgRNA structure by extending the duplex length and mutating the fourth thymine of the continuous sequence of thymines to cytosine or guanine significantly, and sometimes dramatically, improves knockout efficiency in cells. In addition, the optimized sgRNA structure also significantly increases the efficiency of more challenging genome-editing procedures, such as gene deletion, which is important for inducing a loss of function in non-coding genes. By a systematic investigation of sgRNA structure we find that extending the duplex by approximately 5 bp combined with mutating the continuous sequence of thymines at position 4 to cytosine or guanine significantly increases gene knockout efficiency in CRISPR-Cas9-based genome editing experiments.
Molecular Characterization and Expression of a Phytase Gene from the Thermophilic Fungus Thermomyces lanuginosus

PubMed Central

Berka, Randy M.; Rey, Michael W.; Brown, Kimberly M.; Byun, Tony; Klotz, Alan V.

1998-01-01

The phyA gene encoding an extracellular phytase from the thermophilic fungus Thermomyces lanuginosus was cloned and heterologously expressed, and the recombinant gene product was biochemically characterized. The phyA gene encodes a primary translation product (PhyA) of 475 amino acids (aa) which includes a putative signal peptide (23 aa) and propeptide (10 aa). The deduced amino acid sequence of PhyA has limited sequence identity (ca. 47%) with Aspergillus niger phytase. The phyA gene was inserted into an expression vector under transcriptional control of the Fusarium oxysporum trypsin gene promoter and used to transform a Fusarium venenatum recipient strain. The secreted recombinant phytase protein was enzymatically active between pHs 3 and 7.5, with a specific activity of 110 μmol of inorganic phosphate released per min per mg of protein at pH 6 and 37°C. The Thermomyces phytase retained activity at assay temperatures up to 75°C and demonstrated superior catalytic efficiency to any known fungal phytase at 65°C (the temperature optimum). Comparison of this new Thermomyces catalyst with the well-known Aspergillus niger phytase reveals other favorable properties for the enzyme derived from the thermophilic gene donor, including catalytic activity over an expanded pH range. PMID:9797301
HuMiChip: Development of a Functional Gene Array for the Study of Human Microbiomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tu, Q.; Deng, Ye; Lin, Lu

Microbiomes play very important roles in terms of nutrition, health and disease by interacting with their hosts. Based on sequence data currently available in public domains, we have developed a functional gene array to monitor both organismal and functional gene profiles of normal microbiota in human and mouse hosts, and such an array is called human and mouse microbiota array, HMM-Chip. First, seed sequences were identified from KEGG databases, and used to construct a seed database (seedDB) containing 136 gene families in 19 metabolic pathways closely related to human and mouse microbiomes. Second, a mother database (motherDB) was constructed withmore » 81 genomes of bacterial strains with 54 from gut and 27 from oral environments, and 16 metagenomes, and used for selection of genes and probe design. Gene prediction was performed by Glimmer3 for bacterial genomes, and by the Metagene program for metagenomes. In total, 228,240 and 801,599 genes were identified for bacterial genomes and metagenomes, respectively. Then the motherDB was searched against the seedDB using the HMMer program, and gene sequences in the motherDB that were highly homologous with seed sequences in the seedDB were used for probe design by the CommOligo software. Different degrees of specific probes, including gene-specific, inclusive and exclusive group-specific probes were selected. All candidate probes were checked against the motherDB and NCBI databases for specificity. Finally, 7,763 probes covering 91.2percent (12,601 out of 13,814) HMMer confirmed sequences from 75 bacterial genomes and 16 metagenomes were selected. This developed HMM-Chip is able to detect the diversity and abundance of functional genes, the gene expression of microbial communities, and potentially, the interactions of microorganisms and their hosts.« less
EHMT2 directs DNA methylation for efficient gene silencing in mouse embryos

PubMed Central

Auclair, Ghislain; Borgel, Julie; Sanz, Lionel A.; Vallet, Judith; Guibert, Sylvain; Dumas, Michael; Cavelier, Patricia; Girardot, Michael; Forné, Thierry; Feil, Robert; Weber, Michael

2016-01-01

The extent to which histone modifying enzymes contribute to DNA methylation in mammals remains unclear. Previous studies suggested a link between the lysine methyltransferase EHMT2 (also known as G9A and KMT1C) and DNA methylation in the mouse. Here, we used a model of knockout mice to explore the role of EHMT2 in DNA methylation during mouse embryogenesis. The Ehmt2 gene is expressed in epiblast cells but is dispensable for global DNA methylation in embryogenesis. In contrast, EHMT2 regulates DNA methylation at specific sequences that include CpG-rich promoters of germline-specific genes. These loci are bound by EHMT2 in embryonic cells, are marked by H3K9 dimethylation, and have strongly reduced DNA methylation in Ehmt2−/− embryos. EHMT2 also plays a role in the maintenance of germline-derived DNA methylation at one imprinted locus, the Slc38a4 gene. Finally, we show that DNA methylation is instrumental for EHMT2-mediated gene silencing in embryogenesis. Our findings identify EHMT2 as a critical factor that facilitates repressive DNA methylation at specific genomic loci during mammalian development. PMID:26576615
Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes).

PubMed

Dessimoz, Christophe; Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-09-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.

Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)

PubMed Central

Zoller, Stefan; Manousaki, Tereza; Qiu, Huan; Meyer, Axel; Kuraku, Shigehiro

2011-01-01

Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, Callorhinchus milii, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated ESPRIT, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references. PMID:21712341
RNA interference (RNAI) as a tool to engineer high nutritional value in chicory (Chicorium intybus).

PubMed

Asad, M

2006-01-01

The major component of chicory (Chicorium intybus) root is inulin, which is a polymer of fructose. Inulin production from chicory is hampered by the enzyme fructan 1-exohydrolase (1-FEH) that degrades inulin and limits its yield. Increased FEH activity results in massive breakdown of fructan and production of Fructose and inulo-n-oses. The latter phenomena are to be avoided for industrial fructan production. RNA silencing, which is termed post-transcriptional gene silencing (PTGS) in plants, is an RNA degradation process through sequence specific nucleotide interactions induced by double-stranded RNA. For genetic improvement of crop plants, RNAi has advantages over antisense-mediated gene silencing and co-suppression, in terms of its efficiency and stability. We are generating a transgenic chicory plants with suppressed FEH (exohydrolas) genes using RNAi resulting in supressed inulin degradation. A small but important part of the construct is a sequence unique for the target gene (exons) or genes,which were cloned. The hairpin constructs were made and chicory was transformed by Agrobacterium tumifaciense, strain (C58C1). The transgenics should be select and check by means of molecular techniques.
Differential DNA methylation and transcription profiles in date palm roots exposed to salinity

PubMed Central

Al-Harrasi, Ibtisam; Al-Yahyai, Rashid

2018-01-01

As a salt-adaptive plant, the date palm (Phoenix dactylifera L.) requires a suitable mechanism to adapt to the stress of saline soils. There is growing evidence that DNA methylation plays an important role in regulating gene expression in response to abiotic stresses, including salinity. Thus, the present study sought to examine the differential methylation status that occurs in the date palm genome when plants are exposed to salinity, and to identify salinity responsive genes that are regulated by DNA methylation. To achieve these, whole-genome bisulfite sequencing (WGBS) was employed and mRNA was sequenced from salinity-treated and untreated roots. The WGBS analysis included 324,987,795 and 317,056,091 total reads of the control and the salinity-treated samples, respectively. The analysis covered about 81% of the total genomic DNA with about 40% of mapping efficiency of the sequenced reads and an average read depth of 17-fold coverage per DNA strand, and with a bisulfite conversion rate of around 99%. The level of methylation within the differentially methylated regions (DMRs) was significantly (p < 0.05, FDR ≤ 0.05) increased in response to salinity specifically at the mCHG and mCHH sequence contexts. Consistently, the mass spectrometry and the enzyme-linked immunosorbent assay (ELISA) showed that there was a significant (p < 0.05) increase in the global DNA methylation in response to salinity. mRNA sequencing revealed the presence of 6,405 differentially regulated genes with a significant value (p < 0.001, FDR ≤ 0.05) in response to salinity. Integration of high-resolution methylome and transcriptome analyses revealed a negative correlation between mCG methylation located within the promoters and the gene expression, while a positive correlation was noticed between mCHG/mCHH methylation rations and gene expression specifically when plants grew under control conditions. Therefore, the methylome and transcriptome relationships vary based on the methylated sequence context, the methylated region within the gene, the protein-coding ability of the gene, and the salinity treatment. These results provide insights into interplay among DNA methylation and gene expression, and highlight the effect of salinity on the nature of this relationship, which may involve other genetic and epigenetic players under salt stress conditions. The results obtained from this project provide the first draft map of the differential methylome and transcriptome of date palm when exposed to an abiotic stress. PMID:29352281
Differential DNA methylation and transcription profiles in date palm roots exposed to salinity.

PubMed

Al-Harrasi, Ibtisam; Al-Yahyai, Rashid; Yaish, Mahmoud W

2018-01-01

As a salt-adaptive plant, the date palm (Phoenix dactylifera L.) requires a suitable mechanism to adapt to the stress of saline soils. There is growing evidence that DNA methylation plays an important role in regulating gene expression in response to abiotic stresses, including salinity. Thus, the present study sought to examine the differential methylation status that occurs in the date palm genome when plants are exposed to salinity, and to identify salinity responsive genes that are regulated by DNA methylation. To achieve these, whole-genome bisulfite sequencing (WGBS) was employed and mRNA was sequenced from salinity-treated and untreated roots. The WGBS analysis included 324,987,795 and 317,056,091 total reads of the control and the salinity-treated samples, respectively. The analysis covered about 81% of the total genomic DNA with about 40% of mapping efficiency of the sequenced reads and an average read depth of 17-fold coverage per DNA strand, and with a bisulfite conversion rate of around 99%. The level of methylation within the differentially methylated regions (DMRs) was significantly (p < 0.05, FDR ≤ 0.05) increased in response to salinity specifically at the mCHG and mCHH sequence contexts. Consistently, the mass spectrometry and the enzyme-linked immunosorbent assay (ELISA) showed that there was a significant (p < 0.05) increase in the global DNA methylation in response to salinity. mRNA sequencing revealed the presence of 6,405 differentially regulated genes with a significant value (p < 0.001, FDR ≤ 0.05) in response to salinity. Integration of high-resolution methylome and transcriptome analyses revealed a negative correlation between mCG methylation located within the promoters and the gene expression, while a positive correlation was noticed between mCHG/mCHH methylation rations and gene expression specifically when plants grew under control conditions. Therefore, the methylome and transcriptome relationships vary based on the methylated sequence context, the methylated region within the gene, the protein-coding ability of the gene, and the salinity treatment. These results provide insights into interplay among DNA methylation and gene expression, and highlight the effect of salinity on the nature of this relationship, which may involve other genetic and epigenetic players under salt stress conditions. The results obtained from this project provide the first draft map of the differential methylome and transcriptome of date palm when exposed to an abiotic stress.
Identification, characterization and expression analysis of lineage-specific genes within sweet orange (Citrus sinensis).

PubMed

Xu, Yuantao; Wu, Guizhi; Hao, Baohai; Chen, Lingling; Deng, Xiuxin; Xu, Qiang

2015-11-23

With the availability of rapidly increasing number of genome and transcriptome sequences, lineage-specific genes (LSGs) can be identified and characterized. Like other conserved functional genes, LSGs play important roles in biological evolution and functions. Two set of citrus LSGs, 296 citrus-specific genes (CSGs) and 1039 orphan genes specific to sweet orange, were identified by comparative analysis between the sweet orange genome sequences and 41 genomes and 273 transcriptomes. With the two sets of genes, gene structure and gene expression pattern were investigated. On average, both the CSGs and orphan genes have fewer exons, shorter gene length and higher GC content when compared with those evolutionarily conserved genes (ECs). Expression profiling indicated that most of the LSGs expressed in various tissues of sweet orange and some of them exhibited distinct temporal and spatial expression patterns. Particularly, the orphan genes were preferentially expressed in callus, which is an important pluripotent tissue of citrus. Besides, part of the CSGs and orphan genes expressed responsive to abiotic stress, indicating their potential functions during interaction with environment. This study identified and characterized two sets of LSGs in citrus, dissected their sequence features and expression patterns, and provided valuable clues for future functional analysis of the LSGs in sweet orange.
Selection of a DNA barcode for Nectriaceae from fungal whole-genomes.

PubMed

Zeng, Zhaoqing; Zhao, Peng; Luo, Jing; Zhuang, Wenying; Yu, Zhihe

2012-01-01

A DNA barcode is a short segment of sequence that is able to distinguish species. A barcode must ideally contain enough variation to distinguish every individual species and be easily obtained. Fungi of Nectriaceae are economically important and show high species diversity. To establish a standard DNA barcode for this group of fungi, the genomes of Neurospora crassa and 30 other filamentous fungi were compared. The expect value was treated as a criterion to recognize homologous sequences. Four candidate markers, Hsp90, AAC, CDC48, and EF3, were tested for their feasibility as barcodes in the identification of 34 well-established species belonging to 13 genera of Nectriaceae. Two hundred and fifteen sequences were analyzed. Intra- and inter-specific variations and the success rate of PCR amplification and sequencing were considered as important criteria for estimation of the candidate markers. Ultimately, the partial EF3 gene met the requirements for a good DNA barcode: No overlap was found between the intra- and inter-specific pairwise distances. The smallest inter-specific distance of EF3 gene was 3.19%, while the largest intra-specific distance was 1.79%. In addition, there was a high success rate in PCR and sequencing for this gene (96.3%). CDC48 showed sufficiently high sequence variation among species, but the PCR and sequencing success rate was 84% using a single pair of primers. Although the Hsp90 and AAC genes had higher PCR and sequencing success rates (96.3% and 97.5%, respectively), overlapping occurred between the intra- and inter-specific variations, which could lead to misidentification. Therefore, we propose the EF3 gene as a possible DNA barcode for the nectriaceous fungi.
RGAugury: a pipeline for genome-wide prediction of resistance gene analogs (RGAs) in plants.

PubMed

Li, Pingchuan; Quan, Xiande; Jia, Gaofeng; Xiao, Jin; Cloutier, Sylvie; You, Frank M

2016-11-02

Resistance gene analogs (RGAs), such as NBS-encoding proteins, receptor-like protein kinases (RLKs) and receptor-like proteins (RLPs), are potential R-genes that contain specific conserved domains and motifs. Thus, RGAs can be predicted based on their conserved structural features using bioinformatics tools. Computer programs have been developed for the identification of individual domains and motifs from the protein sequences of RGAs but none offer a systematic assessment of the different types of RGAs. A user-friendly and efficient pipeline is needed for large-scale genome-wide RGA predictions of the growing number of sequenced plant genomes. An integrative pipeline, named RGAugury, was developed to automate RGA prediction. The pipeline first identifies RGA-related protein domains and motifs, namely nucleotide binding site (NB-ARC), leucine rich repeat (LRR), transmembrane (TM), serine/threonine and tyrosine kinase (STTK), lysin motif (LysM), coiled-coil (CC) and Toll/Interleukin-1 receptor (TIR). RGA candidates are identified and classified into four major families based on the presence of combinations of these RGA domains and motifs: NBS-encoding, TM-CC, and membrane associated RLP and RLK. All time-consuming analyses of the pipeline are paralleled to improve performance. The pipeline was evaluated using the well-annotated Arabidopsis genome. A total of 98.5, 85.2, and 100 % of the reported NBS-encoding genes, membrane associated RLPs and RLKs were validated, respectively. The pipeline was also successfully applied to predict RGAs for 50 sequenced plant genomes. A user-friendly web interface was implemented to ease command line operations, facilitate visualization and simplify result management for multiple datasets. RGAugury is an efficiently integrative bioinformatics tool for large scale genome-wide identification of RGAs. It is freely available at Bitbucket: https://bitbucket.org/yaanlpc/rgaugury .
Global mapping of DNA conformational flexibility on Saccharomyces cerevisiae.

PubMed

Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella

2015-04-01

In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3'UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3'-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites.
Global Mapping of DNA Conformational Flexibility on Saccharomyces cerevisiae

PubMed Central

Menconi, Giulia; Bedini, Andrea; Barale, Roberto; Sbrana, Isabella

2015-01-01

In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3’UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3’-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites. PMID:25860149
Exome sequencing and arrayCGH detection of gene sequence and copy number variation between ILS and ISS mouse strains.

PubMed

Dumas, Laura; Dickens, C Michael; Anderson, Nathan; Davis, Jonathan; Bennett, Beth; Radcliffe, Richard A; Sikela, James M

2014-06-01

It has been well documented that genetic factors can influence predisposition to develop alcoholism. While the underlying genomic changes may be of several types, two of the most common and disease associated are copy number variations (CNVs) and sequence alterations of protein coding regions. The goal of this study was to identify CNVs and single-nucleotide polymorphisms that occur in gene coding regions that may play a role in influencing the risk of an individual developing alcoholism. Toward this end, two mouse strains were used that have been selectively bred based on their differential sensitivity to alcohol: the Inbred long sleep (ILS) and Inbred short sleep (ISS) mouse strains. Differences in initial response to alcohol have been linked to risk for alcoholism, and the ILS/ISS strains are used to investigate the genetics of initial sensitivity to alcohol. Array comparative genomic hybridization (arrayCGH) and exome sequencing were conducted to identify CNVs and gene coding sequence differences, respectively, between ILS and ISS mice. Mouse arrayCGH was performed using catalog Agilent 1 × 244 k mouse arrays. Subsequently, exome sequencing was carried out using an Illumina HiSeq 2000 instrument. ArrayCGH detected 74 CNVs that were strain-specific (38 ILS/36 ISS), including several ISS-specific deletions that contained genes implicated in brain function and neurotransmitter release. Among several interesting coding variations detected by exome sequencing was the gain of a premature stop codon in the alpha-amylase 2B (AMY2B) gene specifically in the ILS strain. In total, exome sequencing detected 2,597 and 1,768 strain-specific exonic gene variants in the ILS and ISS mice, respectively. This study represents the most comprehensive and detailed genomic comparison of ILS and ISS mouse strains to date. The two complementary genome-wide approaches identified strain-specific CNVs and gene coding sequence variations that should provide strong candidates to contribute to the alcohol-related phenotypic differences associated with these strains.
Cloning, expression, and sequence analysis of the Bacillus methanolicus C1 methanol dehydrogenase gene.

PubMed Central

de Vries, G E; Arfman, N; Terpstra, P; Dijkhuizen, L

1992-01-01

The gene (mdh) coding for methanol dehydrogenase (MDH) of thermotolerant, methylotroph Bacillus methanolicus C1 has been cloned and sequenced. The deduced amino acid sequence of the mdh gene exhibited similarity to those of five other alcohol dehydrogenase (type III) enzymes, which are distinct from the long-chain zinc-containing (type I) or short-chain zinc-lacking (type II) enzymes. Highly efficient expression of the mdh gene in Escherichia coli was probably driven from its own promoter sequence. After purification of MDH from E. coli, the kinetic and biochemical properties of the enzyme were investigated. The physiological effect of MDH synthesis in E. coli and the role of conserved sequence patterns in type III alcohol dehydrogenases have been analyzed and are discussed. Images PMID:1644761
High-resolution melting (HRM) assay for the detection of recurrent BRCA1/BRCA2 germline mutations in Tunisian breast/ovarian cancer families.

PubMed

Riahi, Aouatef; Kharrat, Maher; Lariani, Imen; Chaabouni-Bouhamed, Habiba

2014-12-01

Germline deleterious mutations in the BRCA1/BRCA2 genes are associated with an increased risk for the development of breast and ovarian cancer. Given the large size of these genes the detection of such mutations represents a considerable technical challenge. Therefore, the development of cost-effective and rapid methods to identify these mutations became a necessity. High resolution melting analysis (HRM) is a rapid and efficient technique extensively employed as high-throughput mutation scanning method. The purpose of our study was to assess the specificity and sensitivity of HRM for BRCA1 and BRCA2 genes scanning. As a first step we estimate the ability of HRM for detection mutations in a set of 21 heterozygous samples harboring 8 different known BRCA1/BRCA2 variations, all samples had been preliminarily investigated by direct sequencing, and then we performed a blinded analysis by HRM in a set of 68 further sporadic samples of unknown genotype. All tested heterozygous BRCA1/BRCA2 variants were easily identified. However the HRM assay revealed further alteration that we initially had not searched (one unclassified variant). Furthermore, sequencing confirmed all the HRM detected mutations in the set of unknown samples, including homozygous changes, indicating that in this cohort, with the optimized assays, the mutations detections sensitivity and specificity were 100 %. HRM is a simple, rapid and efficient scanning method for known and unknown BRCA1/BRCA2 germline mutations. Consequently the method will allow for the economical screening of recurrent mutations in Tunisian population.
Assessment of tropism and effectiveness of new primate-derived hybrid recombinant AAV serotypes in the mouse and primate retina.

PubMed

Charbel Issa, Peter; De Silva, Samantha R; Lipinski, Daniel M; Singh, Mandeep S; Mouravlev, Alexandre; You, Qisheng; Barnard, Alun R; Hankins, Mark W; During, Matthew J; Maclaren, Robert E

2013-01-01

Adeno-associated viral vectors (AAV) have been shown to be safe in the treatment of retinal degenerations in clinical trials. Thus, improving the efficiency of viral gene delivery has become increasingly important to increase the success of clinical trials. In this study, structural domains of different rAAV serotypes isolated from primate brain were combined to create novel hybrid recombinant AAV serotypes, rAAV2/rec2 and rAAV2/rec3. The efficacy of these novel serotypes were assessed in wild type mice and in two models of retinal degeneration (the Abca4(-/-) mouse which is a model for Stargardt disease and in the Pde6b(rd1/rd1) mouse) in vivo, in primate tissue ex-vivo, and in the human-derived SH-SY5Y cell line, using an identical AAV2 expression cassette. We show that these novel hybrid serotypes can transduce retinal tissue in mice and primates efficiently, although no more than AAV2/2 and rAAV2/5 serotypes. Transduction efficiency appeared lower in the Abca4(-/-) mouse compared to wild type with all vectors tested, suggesting an effect of specific retinal diseases on the efficiency of gene delivery. Shuffling of AAV capsid domains may have clinical applications for patients who develop T-cell immune responses following AAV gene therapy, as specific peptide antigen sequences could be substituted using this technique prior to vector re-treatments.
Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.

PubMed

Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R

2005-09-01

We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.
[The progress and prospect of application of genetic testing technology-based gene detection technology in the diagnosis and treatment of hereditary cancer].

PubMed

He, J X; Jiang, Y F

2017-08-06

Hereditary cancer is caused by specific pathogenic gene mutations. Early detection and early intervention are the most effective ways to prevent and control hereditary cancer. High-throughput sequencing based genetic testing technology (NGS) breaks through the restrictions of pedigree analysis, provide a convenient and efficient method to detect and diagnose hereditary cancer. Here, we introduce the mechanism of hereditary cancer, summarize, discuss and prospect the application of NGS and other genetic tests in the diagnosis of hereditary retinoblastoma, hereditary breast and ovarian cancer syndrome, hereditary colorectal cancer and other complex and rare hereditary tumors.
Identification of an ICP27-responsive element in the coding region of a herpes simplex virus type 1 late gene.

PubMed

Sedlackova, Lenka; Perkins, Keith D; Meyer, Julia; Strain, Anna K; Goldman, Oksana; Rice, Stephen A

2010-03-01

During productive herpes simplex virus type 1 (HSV-1) infection, a subset of viral delayed-early (DE) and late (L) genes require the immediate-early (IE) protein ICP27 for their expression. However, the cis-acting regulatory sequences in DE and L genes that mediate their specific induction by ICP27 are unknown. One viral L gene that is highly dependent on ICP27 is that encoding glycoprotein C (gC). We previously demonstrated that this gene is posttranscriptionally transactivated by ICP27 in a plasmid cotransfection assay. Based on our past results, we hypothesized that the gC gene possesses a cis-acting inhibitory sequence and that ICP27 overcomes the effects of this sequence to enable efficient gC expression. To test this model, we systematically deleted sequences from the body of the gC gene and tested the resulting constructs for expression. In so doing, we identified a 258-bp "silencing element" (SE) in the 5' portion of the gC coding region. When present, the SE inhibits gC mRNA accumulation from a transiently transfected gC gene, unless ICP27 is present. Moreover, the SE can be transferred to another HSV-1 gene, where it inhibits mRNA accumulation in the absence of ICP27 and confers high-level expression in the presence of ICP27. Thus, for the first time, an ICP27-responsive sequence has been identified in a physiologically relevant ICP27 target gene. To see if the SE functions during viral infection, we engineered HSV-1 recombinants that lack the SE, either in a wild-type (WT) or ICP27-null genetic background. In an ICP27-null background, deletion of the SE led to ICP27-independent expression of the gC gene, demonstrating that the SE functions during viral infection. Surprisingly, the ICP27-independent gC expression seen with the mutant occurred even in the absence of viral DNA synthesis, indicating that the SE helps to regulate the tight DNA replication-dependent expression of gC.
Occurrence of diverse alkane hydroxylase alkB genes in indigenous oil-degrading bacteria of Baltic Sea surface water.

PubMed

Viggor, Signe; Jõesaar, Merike; Vedler, Eve; Kiiker, Riinu; Pärnpuu, Liis; Heinaru, Ain

2015-12-30

Formation of specific oil degrading bacterial communities in diesel fuel, crude oil, heptane and hexadecane supplemented microcosms of the Baltic Sea surface water samples was revealed. The 475 sequences from constructed alkane hydroxylase alkB gene clone libraries were grouped into 30 OPFs. The two largest groups were most similar to Pedobacter sp. (245 from 475) and Limnobacter sp. (112 from 475) alkB gene sequences. From 56 alkane-degrading bacterial strains 41 belonged to the Pseudomonas spp. and 8 to the Rhodococcus spp. having redundant alkB genes. Together 68 alkB gene sequences were identified. These genes grouped into 20 OPFs, half of them being specific only to the isolated strains. Altogether 543 diverse alkB genes were characterized in the brackish Baltic Sea water; some of them representing novel lineages having very low sequence identities with corresponding genes of the reference strains. Copyright © 2015 Elsevier Ltd. All rights reserved.
Developmental expression of a regulatory gene is programmed at the level of splicing.

PubMed Central

Chou, T B; Zachar, Z; Bingham, P M

1987-01-01

We report sequence and transcript structures for a 6191-base chromosomal segment containing the presumptive regulatory gene from Drosophila, suppressor-of-white-apricot [su(wa)]. Our results indicate that su(wa) expression is controlled by regulating occurrence of specific splices. Seven introns are removed from the su(wa) primary transcript during precellular blastoderm development. The sequence of this mature RNA indicates that it is a conventional messenger RNA. In contrast, after cellular blastoderm the first two of these introns cease to be efficiently removed. The mature RNAs resulting from this failure to remove the first two introns have structures quite unexpected of mRNAs. We propose that postcellular blastoderm su(wa) expression is repressed by preventing splices necessary to produce a functional mRNA. Implications and mechanisms are discussed. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. PMID:2832151
Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision.

PubMed

Bao, Zehua; HamediRad, Mohammad; Xue, Pu; Xiao, Han; Tasan, Ipek; Chao, Ran; Liang, Jing; Zhao, Huimin

2018-07-01

We developed a CRISPR-Cas9- and homology-directed-repair-assisted genome-scale engineering method named CHAnGE that can rapidly output tens of thousands of specific genetic variants in yeast. More than 98% of target sequences were efficiently edited with an average frequency of 82%. We validate the single-nucleotide resolution genome-editing capability of this technology by creating a genome-wide gene disruption collection and apply our method to improve tolerance to growth inhibitors.
SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.

PubMed

Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael

2018-05-25

Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.

Evidence for ribosomal frameshifting and a novel overlapping gene in the genomes of insect-specific flaviviruses

DOE Office of Scientific and Technical Information (OSTI.GOV)

Firth, Andrew E., E-mail: a.firth@ucc.i; Blitvich, Bradley J., E-mail: blitvich@iastate.ed; Wills, Norma M., E-mail: nwills@genetics.utah.ed

2010-03-30

Flaviviruses have a positive-sense, single-stranded RNA genome of approx11 kb, encoding a large polyprotein that is cleaved to produce approx10 mature proteins. Cell fusing agent virus, Kamiti River virus, Culex flavivirus and several recently discovered flaviviruses have no known vertebrate host and apparently infect only insects. We present compelling bioinformatic evidence for a 253-295 codon overlapping gene (designated fifo) conserved throughout these insect-specific flaviviruses and immunofluorescent detection of its product. Fifo overlaps the NS2A/NS2B coding sequence in the - 1/+ 2 reading frame and is most likely expressed as a trans-frame fusion protein via ribosomal frameshifting at a conserved GGAUUUYmore » slippery heptanucleotide with 3'-adjacent RNA secondary structure (which stimulates efficient frameshifting in vitro). The discovery bears striking parallels to the recently discovered ribosomal frameshifting site in the NS2A coding sequence of the Japanese encephalitis serogroup of flaviviruses and suggests that programmed ribosomal frameshifting may be more widespread in flaviviruses than currently realized.« less
Evaluation and rational design of guide RNAs for efficient CRISPR/Cas9-mediated mutagenesis in Ciona

PubMed Central

Gandhi, Shashank; Haeussler, Maximilian; Razy-Krajka, Florian; Christiaen, Lionel; Stolfi, Alberto

2017-01-01

The CRISPR/Cas9 system has emerged as an important tool for various genome engineering applications. A current obstacle to high throughput applications of CRISPR/Cas9 is the imprecise prediction of highly active single guide RNAs (sgRNAs). We previously implemented the CRISPR/Cas9 system to induce tissue-specific mutations in the tunicate Ciona. In the present study, we designed and tested 83 single guide RNA (sgRNA) vectors targeting 23 genes expressed in the cardiopharyngeal progenitors and surrounding tissues of Ciona embryo. Using high-throughput sequencing of mutagenized alleles, we identified guide sequences that correlate with sgRNA mutagenesis activity and used this information for the rational design of all possible sgRNAs targeting the Ciona transcriptome. We also describe a one-step cloning-free protocol for the assembly of sgRNA expression cassettes. These cassettes can be directly electroporated as unpurified PCR products into Ciona embryos for sgRNA expression in vivo, resulting in high frequency of CRISPR/Cas9-mediated mutagenesis in somatic cells of electroporated embryos. We found a strong correlation between the frequency of an Ebf loss-of-function phenotype and the mutagenesis efficacies of individual Ebf-targeting sgRNAs tested using this method. We anticipate that our approach can be scaled up to systematically design and deliver highly efficient sgRNAs for the tissue-specific investigation of gene functions in Ciona. PMID:28341547
Ubiquitous and gene-specific regulatory 5' sequences in a sea urchin histone DNA clone coding for histone protein variants.

PubMed Central

Busslinger, M; Portmann, R; Irminger, J C; Birnstiel, M L

1980-01-01

The DNA sequences of the entire structural H4, H3, H2A and H2B genes and of their 5' flanking regions have been determined in the histone DNA clone h19 of the sea urchin Psammechinus miliaris. In clone h19 the polarity of transcription and the relative arrangement of the histone genes is identical to that in clone h22 of the same species. The histone proteins encoded by h19 DNA differ in their primary structure from those encoded by clone h22 and have been compared to histone protein sequences of other sea urchin species as well as other eukaryotes. A comparative analysis of the 5' flanking DNA sequences of the structural histone genes in both clones revealed four ubiquitous sequence motifs; a pentameric element GATCC, followed at short distance by the Hogness box GTATAAATAG, a conserved sequence PyCATTCPu, in or near which the 5' ends of the mRNAs map in h22 DNA and lastly a sequence A, containing the initiation codon. These sequences are also found, sometimes in modified version, in front of other eukaryotic genes transcribed by polymerase II. When prelude sequences of isocoding histone genes in clone h19 and h22 are compared areas of homology are seen to extend beyond the ubiquitous sequence motifs towards the divergent AT-rich spacer and terminate between approximately 140 and 240 nucleotides away from the structural gene. These prelude regions contain quite large conservative sequence blocks which are specific for each type of histone genes. Images PMID:7443547
An att site-based recombination reporter system for genome engineering and synthetic DNA assembly.

PubMed

Bland, Michael J; Ducos-Galand, Magaly; Val, Marie-Eve; Mazel, Didier

2017-07-14

Direct manipulation of the genome is a widespread technique for genetic studies and synthetic biology applications. The tyrosine and serine site-specific recombination systems of bacteriophages HK022 and ΦC31 are widely used for stable directional exchange and relocation of DNA sequences, making them valuable tools in these contexts. We have developed site-specific recombination tools that allow the direct selection of recombination events by embedding the attB site from each system within the β-lactamase resistance coding sequence (bla). The HK and ΦC31 tools were developed by placing the attB sites from each system into the signal peptide cleavage site coding sequence of bla. All possible open reading frames (ORFs) were inserted and tested for recombination efficiency and bla activity. Efficient recombination was observed for all tested ORFs (3 for HK, 6 for ΦC31) as shown through a cointegrate formation assay. The bla gene with the embedded attB site was functional for eight of the nine constructs tested. The HK/ΦC31 att-bla system offers a simple way to directly select recombination events, thus enhancing the use of site-specific recombination systems for carrying out precise, large-scale DNA manipulation, and adding useful tools to the genetics toolbox. We further show the power and flexibility of bla to be used as a reporter for recombination.
Progressive engineering of a homing endonuclease genome editing reagent for the murine X-linked immunodeficiency locus

PubMed Central

Wang, Yupeng; Khan, Iram F.; Boissel, Sandrine; Jarjour, Jordan; Pangallo, Joseph; Thyme, Summer; Baker, David; Scharenberg, Andrew M.; Rawlings, David J.

2014-01-01

LAGLIDADG homing endonucleases (LHEs) are compact endonucleases with 20–22 bp recognition sites, and thus are ideal scaffolds for engineering site-specific DNA cleavage enzymes for genome editing applications. Here, we describe a general approach to LHE engineering that combines rational design with directed evolution, using a yeast surface display high-throughput cleavage selection. This approach was employed to alter the binding and cleavage specificity of the I-Anil LHE to recognize a mutation in the mouse Bruton tyrosine kinase (Btk) gene causative for mouse X-linked immunodeficiency (XID)—a model of human X-linked agammaglobulinemia (XLA). The required re-targeting of I-AniI involved progressive resculpting of the DNA contact interface to accommodate nine base differences from the native cleavage sequence. The enzyme emerging from the progressive engineering process was specific for the XID mutant allele versus the wild-type (WT) allele, and exhibited activity equivalent to WT I-AniI in vitro and in cellulo reporter assays. Fusion of the enzyme to a site-specific DNA binding domain of transcription activator-like effector (TALE) resulted in a further enhancement of gene editing efficiency. These results illustrate the potential of LHE enzymes as specific and efficient tools for therapeutic genome engineering. PMID:24682825
De novo Assembly of the Indo-Pacific Humpback Dolphin Leucocyte Transcriptome to Identify Putative Genes Involved in the Aquatic Adaptation and Immune Response

PubMed Central

Xia, Jia; Yang, Lili; Chen, Jialin; Wu, Yuping; Yi, Meisheng

2013-01-01

Background The Indo-Pacific humpback dolphin (Sousa chinensis), a marine mammal species inhabited in the waters of Southeast Asia, South Africa and Australia, has attracted much attention because of the dramatic decline in population size in the past decades, which raises the concern of extinction. So far, this species is poorly characterized at molecular level due to little sequence information available in public databases. Recent advances in large-scale RNA sequencing provide an efficient approach to generate abundant sequences for functional genomic analyses in the species with un-sequenced genomes. Principal Findings We performed a de novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome by Illumina sequencing. 108,751 high quality sequences from 47,840,388 paired-end reads were generated, and 48,868 and 46,587 unigenes were functionally annotated by BLAST search against the NCBI non-redundant and Swiss-Prot protein databases (E-value<10−5), respectively. In total, 16,467 unigenes were clustered into 25 functional categories by searching against the COG database, and BLAST2GO search assigned 37,976 unigenes to 61 GO terms. In addition, 36,345 unigenes were grouped into 258 KEGG pathways. We also identified 9,906 simple sequence repeats and 3,681 putative single nucleotide polymorphisms as potential molecular markers in our assembled sequences. A large number of unigenes were predicted to be involved in immune response, and many genes were predicted to be relevant to adaptive evolution and cetacean-specific traits. Conclusion This study represented the first transcriptome analysis of the Indo-Pacific humpback dolphin, an endangered species. The de novo transcriptome analysis of the unique transcripts will provide valuable sequence information for discovery of new genes, characterization of gene expression, investigation of various pathways and adaptive evolution, as well as identification of genetic markers. PMID:24015242
De novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome to identify putative genes involved in the aquatic adaptation and immune response.

PubMed

Gui, Duan; Jia, Kuntong; Xia, Jia; Yang, Lili; Chen, Jialin; Wu, Yuping; Yi, Meisheng

2013-01-01

The Indo-Pacific humpback dolphin (Sousa chinensis), a marine mammal species inhabited in the waters of Southeast Asia, South Africa and Australia, has attracted much attention because of the dramatic decline in population size in the past decades, which raises the concern of extinction. So far, this species is poorly characterized at molecular level due to little sequence information available in public databases. Recent advances in large-scale RNA sequencing provide an efficient approach to generate abundant sequences for functional genomic analyses in the species with un-sequenced genomes. We performed a de novo assembly of the Indo-Pacific humpback dolphin leucocyte transcriptome by Illumina sequencing. 108,751 high quality sequences from 47,840,388 paired-end reads were generated, and 48,868 and 46,587 unigenes were functionally annotated by BLAST search against the NCBI non-redundant and Swiss-Prot protein databases (E-value<10(-5)), respectively. In total, 16,467 unigenes were clustered into 25 functional categories by searching against the COG database, and BLAST2GO search assigned 37,976 unigenes to 61 GO terms. In addition, 36,345 unigenes were grouped into 258 KEGG pathways. We also identified 9,906 simple sequence repeats and 3,681 putative single nucleotide polymorphisms as potential molecular markers in our assembled sequences. A large number of unigenes were predicted to be involved in immune response, and many genes were predicted to be relevant to adaptive evolution and cetacean-specific traits. This study represented the first transcriptome analysis of the Indo-Pacific humpback dolphin, an endangered species. The de novo transcriptome analysis of the unique transcripts will provide valuable sequence information for discovery of new genes, characterization of gene expression, investigation of various pathways and adaptive evolution, as well as identification of genetic markers.
IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

PubMed Central

Deonovic, Benjamin; Wang, Yunhao; Weirather, Jason; Wang, Xiu-Jie; Au, Kin Fai

2017-01-01

Abstract Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only. PMID:27899656
Genetic determinants restricting the reassortment of heterologous NSP2 genes into the simian rotavirus SA11 genome.

PubMed

Mingo, Rebecca; Zhang, Shu; Long, Courtney P; LaConte, Leslie E W; McDonald, Sarah M

2017-08-24

Rotaviruses (RVs) can evolve through the process of reassortment, whereby the 11 double-stranded RNA genome segments are exchanged among strains during co-infection. However, reassortment is limited in cases where the genes or encoded proteins of co-infecting strains are functionally incompatible. In this study, we employed a helper virus-based reverse genetics system to identify NSP2 gene regions that correlate with restricted reassortment into simian RV strain SA11. We show that SA11 reassortants with NSP2 genes from human RV strains Wa or DS-1 were efficiently rescued and exhibit no detectable replication defects. However, we could not rescue an SA11 reassortant with a human RV strain AU-1 NSP2 gene, which differs from that of SA11 by 186 nucleotides (36 amino acids). To map restriction determinants, we engineered viruses to contain chimeric NSP2 genes in which specific regions of AU-1 sequence were substituted with SA11 sequence. We show that a region spanning AU-1 NSP2 gene nucleotides 784-820 is critical for the observed restriction; yet additional determinants reside in other gene regions. In silico and in vitro analyses were used to predict how the 784-820 region may impact NSP2 gene/protein function, thereby informing an understanding of the reassortment restriction mechanism.
The siRNA Non-seed Region and Its Target Sequences Are Auxiliary Determinants of Off-Target Effects.

PubMed

Kamola, Piotr J; Nakano, Yuko; Takahashi, Tomoko; Wilson, Paul A; Ui-Tei, Kumiko

2015-12-01

RNA interference (RNAi) is a powerful tool for post-transcriptional gene silencing. However, the siRNA guide strand may bind unintended off-target transcripts via partial sequence complementarity by a mechanism closely mirroring micro RNA (miRNA) silencing. To better understand these off-target effects, we investigated the correlation between sequence features within various subsections of siRNA guide strands, and its corresponding target sequences, with off-target activities. Our results confirm previous reports that strength of base-pairing in the siRNA seed region is the primary factor determining the efficiency of off-target silencing. However, the degree of downregulation of off-target transcripts with shared seed sequence is not necessarily similar, suggesting that there are additional auxiliary factors that influence the silencing potential. Here, we demonstrate that both the melting temperature (Tm) in a subsection of siRNA non-seed region, and the GC contents of its corresponding target sequences, are negatively correlated with the efficiency of off-target effect. Analysis of experimentally validated miRNA targets demonstrated a similar trend, indicating a putative conserved mechanistic feature of seed region-dependent targeting mechanism. These observations may prove useful as parameters for off-target prediction algorithms and improve siRNA 'specificity' design rules.
[New perspectives on molecular and genic therapies in Down syndrome].

PubMed

Delabar, Jean Maurice

2010-04-01

Trisomy 21 was first described as a syndrome in the middle of the nineteenth century and associated to a chromosomic anomaly one hundred years later: the most salient feature of this syndrome is a mental retardation of variable intensity. Molecular mapping and DNA sequencing have allowed identifying the gene content of chromosome 21. Molecular quantitative analyses indicated that trisomy is inducing an overexpression for a large part of the triplicated genes and deregulates also pathways involving non HSA21 genes. Together with the physiological description of murine models overexpressing orthologous genes, these data have allowed to elaborate hypotheses on the cause of cognitive impairment. From these hypotheses and using murine models it is now possible to assess the efficiency of various therapeutic strategies. This paper reviews these new perspectives starting from the strategies targeting the level of HSA21 RNAs or HSA21 proteins; then it describes methods targeting activities either of proteins involved in cell cycle pathways or of proteins controlling the synaptic plasticity. It is promising that strategies targeting specific genes or specific pathways are already giving positive results.
Repairing the sickle cell mutation. I. Specific covalent binding of a photoreactive third strand to the mutated base pair.

PubMed

Broitman, S; Amosova, O; Dolinnaya, N G; Fresco, J R

1999-07-30

A DNA third strand with a 3'-psoralen substituent was designed to form a triplex with the sequence downstream of the T.A mutant base pair of the human sickle cell beta-globin gene. Triplex-mediated psoralen modification of the mutant T residue was sought as an approach to gene repair. The 24-nucleotide purine-rich target sequence switches from one strand to the other and has four pyrimidine interruptions. Therefore, a third strand sequence favorable to two triplex motifs was used, one parallel and the other antiparallel to it. To cope with the pyrimidine interruptions, which weaken third strand binding, 5-methylcytosine and 5-propynyluracil were used in the third strand. Further, a six residue "hook" complementary to an overhang of a linear duplex target was added to the 5'-end of the third strand via a T(4) linker. In binding to the overhang by Watson-Crick pairing, the hook facilitates triplex formation. This third strand also binds specifically to the target within a supercoiled plasmid. The psoralen moiety at the 3'-end of the third strand forms photoadducts to the targeted T with high efficiency. Such monoadducts are known to preferentially trigger reversion of the mutation by DNA repair enzymes.
Suppressive subtractive hybridization approach revealed differential expression of hypersensitive response and reactive oxygen species production genes in tea (Camellia sinensis (L.) O. Kuntze) leaves during Pestalotiopsis thea infection.

PubMed

Senthilkumar, Palanisamy; Thirugnanasambantham, Krishnaraj; Mandal, Abul Kalam Azad

2012-12-01

Tea (Camellia sinensis (L.) O. Kuntze) is an economically important plant cultivated for its leaves. Infection of Pestalotiopsis theae in leaves causes gray blight disease and enormous loss to the tea industry. We used suppressive subtractive hybridization (SSH) technique to unravel the differential gene expression pattern during gray blight disease development in tea. Complementary DNA from P. theae-infected and uninfected leaves of disease tolerant cultivar UPASI-10 was used as tester and driver populations respectively. Subtraction efficiency was confirmed by comparing abundance of β-actin gene. A total of 377 and 720 clones with insert size >250 bp from forward and reverse library respectively were sequenced and analyzed. Basic Local Alignment Search Tool analysis revealed 17 sequences in forward SSH library have high degree of similarity with disease and hypersensitive response related genes and 20 sequences with hypothetical proteins while in reverse SSH library, 23 sequences have high degree of similarity with disease and stress response-related genes and 15 sequences with hypothetical proteins. Functional analysis indicated unknown (61 and 59 %) or hypothetical functions (23 and 18 %) for most of the differentially regulated genes in forward and reverse SSH library, respectively, while others have important role in different cellular activities. Majority of the upregulated genes are related to hypersensitive response and reactive oxygen species production. Based on these expressed sequence tag data, putative role of differentially expressed genes were discussed in relation to disease. We also demonstrated the efficiency of SSH as a tool in enriching gray blight disease related up- and downregulated genes in tea. The present study revealed that many genes related to disease resistance were suppressed during P. theae infection and enhancing these genes by the application of inducers may impart better disease tolerance to the plants.
Efficient genomic correction methods in human iPS cells using CRISPR-Cas9 system.

PubMed

Li, Hongmei Lisa; Gee, Peter; Ishida, Kentaro; Hotta, Akitsu

2016-05-15

Precise gene correction using the CRISPR-Cas9 system in human iPS cells holds great promise for various applications, such as the study of gene functions, disease modeling, and gene therapy. In this review article, we summarize methods for effective editing of genomic sequences of iPS cells based on our experiences correcting dystrophin gene mutations with the CRISPR-Cas9 system. Designing specific sgRNAs as well as having efficient transfection methods and proper detection assays to assess genomic cleavage activities are critical for successful genome editing in iPS cells. In addition, because iPS cells are fragile by nature when dissociated into single cells, a step-by-step confirmation during the cell recovery process is recommended to obtain an adequate number of genome-edited iPS cell clones. We hope that the techniques described here will be useful for researchers from diverse backgrounds who would like to perform genome editing in iPS cells. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Cross-genome map based dissection of a nitrogen use efficiency ortho-metaQTL in bread wheat unravels concerted cereal genome evolution.

PubMed

Quraishi, Umar Masood; Abrouk, Michael; Murat, Florent; Pont, Caroline; Foucrier, Séverine; Desmaizieres, Gregory; Confolent, Carole; Rivière, Nathalie; Charmet, Gilles; Paux, Etienne; Murigneux, Alain; Guerreiro, Laurent; Lafarge, Stéphane; Le Gouis, Jacques; Feuillet, Catherine; Salse, Jerome

2011-03-01

Monitoring nitrogen use efficiency (NUE) in plants is becoming essential to maintain yield while reducing fertilizer usage. Optimized NUE application in major crops is essential for long-term sustainability of agriculture production. Here, we report the precise identification of 11 major chromosomal regions controlling NUE in wheat that co-localise with key developmental genes such as Ppd (photoperiod sensitivity), Vrn (vernalization requirement), Rht (reduced height) and can be considered as robust markers from a molecular breeding perspective. Physical mapping, sequencing, annotation and candidate gene validation of an NUE metaQTL on wheat chromosome 3B allowed us to propose that a glutamate synthase (GoGAT) gene that is conserved structurally and functionally at orthologous positions in rice, sorghum and maize genomes may contribute to NUE in wheat and other cereals. We propose an evolutionary model for the NUE locus in cereals from a common ancestral region, involving species specific shuffling events such as gene deletion, inversion, transposition and the invasion of repetitive elements. © 2011 The Authors. The Plant Journal © 2011 Blackwell Publishing Ltd.
The landscape of sex-differential transcriptome and its consequent selection in human adults.

PubMed

Gershoni, Moran; Pietrokovski, Shmuel

2017-02-07

The prevalence of several human morbid phenotypes is sometimes much higher than intuitively expected. This can directly arise from the presence of two sexes, male and female, in one species. Men and women have almost identical genomes but are distinctly dimorphic, with dissimilar disease susceptibilities. Sexually dimorphic traits mainly result from differential expression of genes present in both sexes. Such genes can be subject to different, and even opposing, selection constraints in the two sexes. This can impact human evolution by differential selection on mutations with dissimilar effects on the two sexes. We comprehensively mapped human sex-differential genetic architecture across 53 tissues. Analyzing available RNA-sequencing data from 544 adults revealed thousands of genes differentially expressed in the reproductive tracts and tissues common to both sexes. Sex-differential genes are related to various biological systems, and suggest new insights into the pathophysiology of diverse human diseases. We also identified a significant association between sex-specific gene transcription and reduced selection efficiency and accumulation of deleterious mutations, which might affect the prevalence of different traits and diseases. Interestingly, many of the sex-specific genes that also undergo reduced selection efficiency are essential for successful reproduction in men or women. This seeming paradox might partially explain the high incidence of human infertility. This work provides a comprehensive overview of the sex-differential transcriptome and its importance to human evolution and human physiology in health and in disease.
Ion Torrent sequencing as a tool for mutation discovery in the flax (Linum usitatissimum L.) genome.

PubMed

Galindo-González, Leonardo; Pinzón-Latorre, David; Bergen, Erik A; Jensen, Dustin C; Deyholos, Michael K

2015-01-01

Detection of induced mutations is valuable for inferring gene function and for developing novel germplasm for crop improvement. Many reverse genetics approaches have been developed to identify mutations in genes of interest within a mutagenized population, including some approaches that rely on next-generation sequencing (e.g. exome capture, whole genome resequencing). As an alternative to these genome or exome-scale methods, we sought to develop a scalable and efficient method for detection of induced mutations that could be applied to a small number of target genes, using Ion Torrent technology. We developed this method in flax (Linum usitatissimum), to demonstrate its utility in a crop species. We used an amplicon-based approach in which DNA samples from an ethyl methanesulfonate (EMS)-mutagenized population were pooled and used as template in PCR reactions to amplify a region of each gene of interest. Barcodes were incorporated during PCR, and the pooled amplicons were sequenced using an Ion Torrent PGM. A pilot experiment with known SNPs showed that they could be detected at a frequency > 0.3% within the pools. We then selected eight genes for which we wanted to discover novel mutations, and applied our approach to screen 768 individuals from the EMS population, using either the Ion 314 or Ion 316 chips. Out of 29 potential mutations identified after processing the NGS reads, 16 mutations were confirmed using Sanger sequencing. The methodology presented here demonstrates the utility of Ion Torrent technology in detecting mutation variants in specific genome regions for large populations of a species such as flax. The methodology could be scaled-up to test >100 genes using the higher capacity chips now available from Ion Torrent.
Bivalve-specific gene expansion in the pearl oyster genome: implications of adaptation to a sessile lifestyle.

PubMed

Takeuchi, Takeshi; Koyanagi, Ryo; Gyoja, Fuki; Kanda, Miyuki; Hisata, Kanako; Fujie, Manabu; Goto, Hiroki; Yamasaki, Shinichi; Nagai, Kiyohito; Morino, Yoshiaki; Miyamoto, Hiroshi; Endo, Kazuyoshi; Endo, Hirotoshi; Nagasawa, Hiromichi; Kinoshita, Shigeharu; Asakawa, Shuichi; Watabe, Shugo; Satoh, Noriyuki; Kawashima, Takeshi

2016-01-01

Bivalve molluscs have flourished in marine environments, and many species constitute important aquatic resources. Recently, whole genome sequences from two bivalves, the pearl oyster, Pinctada fucata, and the Pacific oyster, Crassostrea gigas, have been decoded, making it possible to compare genomic sequences among molluscs, and to explore general and lineage-specific genetic features and trends in bivalves. In order to improve the quality of sequence data for these purposes, we have updated the entire P. fucata genome assembly. We present a new genome assembly of the pearl oyster, Pinctada fucata (version 2.0). To update the assembly, we conducted additional sequencing, obtaining accumulated sequence data amounting to 193× the P. fucata genome. Sequence redundancy in contigs that was caused by heterozygosity was removed in silico, which significantly improved subsequent scaffolding. Gene model version 2.0 was generated with the aid of manual gene annotations supplied by the P. fucata research community. Comparison of mollusc and other bilaterian genomes shows that gene arrangements of Hox, ParaHox, and Wnt clusters in the P. fucata genome are similar to those of other molluscs. Like the Pacific oyster, P. fucata possesses many genes involved in environmental responses and in immune defense. Phylogenetic analyses of heat shock protein70 and C1q domain-containing protein families indicate that extensive expansion of genes occurred independently in each lineage. Several gene duplication events prior to the split between the pearl oyster and the Pacific oyster are also evident. In addition, a number of tandem duplications of genes that encode shell matrix proteins are also well characterized in the P. fucata genome. Both the Pinctada and Crassostrea lineages have expanded specific gene families in a lineage-specific manner. Frequent duplication of genes responsible for shell formation in the P. fucata genome explains the diversity of mollusc shell structures. These duplications reveal dynamic genome evolution to forge the complex physiology that enables bivalves to employ a sessile lifestyle in the intertidal zone.
Efficient mutation identification in zebrafish by microarray capturing and next generation sequencing.

PubMed

Bontems, Franck; Baerlocher, Loic; Mehenni, Sabrina; Bahechar, Ilham; Farinelli, Laurent; Dosch, Roland

2011-02-18

Fish models like medaka, stickleback or zebrafish provide a valuable resource to study vertebrate genes. However, finding genetic variants e.g. mutations in the genome is still arduous. Here we used a combination of microarray capturing and next generation sequencing to identify the affected gene in the mozartkugelp11cv (mzlp11cv) mutant zebrafish. We discovered a 31-bp deletion in macf1 demonstrating the potential of this technique to efficiently isolate mutations in a vertebrate genome. Copyright © 2011 Elsevier Inc. All rights reserved.
Enhanced protective efficacy against tuberculosis provided by a recombinant urease deficient BCG expressing heat shock protein 70-major membrane protein-II having PEST sequence.

PubMed

Tsukamoto, Yumiko; Maeda, Yumi; Tamura, Toshiki; Mukai, Tetsu; Mitarai, Satoshi; Yamamoto, Saburo; Makino, Masahiko

2016-12-07

Enhancement of the T cell-stimulating ability of Mycobacterium bovis BCG (BCG) is necessary to develop an effective tuberculosis vaccine. For this purpose, we introduced the PEST-HSP70-major membrane protein-II (MMPII)-PEST fusion gene into ureC-gene depleted recombinant (r) BCG to produce BCG-PEST. The PEST sequence is involved in the proteasomal processing of antigens. BCG-PEST secreted the PEST-HSP70-MMPII-PEST fusion protein and more efficiently activated human monocyte-derived dendritic cells (DCs) in terms of phenotypic changes and cytokine productions than an empty-vector-introduced BCG or HSP70-MMPII gene-introduced ureC gene-depleted BCG (BCG-DHTM). Autologous human naïve CD8 + T cells and naïve CD4 + T cells were effectively activated by BCG-PEST and produced IFN-γ in an antigen-specific manner through DCs. These T cell activations were closely associated with phagosomal maturation and intraproteasomal protein degradation in antigen-presenting cells. Furthermore, BCG-PEST produced long-lasting memory-type T cells in C57BL/6 mice more efficiently than control rBCGs. Moreover, a single subcutaneous injection of BCG-PEST more effectively reduced the multiplication of subsequent aerosol-challenged Mycobacterium tuberculosis of the standard H37Rv strain and clinically isolated Beijing strain in the lungs than control rBCGs. The vaccination effect of BCG-PEST lasted for at least 6months. These results indicate that BCG-PEST may be able to efficiently control the spread of tuberculosis in human. Copyright Â© 2016 Elsevier Ltd. All rights reserved.

Host-Induced Gene Silencing of Rice Blast Fungus Magnaporthe oryzae Pathogenicity Genes Mediated by the Brome Mosaic Virus.

PubMed

Zhu, Lin; Zhu, Jian; Liu, Zhixue; Wang, Zhengyi; Zhou, Cheng; Wang, Hong

2017-09-26

Magnaporthe oryzae is a devastating plant pathogen, which has a detrimental impact on rice production worldwide. Despite its agronomical importance, some newly-emerging pathotypes often overcome race-specific disease resistance rapidly. It is thus desirable to develop a novel strategy for the long-lasting resistance of rice plants to ever-changing fungal pathogens. Brome mosaic virus (BMV)-induced RNA interference (RNAi) has emerged as a useful tool to study host-resistance genes for rice blast protection. Planta-generated silencing of targeted genes inside biotrophic pathogens can be achieved by expression of M. oryzae -derived gene fragments in the BMV-mediated gene silencing system, a technique termed host-induced gene silencing (HIGS). In this study, the effectiveness of BMV-mediated HIGS in M. oryzae was examined by targeting three predicted pathogenicity genes, MoABC1, MoMAC1 and MoPMK1 . Systemic generation of fungal gene-specific small interfering RNA (siRNA) molecules induced by inoculation of BMV viral vectors inhibited disease development and reduced the transcription of targeted fungal genes after subsequent M. oryzae inoculation. Combined introduction of fungal gene sequences in sense and antisense orientation mediated by the BMV silencing vectors significantly enhanced the efficiency of this host-generated trans-specific RNAi, implying that these fungal genes played crucial roles in pathogenicity. Collectively, our results indicated that BMV-HIGS system was a great strategy for protecting host plants against the invasion of pathogenic fungi.
Experience of targeted Usher exome sequencing as a clinical test

PubMed Central

Besnard, Thomas; García-García, Gema; Baux, David; Vaché, Christel; Faugère, Valérie; Larrieu, Lise; Léonard, Susana; Millan, Jose M; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise

2014-01-01

We show that massively parallel targeted sequencing of 19 genes provides a new and reliable strategy for molecular diagnosis of Usher syndrome (USH) and nonsyndromic deafness, particularly appropriate for these disorders characterized by a high clinical and genetic heterogeneity and a complex structure of several of the genes involved. A series of 71 patients including Usher patients previously screened by Sanger sequencing plus newly referred patients was studied. Ninety-eight percent of the variants previously identified by Sanger sequencing were found by next-generation sequencing (NGS). NGS proved to be efficient as it offers analysis of all relevant genes which is laborious to reach with Sanger sequencing. Among the 13 newly referred Usher patients, both mutations in the same gene were identified in 77% of cases (10 patients) and one candidate pathogenic variant in two additional patients. This work can be considered as pilot for implementing NGS for genetically heterogeneous diseases in clinical service. PMID:24498627
Development of ITS sequence based molecular marker to distinguish, Tribulus terrestris L. (Zygophyllaceae) from its adulterants.

PubMed

Balasubramani, Subramani Paranthaman; Murugan, Ramar; Ravikumar, Kaliamoorthy; Venkatasubramanian, Padma

2010-09-01

Tribulus terrestris L. (Zygophyllaceae) is one of the highly traded raw drugs and also used as a stimulative food additive in Europe and USA. While, Ayurvedic Pharmacopoeia of India recognizes T. terrestris as Goksura, Tribulus lanuginosus and T. subramanyamii are also traded by the same name raising issues of quality control. The nuclear ribosomal RNA genes and ITS (internal transcribed spacer) sequence were used to develop species-specific DNA markers. The species-specific markers efficiently amplified 295bp for T. terrestris (TT1F and TT1R), 300bp for T. lanuginosus (TL1F and TL1R) and 214bp for T. subramanyamii (TS1F and TS1R). These DNA markers can be used to distinguish T. terrestris from its adulterants. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Fluorescent in situ hybridisation to amphioxus chromosomes.

PubMed

Castro, Luis Filipe Costa; Holland, Peter William Harold

2002-12-01

We describe an efficient protocol for mapping genes and other DNA sequences to amphioxus chromosomes using fluorescent in situ hybridisation. We apply this method to identify the number and location of ribosomal DNA gene clusters and telomere sequences in metaphase spreads of Branchiostoma floridae. We also describe how the locations of two single copy genes can be mapped relative to each other, and demonstrate this by mapping an amphioxus Pax gene relative to a homologue of the Notch gene. These methods have great potential for performing comparative genomics between amphioxus and vertebrates.
Cloning-free CRISPR

PubMed Central

Arbab, Mandana; Srinivasan, Sharanya; Hashimoto, Tatsunori; Geijsen, Niels; Sherwood, Richard I.

2015-01-01

Summary We present self-cloning CRISPR/Cas9 (scCRISPR), a technology that allows for CRISPR/Cas9-mediated genomic mutation and site-specific knockin transgene creation within several hours by circumventing the need to clone a site-specific single-guide RNA (sgRNA) or knockin homology construct for each target locus. We introduce a self-cleaving palindromic sgRNA plasmid and a short double-stranded DNA sequence encoding the desired locus-specific sgRNA into target cells, allowing them to produce a locus-specific sgRNA plasmid through homologous recombination. scCRISPR enables efficient generation of gene knockouts (∼88% mutation rate) at approximately one-sixth the cost of plasmid-based sgRNA construction with only 2 hr of preparation for each targeted site. Additionally, we demonstrate efficient site-specific knockin of GFP transgenes without any plasmid cloning or genome-integrated selection cassette in mouse and human embryonic stem cells (2%–4% knockin rate) through PCR-based addition of short homology arms. scCRISPR substantially lowers the bar on mouse and human transgenesis. PMID:26527385
De novo Transcriptome Assembly of Common Wild Rice (Oryza rufipogon Griff.) and Discovery of Drought-Response Genes in Root Tissue Based on Transcriptomic Data.

PubMed

Tian, Xin-Jie; Long, Yan; Wang, Jiao; Zhang, Jing-Wen; Wang, Yan-Yan; Li, Wei-Min; Peng, Yu-Fa; Yuan, Qian-Hua; Pei, Xin-Wu

2015-01-01

The perennial O. rufipogon (common wild rice), which is considered to be the ancestor of Asian cultivated rice species, contains many useful genetic resources, including drought resistance genes. However, few studies have identified the drought resistance and tissue-specific genes in common wild rice. In this study, transcriptome sequencing libraries were constructed, including drought-treated roots (DR) and control leaves (CL) and roots (CR). Using Illumina sequencing technology, we generated 16.75 million bases of high-quality sequence data for common wild rice and conducted de novo assembly and annotation of genes without prior genome information. These reads were assembled into 119,332 unigenes with an average length of 715 bp. A total of 88,813 distinct sequences (74.42% of unigenes) significantly matched known genes in the NCBI NT database. Differentially expressed gene (DEG) analysis showed that 3617 genes were up-regulated and 4171 genes were down-regulated in the CR library compared with the CL library. Among the DEGs, 535 genes were expressed in roots but not in shoots. A similar comparison between the DR and CR libraries showed that 1393 genes were up-regulated and 315 genes were down-regulated in the DR library compared with the CR library. Finally, 37 genes that were specifically expressed in roots were screened after comparing the DEGs identified in the above-described analyses. This study provides a transcriptome sequence resource for common wild rice plants and establishes a digital gene expression profile of wild rice plants under drought conditions using the assembled transcriptome data as a reference. Several tissue-specific and drought-stress-related candidate genes were identified, representing a fully characterized transcriptome and providing a valuable resource for genetic and genomic studies in plants.
Assessment of clinical analytical sensitivity and specificity of next-generation sequencing for detection of simple and complex mutations.

PubMed

Chin, Ephrem L H; da Silva, Cristina; Hegde, Madhuri

2013-02-19

Detecting mutations in disease genes by full gene sequence analysis is common in clinical diagnostic laboratories. Sanger dideoxy terminator sequencing allows for rapid development and implementation of sequencing assays in the clinical laboratory, but it has limited throughput, and due to cost constraints, only allows analysis of one or at most a few genes in a patient. Next-generation sequencing (NGS), on the other hand, has evolved rapidly, although to date it has mainly been used for large-scale genome sequencing projects and is beginning to be used in the clinical diagnostic testing. One advantage of NGS is that many genes can be analyzed easily at the same time, allowing for mutation detection when there are many possible causative genes for a specific phenotype. In addition, regions of a gene typically not tested for mutations, like deep intronic and promoter mutations, can also be detected. Here we use 20 previously characterized Sanger-sequenced positive controls in disease-causing genes to demonstrate the utility of NGS in a clinical setting using standard PCR based amplification to assess the analytical sensitivity and specificity of the technology for detecting all previously characterized changes (mutations and benign SNPs). The positive controls chosen for validation range from simple substitution mutations to complex deletion and insertion mutations occurring in autosomal dominant and recessive disorders. The NGS data was 100% concordant with the Sanger sequencing data identifying all 119 previously identified changes in the 20 samples. We have demonstrated that NGS technology is ready to be deployed in clinical laboratories. However, NGS and associated technologies are evolving, and clinical laboratories will need to invest significantly in staff and infrastructure to build the necessary foundation for success.
Local gene silencing in plants via synthetic dsRNA and carrier peptide.

PubMed

Numata, Keiji; Ohtani, Misato; Yoshizumi, Takeshi; Demura, Taku; Kodama, Yutaka

2014-10-01

Quick and facile transient RNA interference (RNAi) is one of the most valuable plant biotechnologies for analysing plant gene functions. To establish a novel double-strand RNA (dsRNA) delivery system for plants, we developed an ionic complex of synthetic dsRNA with a carrier peptide in which a cell-penetrating peptide is fused with a polycation sequence as a gene carrier. The dsRNA-peptide complex is 100-300 nm in diameter and positively charged. Infiltration of the complex into intact leaf cells of Arabidopsis thaliana successfully induced rapid and efficient down-regulation of exogenous and endogenous genes such as yellow fluorescent protein and chalcone synthase. The present method realizes quick and local gene silencing in specific tissues and/or organs in plants. © 2014 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
BuD, a helix–loop–helix DNA-binding domain for genome modification

PubMed Central

Stella, Stefano; Molina, Rafael; López-Méndez, Blanca; Juillerat, Alexandre; Bertonati, Claudia; Daboussi, Fayza; Campos-Olivas, Ramon; Duchateau, Phillippe; Montoya, Guillermo

2014-01-01

DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-binding domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing. PMID:25004980
Identification of upstream and intragenic regulatory elements that confer cell-type-restricted and differentiation-specific expression on the muscle creatine kinase gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sternberg, E.A.; Spizz, G.; Perry, W.M.

1988-07-01

Terminal differentiation of skeletal myobalsts is accompanied by induction of a series of tissue-specific gene products, which includes the muscle isoenzymte of creatine kinase (MCK). To begin to define the sequences and signals involved in MCK regulation in developing muscle cells, the mouse MCK gene has been isolated. Sequence analysis of 4,147 bases of DNA surrounding the transcription initiation site revealed several interesting structural features, some of which are common to other muscle-specific genes and to cellular and viral enhancers.
Identification of a high-efficiency baculovirus DNA replication origin that functions in insect and mammalian cells.

PubMed

Wu, Yueh-Lung; Wu, Carol-P; Huang, Yu-Hui; Huang, Sheng-Ping; Lo, Huei-Ru; Chang, Hao-Shuo; Lin, Pi-Hsiu; Wu, Ming-Cheng; Chang, Chia-Jung; Chao, Yu-Chan

2014-11-01

The p143 gene from Autographa californica multinucleocapsid nucleopolyhedrovirus (AcMNPV) has been found to increase the expression of luciferase, which is driven by the polyhedrin gene promoter, in a plasmid with virus coinfection. Further study indicated that this is due to the presence of a replication origin (ori) in the coding region of this gene. Transient DNA replication assays showed that a specific fragment of the p143 coding sequence, p143-3, underwent virus-dependent DNA replication in Spodoptera frugiperda IPLB-Sf-21 (Sf-21) cells. Deletion analysis of the p143-3 fragment showed that subfragment p143-3.2a contained the essential sequence of this putative ori. Sequence analysis of this region revealed a unique distribution of imperfect palindromes with high AT contents. No sequence homology or similarity between p143-3.2a and any other known ori was detected, suggesting that it is a novel baculovirus ori. Further study showed that the p143-3.2a ori can replicate more efficiently in infected Sf-21 cells than baculovirus homologous regions (hrs), the major baculovirus ori, or non-hr oris during virus replication. Previously, hr on its own was unable to replicate in mammalian cells, and for mammalian viral oris, viral proteins are generally required for their proper replication in host cells. However, the p143-3.2a ori was, surprisingly, found to function as an efficient ori in mammalian cells without the need for any viral proteins. We conclude that p143 contains a unique sequence that can function as an ori to enhance gene expression in not only insect cells but also mammalian cells. Baculovirus DNA replication relies on both hr and non-hr oris; however, so far very little is known about the latter oris. Here we have identified a new non-hr ori, the p143 ori, which resides in the coding region of p143. By developing a novel DNA replication-enhanced reporter system, we have identified and located the core region required for the p143 ori. This ori contains a large number of imperfect inverted repeats and is the most active ori in the viral genome during virus infection in insect cells. We also found that it is a unique ori that can replicate in mammalian cells without the assistance of baculovirus gene products. The identification of this ori should contribute to a better understanding of baculovirus DNA replication. Also, this ori is very useful in assisting with gene expression in mammalian cells. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Preparation of anti-mouse caspase-12 mRNA hammerhead ribozyme and identification of its activity in vitro

PubMed Central

Jiang, Shan; Xie, Qing; Zhang, Wei; Zhou, Xia-Qiu; Jin, You-Xin

2005-01-01

AIM: To prepare and identify specific anti-mouse caspase-12 hammerhead ribozymes in vitro, in order to select a more effective ribozyme against mouse caspase-12 as a potential tool to rescue cells from endoplasmic reticulum stress induced apoptosis. METHODS: Two hammerhead ribozymes directed separately against 138 and 218 site of nucleotide of mouse caspase-12 mRNA were designed by computer software, and their DNA sequences were synthesized. The synthesized ribozymes were cloned into an eukaryotic expression vector-neorpBSKU6 and embedded in U6 SnRNA context for further study. Mouse caspase-12 gene segment was cloned into PGEM-T vector under the control of T7 RNA polymerase promoter (containing gene sequence from positions nt 41 to nt 894) as target. In vitro transcription both the ribozymes and target utilize T7 promoter. The target was labeled with [α-32P]UTP, while ribozymes were not labeled. After gel purification the RNAs were dissolved in RNase free water. Ribozyme and target were incubated for 90 min at 37°C in reaction buffer (40 mmol/L Tris-HCL, pH 7.5, 10 mmol/L Mg2+). Molar ratio of ribozyme vs target was 30:1. Samples were analyzed on 6% PAGE (containing 8 mol/L urea). RESULTS: Both caspase-12 and ribozyme gene sequences were successfully cloned into expression vector confirmed by sequencing. Ribozymes and caspase-12 mRNA were obtained by in vitro transcription. Cleavage experiment showed that in a physiological similar condition (37°C, pH 7.5), Rz138 and Rz218 both cleaved targets at predicted sites, for Rz138 the cleavage efficiency was about 100%, for Rz218 the value was 36.66%. CONCLUSION: Rz138 prepared in vitro can site specific cleave mouse caspase-12 mRNA with an excellent efficiency. It shows a potential to suppress the expression of caspase-12 in vivo, thus provided a new way to protect cells from ER stress induced apoptosis. PMID:15996037
RNA therapeutics targeting osteoclast-mediated excessive bone resorption

PubMed Central

Wang, Yuwei; Grainger, David W

2011-01-01

RNA interference (RNAi) is a sequence-specific post-transcriptional gene silencing technique developed with dramatically increasing utility for both scientific and therapeutic purposes. Short interfering RNA (siRNA) is currently exploited to regulate protein expression relevant to many therapeutic applications, and commonly used as a tool for elucidating disease-associated genes. Osteoporosis and their associated osteoporotic fragility fractures in both men and women are rapidly becoming a global healthcare crisis as average life expectancy increases worldwide. New therapeutics are needed for this increasing patient population. This review describes the diversity of molecular targets suitable for RNAi-based gene knock-down in osteoclasts to control osteoclast-mediated excessive bone resorption. We identify strategies for developing targeted siRNA delivery and efficient gene silencing, and describe opportunities and challenges of introducing siRNA as a therapeutic approach to hard and connective tissue disorders. PMID:21945356
Rapid generation of genetic diversity by multiplex CRISPR/Cas9 genome editing in rice.

PubMed

Shen, Lan; Hua, Yufeng; Fu, Yaping; Li, Jian; Liu, Qing; Jiao, Xiaozhen; Xin, Gaowei; Wang, Junjie; Wang, Xingchun; Yan, Changjie; Wang, Kejian

2017-05-01

The clustered regularly interspaced short palindromic repeats (CRISPR)-associated endonuclease 9 (CRISPR/Cas9) system has emerged as a promising technology for specific genome editing in many species. Here we constructed one vector targeting eight agronomic genes in rice using the CRISPR/Cas9 multiplex genome editing system. By subsequent genetic transformation and DNA sequencing, we found that the eight target genes have high mutation efficiencies in the T 0 generation. Both heterozygous and homozygous mutations of all editing genes were obtained in T 0 plants. In addition, homozygous sextuple, septuple, and octuple mutants were identified. As the abundant genotypes in T 0 transgenic plants, various phenotypes related to the editing genes were observed. The findings demonstrate the potential of the CRISPR/Cas9 system for rapid introduction of genetic diversity during crop breeding.
Serratia marcescens outbreak in a neonatal intensive care unit (NICU): new insights from next-generation sequencing applications.

PubMed

Martineau, Christine; Li, Xuejing; Lalancette, Cindy; Perreault, Thérèse; Fournier, Eric; Tremblay, Julien; Gonzales, Milagros; Yergeau, Étienne; Quach, Caroline

2018-06-13

Serratia marcescens is an environmental bacterium commonly associated with outbreaks in neonatal intensive care units (NICU). Investigation of S. marcescens outbreaks requires efficient recovery and typing of clinical and environmental isolates. In this study, we described how the use of next-generation sequencing applications, such as bacterial whole-genome sequencing (WGS) and bacterial community profiling, could improve S. marcescens outbreak investigation. Phylogenomic links and potential antibiotic resistance genes and plasmids in S. marcescens isolates were investigated using WGS, while bacterial communities and relative abundances of Serratia in environmental samples were assessed using sequencing of bacterial phylogenetic marker genes (16S rRNA and gyrB genes). Typing results obtained using WGS for the ten S. marcescens isolates recovered during a NICU outbreak investigation were highly consistent with those from pulse-field gel electrophoresis (PFGE), the current gold standard typing method for this bacterium. WGS also allowed for the identification of genes associated with antibiotic resistance in all isolates, while no plasmid was detected. Sequencing of the 16S rRNA and gyrB genes both showed higher relative abundances of Serratia in environmental sampling sites that were in close contact with infected babies. Much lower relative abundances of Serratia were observed following disinfection of a room, indicating that the protocol used was efficient. Variations in the bacterial community composition and structure following room disinfection and between sampling sites were also identified through 16S rRNA gene sequencing. Globally, results from this study highlight the potential for next-generation sequencing tools to improve and facilitate outbreak investigation. Copyright © 2018 American Society for Microbiology.
Induction and maintenance of DNA methylation in plant promoter sequences by apple latent spherical virus-induced transcriptional gene silencing

PubMed Central

Kon, Tatsuya; Yoshikawa, Nobuyuki

2014-01-01

Apple latent spherical virus (ALSV) is an efficient virus-induced gene silencing vector in functional genomics analyses of a broad range of plant species. Here, an Agrobacterium-mediated inoculation (agroinoculation) system was developed for the ALSV vector, and virus-induced transcriptional gene silencing (VITGS) is described in plants infected with the ALSV vector. The cDNAs of ALSV RNA1 and RNA2 were inserted between the cauliflower mosaic virus 35S promoter and the NOS-T sequences in a binary vector pCAMBIA1300 to produce pCALSR1 and pCALSR2-XSB or pCALSR2-XSB/MN. When these vector constructs were agroinoculated into Nicotiana benthamiana plants with a construct expressing a viral silencing suppressor, the infection efficiency of the vectors was 100%. A recombinant ALSV vector carrying part of the 35S promoter sequence induced transcriptional gene silencing of the green fluorescent protein gene in a line of N. benthamiana plants, resulting in the disappearance of green fluorescence of infected plants. Bisulfite sequencing showed that cytosine residues at CG and CHG sites of the 35S promoter sequence were highly methylated in the silenced generation zero plants infected with the ALSV carrying the promoter sequence as well as in progeny. The ALSV-mediated VITGS state was inherited by progeny for multiple generations. In addition, induction of VITGS of an endogenous gene (chalcone synthase-A) was demonstrated in petunia plants infected with an ALSV vector carrying the native promoter sequence. These results suggest that ALSV-based vectors can be applied to study DNA methylation in plant genomes, and provide a useful tool for plant breeding via epigenetic modification. PMID:25426109
Minimal and Contributing Sequence Determinants of the cis-Acting Locus of Transfer (clt) of Streptomycete Plasmid pIJ101 Occur within an Intrinsically Curved Plasmid Region

PubMed Central

Ducote, Matthew J.; Prakash, Shubha; Pettis, Gregg S.

2000-01-01

Efficient interbacterial transfer of streptomycete plasmid pIJ101 requires the pIJ101 tra gene, as well as a cis-acting plasmid function known as clt. Here we show that the minimal pIJ101 clt locus consists of a sequence no greater than 54 bp in size that includes essential inverted-repeat and direct-repeat sequences and is located in close proximity to the 3′ end of the korB regulatory gene. Evidence that sequences extending beyond the minimal locus and into the korB open reading frame influence clt transfer function and demonstration that clt-korB sequences are intrinsically curved raise the possibility that higher-order structuring of DNA and protein within this plasmid region may be an inherent feature of efficient pIJ101 transfer. PMID:11073933
Minimal and contributing sequence determinants of the cis-acting locus of transfer (clt) of streptomycete plasmid pIJ101 occur within an intrinsically curved plasmid region.

PubMed

Ducote, M J; Prakash, S; Pettis, G S

2000-12-01

Efficient interbacterial transfer of streptomycete plasmid pIJ101 requires the pIJ101 tra gene, as well as a cis-acting plasmid function known as clt. Here we show that the minimal pIJ101 clt locus consists of a sequence no greater than 54 bp in size that includes essential inverted-repeat and direct-repeat sequences and is located in close proximity to the 3' end of the korB regulatory gene. Evidence that sequences extending beyond the minimal locus and into the korB open reading frame influence clt transfer function and demonstration that clt-korB sequences are intrinsically curved raise the possibility that higher-order structuring of DNA and protein within this plasmid region may be an inherent feature of efficient pIJ101 transfer.
Baculoviral delivery of CRISPR/Cas9 facilitates efficient genome editing in human cells

PubMed Central

Hindriksen, Sanne; Bramer, Arne J.; Truong, My Anh; Vromans, Martijn J. M.; Post, Jasmin B.; Verlaan-Klink, Ingrid; Snippert, Hugo J.; Lens, Susanne M. A.

2017-01-01

The CRISPR/Cas9 system is a highly effective tool for genome editing. Key to robust genome editing is the efficient delivery of the CRISPR/Cas9 machinery. Viral delivery systems are efficient vehicles for the transduction of foreign genes but commonly used viral vectors suffer from a limited capacity in the genetic information they can carry. Baculovirus however is capable of carrying large exogenous DNA fragments. Here we investigate the use of baculoviral vectors as a delivery vehicle for CRISPR/Cas9 based genome-editing tools. We demonstrate transduction of a panel of cell lines with Cas9 and an sgRNA sequence, which results in efficient knockout of all four targeted subunits of the chromosomal passenger complex (CPC). We further show that introduction of a homology directed repair template into the same CRISPR/Cas9 baculovirus facilitates introduction of specific point mutations and endogenous gene tags. Tagging of the CPC recruitment factor Haspin with the fluorescent reporter YFP allowed us to study its native localization as well as recruitment to the cohesin subunit Pds5B. PMID:28640891
Long-range comparison of human and mouse Sprr loci to identify conserved noncoding sequences involved in coordinate regulation

PubMed Central

Martin, Natalia; Patel, Satyakam; Segre, Julia A.

2004-01-01

Mammalian epidermis provides a permeability barrier between an organism and its environment. Under homeostatic conditions, epidermal cells produce structural proteins, which are cross-linked in an orderly fashion to form a cornified envelope (CE). However, under genetic or environmental stress, specific genes are induced to rapidly build a temporary barrier. Small proline-rich (SPRR) proteins are the primary constituents of the CE. Under stress the entire family of 14 Sprr genes is upregulated. The Sprr genes are clustered within the larger epidermal differentiation complex on mouse chromosome 3, human chromosome 1q21. The clustering of the Sprr genes and their upregulation under stress suggest that these genes may be coordinately regulated. To identify enhancer elements that regulate this stress response activation of the Sprr locus, we utilized bioinformatic tools and classical biochemical dissection. Long-range comparative sequence analysis identified conserved noncoding sequences (CNSs). Clusters of epidermal-specific DNaseI-hypersensitive sites (HSs) mapped to specific CNSs. Increased prevalence of these HSs in barrier-deficient epidermis provides in vivo evidence of the regulation of the Sprr locus by these conserved sequences. Individual components of these HSs were cloned, and one was shown to have strong enhancer activity specific to conditions when the Sprr genes are coordinately upregulated. PMID:15574822

Taxonomically Different Co-Microsymbionts of a Relict Legume, Oxytropis popoviana, Have Complementary Sets of Symbiotic Genes and Together Increase the Efficiency of Plant Nodulation.

PubMed

Safronova, Vera I; Belimov, Andrey A; Sazanova, Anna L; Chirak, Elizaveta R; Verkhozina, Alla V; Kuznetsova, Irina G; Andronov, Evgeny E; Puhalsky, Jan V; Tikhonovich, Igor A

2018-06-20

Ten rhizobial strains were isolated from root nodules of a relict legume Oxytropis popoviana Peschkova. For identification of the isolates, sequencing of rrs, the internal transcribed spacer region, and housekeeping genes recA, glnII, and rpoB was used. Nine fast-growing isolates were Mesorhizobium-related; eight strains were identified as M. japonicum and one isolate belonged to M. kowhaii. The only slow-growing isolate was identified as a Bradyrhizobium sp. Two strains, M. japonicum Opo-242 and Bradyrhizobium sp. strain Opo-243, were isolated from the same nodule. Symbiotic genes of these isolates were searched throughout the whole-genome sequences. The common nodABC genes and other symbiotic genes required for plant nodulation and nitrogen fixation were present in the isolate Opo-242. Strain Opo-243 did not contain the principal nod, nif, and fix genes; however, five genes (nodP, nodQ, nifL, nolK, and noeL) affecting the specificity of plant-rhizobia interactions but absent in isolate Opo-242 were detected. Strain Opo-243 could not induce nodules but significantly accelerated the root nodule formation after coinoculation with isolate Opo-242. Thus, we demonstrated that taxonomically different strains of the archaic symbiotic system can be co-microsymbionts infecting the same nodule and promoting the nodulation process due to complementary sets of symbiotic genes.
Parallel gene analysis with allele-specific padlock probes and tag microarrays

PubMed Central

Banér, Johan; Isaksson, Anders; Waldenström, Erik; Jarvius, Jonas; Landegren, Ulf; Nilsson, Mats

2003-01-01

Parallel, highly specific analysis methods are required to take advantage of the extensive information about DNA sequence variation and of expressed sequences. We present a scalable laboratory technique suitable to analyze numerous target sequences in multiplexed assays. Sets of padlock probes were applied to analyze single nucleotide variation directly in total genomic DNA or cDNA for parallel genotyping or gene expression analysis. All reacted probes were then co-amplified and identified by hybridization to a standard tag oligonucleotide array. The technique was illustrated by analyzing normal and pathogenic variation within the Wilson disease-related ATP7B gene, both at the level of DNA and RNA, using allele-specific padlock probes. PMID:12930977
Fusion primer and nested integrated PCR (FPNI-PCR): a new high-efficiency strategy for rapid chromosome walking or flanking sequence cloning

PubMed Central

2011-01-01

Background The advent of genomics-based technologies has revolutionized many fields of biological enquiry. However, chromosome walking or flanking sequence cloning is still a necessary and important procedure to determining gene structure. Such methods are used to identify T-DNA insertion sites and so are especially relevant for organisms where large T-DNA insertion libraries have been created, such as rice and Arabidopsis. The currently available methods for flanking sequence cloning, including the popular TAIL-PCR technique, are relatively laborious and slow. Results Here, we report a simple and effective fusion primer and nested integrated PCR method (FPNI-PCR) for the identification and cloning of unknown genomic regions flanked known sequences. In brief, a set of universal primers was designed that consisted of various 15-16 base arbitrary degenerate oligonucleotides. These arbitrary degenerate primers were fused to the 3' end of an adaptor oligonucleotide which provided a known sequence without degenerate nucleotides, thereby forming the fusion primers (FPs). These fusion primers are employed in the first step of an integrated nested PCR strategy which defines the overall FPNI-PCR protocol. In order to demonstrate the efficacy of this novel strategy, we have successfully used it to isolate multiple genomic sequences namely, 21 orthologs of genes in various species of Rosaceace, 4 MYB genes of Rosa rugosa, 3 promoters of transcription factors of Petunia hybrida, and 4 flanking sequences of T-DNA insertion sites in transgenic tobacco lines and 6 specific genes from sequenced genome of rice and Arabidopsis. Conclusions The successful amplification of target products through FPNI-PCR verified that this novel strategy is an effective, low cost and simple procedure. Furthermore, FPNI-PCR represents a more sensitive, rapid and accurate technique than the established TAIL-PCR and hiTAIL-PCR procedures. PMID:22093809
Shine-Dalgarno sequence enhances the efficiency of lacZ repression by artificial anti-lac antisense RNAs in Escherichia coli.

PubMed

Stefan, Alessandra; Schwarz, Flavio; Bressanin, Daniela; Hochkoeppler, Alejandro

2010-11-01

Silencing of the lacZ gene in Escherichia coli was attempted by means of the expression of antisense RNAs (asRNAs) in vivo. A short fragment of lacZ was cloned into the pBAD expression vector, in reverse orientation, using the EcoRI and PstI restriction sites. This construct (pBAD-Zcal1) was used to transform E. coli cells, and the antisense transcription was induced simply by adding arabinose to the culture medium. We demonstrated that the Zcal1 asRNA effectively silenced lacZ using β-galactosidase activity determinations, SDS-PAGE, and Western blotting. Because the concentration of the lac mRNA was always high in cells that expressed Zcal1, we hypothesize that this antisense acts by inhibiting messenger translation. Similar analyses, performed with a series of site-specific Zcal1 mutants, showed that the Shine-Dalgarno sequence, which is conferred by the pBAD vector, is an essential requisite for silencing competence. Indeed, the presence of the intact Shine-Dalgarno sequence positively affects asRNA stability and, hence, silencing effectiveness. Our observations will contribute to the understanding of the main determinants of silencing as exerted by asRNAs as well as provide useful support for the design of robust and efficient prokaryotic gene silencers. Copyright © 2010 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Gene expression promoted by the SV40 DNA targeting sequence and the hypoxia-responsive element under normoxia and hypoxia.

PubMed

Sacramento, C B; Moraes, J Z; Denapolis, P M A; Han, S W

2010-08-01

The main objective of the present study was to find suitable DNA-targeting sequences (DTS) for the construction of plasmid vectors to be used to treat ischemic diseases. The well-known Simian virus 40 nuclear DTS (SV40-DTS) and hypoxia-responsive element (HRE) sequences were used to construct plasmid vectors to express the human vascular endothelial growth factor gene (hVEGF). The rate of plasmid nuclear transport and consequent gene expression under normoxia (20% O2) and hypoxia (less than 5% O2) were determined. Plasmids containing the SV40-DTS or HRE sequences were constructed and used to transfect the A293T cell line (a human embryonic kidney cell line) in vitro and mouse skeletal muscle cells in vivo. Plasmid transport to the nucleus was monitored by real-time PCR, and the expression level of the hVEGF gene was measured by ELISA. The in vitro nuclear transport efficiency of the SV40-DTS plasmid was about 50% lower under hypoxia, while the HRE plasmid was about 50% higher under hypoxia. Quantitation of reporter gene expression in vitro and in vivo, under hypoxia and normoxia, confirmed that the SV40-DTS plasmid functioned better under normoxia, while the HRE plasmid was superior under hypoxia. These results indicate that the efficiency of gene expression by plasmids containing DNA binding sequences is affected by the concentration of oxygen in the medium.
ETS target genes: Identification of Egr1 as a target by RNA differential display and whole genome PCR techniques

PubMed Central

Robinson, Lois; Panayiotakis, Alexandra; Papas, Takis S.; Kola, Ismail; Seth, Arun

1997-01-01

ETS transcription factors play important roles in hematopoiesis, angiogenesis, and organogenesis during murine development. The ETS genes also have a role in neoplasia, for example in Ewing’s sarcomas and retrovirally induced cancers. The ETS genes encode transcription factors that bind to specific DNA sequences and activate transcription of various cellular and viral genes. To isolate novel ETS target genes, we used two approaches. In the first approach, we isolated genes by the RNA differential display technique. Previously, we have shown that the overexpression of ETS1 and ETS2 genes effects transformation of NIH 3T3 cells and specific transformants produce high levels of the ETS proteins. To isolate ETS1 and ETS2 responsive genes in these transformed cells, we prepared RNA from ETS1, ETS2 transformants, and normal NIH 3T3 cell lines and converted it into cDNA. This cDNA was amplified by PCR and displayed on sequencing gels. The differentially displayed bands were subcloned into plasmid vectors. By Northern blot analysis, several clones showed differential patterns of mRNA expression in the NIH 3T3-, ETS1-, and ETS2-expressing cell lines. Sixteen clones were analyzed by DNA sequence analysis, and 13 of them appeared to be unique because their DNA sequences did not match with any of the known genes present in the gene bank. Three known genes were found to be identical to the CArG box binding factor, phospholipase A2-activating protein, and early growth response 1 (Egr1) genes. In the second approach, to isolate ETS target promoters directly, we performed ETS1 binding with MboI-cleaved genomic DNA in the presence of a specific mAb followed by whole genome PCR. The immune complex-bound ETS binding sites containing DNA fragments were amplified and subcloned into pBluescript and subjected to DNA sequence and computer analysis. We found that, of a large number of clones isolated, 43 represented unique sequences not previously identified. Three clones turned out to contain regulatory sequences derived from human serglycin, preproapolipoprotein C II, and Egr1 genes. The ETS binding sites derived from these three regulatory sequences showed specific binding with recombinant ETS proteins. Of interest, Egr1 was identified by both of these techniques, suggesting strongly that it is indeed an ETS target gene. PMID:9207063
Rapid and accurate synthesis of TALE genes from synthetic oligonucleotides.

PubMed

Wang, Fenghua; Zhang, Hefei; Gao, Jingxia; Chen, Fengjiao; Chen, Sijie; Zhang, Cuizhen; Peng, Gang

2016-01-01

Custom synthesis of transcription activator-like effector (TALE) genes has relied upon plasmid libraries of pre-fabricated TALE-repeat monomers or oligomers. Here we describe a novel synthesis method that directly incorporates annealed synthetic oligonucleotides into the TALE-repeat units. Our approach utilizes iterative sets of oligonucleotides and a translational frame check strategy to ensure the high efficiency and accuracy of TALE-gene synthesis. TALE arrays of more than 20 repeats can be constructed, and the majority of the synthesized constructs have perfect sequences. In addition, this novel oligonucleotide-based method can readily accommodate design changes to the TALE repeats. We demonstrated an increased gene targeting efficiency against a genomic site containing a potentially methylated cytosine by incorporating non-conventional repeat variable di-residue (RVD) sequences.
Detection of nucleotide-specific CRISPR/Cas9 modified alleles using multiplex ligation detection

PubMed Central

KC, R.; Srivastava, A.; Wilkowski, J. M.; Richter, C. E.; Shavit, J. A.; Burke, D. T.; Bielas, S. L.

2016-01-01

CRISPR/Cas9 genome-editing has emerged as a powerful tool to create mutant alleles in model organisms. However, the precision with which these mutations are created has introduced a new set of complications for genotyping and colony management. Traditional gene-targeting approaches in many experimental organisms incorporated exogenous DNA and/or allele specific sequence that allow for genotyping strategies based on binary readout of PCR product amplification and size selection. In contrast, alleles created by non-homologous end-joining (NHEJ) repair of double-stranded DNA breaks generated by Cas9 are much less amenable to such strategies. Here we describe a novel genotyping strategy that is cost effective, sequence specific and allows for accurate and efficient multiplexing of small insertion-deletions and single-nucleotide variants characteristic of CRISPR/Cas9 edited alleles. We show that ligation detection reaction (LDR) can be used to generate products that are sequence specific and uniquely detected by product size and/or fluorescent tags. The method works independently of the model organism and will be useful for colony management as mutant alleles differing by a few nucleotides become more prevalent in experimental animal colonies. PMID:27557703
Phage display vectors for in vivo recombination of immunoglobulin heavy and light chain genes to make large combinatorial libraries.

PubMed

Tsurushita, N; Fu, H; Warren, C

1996-06-12

New phage display vectors for in vivo recombination of immunoglobulin (Ig) heavy (VH) and light (VL) chain variable genes, to make single-chain Fv fragments (scFv), were constructed. The VH and VL genes of monoclonal antibody (mAb) EP-5C7, which binds to both human E- and P-selectin, were cloned into a pUC19-derived plasmid vector, pCW93, and a pACYC184-derived phagemid vector, pCW99, respectively. Upon induction of Cre recombinase (phage P1 recombinase), the VH and VL genes were efficiently recombined into the same plasmid via the two loxP sites (phage P1 recombination sites), one located downstream from a VH gene in pCW93 and another upstream from a VL gene in pCW99. In the resulting phagemid, the loxP sequence also encodes a polypeptide linker connecting the VH and VL domains to form a scFv of EP-5C7. Whether expressed on the phage surface or as a soluble form, the EP-5C7 scFv showed specific binding to human E- and P-selectin. This phagemid vector system provides a way to recombine VH and VL gene libraries efficiently in vivo to make extremely large Ig combinatorial libraries.
Acquisition and evolution of plant pathogenesis-associated gene clusters and candidate determinants of tissue-specificity in xanthomonas.

PubMed

Lu, Hong; Patil, Prabhu; Van Sluys, Marie-Anne; White, Frank F; Ryan, Robert P; Dow, J Maxwell; Rabinowicz, Pablo; Salzberg, Steven L; Leach, Jan E; Sonti, Ramesh; Brendel, Volker; Bogdanove, Adam J

2008-01-01

Xanthomonas is a large genus of plant-associated and plant-pathogenic bacteria. Collectively, members cause diseases on over 392 plant species. Individually, they exhibit marked host- and tissue-specificity. The determinants of this specificity are unknown. To assess potential contributions to host- and tissue-specificity, pathogenesis-associated gene clusters were compared across genomes of eight Xanthomonas strains representing vascular or non-vascular pathogens of rice, brassicas, pepper and tomato, and citrus. The gum cluster for extracellular polysaccharide is conserved except for gumN and sequences downstream. The xcs and xps clusters for type II secretion are conserved, except in the rice pathogens, in which xcs is missing. In the otherwise conserved hrp cluster, sequences flanking the core genes for type III secretion vary with respect to insertion sequence element and putative effector gene content. Variation at the rpf (regulation of pathogenicity factors) cluster is more pronounced, though genes with established functional relevance are conserved. A cluster for synthesis of lipopolysaccharide varies highly, suggesting multiple horizontal gene transfers and reassortments, but this variation does not correlate with host- or tissue-specificity. Phylogenetic trees based on amino acid alignments of gum, xps, xcs, hrp, and rpf cluster products generally reflect strain phylogeny. However, amino acid residues at four positions correlate with tissue specificity, revealing hpaA and xpsD as candidate determinants. Examination of genome sequences of xanthomonads Xylella fastidiosa and Stenotrophomonas maltophilia revealed that the hrp, gum, and xcs clusters are recent acquisitions in the Xanthomonas lineage. Our results provide insight into the ancestral Xanthomonas genome and indicate that differentiation with respect to host- and tissue-specificity involved not major modifications or wholesale exchange of clusters, but subtle changes in a small number of genes or in non-coding sequences, and/or differences outside the clusters, potentially among regulatory targets or secretory substrates.
Genome Editing in Mice Using TALE Nucleases.

PubMed

Wefers, Benedikt; Brandl, Christina; Ortiz, Oskar; Wurst, Wolfgang; Kühn, Ralf

2016-01-01

Gene engineering for generating targeted mouse mutants is a key technology for biomedical research. Using TALENs as sequence-specific nucleases to induce targeted double-strand breaks, the mouse genome can be directly modified in zygotes in a single step without the need for embryonic stem cells. By embryo microinjection of TALEN mRNAs and targeting vectors, knockout and knock-in alleles can be generated fast and efficiently. In this chapter we provide protocols for the application of TALENs in mouse zygotes.
Mycoplasma CG- and GATC-specific DNA methyltransferases selectively and efficiently methylate the host genome and alter the epigenetic landscape in human cells

PubMed Central

Chernov, Andrei V; Reyes, Leticia; Xu, Zhenkang; Gonzalez, Beatriz; Golovko, Georgiy; Peterson, Scott; Perucho, Manuel; Fofanov, Yuriy; Strongin, Alex Y

2015-01-01

Aberrant DNA methylation is frequently observed in disease, including many cancer types, yet the underlying mechanisms remain unclear. Because germline and somatic mutations in the genes that are responsible for DNA methylation are infrequent in malignancies, additional mechanisms must be considered. Mycoplasmas spp., including Mycoplasma hyorhinis, efficiently colonize human cells and may serve as a vehicle for delivery of enzymatically active microbial proteins into the intracellular milieu. Here, we performed, for the first time, genome-wide and individual gene mapping of methylation marks generated by the M. hyorhinis CG- and GATC-specific DNA cytosine methyltransferases (MTases) in human cells. Our results demonstrated that, upon expression in human cells, MTases readily translocated to the cell nucleus. In the nucleus, MTases selectively and efficiently methylated the host genome at the DNA sequence sites free from pre-existing endogenous methylation, including those in a variety of cancer-associated genes. We also established that mycoplasma is widespread in colorectal cancers, suggesting that either the infection contributed to malignancy onset or, alternatively, that tumors provide a favorable environment for mycoplasma growth. In the human genome, ∼11% of GATC sites overlap with CGs (e.g., CGATmCG); therefore, the methylated status of these sites can be perpetuated by human DNMT1. Based on these results, we now suggest that the GATC-specific methylation represents a novel type of infection-specific epigenetic mark that originates in human cells with a previous exposure to infection. Overall, our findings unveil an entirely new panorama of interactions between the human microbiome and epigenome with a potential impact in disease etiology. PMID:25695131
Flanking sequence determination and event-specific detection of genetically modified wheat B73-6-1.

PubMed

Xu, Junyi; Cao, Jijuan; Cao, Dongmei; Zhao, Tongtong; Huang, Xin; Zhang, Piqiao; Luan, Fengxia

2013-05-01

In order to establish a specific identification method for genetically modified (GM) wheat, exogenous insert DNA and flanking sequence between exogenous fragment and recombinant chromosome of GM wheat B73-6-1 were successfully acquired by means of conventional polymerase chain reaction (PCR) and thermal asymmetric interlaced (TAIL)-PCR strategies. Newly acquired exogenous fragment covered the full-length sequence of transformed genes such as transformed plasmid and corresponding functional genes including marker uidA, herbicide-resistant bar, ubiquitin promoter, and high-molecular-weight gluten subunit. The flanking sequence between insert DNA revealed high similarity with Triticum turgidum A gene (GenBank: AY494981.1). A specific PCR detection method for GM wheat B73-6-1 was established on the basis of primers designed according to the flanking sequence. This specific PCR method was validated by GM wheat, GM corn, GM soybean, GM rice, and non-GM wheat. The specifically amplified target band was observed only in GM wheat B73-6-1. This method is of high specificity, high reproducibility, rapid identification, and excellent accuracy for the identification of GM wheat B73-6-1.
Epstein-Barr Virus Sequence Variation—Biology and Disease

PubMed Central

Tzellos, Stelios; Farrell, Paul J.

2012-01-01

Some key questions in Epstein-Barr virus (EBV) biology center on whether naturally occurring sequence differences in the virus affect infection or EBV associated diseases. Understanding the pattern of EBV sequence variation is also important for possible development of EBV vaccines. At present EBV isolates worldwide can be grouped into Type 1 and Type 2, a classification based on the EBNA2 gene sequence. Type 1 EBV is the most prevalent worldwide but Type 2 is common in parts of Africa. Type 1 transforms human B cells into lymphoblastoid cell lines much more efficiently than Type 2 EBV. Molecular mechanisms that may account for this difference in cell transformation are now becoming clearer. Advances in sequencing technology will greatly increase the amount of whole EBV genome data for EBV isolated from different parts of the world. Study of regional variation of EBV strains independent of the Type 1/Type 2 classification and systematic investigation of the relationship between viral strains, infection and disease will become possible. The recent discovery that specific mutation of the EBV EBNA3B gene may be linked to development of diffuse large B cell lymphoma illustrates the importance that mutations in the virus genome may have in infection and human disease. PMID:25436768
Accurate Sample Assignment in a Multiplexed, Ultrasensitive, High-Throughput Sequencing Assay for Minimal Residual Disease.

PubMed

Bartram, Jack; Mountjoy, Edward; Brooks, Tony; Hancock, Jeremy; Williamson, Helen; Wright, Gary; Moppett, John; Goulden, Nick; Hubank, Mike

2016-07-01

High-throughput sequencing (HTS) (next-generation sequencing) of the rearranged Ig and T-cell receptor genes promises to be less expensive and more sensitive than current methods of monitoring minimal residual disease (MRD) in patients with acute lymphoblastic leukemia. However, the adoption of new approaches by clinical laboratories requires careful evaluation of all potential sources of error and the development of strategies to ensure the highest accuracy. Timely and efficient clinical use of HTS platforms will depend on combining multiple samples (multiplexing) in each sequencing run. Here we examine the Ig heavy-chain gene HTS on the Illumina MiSeq platform for MRD. We identify errors associated with multiplexing that could potentially impact the accuracy of MRD analysis. We optimize a strategy that combines high-purity, sequence-optimized oligonucleotides, dual indexing, and an error-aware demultiplexing approach to minimize errors and maximize sensitivity. We present a probability-based, demultiplexing pipeline Error-Aware Demultiplexer that is suitable for all MiSeq strategies and accurately assigns samples to the correct identifier without excessive loss of data. Finally, using controls quantified by digital PCR, we show that HTS-MRD can accurately detect as few as 1 in 10(6) copies of specific leukemic MRD. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.
Abundance and Genetic Diversity of Microbial Polygalacturonase and Pectate Lyase in the Sheep Rumen Ecosystem

PubMed Central

Wang, Yaru; Luo, Huiying; Huang, Huoqing; Shi, Pengjun; Bai, Yingguo; Yang, Peilong; Yao, Bin

2012-01-01

Background Efficient degradation of pectin in the rumen is necessary for plant-based feed utilization. The objective of this study was to characterize the diversity, abundance, and functions of pectinases from microorganisms in the sheep rumen. Methodology/Principal Findings A total of 103 unique fragments of polygalacturonase (PF00295) and pectate lyase (PF00544 and PF09492) genes were retrieved from microbial DNA in the rumen of a Small Tail Han sheep, and 66% of the sequences of these fragments had low identities (<65%) with known sequences. Phylogenetic tree building separated the PF00295, PF00544, and PF09492 sequences into five, three, and three clades, respectively. Cellulolytic and noncellulolytic Butyrivibrio, Prevotella, and Fibrobacter species were the major sources of the pectinases. The two most abundant pectate lyase genes were cloned, and their protein products, expressed in Escherichia coli, were characterized. Both enzymes probably act extracellularly as their nucleotide sequences contained signal sequences, and they had optimal activities at the ruminal physiological temperature and complementary pH-dependent activity profiles. Conclusion/Significance This study reveals the specificity, diversity, and abundance of pectinases in the rumen ecosystem and provides two additional ruminal pectinases for potential industrial use under physiological conditions. PMID:22815874
Isolation and characterization of a water stress-specific genomic gene, pwsi 18, from rice.

PubMed

Joshee, N; Kisaka, H; Kitagawa, Y

1998-01-01

One of the water stress-specific cDNA clones of rice characterised previously, wsi18, was selected for further study. The wsi18 gene can be induced by water stress conditions such as mannitol, NaCl, and dryness, but not by ABA, cold, or heat. A genomic clone for wsi18, pwsi18, contained about 1.7 kbp of the 5' upstream sequence, two introns, and the full coding sequence. The 5'-upstream sequence of pwsi18 contained putative cis-acting elements, namely an ABA-responsive element (ABRE), three G-boxes, three E-boxes, a MEF-2 sequence, four direct and two inverted repeats, and four sequences similar to DRE, which is involved in the dehydration response of Arabidopsis genes. The gusA reporter gene under the control of the pwsi18 promoter showed transient expression in response to water stress. Deletion of the downstream DRE-like sequence between the distal G-boxes-2 and -3 resulted in rather low GUS expression.
Factors affecting expression of the recF gene of Escherichia coli K-12.

PubMed

Sandler, S J; Clark, A J

1990-01-31

This report describes four factors which affect expression of the recF gene from strong upstream lambda promoters under temperature-sensitive cIAt2-encoded repressor control. The first factor was the long mRNA leader sequence consisting of the Escherichia coli dnaN gene and 95% of the dnaA gene and lambda bet, N (double amber) and 40% of the exo gene. When most of this DNA was deleted, RecF became detectable in maxicells. The second factor was the vector, pBEU28, a runaway replication plasmid. When we substituted pUC118 for pBEU28, RecF became detectable in whole cells by the Coomassie blue staining technique. The third factor was the efficiency of initiation of translation. We used site-directed mutagenesis to change the mRNA leader, ribosome-binding site and the 3 bp before and after the translational start codon. Monitoring the effect of these mutational changes by translational fusion to lacZ, we discovered that the efficiency of initiation of translation was increased 30-fold. Only an estimated two- or threefold increase in accumulated levels of RecF occurred, however. This led us to discover the fourth factor, namely sequences in the recF gene itself. These sequences reduce expression of the recF-lacZ fusion genes 100-fold. The sequences responsible for this decrease in expression occur in four regions in the N-terminal half of recF. Expression is reduced by some sequences at the transcriptional level and by others at the translational level.
Integration of Myeloblastosis Associated Virus proviral sequences occurs in the vicinity of genes encoding signaling proteins and regulators of cell proliferation

PubMed Central

Li, Chang Long; Coullin, Philippe; Bernheim, Alain; Joliot, Véronique; Auffray, Charles; Zoroob, Rima; Perbal, Bernard

2006-01-01

Aims Myeloblastosis Associated Virus type 1 (N) [MAV 1(N)] induces specifically nephroblastomas in 8–10 weeks when injected to newborn chicken. The MAV-induced nephroblastomas constitute a unique animal model of the pediatric Wilms' tumor. We have made use of three independent nephroblastomas that represent increasing tumor grades, to identify the host DNA regions in which MAV proviral sequences were integrated. METHODS Cellular sequences localized next to MAV-integration sites in the tumor DNAs were used to screen a Bacterial Artificial Chromosomes (BACs) library and isolate BACs containing about 150 kilobases of normal DNA corresponding to MAV integration regions (MIRs). These BACs were mapped on the chicken chromosomes by Fluorescent In Situ Hybridization (FISH) and used for molecular studies. Results The different MAV integration sites that were conserved after tumor cell selection identify genes involved in the control of cell signaling and proliferation. Syntenic fragments in human DNA contain genes whose products have been involved in normal and pathological kidney development, and several oncogenes responsible for tumorigenesis in human. Conclusion The identification of putative target genes for MAV provides important clues for the understanding of the MAV pathogenic potential. These studies identified ADAMTS1 as a gene upregulated in MAV-induced nephroblastoma and established that ccn3/nov is not a preferential site of integration for MAV as previously thought. The present results support our hypothesis that the highly efficient and specific MAV-induced tumorigenesis results from the alteration of multiple target genes in differentiating blastemal cells, some of which are required for the progression to highly aggressive stages. This study reinforces our previous conclusions that the MAV-induced nephroblastoma constitutes an excellent model in which to characterize new potential oncogenes and tumor suppressors involved in the establishment and maintenance of tumors. PMID:16403231
Genomic Characterization of Phenylalanine Ammonia Lyase Gene in Buckwheat

PubMed Central

Thiyagarajan, Karthikeyan; Vitali, Fabio; Tolaini, Valentina; Galeffi, Patrizia; Cantale, Cristina; Vikram, Prashant; Singh, Sukhwinder; De Rossi, Patrizia; Nobili, Chiara; Procacci, Silvia; Del Fiore, Antonella; Antonini, Alessandro; Presenti, Ombretta; Brunori, Andrea

2016-01-01

Phenylalanine Ammonia Lyase (PAL) gene which plays a key role in bio-synthesis of medicinally important compounds, Rutin/quercetin was sequence characterized for its efficient genomics application. These compounds possessing anti-diabetic and anti-cancer properties and are predominantly produced by Fagopyrum spp. In the present study, PAL gene was sequenced from three Fagopyrum spp. (F. tataricum, F. esculentum and F. dibotrys) and showed the presence of three SNPs and four insertion/deletions at intra and inter specific level. Among them, the potential SNP (position 949th bp G>C) with Parsimony Informative Site was selected and successfully utilised to individuate the zygosity/allelic variation of 16 F. tataricum varieties. Insertion mutations were identified in coding region, which resulted the change of a stretch of 39 amino acids on the putative protein. Our Study revealed that autogamous species (F. tataricum) has lower frequency of observed SNPs as compared to allogamous species (F. dibotrys and F. esculentum). The identified SNPs in F. tataricum didn’t result to amino acid change, while in other two species it caused both conservative and non-conservative variations. Consistent pattern of SNPs across the species revealed their phylogenetic importance. We found two groups of F. tataricum and one of them was closely related with F. dibotrys. Sequence characterization information of PAL gene reported in present investigation can be utilized in genetic improvement of buckwheat in reference to its medicinal value. PMID:26990297

Loci and candidate genes conferring resistance to soybean cyst nematode HG type 2.5.7.

PubMed

Zhao, Xue; Teng, Weili; Li, Yinghui; Liu, Dongyuan; Cao, Guanglu; Li, Dongmei; Qiu, Lijuan; Zheng, Hongkun; Han, Yingpeng; Li, Wenbin

2017-06-14

Soybean (Glycine max L. Merr.) cyst nematode (SCN, Heterodera glycines I,) is a major pest of soybean worldwide. The most effective strategy to control this pest involves the use of resistant cultivars. The aim of the present study was to investigate the genome-wide genetic architecture of resistance to SCN HG Type 2.5.7 (race 1) in landrace and elite cultivated soybeans. A total of 200 diverse soybean accessions were screened for resistance to SCN HG Type 2.5.7 and genotyped through sequencing using the Specific Locus Amplified Fragment Sequencing (SLAF-seq) approach with a 6.14-fold average sequencing depth. A total of 33,194 SNPs were identified with minor allele frequencies (MAF) over 4%, covering 97% of all the genotypes. Genome-wide association mapping (GWAS) revealed thirteen SNPs associated with resistance to SCN HG Type 2.5.7. These SNPs were distributed on five chromosomes (Chr), including Chr7, 8, 14, 15 and 18. Four SNPs were novel resistance loci and nine SNPs were located near known QTL. A total of 30 genes were identified as candidate genes underlying SCN resistance. A total of sixteen novel soybean accessions were identified with significant resistance to HG Type 2.5.7. The beneficial alleles and candidate genes identified by GWAS might be valuable for improving marker-assisted breeding efficiency and exploring the molecular mechanisms underlying SCN resistance.
EXONSAMPLER: a computer program for genome-wide and candidate gene exon sampling for targeted next-generation sequencing.

PubMed

Cosart, Ted; Beja-Pereira, Albano; Luikart, Gordon

2014-11-01

The computer program EXONSAMPLER automates the sampling of thousands of exon sequences from publicly available reference genome sequences and gene annotation databases. It was designed to provide exon sequences for the efficient, next-generation gene sequencing method called exon capture. The exon sequences can be sampled by a list of gene name abbreviations (e.g. IFNG, TLR1), or by sampling exons from genes spaced evenly across chromosomes. It provides a list of genomic coordinates (a bed file), as well as a set of sequences in fasta format. User-adjustable parameters for collecting exon sequences include a minimum and maximum acceptable exon length, maximum number of exonic base pairs (bp) to sample per gene, and maximum total bp for the entire collection. It allows for partial sampling of very large exons. It can preferentially sample upstream (5 prime) exons, downstream (3 prime) exons, both external exons, or all internal exons. It is written in the Python programming language using its free libraries. We describe the use of EXONSAMPLER to collect exon sequences from the domestic cow (Bos taurus) genome for the design of an exon-capture microarray to sequence exons from related species, including the zebu cow and wild bison. We collected ~10% of the exome (~3 million bp), including 155 candidate genes, and ~16,000 exons evenly spaced genomewide. We prioritized the collection of 5 prime exons to facilitate discovery and genotyping of SNPs near upstream gene regulatory DNA sequences, which control gene expression and are often under natural selection. © 2014 John Wiley & Sons Ltd.
In silico comparison of genomic regions containing genes coding for enzymes and transcription factors for the phenylpropanoid pathway in Phaseolus vulgaris L. and Glycine max L. Merr

PubMed Central

Reinprecht, Yarmilla; Yadegari, Zeinab; Perry, Gregory E.; Siddiqua, Mahbuba; Wright, Lori C.; McClean, Phillip E.; Pauls, K. Peter

2013-01-01

Legumes contain a variety of phytochemicals derived from the phenylpropanoid pathway that have important effects on human health as well as seed coat color, plant disease resistance and nodulation. However, the information about the genes involved in this important pathway is fragmentary in common bean (Phaseolus vulgaris L.). The objectives of this research were to isolate genes that function in and control the phenylpropanoid pathway in common bean, determine their genomic locations in silico in common bean and soybean, and analyze sequences of the 4CL gene family in two common bean genotypes. Sequences of phenylpropanoid pathway genes available for common bean or other plant species were aligned, and the conserved regions were used to design sequence-specific primers. The PCR products were cloned and sequenced and the gene sequences along with common bean gene-based (g) markers were BLASTed against the Glycine max v.1.0 genome and the P. vulgaris v.1.0 (Andean) early release genome. In addition, gene sequences were BLASTed against the OAC Rex (Mesoamerican) genome sequence assembly. In total, fragments of 46 structural and regulatory phenylpropanoid pathway genes were characterized in this way and placed in silico on common bean and soybean sequence maps. The maps contain over 250 common bean g and SSR (simple sequence repeat) markers and identify the positions of more than 60 additional phenylpropanoid pathway gene sequences, plus the putative locations of seed coat color genes. The majority of cloned phenylpropanoid pathway gene sequences were mapped to one location in the common bean genome but had two positions in soybean. The comparison of the genomic maps confirmed previous studies, which show that common bean and soybean share genomic regions, including those containing phenylpropanoid pathway gene sequences, with conserved synteny. Indels identified in the comparison of Andean and Mesoamerican common bean 4CL gene sequences might be used to develop inter-pool phenylpropanoid pathway gene-based markers. We anticipate that the information obtained by this study will simplify and accelerate selections of common bean with specific phenylpropanoid pathway alleles to increase the contents of beneficial phenylpropanoids in common bean and other legumes. PMID:24046770
An efficient strategy for producing a stable, replaceable, highly efficient transgene expression system in silkworm, Bombyx mori

PubMed Central

Long, Dingpei; Lu, Weijian; Zhang, Yuli; Bi, Lihui; Xiang, Zhonghuai; Zhao, Aichun

2015-01-01

We developed an efficient strategy that combines a method for the post-integration elimination of all transposon sequences, a site-specific recombination system, and an optimized fibroin H-chain expression system to produce a stable, replaceable, highly efficient transgene expression system in the silkworm (Bombyx mori) that overcomes the disadvantages of random insertion and post-integration instability of transposons. Here, we generated four different transgenic silkworm strains, and of one the transgenic strains, designated TS1-RgG2, with up to 16% (w/w) of the target protein in the cocoons, was selected. The subsequent elimination of all the transposon sequences from TS1-RgG2 was completed by the heat-shock-induced expression of the transposase in vivo. The resulting transgenic silkworm strain was designated TS3-g2 and contained only the attP-flanked optimized fibroin H-chain expression cassette in its genome. A phiC31/att-system-based recombinase-mediated cassette exchange (RMCE) method could be used to integrate other genes of interest into the same genome locus between the attP sites in TS3-g2. Controlling for position effects with phiC31-mediated RMCE will also allow the optimization of exogenous protein expression and fine gene function analyses in the silkworm. The strategy developed here is also applicable to other lepidopteran insects, to improve the ecological safety of transgenic strains in biocontrol programs. PMID:25739894
Site-specific DNA excision in transgenic rice with a cell-permeable cre recombinase.

PubMed

Cao, Ming-Xia; Huang, Jian-Qiu; Yao, Quan-Hong; Liu, Sheng-Jun; Wang, Cheng-Long; Wei, Zhi-Ming

2006-01-01

The removal of selected marker genes from transgenic plants is necessary to address biosafety concerns and to carry out further experiments with transgenic organisms. In the present study, the 12-amino-acid membrane translocation sequence (MTS) from the Kaposi fibroblast growth factor (FGF)-4 was used as a carrier to deliver enzymatically active Cre proteins into living plant cells, and to produce a site-specific DNA excision in transgenic rice plants. The process, which made cells permeable to Cre recombinase-mediated DNA recombination, circumvented the need to express Cre under spatiotemporal control and was proved to be a simple and efficient system to achieve marker-free transgenic plants. The ultimate aim of the present study is to develop commercial rice cultivars free from selected marker genes to hasten public acceptance of transgenic crops.
Identification of an ancestral resistance gene cluster involved in the coevolution process between Phaseolus vulgaris and its fungal pathogen Colletotrichum lindemuthianum.

PubMed

Geffroy, V; Sicard, D; de Oliveira, J C; Sévignac, M; Cohen, S; Gepts, P; Neema, C; Langin, T; Dron, M

1999-09-01

The recent cloning of plant resistance (R) genes and the sequencing of resistance gene clusters have shed light on the molecular evolution of R genes. However, up to now, no attempt has been made to correlate this molecular evolution with the host-pathogen coevolution process at the population level. Cross-inoculations were carried out between 26 strains of the fungal pathogen Colletotrichum lindemuthianum and 48 Phaseolus vulgaris plants collected in the three centers of diversity of the host species. A high level of diversity for resistance against the pathogen was revealed. Most of the resistance specificities were overcome in sympatric situations, indicating an adaptation of the pathogen to the local host. In contrast, plants were generally resistant to allopatric strains, suggesting that R genes that were efficient against exotic strains but had been overcome locally were maintained in the plant genome. These results indicated that coevolution processes between the two protagonists led to a differentiation for resistance in the three centers of diversity of the host. To improve our understanding of the molecular evolution of these different specificities, a recombinant inbred (RI) population derived from two representative genotypes of the Andean (JaloEEP558) and Mesoamerican (BAT93) gene pools was used to map anthracnose specificities. A gene cluster comprising both Andean (Co-y; Co-z) and Mesoamerican (Co-9) host resistance specificities was identified, suggesting that this locus existed prior to the separation of the two major gene pools of P. vulgaris. Molecular analysis revealed a high level of complexity at this locus. It harbors 11 restriction fragment length polymorphisms when R gene analog (RGA) clones are used. The relationship between the coevolution process and diversification of resistance specificities at resistance gene clusters is discussed.
Efficient Generation of Myostatin Knock-Out Sheep Using CRISPR/Cas9 Technology and Microinjection into Zygotes.

PubMed

Crispo, M; Mulet, A P; Tesson, L; Barrera, N; Cuadro, F; dos Santos-Neto, P C; Nguyen, T H; Crénéguy, A; Brusselle, L; Anegón, I; Menchaca, A

2015-01-01

While CRISPR/Cas9 technology has proven to be a valuable system to generate gene-targeted modified animals in several species, this tool has been scarcely reported in farm animals. Myostatin is encoded by MSTN gene involved in the inhibition of muscle differentiation and growth. We determined the efficiency of the CRISPR/Cas9 system to edit MSTN in sheep and generate knock-out (KO) animals with the aim to promote muscle development and body growth. We generated CRISPR/Cas9 mRNAs specific for ovine MSTN and microinjected them into the cytoplasm of ovine zygotes. When embryo development of CRISPR/Cas9 microinjected zygotes (n = 216) was compared with buffer injected embryos (n = 183) and non microinjected embryos (n = 173), cleavage rate was lower for both microinjected groups (P<0.05) and neither was affected by CRISPR/Cas9 content in the injected medium. Embryo development to blastocyst was not affected by microinjection and was similar among the experimental groups. From 20 embryos analyzed by Sanger sequencing, ten were mutant (heterozygous or mosaic; 50% efficiency). To obtain live MSTN KO lambs, 53 blastocysts produced after zygote CRISPR/Cas9 microinjection were transferred to 29 recipient females resulting in 65.5% (19/29) of pregnant ewes and 41.5% (22/53) of newborns. From 22 born lambs analyzed by T7EI and Sanger sequencing, ten showed indel mutations at MSTN gene. Eight showed mutations in both alleles and five of them were homozygous for indels generating out-of frame mutations that resulted in premature stop codons. Western blot analysis of homozygous KO founders confirmed the absence of myostatin, showing heavier body weight than wild type counterparts. In conclusion, our results demonstrate that CRISPR/Cas9 system was a very efficient tool to generate gene KO sheep. This technology is quick and easy to perform and less expensive than previous techniques, and can be applied to obtain genetically modified animal models of interest for biomedicine and livestock.
High-sensitivity HLA typing by Saturated Tiling Capture Sequencing (STC-Seq).

PubMed

Jiao, Yang; Li, Ran; Wu, Chao; Ding, Yibin; Liu, Yanning; Jia, Danmei; Wang, Lifeng; Xu, Xiang; Zhu, Jing; Zheng, Min; Jia, Junling

2018-01-15

Highly polymorphic human leukocyte antigen (HLA) genes are responsible for fine-tuning the adaptive immune system. High-resolution HLA typing is important for the treatment of autoimmune and infectious diseases. Additionally, it is routinely performed for identifying matched donors in transplantation medicine. Although many HLA typing approaches have been developed, the complexity, low-efficiency and high-cost of current HLA-typing assays limit their application in population-based high-throughput HLA typing for donors, which is required for creating large-scale databases for transplantation and precision medicine. Here, we present a cost-efficient Saturated Tiling Capture Sequencing (STC-Seq) approach to capturing 14 HLA class I and II genes. The highly efficient capture (an approximately 23,000-fold enrichment) of these genes allows for simplified allele calling. Tests on five genes (HLA-A/B/C/DRB1/DQB1) from 31 human samples and 351 datasets using STC-Seq showed results that were 98% consistent with the known two sets of digitals (field1 and field2) genotypes. Additionally, STC can capture genomic DNA fragments longer than 3 kb from HLA loci, making the library compatible with the third-generation sequencing. STC-Seq is a highly accurate and cost-efficient method for HLA typing which can be used to facilitate the establishment of population-based HLA databases for the precision and transplantation medicine.
Efficient Exploration of the Space of Reconciled Gene Trees

PubMed Central

Szöllősi, Gergely J.; Rosikiewicz, Wojciech; Boussau, Bastien; Tannier, Eric; Daubin, Vincent

2013-01-01

Gene trees record the combination of gene-level events, such as duplication, transfer and loss (DTL), and species-level events, such as speciation and extinction. Gene tree–species tree reconciliation methods model these processes by drawing gene trees into the species tree using a series of gene and species-level events. The reconstruction of gene trees based on sequence alone almost always involves choosing between statistically equivalent or weakly distinguishable relationships that could be much better resolved based on a putative species tree. To exploit this potential for accurate reconstruction of gene trees, the space of reconciled gene trees must be explored according to a joint model of sequence evolution and gene tree–species tree reconciliation. Here we present amalgamated likelihood estimation (ALE), a probabilistic approach to exhaustively explore all reconciled gene trees that can be amalgamated as a combination of clades observed in a sample of gene trees. We implement the ALE approach in the context of a reconciliation model (Szöllősi et al. 2013), which allows for the DTL of genes. We use ALE to efficiently approximate the sum of the joint likelihood over amalgamations and to find the reconciled gene tree that maximizes the joint likelihood among all such trees. We demonstrate using simulations that gene trees reconstructed using the joint likelihood are substantially more accurate than those reconstructed using sequence alone. Using realistic gene tree topologies, branch lengths, and alignment sizes, we demonstrate that ALE produces more accurate gene trees even if the model of sequence evolution is greatly simplified. Finally, examining 1099 gene families from 36 cyanobacterial genomes we find that joint likelihood-based inference results in a striking reduction in apparent phylogenetic discord, with respectively. 24%, 59%, and 46% reductions in the mean numbers of duplications, transfers, and losses per gene family. The open source implementation of ALE is available from https://github.com/ssolo/ALE.git. [amalgamation; gene tree reconciliation; gene tree reconstruction; lateral gene transfer; phylogeny.] PMID:23925510
Comparative description of ten transcriptomes of newly sequenced invertebrates and efficiency estimation of genomic sampling in non-model taxa

PubMed Central

2012-01-01

Introduction Traditionally, genomic or transcriptomic data have been restricted to a few model or emerging model organisms, and to a handful of species of medical and/or environmental importance. Next-generation sequencing techniques have the capability of yielding massive amounts of gene sequence data for virtually any species at a modest cost. Here we provide a comparative analysis of de novo assembled transcriptomic data for ten non-model species of previously understudied animal taxa. Results cDNA libraries of ten species belonging to five animal phyla (2 Annelida [including Sipuncula], 2 Arthropoda, 2 Mollusca, 2 Nemertea, and 2 Porifera) were sequenced in different batches with an Illumina Genome Analyzer II (read length 100 or 150 bp), rendering between ca. 25 and 52 million reads per species. Read thinning, trimming, and de novo assembly were performed under different parameters to optimize output. Between 67,423 and 207,559 contigs were obtained across the ten species, post-optimization. Of those, 9,069 to 25,681 contigs retrieved blast hits against the NCBI non-redundant database, and approximately 50% of these were assigned with Gene Ontology terms, covering all major categories, and with similar percentages in all species. Local blasts against our datasets, using selected genes from major signaling pathways and housekeeping genes, revealed high efficiency in gene recovery compared to available genomes of closely related species. Intriguingly, our transcriptomic datasets detected multiple paralogues in all phyla and in nearly all gene pathways, including housekeeping genes that are traditionally used in phylogenetic applications for their purported single-copy nature. Conclusions We generated the first study of comparative transcriptomics across multiple animal phyla (comparing two species per phylum in most cases), established the first Illumina-based transcriptomic datasets for sponge, nemertean, and sipunculan species, and generated a tractable catalogue of annotated genes (or gene fragments) and protein families for ten newly sequenced non-model organisms, some of commercial importance (i.e., Octopus vulgaris). These comprehensive sets of genes can be readily used for phylogenetic analysis, gene expression profiling, developmental analysis, and can also be a powerful resource for gene discovery. The characterization of the transcriptomes of such a diverse array of animal species permitted the comparison of sequencing depth, functional annotation, and efficiency of genomic sampling using the same pipelines, which proved to be similar for all considered species. In addition, the datasets revealed their potential as a resource for paralogue detection, a recurrent concern in various aspects of biological inquiry, including phylogenetics, molecular evolution, development, and cellular biochemistry. PMID:23190771
Targeted mutagenesis in cotton (Gossypium hirsutum L.) using the CRISPR/Cas9 system

PubMed Central

Chen, Xiugui; Lu, Xuke; Shu, Na; Wang, Shuai; Wang, Junjuan; Wang, Delong; Guo, Lixue; Ye, Wuwei

2017-01-01

The CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas9 system has been widely used for genome editing in various plants because of its simplicity, high efficiency and design flexibility. However, to our knowledge, there is no report on the application of CRISPR/Cas9-mediated targeted mutagenesis in cotton. Here, we report the genome editing and targeted mutagenesis in upland cotton (Gossypium hirsutum L., hereafter cotton) using the CRISPR/Cas9 system. We designed two guide RNAs to target distinct sites of the cotton Cloroplastos alterados 1 (GhCLA1) and vacuolar H+-pyrophosphatase (GhVP) genes. Mutations in these two genes were detected in cotton protoplasts. Most of the mutations were nucleotide substitutions, with one nucleotide insertion and one substitution found in GhCLA1 and one deletion found in GhVP in cotton protoplasts. Subsequently, the two vectors were transformed into cotton shoot apexes through Agrobacterium-mediated transformation, resulting in efficient target gene editing. Most of the mutations were nucleotide deletions, and the mutation efficiencies were 47.6–81.8% in transgenic cotton plants. Evaluation using restriction-enzyme-PCR assay and sequence analysis detected no off-target mutations. Our results indicated that the CRISPR/Cas9 system was an efficient and specific tool for targeted mutagenesis of the cotton genome. PMID:28287154
Targeted mutagenesis in cotton (Gossypium hirsutum L.) using the CRISPR/Cas9 system.

PubMed

Chen, Xiugui; Lu, Xuke; Shu, Na; Wang, Shuai; Wang, Junjuan; Wang, Delong; Guo, Lixue; Ye, Wuwei

2017-03-13

The CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas9 system has been widely used for genome editing in various plants because of its simplicity, high efficiency and design flexibility. However, to our knowledge, there is no report on the application of CRISPR/Cas9-mediated targeted mutagenesis in cotton. Here, we report the genome editing and targeted mutagenesis in upland cotton (Gossypium hirsutum L., hereafter cotton) using the CRISPR/Cas9 system. We designed two guide RNAs to target distinct sites of the cotton Cloroplastos alterados 1 (GhCLA1) and vacuolar H + -pyrophosphatase (GhVP) genes. Mutations in these two genes were detected in cotton protoplasts. Most of the mutations were nucleotide substitutions, with one nucleotide insertion and one substitution found in GhCLA1 and one deletion found in GhVP in cotton protoplasts. Subsequently, the two vectors were transformed into cotton shoot apexes through Agrobacterium-mediated transformation, resulting in efficient target gene editing. Most of the mutations were nucleotide deletions, and the mutation efficiencies were 47.6-81.8% in transgenic cotton plants. Evaluation using restriction-enzyme-PCR assay and sequence analysis detected no off-target mutations. Our results indicated that the CRISPR/Cas9 system was an efficient and specific tool for targeted mutagenesis of the cotton genome.
DROMPA: easy-to-handle peak calling and visualization software for the computational analysis and validation of ChIP-seq data.

PubMed

Nakato, Ryuichiro; Itoh, Tahehiko; Shirahige, Katsuhiko

2013-07-01

Chromatin immunoprecipitation with high-throughput sequencing (ChIP-seq) can identify genomic regions that bind proteins involved in various chromosomal functions. Although the development of next-generation sequencers offers the technology needed to identify these protein-binding sites, the analysis can be computationally challenging because sequencing data sometimes consist of >100 million reads/sample. Herein, we describe a cost-effective and time-efficient protocol that is generally applicable to ChIP-seq analysis; this protocol uses a novel peak-calling program termed DROMPA to identify peaks and an additional program, parse2wig, to preprocess read-map files. This two-step procedure drastically reduces computational time and memory requirements compared with other programs. DROMPA enables the identification of protein localization sites in repetitive sequences and efficiently identifies both broad and sharp protein localization peaks. Specifically, DROMPA outputs a protein-binding profile map in pdf or png format, which can be easily manipulated by users who have a limited background in bioinformatics. © 2013 The Authors Genes to Cells © 2013 by the Molecular Biology Society of Japan and Wiley Publishing Asia Pty Ltd.
In trans paired nicking triggers seamless genome editing without double-stranded DNA cutting.

PubMed

Chen, Xiaoyu; Janssen, Josephine M; Liu, Jin; Maggio, Ignazio; 't Jong, Anke E J; Mikkers, Harald M M; Gonçalves, Manuel A F V

2017-09-22

Precise genome editing involves homologous recombination between donor DNA and chromosomal sequences subjected to double-stranded DNA breaks made by programmable nucleases. Ideally, genome editing should be efficient, specific, and accurate. However, besides constituting potential translocation-initiating lesions, double-stranded DNA breaks (targeted or otherwise) are mostly repaired through unpredictable and mutagenic non-homologous recombination processes. Here, we report that the coordinated formation of paired single-stranded DNA breaks, or nicks, at donor plasmids and chromosomal target sites by RNA-guided nucleases based on CRISPR-Cas9 components, triggers seamless homology-directed gene targeting of large genetic payloads in human cells, including pluripotent stem cells. Importantly, in addition to significantly reducing the mutagenicity of the genome modification procedure, this in trans paired nicking strategy achieves multiplexed, single-step, gene targeting, and yields higher frequencies of accurately edited cells when compared to the standard double-stranded DNA break-dependent approach.CRISPR-Cas9-based gene editing involves double-strand breaks at target sequences, which are often repaired by mutagenic non-homologous end-joining. Here the authors use Cas9 nickases to generate coordinated single-strand breaks in donor and target DNA for precise homology-directed gene editing.
Efficient production of artificially designed gelatins with a Bacillus brevis system.

PubMed

Kajino, T; Takahashi, H; Hirai, M; Yamada, Y

2000-01-01

Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.
Generation of “LYmph Node Derived Antibody Libraries” (LYNDAL) for selecting fully human antibody fragments with therapeutic potential

PubMed Central

Diebolder, Philipp; Keller, Armin; Haase, Stephanie; Schlegelmilch, Anne; Kiefer, Jonathan D; Karimi, Tamana; Weber, Tobias; Moldenhauer, Gerhard; Kehm, Roland; Eis-Hübinger, Anna M; Jäger, Dirk; Federspil, Philippe A; Herold-Mende, Christel; Dyckhoff, Gerhard; Kontermann, Roland E; Arndt, Michaela AE; Krauss, Jürgen

2014-01-01

The development of efficient strategies for generating fully human monoclonal antibodies with unique functional properties that are exploitable for tailored therapeutic interventions remains a major challenge in the antibody technology field. Here, we present a methodology for recovering such antibodies from antigen-encountered human B cell repertoires. As the source for variable antibody genes, we cloned immunoglobulin G (IgG)-derived B cell repertoires from lymph nodes of 20 individuals undergoing surgery for head and neck cancer. Sequence analysis of unselected “LYmph Node Derived Antibody Libraries” (LYNDAL) revealed a naturally occurring distribution pattern of rearranged antibody sequences, representing all known variable gene families and most functional germline sequences. To demonstrate the feasibility for selecting antibodies with therapeutic potential from these repertoires, seven LYNDAL from donors with high serum titers against herpes simplex virus (HSV) were panned on recombinant glycoprotein B of HSV-1. Screening for specific binders delivered 34 single-chain variable fragments (scFvs) with unique sequences. Sequence analysis revealed extensive somatic hypermutation of enriched clones as a result of affinity maturation. Binding of scFvs to common glycoprotein B variants from HSV-1 and HSV-2 strains was highly specific, and the majority of analyzed antibody fragments bound to the target antigen with nanomolar affinity. From eight scFvs with HSV-neutralizing capacity in vitro, the most potent antibody neutralized 50% HSV-2 at 4.5 nM as a dimeric (scFv)2. We anticipate our approach to be useful for recovering fully human antibodies with therapeutic potential. PMID:24256717
Generation of “LYmph Node Derived Antibody Libraries” (LYNDAL) for selecting fully human antibody fragments with therapeutic potential.

PubMed

Diebolder, Philipp; Keller, Armin; Haase, Stephanie; Schlegelmilch, Anne; Kiefer, Jonathan D; Karimi, Tamana; Weber, Tobias; Moldenhauer, Gerhard; Kehm, Roland; Eis-Hübinger, Anna M; Jäger, Dirk; Federspil, Philippe A; Herold-Mende, Christel; Dyckhoff, Gerhard; Kontermann, Roland E; Arndt, Michaela A E; Krauss, Jürgen

2014-01-01

The development of efficient strategies for generating fully human monoclonal antibodies with unique functional properties that are exploitable for tailored therapeutic interventions remains a major challenge in the antibody technology field. Here, we present a methodology for recovering such antibodies from antigen-encountered human B cell repertoires. As the source for variable antibody genes, we cloned immunoglobulin G (IgG)-derived B cell repertoires from lymph nodes of 20 individuals undergoing surgery for head and neck cancer. Sequence analysis of unselected “LYmph Node Derived Antibody Libraries” (LYNDAL) revealed a naturally occurring distribution pattern of rearranged antibody sequences, representing all known variable gene families and most functional germline sequences. To demonstrate the feasibility for selecting antibodies with therapeutic potential from these repertoires, seven LYNDAL from donors with high serum titers against herpes simplex virus (HSV) were panned on recombinant glycoprotein B of HSV-1. Screening for specific binders delivered 34 single-chain variable fragments (scFvs) with unique sequences. Sequence analysis revealed extensive somatic hypermutation of enriched clones as a result of affinity maturation. Binding of scFvs to common glycoprotein B variants from HSV-1 and HSV-2 strains was highly specific, and the majority of analyzed antibody fragments bound to the target antigen with nanomolar affinity. From eight scFvs with HSV-neutralizing capacity in vitro,the most potent antibody neutralized 50% HSV-2 at 4.5 nM as a dimeric (scFv)2. We anticipate our approach to be useful for recovering fully human antibodies with therapeutic potential.
The HIP1 binding site is required for growth regulation of the dihydrofolate reductase gene promoter.

PubMed

Means, A L; Slansky, J E; McMahon, S L; Knuth, M W; Farnham, P J

1992-03-01

The transcription rate of the dihydrofolate reductase (DHFR) gene increases at the G1/S boundary of the proliferative cell cycle. Through analysis of transiently and stably transfected NIH 3T3 cells, we have now demonstrated that DHFR promoter sequences extending from -270 to +20 are sufficient to confer similar regulation on a reporter gene. Mutation of a protein binding site that spans sequences from -16 to +11 in the DHFR promoter resulted in loss of the transcriptional increase at the G1/S boundary. Purification of an activity from HeLa nuclear extract that binds to this region enriched for a 180-kDa polypeptide (HIP1). Using this HIP1 preparation, we have identified specific positions within the binding site that are critical for efficient protein-DNA interactions. An analysis of association and dissociation rates suggests that bound HIP1 protein can exchange rapidly with free protein. This rapid exchange may facilitate the burst of transcriptional activity from the DHFR promoter at the G1/S boundary.
The HIP1 binding site is required for growth regulation of the dihydrofolate reductase gene promoter.

PubMed Central

Means, A L; Slansky, J E; McMahon, S L; Knuth, M W; Farnham, P J

1992-01-01

The transcription rate of the dihydrofolate reductase (DHFR) gene increases at the G1/S boundary of the proliferative cell cycle. Through analysis of transiently and stably transfected NIH 3T3 cells, we have now demonstrated that DHFR promoter sequences extending from -270 to +20 are sufficient to confer similar regulation on a reporter gene. Mutation of a protein binding site that spans sequences from -16 to +11 in the DHFR promoter resulted in loss of the transcriptional increase at the G1/S boundary. Purification of an activity from HeLa nuclear extract that binds to this region enriched for a 180-kDa polypeptide (HIP1). Using this HIP1 preparation, we have identified specific positions within the binding site that are critical for efficient protein-DNA interactions. An analysis of association and dissociation rates suggests that bound HIP1 protein can exchange rapidly with free protein. This rapid exchange may facilitate the burst of transcriptional activity from the DHFR promoter at the G1/S boundary. Images PMID:1545788
In vivo gene correction with targeted sequence substitution through microhomology-mediated end joining.

PubMed

Shin, Jeong Hong; Jung, Soobin; Ramakrishna, Suresh; Kim, Hyongbum Henry; Lee, Junwon

2018-07-07

Genome editing technology using programmable nucleases has rapidly evolved in recent years. The primary mechanism to achieve precise integration of a transgene is mainly based on homology-directed repair (HDR). However, an HDR-based genome-editing approach is less efficient than non-homologous end-joining (NHEJ). Recently, a microhomology-mediated end-joining (MMEJ)-based transgene integration approach was developed, showing feasibility both in vitro and in vivo. We expanded this method to achieve targeted sequence substitution (TSS) of mutated sequences with normal sequences using double-guide RNAs (gRNAs), and a donor template flanking the microhomologies and target sequence of the gRNAs in vitro and in vivo. Our method could realize more efficient sequence substitution than the HDR-based method in vitro using a reporter cell line, and led to the survival of a hereditary tyrosinemia mouse model in vivo. The proposed MMEJ-based TSS approach could provide a novel therapeutic strategy, in addition to HDR, to achieve gene correction from a mutated sequence to a normal sequence. Copyright © 2018 Elsevier Inc. All rights reserved.

Laser assisted microdissection, an efficient technique to understand tissue specific gene expression patterns and functional genomics in plants.

PubMed

Gautam, Vibhav; Sarkar, Ananda K

2015-04-01

Laser assisted microdissection (LAM) is an advanced technology used to perform tissue or cell-specific expression profiling of genes and proteins, owing to its ability to isolate the desired tissue or cell type from a heterogeneous population. Due to the specificity and high efficiency acquired during its pioneering use in medical science, the LAM technique has quickly been adopted for use in many biological researches. Today, it has become a potent tool to address a wide range of questions in diverse field of plant biology. Beginning with comparative transcriptome analysis of different tissues such as reproductive parts, meristems, lateral organs, roots etc., LAM has also been extensively used in plant-pathogen interaction studies, proteomics, and metabolomics. In combination with next generation sequencing and proteomics analysis, LAM has opened up promising opportunities in the area of large scale functional studies in plants. Ever since the advent of this technique, significant improvements have been achieved in term of its instrumentation and method, which has made LAM a more efficient tool applicable in wider research areas. Here, we discuss the advancement of LAM technique with special emphasis on its methodology and highlight its scope in modern research areas of plant biology. Although we put emphasis on use of LAM in transcriptome studies, which is mostly used, we also discuss its recent application and scope in proteome and metabolome studies.
Streamlining and Large Ancestral Genomes in Archaea Inferred with a Phylogenetic Birth-and-Death Model

PubMed Central

Miklós, István

2009-01-01

Homologous genes originate from a common ancestor through vertical inheritance, duplication, or horizontal gene transfer. Entire homolog families spawned by a single ancestral gene can be identified across multiple genomes based on protein sequence similarity. The sequences, however, do not always reveal conclusively the history of large families. To study the evolution of complete gene repertoires, we propose here a mathematical framework that does not rely on resolved gene family histories. We show that so-called phylogenetic profiles, formed by family sizes across multiple genomes, are sufficient to infer principal evolutionary trends. The main novelty in our approach is an efficient algorithm to compute the likelihood of a phylogenetic profile in a model of birth-and-death processes acting on a phylogeny. We examine known gene families in 28 archaeal genomes using a probabilistic model that involves lineage- and family-specific components of gene acquisition, duplication, and loss. The model enables us to consider all possible histories when inferring statistics about archaeal evolution. According to our reconstruction, most lineages are characterized by a net loss of gene families. Major increases in gene repertoire have occurred only a few times. Our reconstruction underlines the importance of persistent streamlining processes in shaping genome composition in Archaea. It also suggests that early archaeal genomes were as complex as typical modern ones, and even show signs, in the case of the methanogenic ancestor, of an extremely large gene repertoire. PMID:19570746
Single sea urchin phagocytes express messages of a single sequence from the diverse Sp185/333 gene family in response to bacterial challenge.

PubMed

Majeske, Audrey J; Oren, Matan; Sacchi, Sandro; Smith, L Courtney

2014-12-01

Immune systems in animals rely on fast and efficient responses to a wide variety of pathogens. The Sp185/333 gene family in the purple sea urchin, Strongylocentrotus purpuratus, consists of an estimated 50 (±10) members per genome that share a basic gene structure but show high sequence diversity, primarily due to the mosaic appearance of short blocks of sequence called elements. The genes show significantly elevated expression in three subpopulations of phagocytes responding to marine bacteria. The encoded Sp185/333 proteins are highly diverse and have central effector functions in the immune system. In this study we report the Sp185/333 gene expression in single sea urchin phagocytes. Sea urchins challenged with heat-killed marine bacteria resulted in a typical increase in coelomocyte concentration within 24 h, which included an increased proportion of phagocytes expressing Sp185/333 proteins. Phagocyte fractions enriched from coelomocytes were used in limiting dilutions to obtain samples of single cells that were evaluated for Sp185/333 gene expression by nested RT-PCR. Amplicon sequences showed identical or nearly identical Sp185/333 amplicon sequences in single phagocytes with matches to six known Sp185/333 element patterns, including both common and rare element patterns. This suggested that single phagocytes show restricted expression from the Sp185/333 gene family and infers a diverse, flexible, and efficient response to pathogens. This type of expression pattern from a family of immune response genes in single cells has not been identified previously in other invertebrates. Copyright © 2014 by The American Association of Immunologists, Inc.
The Unique hmuY Gene Sequence as a Specific Marker of Porphyromonas gingivalis

PubMed Central

Mackiewicz, Paweł; Radwan-Oczko, Małgorzata; Kantorowicz, Małgorzata; Chomyszyn-Gajewska, Maria; Frąszczak, Magdalena; Bielecki, Marcin; Olczak, Mariusz; Olczak, Teresa

2013-01-01

Porphyromonas gingivalis, a major etiological agent of chronic periodontitis, acquires heme from host hemoproteins using the HmuY hemophore. The aim of this study was to develop a specific P. gingivalis marker based on a hmuY gene sequence. Subgingival samples were collected from 66 patients with chronic periodontitis and 40 healthy subjects and the entire hmuY gene was analyzed in positive samples. Phylogenetic analyses demonstrated that both the amino acid sequence of the HmuY protein and the nucleotide sequence of the hmuY gene are unique among P. gingivalis strains/isolates and show low identity to sequences found in other species (below 50 and 56%, respectively). In agreement with these findings, a set of hmuY gene-based primers and standard/real-time PCR with SYBR Green chemistry allowed us to specifically detect P. gingivalis in patients with chronic periodontitis (77.3%) and healthy subjects (20%), the latter possessing lower number of P. gingivalis cells and total bacterial cells. Isolates from healthy subjects possess the hmuY gene-based nucleotide sequence pattern occurring in W83/W50/A7436 (n = 4), 381/ATCC 33277 (n = 3) or TDC60 (n = 1) strains, whereas those from patients typically have TDC60 (n = 21), W83/W50/A7436 (n = 17) and 381/ATCC 33277 (n = 13) strains. We observed a significant correlation between periodontal index of risk of infectiousness (PIRI) and the presence/absence of P. gingivalis (regardless of the hmuY gene-based sequence pattern of the isolate identified [r = 0.43; P = 0.0002] and considering particular isolate pattern [r = 0.38; P = 0.0012]). In conclusion, we demonstrated that the hmuY gene sequence or its fragments may be used as one of the molecular markers of P. gingivalis. PMID:23844074
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

NASA Astrophysics Data System (ADS)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
Pyicos: a versatile toolkit for the analysis of high-throughput sequencing data.

PubMed

Althammer, Sonja; González-Vallinas, Juan; Ballaré, Cecilia; Beato, Miguel; Eyras, Eduardo

2011-12-15

High-throughput sequencing (HTS) has revolutionized gene regulation studies and is now fundamental for the detection of protein-DNA and protein-RNA binding, as well as for measuring RNA expression. With increasing variety and sequencing depth of HTS datasets, the need for more flexible and memory-efficient tools to analyse them is growing. We describe Pyicos, a powerful toolkit for the analysis of mapped reads from diverse HTS experiments: ChIP-Seq, either punctuated or broad signals, CLIP-Seq and RNA-Seq. We prove the effectiveness of Pyicos to select for significant signals and show that its accuracy is comparable and sometimes superior to that of methods specifically designed for each particular type of experiment. Pyicos facilitates the analysis of a variety of HTS datatypes through its flexibility and memory efficiency, providing a useful framework for data integration into models of regulatory genomics. Open-source software, with tutorials and protocol files, is available at http://regulatorygenomics.upf.edu/pyicos or as a Galaxy server at http://regulatorygenomics.upf.edu/galaxy eduardo.eyras@upf.edu Supplementary data are available at Bioinformatics online.
Optimization of a gene electrotransfer procedure for efficient intradermal immunization with an hTERT-based DNA vaccine in mice

PubMed Central

Calvet, Christophe Y; Thalmensi, Jessie; Liard, Christelle; Pliquet, Elodie; Bestetti, Thomas; Huet, Thierry; Langlade-Demoyen, Pierre; Mir, Lluis M

2014-01-01

DNA vaccination consists in administering an antigen-encoding plasmid in order to trigger a specific immune response. This specific vaccine strategy is of particular interest to fight against various infectious diseases and cancer. Gene electrotransfer is the most efficient and safest non-viral gene transfer procedure and specific electrical parameters have been developed for several target tissues. Here, a gene electrotransfer protocol into the skin has been optimized in mice for efficient intradermal immunization against the well-known telomerase tumor antigen. First, the luciferase reporter gene was used to evaluate gene electrotransfer efficiency into the skin as a function of the electrical parameters and electrodes, either non-invasive or invasive. In a second time, these parameters were tested for their potency to generate specific cellular CD8 immune responses against telomerase epitopes. These CD8 T-cells were fully functional as they secreted IFNγ and were endowed with specific cytotoxic activity towards target cells. This simple and optimized procedure for efficient gene electrotransfer into the skin using the telomerase antigen is to be used in cancer patients for the phase 1 clinical evaluation of a therapeutic cancer DNA vaccine called INVAC-1. PMID:26015983
Functional dissection of the Hox protein Abdominal-B in Drosophila cell culture

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhai, Zongzhao; CellNetworks - Cluster of Excellence, Centre for Organismal Studies; Graduate School of Chinese Academy of Sciences, Beijing 100039

2011-11-04

Highlights: Black-Right-Pointing-Pointer ct340 CRM was identified to be the posterior spiracle enhancer of gene cut. Black-Right-Pointing-Pointer ct340 is under the direct transcriptional control of Hox protein Abd-B. Black-Right-Pointing-Pointer An efficient cloning system was developed to assay protein-DNA interaction. Black-Right-Pointing-Pointer New features of Abd-B dependent target gene regulation were detected. -- Abstract: Hox transcription factors regulate the morphogenesis along the anterior-posterior (A/P) body axis through the interaction with small cis-regulatory modules (CRMs) of their target gene, however so far very few Hox CRMs are known and have been analyzed in detail. In this study we have identified a new Hox CRM,more » ct340, which guides the expression of the cell type specification gene cut (ct) in the posterior spiracle under the direct control of the Hox protein Abdominal-B (Abd-B). Using the ct340 enhancer activity as readout, an efficient cloning system to generate VP16 activation domain fusion protein was developed to unambiguously test protein-DNA interaction in Drosophila cell culture. By functionally dissecting the Abd-B protein, new features of Abd-B dependent target gene regulation were detected. Due to its easy adaptability, this system can be generally used to map functional domains within sequence-specific transcriptional factors in Drosophila cell culture, and thus provide preliminary knowledge of the protein functional domain structure for further in vivo analysis.« less
Employment of Near Full-Length Ribosome Gene TA-Cloning and Primer-Blast to Detect Multiple Species in a Natural Complex Microbial Community Using Species-Specific Primers Designed with Their Genome Sequences.

PubMed

Zhang, Huimin; He, Hongkui; Yu, Xiujuan; Xu, Zhaohui; Zhang, Zhizhou

2016-11-01

It remains an unsolved problem to quantify a natural microbial community by rapidly and conveniently measuring multiple species with functional significance. Most widely used high throughput next-generation sequencing methods can only generate information mainly for genus-level taxonomic identification and quantification, and detection of multiple species in a complex microbial community is still heavily dependent on approaches based on near full-length ribosome RNA gene or genome sequence information. In this study, we used near full-length rRNA gene library sequencing plus Primer-Blast to design species-specific primers based on whole microbial genome sequences. The primers were intended to be specific at the species level within relevant microbial communities, i.e., a defined genomics background. The primers were tested with samples collected from the Daqu (also called fermentation starters) and pit mud of a traditional Chinese liquor production plant. Sixteen pairs of primers were found to be suitable for identification of individual species. Among them, seven pairs were chosen to measure the abundance of microbial species through quantitative PCR. The combination of near full-length ribosome RNA gene library sequencing and Primer-Blast may represent a broadly useful protocol to quantify multiple species in complex microbial population samples with species-specific primers.
m6A-Driver: Identifying Context-Specific mRNA m6A Methylation-Driven Gene Interaction Networks

PubMed Central

Zhang, Song-Yao; Zhang, Shao-Wu; Liu, Lian; Huang, Yufei

2016-01-01

As the most prevalent mammalian mRNA epigenetic modification, N6-methyladenosine (m6A) has been shown to possess important post-transcriptional regulatory functions. However, the regulatory mechanisms and functional circuits of m6A are still largely elusive. To help unveil the regulatory circuitry mediated by mRNA m6A methylation, we develop here m6A-Driver, an algorithm for predicting m6A-driven genes and associated networks, whose functional interactions are likely to be actively modulated by m6A methylation under a specific condition. Specifically, m6A-Driver integrates the PPI network and the predicted differential m6A methylation sites from methylated RNA immunoprecipitation sequencing (MeRIP-Seq) data using a Random Walk with Restart (RWR) algorithm and then builds a consensus m6A-driven network of m6A-driven genes. To evaluate the performance, we applied m6A-Driver to build the context-specific m6A-driven networks for 4 known m6A (de)methylases, i.e., FTO, METTL3, METTL14 and WTAP. Our results suggest that m6A-Driver can robustly and efficiently identify m6A-driven genes that are functionally more enriched and associated with higher degree of differential expression than differential m6A methylated genes. Pathway analysis of the constructed context-specific m6A-driven gene networks further revealed the regulatory circuitry underlying the dynamic interplays between the methyltransferases and demethylase at the epitranscriptomic layer of gene regulation. PMID:28027310
Pyramiding, alternating or mixing: comparative performances of deployment strategies of nematode resistance genes to promote plant resistance efficiency and durability.

PubMed

Djian-Caporalino, Caroline; Palloix, Alain; Fazari, Ariane; Marteu, Nathalie; Barbary, Arnaud; Abad, Pierre; Sage-Palloix, Anne-Marie; Mateille, Thierry; Risso, Sabine; Lanza, Roger; Taussig, Catherine; Castagnone-Sereno, Philippe

2014-02-22

Resistant cultivars are key elements for pathogen control and pesticide reduction, but their repeated use may lead to the emergence of virulent pathogen populations, able to overcome the resistance. Increased research efforts, mainly based on theoretical studies, explore spatio-temporal deployment strategies of resistance genes in order to maximize their durability. We evaluated experimentally three of these strategies to control root-knot nematodes: cultivar mixtures, alternating and pyramiding resistance genes, under controlled and field conditions over a 3-years period, assessing the efficiency and the durability of resistance in a protected crop rotation system with pepper as summer crop and lettuce as winter crop. The choice of the resistance gene and the genetic background in which it is introgressed, affected the frequency of resistance breakdown. The pyramiding of two different resistance genes in one genotype suppressed the emergence of virulent isolates. Alternating different resistance genes in rotation was also efficient to decrease virulent populations in fields due to the specificity of the virulence and the trapping effect of resistant plants. Mixing resistant cultivars together appeared as a less efficient strategy to control nematodes. This work provides experimental evidence that, in a cropping system with seasonal sequences of vegetable species, pyramiding or alternating resistance genes benefit yields in the long-term by increasing the durability of resistant cultivars and improving the long-term control of a soil-borne pest. To our knowledge, this result is the first one obtained for a plant-nematode interaction, which helps demonstrate the general applicability of such strategies for breeding and sustainable management of resistant cultivars against pathogens.
Pyramiding, alternating or mixing: comparative performances of deployment strategies of nematode resistance genes to promote plant resistance efficiency and durability

PubMed Central

2014-01-01

Background Resistant cultivars are key elements for pathogen control and pesticide reduction, but their repeated use may lead to the emergence of virulent pathogen populations, able to overcome the resistance. Increased research efforts, mainly based on theoretical studies, explore spatio-temporal deployment strategies of resistance genes in order to maximize their durability. We evaluated experimentally three of these strategies to control root-knot nematodes: cultivar mixtures, alternating and pyramiding resistance genes, under controlled and field conditions over a 3-years period, assessing the efficiency and the durability of resistance in a protected crop rotation system with pepper as summer crop and lettuce as winter crop. Results The choice of the resistance gene and the genetic background in which it is introgressed, affected the frequency of resistance breakdown. The pyramiding of two different resistance genes in one genotype suppressed the emergence of virulent isolates. Alternating different resistance genes in rotation was also efficient to decrease virulent populations in fields due to the specificity of the virulence and the trapping effect of resistant plants. Mixing resistant cultivars together appeared as a less efficient strategy to control nematodes. Conclusions This work provides experimental evidence that, in a cropping system with seasonal sequences of vegetable species, pyramiding or alternating resistance genes benefit yields in the long-term by increasing the durability of resistant cultivars and improving the long-term control of a soil-borne pest. To our knowledge, this result is the first one obtained for a plant-nematode interaction, which helps demonstrate the general applicability of such strategies for breeding and sustainable management of resistant cultivars against pathogens. PMID:24559060
Genotyping microarray (gene chip) for the ABCR (ABCA4) gene.

PubMed

Jaakson, K; Zernant, J; Külm, M; Hutchinson, A; Tonisson, N; Glavac, D; Ravnik-Glavac, M; Hawlina, M; Meltzer, M R; Caruso, R C; Testa, F; Maugeri, A; Hoyng, C B; Gouras, P; Simonelli, F; Lewis, R A; Lupski, J R; Cremers, F P M; Allikmets, R

2003-11-01

Genetic variation in the ABCR (ABCA4) gene has been associated with five distinct retinal phenotypes, including Stargardt disease/fundus flavimaculatus (STGD/FFM), cone-rod dystrophy (CRD), and age-related macular degeneration (AMD). Comparative genetic analyses of ABCR variation and diagnostics have been complicated by substantial allelic heterogeneity and by differences in screening methods. To overcome these limitations, we designed a genotyping microarray (gene chip) for ABCR that includes all approximately 400 disease-associated and other variants currently described, enabling simultaneous detection of all known ABCR variants. The ABCR genotyping microarray (the ABCR400 chip) was constructed by the arrayed primer extension (APEX) technology. Each sequence change in ABCR was included on the chip by synthesis and application of sequence-specific oligonucleotides. We validated the chip by screening 136 confirmed STGD patients and 96 healthy controls, each of whom we had analyzed previously by single strand conformation polymorphism (SSCP) technology and/or heteroduplex analysis. The microarray was >98% effective in determining the existing genetic variation and was comparable to direct sequencing in that it yielded many sequence changes undetected by SSCP. In STGD patient cohorts, the efficiency of the array to detect disease-associated alleles was between 54% and 78%, depending on the ethnic composition and degree of clinical and molecular characterization of a cohort. In addition, chip analysis suggested a high carrier frequency (up to 1:10) of ABCR variants in the general population. The ABCR genotyping microarray is a robust, cost-effective, and comprehensive screening tool for variation in one gene in which mutations are responsible for a substantial fraction of retinal disease. The ABCR chip is a prototype for the next generation of screening and diagnostic tools in ophthalmic genetics, bridging clinical and scientific research. Copyright 2003 Wiley-Liss, Inc.
Using Next Generation Sequencing for Multiplexed Trait-Linked Markers in Wheat

PubMed Central

Bernardo, Amy; Wang, Shan; St. Amand, Paul; Bai, Guihua

2015-01-01

With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat ( Triticum aestivum L.) that can be effectively used in marker-assisted selection (MAS) is still limited and SNP assays for MAS are usually uniplex. A shift from uniplex to multiplex assays will allow the simultaneous analysis of multiple markers and increase MAS efficiency. We designed 33 locus-specific markers from SNP or indel-based marker sequences that linked to 20 different quantitative trait loci (QTL) or genes of agronomic importance in wheat and analyzed the amplicon sequences using an Ion Torrent Proton Sequencer and a custom allele detection pipeline to determine the genotypes of 24 selected germplasm accessions. Among the 33 markers, 27 were successfully multiplexed and 23 had 100% SNP call rates. Results from analysis of "kompetitive allele-specific PCR" (KASP) and sequence tagged site (STS) markers developed from the same loci fully verified the genotype calls of 23 markers. The NGS-based multiplexed assay developed in this study is suitable for rapid and high-throughput screening of SNPs and some indel-based markers in wheat. PMID:26625271
Development and application of a general plasmid reference material for GMO screening.

PubMed

Wu, Yuhua; Li, Jun; Wang, Yulei; Li, Xiaofei; Li, Yunjing; Zhu, Li; Li, Jun; Wu, Gang

The use of analytical controls is essential when performing GMO detection through screening tests. Additionally, the presence of taxon-specific sequences is analyzed mostly for quality control during GMO detection. In this study, 11 commonly used genetic elements involving three promoters (P-35S, P-FMV35S and P-NOS), four marker genes (Bar, NPTII, HPT and Pmi), and four terminators (T-NOS, T-35S, T-g7 and T-e9), together with the reference gene fragments from six major crops of maize, soybean, rapeseed, rice, cotton and wheat, were co-integrated into the same single plasmid to construct a general reference plasmid pBI121-Screening. The suitability test of pBI121-Screening plasmid as reference material indicated that the non-target sequence on the pBI121-Screening plasmid did not affect the PCR amplification efficiencies of screening methods and taxon-specific methods. The sensitivity of screening and taxon-specific assays ranged from 5 to 10 copies of pBI121-Screening plasmid, meeting the sensitivity requirement of GMO detection. The construction of pBI121-Screening solves the lack of a general positive control for screening tests, thereby reducing the workload and cost of preparing a plurality of the positive control. Copyright © 2016 Elsevier B.V. All rights reserved.
A Comprehensive Analysis of Transcript-Supported De Novo Genes in Saccharomyces sensu stricto Yeasts

PubMed Central

Lu, Tzu-Chiao; Leu, Jun-Yi; Lin, Wen-Chang

2017-01-01

Abstract Novel genes arising from random DNA sequences (de novo genes) have been suggested to be widespread in the genomes of different organisms. However, our knowledge about the origin and evolution of de novo genes is still limited. To systematically understand the general features of de novo genes, we established a robust pipeline to analyze >20,000 transcript-supported coding sequences (CDSs) from the budding yeast Saccharomyces cerevisiae. Our analysis pipeline combined phylogeny, synteny, and sequence alignment information to identify possible orthologs across 20 Saccharomycetaceae yeasts and discovered 4,340 S. cerevisiae-specific de novo genes and 8,871 S. sensu stricto-specific de novo genes. We further combine information on CDS positions and transcript structures to show that >65% of de novo genes arose from transcript isoforms of ancient genes, especially in the upstream and internal regions of ancient genes. Fourteen identified de novo genes with high transcript levels were chosen to verify their protein expressions. Ten of them, including eight transcript isoform-associated CDSs, showed translation signals and five proteins exhibited specific cytosolic localizations. Our results suggest that de novo genes frequently arise in the S. sensu stricto complex and have the potential to be quickly integrated into ancient cellular network. PMID:28981695
RapGene: a fast and accurate strategy for synthetic gene assembly in Escherichia coli

PubMed Central

Zampini, Massimiliano; Stevens, Pauline Rees; Pachebat, Justin A.; Kingston-Smith, Alison; Mur, Luis A. J.; Hayes, Finbarr

2015-01-01

The ability to assemble DNA sequences de novo through efficient and powerful DNA fabrication methods is one of the foundational technologies of synthetic biology. Gene synthesis, in particular, has been considered the main driver for the emergence of this new scientific discipline. Here we describe RapGene, a rapid gene assembly technique which was successfully tested for the synthesis and cloning of both prokaryotic and eukaryotic genes through a ligation independent approach. The method developed in this study is a complete bacterial gene synthesis platform for the quick, accurate and cost effective fabrication and cloning of gene-length sequences that employ the widely used host Escherichia coli. PMID:26062748
Next-generation sequencing for targeted discovery of rare mutations in rice

USDA-ARS?s Scientific Manuscript database

Advances in DNA sequencing (i.e., next-generation sequencing, NGS) have greatly increased the power and efficiency of detecting rare mutations in large mutant populations. Targeting Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach for identifying gene mutations resulting fro...
RNA-guided genome editing for target gene mutations in wheat.

PubMed

Upadhyay, Santosh Kumar; Kumar, Jitesh; Alok, Anshu; Tuli, Rakesh

2013-12-09

The clustered, regularly interspaced, short palindromic repeats (CRISPR) and CRISPR-associated protein (Cas) system has been used as an efficient tool for genome editing. We report the application of CRISPR-Cas-mediated genome editing to wheat (Triticum aestivum), the most important food crop plant with a very large and complex genome. The mutations were targeted in the inositol oxygenase (inox) and phytoene desaturase (pds) genes using cell suspension culture of wheat and in the pds gene in leaves of Nicotiana benthamiana. The expression of chimeric guide RNAs (cgRNA) targeting single and multiple sites resulted in indel mutations in all the tested samples. The expression of Cas9 or sgRNA alone did not cause any mutation. The expression of duplex cgRNA with Cas9 targeting two sites in the same gene resulted in deletion of DNA fragment between the targeted sequences. Multiplexing the cgRNA could target two genes at one time. Target specificity analysis of cgRNA showed that mismatches at the 3' end of the target site abolished the cleavage activity completely. The mismatches at the 5' end reduced cleavage, suggesting that the off target effects can be abolished in vivo by selecting target sites with unique sequences at 3' end. This approach provides a powerful method for genome engineering in plants.
A High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine Variety

PubMed Central

Cartwright, Dustin A.; Cestaro, Alessandro; Pruss, Dmitry; Pindo, Massimo; FitzGerald, Lisa M.; Vezzulli, Silvia; Reid, Julia; Malacarne, Giulia; Iliev, Diana; Coppola, Giuseppina; Wardell, Bryan; Micheletti, Diego; Macalma, Teresita; Facci, Marco; Mitchell, Jeff T.; Perazzolli, Michele; Eldredge, Glenn; Gatto, Pamela; Oyzerski, Rozan; Moretto, Marco; Gutin, Natalia; Stefanini, Marco; Chen, Yang; Segala, Cinzia; Davenport, Christine; Demattè, Lorenzo; Mraz, Amy; Battilana, Juri; Stormo, Keith; Costa, Fabrizio; Tao, Quanzhou; Si-Ammour, Azeddine; Harkins, Tim; Lackey, Angie; Perbost, Clotilde; Taillon, Bruce; Stella, Alessandra; Solovyev, Victor; Fawcett, Jeffrey A.; Sterck, Lieven; Vandepoele, Klaas; Grando, Stella M.; Toppo, Stefano; Moser, Claudio; Lanchbury, Jerry; Bogden, Robert; Skolnick, Mark; Sgaramella, Vittorio; Bhatnagar, Satish K.; Fontana, Paolo; Gutin, Alexander; Van de Peer, Yves; Salamini, Francesco; Viola, Roberto

2007-01-01

Background Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented. Principal Findings We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before). Conclusions Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape. PMID:18094749

Application of CRISPR/Cas9 genome editing to the study and treatment of disease.

PubMed

Pellagatti, Andrea; Dolatshad, Hamid; Valletta, Simona; Boultwood, Jacqueline

2015-07-01

CRISPR/Cas is a microbial adaptive immune system that uses RNA-guided nucleases to cleave foreign genetic elements. The CRISPR/Cas9 method has been engineered from the type II prokaryotic CRISPR system and uses a single-guide RNA to target the Cas9 nuclease to a specific genomic sequence. Cas9 induces double-stranded DNA breaks which are repaired either by imperfect non-homologous end joining to generate insertions or deletions (indels) or, if a repair template is provided, by homology-directed repair. Due to its specificity, simplicity and versatility, the CRISPR/Cas9 system has recently emerged as a powerful tool for genome engineering in various species. This technology can be used to investigate the function of a gene of interest or to correct gene mutations in cells via genome editing, paving the way for future gene therapy approaches. Improvements to the efficiency of CRISPR repair, in particular to increase the rate of gene correction and to reduce undesired off-target effects, and the development of more effective delivery methods will be required for its broad therapeutic application.
Efficient modification of CCR5 in primary human hematopoietic cells using a megaTAL nuclease and AAV donor template.

PubMed

Sather, Blythe D; Romano Ibarra, Guillermo S; Sommer, Karen; Curinga, Gabrielle; Hale, Malika; Khan, Iram F; Singh, Swati; Song, Yumei; Gwiazda, Kamila; Sahni, Jaya; Jarjour, Jordan; Astrakhan, Alexander; Wagner, Thor A; Scharenberg, Andrew M; Rawlings, David J

2015-09-30

Genetic mutations or engineered nucleases that disrupt the HIV co-receptor CCR5 block HIV infection of CD4(+) T cells. These findings have motivated the engineering of CCR5-specific nucleases for application as HIV therapies. The efficacy of this approach relies on efficient biallelic disruption of CCR5, and the ability to efficiently target sequences that confer HIV resistance to the CCR5 locus has the potential to further improve clinical outcomes. We used RNA-based nuclease expression paired with adeno-associated virus (AAV)-mediated delivery of a CCR5-targeting donor template to achieve highly efficient targeted recombination in primary human T cells. This method consistently achieved 8 to 60% rates of homology-directed recombination into the CCR5 locus in T cells, with over 80% of cells modified with an MND-GFP expression cassette exhibiting biallelic modification. MND-GFP-modified T cells maintained a diverse repertoire and engrafted in immune-deficient mice as efficiently as unmodified cells. Using this method, we integrated sequences coding chimeric antigen receptors (CARs) into the CCR5 locus, and the resulting targeted CAR T cells exhibited antitumor or anti-HIV activity. Alternatively, we introduced the C46 HIV fusion inhibitor, generating T cell populations with high rates of biallelic CCR5 disruption paired with potential protection from HIV with CXCR4 co-receptor tropism. Finally, this protocol was applied to adult human mobilized CD34(+) cells, resulting in 15 to 20% homologous gene targeting. Our results demonstrate that high-efficiency targeted integration is feasible in primary human hematopoietic cells and highlight the potential of gene editing to engineer T cell products with myriad functional properties. Copyright © 2015, American Association for the Advancement of Science.
Whole-genome sequencing of Bacillus subtilis XF-1 reveals mechanisms for biological control and multiple beneficial properties in plants.

PubMed

Guo, Shengye; Li, Xingyu; He, Pengfei; Ho, Honhing; Wu, Yixin; He, Yueqiu

2015-06-01

Bacillus subtilis XF-1 is a gram-positive, plant-associated bacterium that stimulates plant growth and produces secondary metabolites that suppress soil-borne plant pathogens. In particular, it is especially highly efficient at controlling the clubroot disease of cruciferous crops. Its 4,061,186-bp genome contains an estimated 3853 protein-coding sequences and the 1155 genes of XF-1 are present in most genome-sequenced Bacillus strains: 3757 genes in B. subtilis 168, and 1164 in B. amyloliquefaciens FZB42. Analysis using the Cluster of Orthologous Groups database of proteins shows that 60 genes control bacterial mobility, 221 genes are related to cell wall and membrane biosynthesis, and more than 112 are genes associated with secondary metabolites. In addition, the genes contributed to the strain's plant colonization, bio-control and stimulation of plant growth. Sequencing of the genome is a fundamental step for developing a desired strain to serve as an efficient biological control agent and plant growth stimulator. Similar to other members of the taxon, XF-1 has a genome that contains giant gene clusters for the non-ribosomal synthesis of antifungal lipopeptides (surfactin and fengycin), the polyketides (macrolactin and bacillaene), the siderophore bacillibactin, and the dipeptide bacilysin. There are two synthesis pathways for volatile growth-promoting compounds. The expression of biosynthesized antibiotic peptides in XF-1 was revealed by matrix-assisted laser desorption/ionization-time of flight mass spectrometry.
Real-Time PCR Quantification Using A Variable Reaction Efficiency Model

PubMed Central

Platts, Adrian E.; Johnson, Graham D.; Linnemann, Amelia K.; Krawetz, Stephen A.

2008-01-01

Quantitative real-time PCR remains a cornerstone technique in gene expression analysis and sequence characterization. Despite the importance of the approach to experimental biology the confident assignment of reaction efficiency to the early cycles of real-time PCR reactions remains problematic. Considerable noise may be generated where few cycles in the amplification are available to estimate peak efficiency. An alternate approach that uses data from beyond the log-linear amplification phase is explored with the aim of reducing noise and adding confidence to efficiency estimates. PCR reaction efficiency is regressed to estimate the per-cycle profile of an asymptotically departed peak efficiency, even when this is not closely approximated in the measurable cycles. The process can be repeated over replicates to develop a robust estimate of peak reaction efficiency. This leads to an estimate of the maximum reaction efficiency that may be considered primer-design specific. Using a series of biological scenarios we demonstrate that this approach can provide an accurate estimate of initial template concentration. PMID:18570886
Rapid and efficient introduction of a foreign gene into bacterial artificial chromosome-cloned varicella vaccine by Tn7-mediated site-specific transposition

DOE Office of Scientific and Technical Information (OSTI.GOV)

Somboonthum, Pranee; Koshizuka, Tetsuo; Okamoto, Shigefumi

2010-06-20

Using a rapid and reliable system based on Tn7-mediated site-specific transposition, we have successfully constructed a recombinant Oka varicella vaccine (vOka) expressing the mumps virus (MuV) fusion protein (F). The backbone of the vector was our previously reported vOka-BAC (bacterial artificial chromosome) genome. We inserted the transposon Tn7 attachment sequence, LacZ{alpha}-mini-attTn7, into the region between ORF12 and ORF13 to generate a vOka-BAC-Tn genome. The MuV-F expressing cassette was transposed into the vOka-BAC genome at the mini-attTn7 transposition site. MuV-F protein was expressed in recombinant virus, rvOka-F infected cells. In addition, the MuV-F protein was cleaved in the rvOka-F infected cellsmore » as in MuV-infected cells. The growth of rvOka-F was similar to that of the original recombinant vOka without the F gene. Thus, we show that Tn7-mediated transposition is an efficient method for introducing a foreign gene expression cassette into the vOka-BAC genome as a live virus vector.« less
Natural gene expression variation studies in yeast.

PubMed

Thompson, Dawn A; Cubillos, Francisco A

2017-01-01

The rise of sequence information across different yeast species and strains is driving an increasing number of studies in the emerging field of genomics to associate polymorphic variants, mRNA abundance and phenotypic differences between individuals. Here, we gathered evidence from recent studies covering several layers that define the genotype-phenotype gap, such as mRNA abundance, allele-specific expression and translation efficiency to demonstrate how genetic variants co-evolve and define an individual's genome. Moreover, we exposed several antecedents where inter- and intra-specific studies led to opposite conclusions, probably owing to genetic divergence. Future studies in this area will benefit from the access to a massive array of well-annotated genomes and new sequencing technologies, which will allow the fine breakdown of the complex layers that delineate the genotype-phenotype map. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
The histidine permease gene (HIP1) of Saccharomyces cerevisiae.

PubMed

Tanaka, J; Fink, G R

1985-01-01

The histidine-specific permease gene (HIP1) of Saccharomyces cerevisiae has been mapped, cloned, and sequenced. The HIP1 gene maps to the right arm of chromosome VII, approx. 11 cM distal to the ADE3 gene. The gene was isolated as an 8.6-kb BamHI-Sau3A fragment by complementation of the histidine-specific permease deficiency in recipient yeast cells. We sequenced a 2.4-kb subfragment of this BamHI-Sau3A fragment containing the HIP1 gene and identified a 1596-bp open reading frame (ORF). We confirmed the assignment of the 1596-bp ORF as the HIP1 coding sequence by sequencing a hip1 nonsense mutation. Analysis of the amino acid (aa) sequence of the HIP1 gene reveals several hydrophobic stretches, but shows no obvious N-terminal signal peptide. We have constructed a deletion of the HIP1 gene in vitro and replaced the wild-type copy of the gene with this deletion. The hip1 deletion mutant can grow when it is supplemented with 30 mM histidine, 50 times the amount required for the growth of HIP1 cells. Revertants of this deletion mutant able to grow on a normal level of histidine arise by mutation in unlinked genes. Both these observations suggest that there are additional, low-affinity pathways for histidine uptake.
Efficient engineering of chromosomal ribosome binding site libraries in mismatch repair proficient Escherichia coli.

PubMed

Oesterle, Sabine; Gerngross, Daniel; Schmitt, Steven; Roberts, Tania Michelle; Panke, Sven

2017-09-26

Multiplexed gene expression optimization via modulation of gene translation efficiency through ribosome binding site (RBS) engineering is a valuable approach for optimizing artificial properties in bacteria, ranging from genetic circuits to production pathways. Established algorithms design smart RBS-libraries based on a single partially-degenerate sequence that efficiently samples the entire space of translation initiation rates. However, the sequence space that is accessible when integrating the library by CRISPR/Cas9-based genome editing is severely restricted by DNA mismatch repair (MMR) systems. MMR efficiency depends on the type and length of the mismatch and thus effectively removes potential library members from the pool. Rather than working in MMR-deficient strains, which accumulate off-target mutations, or depending on temporary MMR inactivation, which requires additional steps, we eliminate this limitation by developing a pre-selection rule of genome-library-optimized-sequences (GLOS) that enables introducing large functional diversity into MMR-proficient strains with sequences that are no longer subject to MMR-processing. We implement several GLOS-libraries in Escherichia coli and show that GLOS-libraries indeed retain diversity during genome editing and that such libraries can be used in complex genome editing operations such as concomitant deletions. We argue that this approach allows for stable and efficient fine tuning of chromosomal functions with minimal effort.
Evaluation and rational design of guide RNAs for efficient CRISPR/Cas9-mediated mutagenesis in Ciona.

PubMed

Gandhi, Shashank; Haeussler, Maximilian; Razy-Krajka, Florian; Christiaen, Lionel; Stolfi, Alberto

2017-05-01

The CRISPR/Cas9 system has emerged as an important tool for various genome engineering applications. A current obstacle to high throughput applications of CRISPR/Cas9 is the imprecise prediction of highly active single guide RNAs (sgRNAs). We previously implemented the CRISPR/Cas9 system to induce tissue-specific mutations in the tunicate Ciona. In the present study, we designed and tested 83 single guide RNA (sgRNA) vectors targeting 23 genes expressed in the cardiopharyngeal progenitors and surrounding tissues of Ciona embryo. Using high-throughput sequencing of mutagenized alleles, we identified guide sequences that correlate with sgRNA mutagenesis activity and used this information for the rational design of all possible sgRNAs targeting the Ciona transcriptome. We also describe a one-step cloning-free protocol for the assembly of sgRNA expression cassettes. These cassettes can be directly electroporated as unpurified PCR products into Ciona embryos for sgRNA expression in vivo, resulting in high frequency of CRISPR/Cas9-mediated mutagenesis in somatic cells of electroporated embryos. We found a strong correlation between the frequency of an Ebf loss-of-function phenotype and the mutagenesis efficacies of individual Ebf-targeting sgRNAs tested using this method. We anticipate that our approach can be scaled up to systematically design and deliver highly efficient sgRNAs for the tissue-specific investigation of gene functions in Ciona. Copyright © 2017 Elsevier Inc. All rights reserved.
Programming Native CRISPR Arrays for the Generation of Targeted Immunity.

PubMed

Hynes, Alexander P; Labrie, Simon J; Moineau, Sylvain

2016-05-03

The adaptive immune system of prokaryotes, called CRISPR-Cas (clustered regularly interspaced short palindromic repeats and CRISPR-associated genes), results in specific cleavage of invading nucleic acid sequences recognized by the cell's "memory" of past encounters. Here, we exploited the properties of native CRISPR-Cas systems to program the natural "memorization" process, efficiently generating immunity not only to a bacteriophage or plasmid but to any specifically chosen DNA sequence. CRISPR-Cas systems have entered the public consciousness as genome editing tools due to their readily programmable nature. In industrial settings, natural CRISPR-Cas immunity is already exploited to generate strains resistant to potentially disruptive viruses. However, the natural process by which bacteria acquire new target specificities (adaptation) is difficult to study and manipulate. The target against which immunity is conferred is selected stochastically. By biasing the immunization process, we offer a means to generate customized immunity, as well as provide a new tool to study adaptation. Copyright © 2016 Hynes et al.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase

DOEpatents

Studier, F. William; Davanloo, Parichehre; Rosenberg, Alan H.; Moffatt, Barbara A.; Dunn, John J.

1999-02-09

This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the R7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties. T7 RNA polymerase is also used in a system for selective, high-level synthesis of RNAs and proteins in suitable host cells.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase

DOEpatents

Studier, F. William; Davanloo, Parichehre; Rosenberg, Alan H.; Moffatt, Barbara A.; Dunn, John J.

1997-12-02

This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the R7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties. T7 RNA polymerase is also used in a system for selective, high-level synthesis of RNAs and proteins in suitable host cells.
Cloning and expression of the gene for bacteriophage T7 RNA polymerase

DOEpatents

Studier, F. William; Davanloo, Parichehre; Rosenberg, Alan H.; Moffatt, Barbara A.; Dunn, John J.

1990-01-01

This application describes a means to clone a functional gene for bacteriophage T7 RNA polymerase. Active T7 RNA polymerase is produced from the cloned gene, and a plasmid has been constructed that can produce the active enzyme in large amounts. T7 RNA polymerase transcribes DNA very efficiently and is highly selective for a relatively long promoter sequence. This enzyme is useful for synthesizing large amounts of RNA in vivo or in vitro, and is capable of producing a single RNA selectively from a complex mixture of DNAs. The procedure used to obtain a clone of the T7 RNA polymerase gene can be applied to other T7-like phages to obtain clones that produce RNA polymerases having different promoter specificities, different bacterial hosts, or other desirable properties. T7 RNA polymerase is also used in a system for selective, high-level synthesis of RNAs and proteins in suitable host cells.
Design and assembly of new non-viral RNAi delivery agents by microwave-assisted quaternization (MAQ) of tertiary amines

PubMed Central

Ghosh, Animesh; Mukherjee, Koushik; Jiang, Xinpeng; Zhou, Ying; McCarroll, Joshua; Qu, James; Swain, Pamela M.; Baigude, Huricha; Rana, Tariq M.

2010-01-01

RNA interference (RNAi), a gene-silencing phenomenon whereby double-stranded RNA (dsRNA) triggers the sequence-specific degradation of homologous mRNA. RNAi has been quickly and widely applied to discover gene functions and holds great potential to provide a new class of therapeutic agents. However, new chemistry and delivery approaches are greatly needed to silence disease-causing genes without toxic effects. We reasoned that conjugation of the cholesterol moiety to cationic lipids would enhance RNAi efficiencies and lower the toxic effects of lipid-mediated RNAi delivery. Here, we report the first design and synthesis of new cholesterol-conjugated cationic lipids for RNAi delivery using microwave-assisted quaternization (MAQ) of tertiary amines. This strategy can be employed to develop new classes of non-viral gene delivery agents under safe and fast reaction conditions. PMID:20722369
Wheat-specific gene, ribosomal protein l21, used as the endogenous reference gene for qualitative and real-time quantitative polymerase chain reaction detection of transgenes.

PubMed

Liu, Yi-Ke; Li, He-Ping; Huang, Tao; Cheng, Wei; Gao, Chun-Sheng; Zuo, Dong-Yun; Zhao, Zheng-Xi; Liao, Yu-Cai

2014-10-29

Wheat-specific ribosomal protein L21 (RPL21) is an endogenous reference gene suitable for genetically modified (GM) wheat identification. This taxon-specific RPL21 sequence displayed high homogeneity in different wheat varieties. Southern blots revealed 1 or 3 copies, and sequence analyses showed one amplicon in common wheat. Combined analyses with sequences from common wheat (AABBDD) and three diploid ancestral species, Triticum urartu (AA), Aegilops speltoides (BB), and Aegilops tauschii (DD), demonstrated the presence of this amplicon in the AA genome. Using conventional qualitative polymerase chain reaction (PCR), the limit of detection was 2 copies of wheat haploid genome per reaction. In the quantitative real-time PCR assay, limits of detection and quantification were about 2 and 8 haploid genome copies, respectively, the latter of which is 2.5-4-fold lower than other reported wheat endogenous reference genes. Construct-specific PCR assays were developed using RPL21 as an endogenous reference gene, and as little as 0.5% of GM wheat contents containing Arabidopsis NPR1 were properly quantified.
[Comparison of the sensibility and specificity between single-stranded conformation polymorphism and denaturing high-performance liquid chromatography in screening hMSH2 and hMLH1 gene mutations in hereditary non-polyposis colorectal cancer].

PubMed

Wei, Guang-hui; Zhao, Bo; Wang, Zhen-jun

2008-09-01

To compare the sensibility and specificity between single-stranded conformation polymorphism (SSCP) and denaturing high-performance liquid chromatography (DHPLC) in screening hMSH2 and hMLH1 gene mutations for the diagnosis of hereditary non-polyposis colorectal cancer (HNPCC). Seven Chinese HNPCC kindreds were collected. PCR-SSCP and DHPLC were used to screen the coding regions of hMSH2 and hMLH1 genes and the abnormal profiles were sequenced by a 377 DNA sequencer. Seven gene sequence variations of hMSH2 or hMLH1 were found. Among them, 4 variations were not found by SSCP, but by DHPLC. The sensibility of SSCP and DHPLC were 51.6% and 100% respectively, and the specificity were 66.6% and 93.3% respectively. DHPLC has better sensibility and specificity in screening hMSH2 and hMLH1 gene mutation as compared to SSCP. DHPLC is an ideal method in the diagnosis of HNPCC.
Progress of targeted genome modification approaches in higher plants.

PubMed

Cardi, Teodoro; Neal Stewart, C

2016-07-01

Transgene integration in plants is based on illegitimate recombination between non-homologous sequences. The low control of integration site and number of (trans/cis)gene copies might have negative consequences on the expression of transferred genes and their insertion within endogenous coding sequences. The first experiments conducted to use precise homologous recombination for gene integration commenced soon after the first demonstration that transgenic plants could be produced. Modern transgene targeting categories used in plant biology are: (a) homologous recombination-dependent gene targeting; (b) recombinase-mediated site-specific gene integration; (c) oligonucleotide-directed mutagenesis; (d) nuclease-mediated site-specific genome modifications. New tools enable precise gene replacement or stacking with exogenous sequences and targeted mutagenesis of endogeneous sequences. The possibility to engineer chimeric designer nucleases, which are able to target virtually any genomic site, and use them for inducing double-strand breaks in host DNA create new opportunities for both applied plant breeding and functional genomics. CRISPR is the most recent technology available for precise genome editing. Its rapid adoption in biological research is based on its inherent simplicity and efficacy. Its utilization, however, depends on available sequence information, especially for genome-wide analysis. We will review the approaches used for genome modification, specifically those for affecting gene integration and modification in higher plants. For each approach, the advantages and limitations will be noted. We also will speculate on how their actual commercial development and implementation in plant breeding will be affected by governmental regulations.
Outbred genome sequencing and CRISPR/Cas9 gene editing in butterflies

PubMed Central

Li, Xueyan; Fan, Dingding; Zhang, Wei; Liu, Guichun; Zhang, Lu; Zhao, Li; Fang, Xiaodong; Chen, Lei; Dong, Yang; Chen, Yuan; Ding, Yun; Zhao, Ruoping; Feng, Mingji; Zhu, Yabing; Feng, Yue; Jiang, Xuanting; Zhu, Deying; Xiang, Hui; Feng, Xikan; Li, Shuaicheng; Wang, Jun; Zhang, Guojie; Kronforst, Marcus R.; Wang, Wen

2015-01-01

Butterflies are exceptionally diverse but their potential as an experimental system has been limited by the difficulty of deciphering heterozygous genomes and a lack of genetic manipulation technology. Here we use a hybrid assembly approach to construct high-quality reference genomes for Papilio xuthus (contig and scaffold N50: 492 kb, 3.4 Mb) and Papilio machaon (contig and scaffold N50: 81 kb, 1.15 Mb), highly heterozygous species that differ in host plant affiliations, and adult and larval colour patterns. Integrating comparative genomics and analyses of gene expression yields multiple insights into butterfly evolution, including potential roles of specific genes in recent diversification. To functionally test gene function, we develop an efficient (up to 92.5%) CRISPR/Cas9 gene editing method that yields obvious phenotypes with three genes, Abdominal-B, ebony and frizzled. Our results provide valuable genomic and technological resources for butterflies and unlock their potential as a genetic model system. PMID:26354079
Gene finding in metatranscriptomic sequences.

PubMed

Ismail, Wazim Mohammed; Ye, Yuzhen; Tang, Haixu

2014-01-01

Metatranscriptomic sequencing is a highly sensitive bioassay of functional activity in a microbial community, providing complementary information to the metagenomic sequencing of the community. The acquisition of the metatranscriptomic sequences will enable us to refine the annotations of the metagenomes, and to study the gene activities and their regulation in complex microbial communities and their dynamics. In this paper, we present TransGeneScan, a software tool for finding genes in assembled transcripts from metatranscriptomic sequences. By incorporating several features of metatranscriptomic sequencing, including strand-specificity, short intergenic regions, and putative antisense transcripts into a Hidden Markov Model, TranGeneScan can predict a sense transcript containing one or multiple genes (in an operon) or an antisense transcript. We tested TransGeneScan on a mock metatranscriptomic data set containing three known bacterial genomes. The results showed that TranGeneScan performs better than metagenomic gene finders (MetaGeneMark and FragGeneScan) on predicting protein coding genes in assembled transcripts, and achieves comparable or even higher accuracy than gene finders for microbial genomes (Glimmer and GeneMark). These results imply, with the assistance of metatranscriptomic sequencing, we can obtain a broad and precise picture about the genes (and their functions) in a microbial community. TransGeneScan is available as open-source software on SourceForge at https://sourceforge.net/projects/transgenescan/.
Developing a Novel Gene-Delivery Vector System Using the Recombinant Fusion Protein of Pseudomonas Exotoxin A and Hyperthermophilic Archaeal Histone HPhA

PubMed Central

Zhang, Ling; Feng, Yan; Li, Zehong; Wu, GuangMou; Yue, Yuhuan; Li, Gensong; Cao, Yu; Zhu, Ping

2015-01-01

Non-viral gene delivery system with many advantages has a great potential for the future of gene therapy. One inherent obstacle of such approach is the uptake by endocytosis into vesicular compartments. Receptor-mediated gene delivery method holds promise to overcome this obstacle. In this study, we developed a receptor-mediated gene delivery system based on a combination of the Pseudomonas exotoxin A (PE), which has a receptor binding and membrane translocation domain, and the hyperthermophilic archaeal histone (HPhA), which has the DNA binding ability. First, we constructed and expressed the rPE-HPhA fusion protein. We then examined the cytotoxicity and the DNA binding ability of rPE-HPhA. We further assessed the efficiency of transfection of the pEGF-C1 plasmid DNA to CHO cells by the rPE-HPhA system, in comparison to the cationic liposome method. The results showed that the transfection efficiency of rPE-HPhA was higher than that of cationic liposomes. In addition, the rPE-HPhA gene delivery system is non-specific to DNA sequence, topology or targeted cell type. Thus, the rPE-HPhA system can be used for delivering genes of interest into mammalian cells and has great potential to be applied for gene therapy. PMID:26556098

Single-Nucleotide-Specific Targeting of the Tf1 Retrotransposon Promoted by the DNA-Binding Protein Sap1 of Schizosaccharomyces pombe.

PubMed

Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R; McQueen, Philip G; Yang, Andrew X; Mizuguchi, Takeshi; Grewal, Shiv I S; Levin, Henry L

2015-11-01

Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and -9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. Copyright © 2015 by the Genetics Society of America.
Single-Nucleotide-Specific Targeting of the Tf1 Retrotransposon Promoted by the DNA-Binding Protein Sap1 of Schizosaccharomyces pombe

PubMed Central

Hickey, Anthony; Esnault, Caroline; Majumdar, Anasuya; Chatterjee, Atreyi Ghatak; Iben, James R.; McQueen, Philip G.; Yang, Andrew X.; Mizuguchi, Takeshi; Grewal, Shiv I. S.; Levin, Henry L.

2015-01-01

Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and −9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration. PMID:26358720
Terminal Duplex Stability and Nucleotide Identity Differentially Control siRNA Loading and Activity in RNA Interference

PubMed Central

Angart, Phillip A.; Carlson, Rebecca J.; Adu-Berchie, Kwasi

2016-01-01

Efficient short interfering RNA (siRNA)-mediated gene silencing requires selection of a sequence that is complementary to the intended target and possesses sequence and structural features that encourage favorable functional interactions with the RNA interference (RNAi) pathway proteins. In this study, we investigated how terminal sequence and structural characteristics of siRNAs contribute to siRNA strand loading and silencing activity and how these characteristics ultimately result in a functionally asymmetric duplex in cultured HeLa cells. Our results reiterate that the most important characteristic in determining siRNA activity is the 5′ terminal nucleotide identity. Our findings further suggest that siRNA loading is controlled principally by the hybridization stability of the 5′ terminus (Nucleotides: 1–2) of each siRNA strand, independent of the opposing terminus. Postloading, RNA-induced silencing complex (RISC)–specific activity was found to be improved by lower hybridization stability in the 5′ terminus (Nucleotides: 3–4) of the loaded siRNA strand and greater hybridization stability toward the 3′ terminus (Nucleotides: 17–18). Concomitantly, specific recognition of the 5′ terminal nucleotide sequence by human Argonaute 2 (Ago2) improves RISC half-life. These findings indicate that careful selection of siRNA sequences can maximize both the loading and the specific activity of the intended guide strand. PMID:27399870
Comparative Analysis of the Genomes of Two Field Isolates of the Rice Blast Fungus Magnaporthe oryzae

PubMed Central

Li, Zhigang; Hu, Songnian; Yao, Nan; Dean, Ralph A.; Zhao, Wensheng; Shen, Mi; Zhang, Haiwang; Li, Chao; Liu, Liyuan; Cao, Lei; Xu, Xiaowen; Xing, Yunfei; Hsiang, Tom; Zhang, Ziding; Xu, Jin-Rong; Peng, You-Liang

2012-01-01

Rice blast caused by Magnaporthe oryzae is one of the most destructive diseases of rice worldwide. The fungal pathogen is notorious for its ability to overcome host resistance. To better understand its genetic variation in nature, we sequenced the genomes of two field isolates, Y34 and P131. In comparison with the previously sequenced laboratory strain 70-15, both field isolates had a similar genome size but slightly more genes. Sequences from the field isolates were used to improve genome assembly and gene prediction of 70-15. Although the overall genome structure is similar, a number of gene families that are likely involved in plant-fungal interactions are expanded in the field isolates. Genome-wide analysis on asynonymous to synonymous nucleotide substitution rates revealed that many infection-related genes underwent diversifying selection. The field isolates also have hundreds of isolate-specific genes and a number of isolate-specific gene duplication events. Functional characterization of randomly selected isolate-specific genes revealed that they play diverse roles, some of which affect virulence. Furthermore, each genome contains thousands of loci of transposon-like elements, but less than 30% of them are conserved among different isolates, suggesting active transposition events in M. oryzae. A total of approximately 200 genes were disrupted in these three strains by transposable elements. Interestingly, transposon-like elements tend to be associated with isolate-specific or duplicated sequences. Overall, our results indicate that gain or loss of unique genes, DNA duplication, gene family expansion, and frequent translocation of transposon-like elements are important factors in genome variation of the rice blast fungus. PMID:22876203
Species specific identification of spore-producing microbes using the gene sequence of small acid-soluble spore coat proteins for amplification based diagnostics

DOEpatents

McKinney, Nancy

2002-01-01

PCR (polymerase chain reaction) primers for the detection of certain Bacillus species, such as Bacillus anthracis. The primers specifically amplify only DNA found in the target species and can distinguish closely related species. Species-specific PCR primers for Bacillus anthracis, Bacillus globigii and Clostridium perfringens are disclosed. The primers are directed to unique sequences within sasp (small acid soluble protein) genes.
Whole Genome SNP Genotyping and Exome Sequencing Reveal Novel Genetic Variants and Putative Causative Genes in Congenital Hyperinsulinism

PubMed Central

Proverbio, Maria Carla; Mangano, Eleonora; Gessi, Alessandra; Bordoni, Roberta; Spinelli, Roberta; Asselta, Rosanna; Valin, Paola Sogno; Di Candia, Stefania; Zamproni, Ilaria; Diceglie, Cecilia; Mora, Stefano; Caruso-Nicoletti, Manuela; Salvatoni, Alessandro; De Bellis, Gianluca; Battaglia, Cristina

2013-01-01

Congenital hyperinsulinism of infancy (CHI) is a rare disorder characterized by severe hypoglycemia due to inappropriate insulin secretion. The genetic causes of CHI have been found in genes regulating insulin secretion from pancreatic β-cells; recessive inactivating mutations in the ABCC8 and KCNJ11 genes represent the most common events. Despite the advances in understanding the molecular pathogenesis of CHI, specific genetic determinants in about 50 % of the CHI patients remain unknown, suggesting additional locus heterogeneity. In order to search for novel loci contributing to the pathogenesis of CHI, we combined a family-based association study, using the transmission disequilibrium test on 17 CHI patients lacking mutations in ABCC8/KCNJ11, with a whole-exome sequencing analysis performed on 10 probands. This strategy allowed the identification of the potential causative mutations in genes implicated in the regulation of insulin secretion such as transmembrane proteins (CACNA1A, KCNH6, KCNJ10, NOTCH2, RYR3, SCN8A, TRPV3, TRPC5), cytosolic (ACACB, CAMK2D, CDKAL1, GNAS, NOS2, PDE4C, PIK3R3) and mitochondrial enzymes (PC, SLC24A6), and in four genes (CSMD1, SLC37A3, SULF1, TLL1) suggested by TDT family-based association study. Moreover, the exome-sequencing approach resulted to be an efficient diagnostic tool for CHI, allowing the identification of mutations in three causative CHI genes (ABCC8, GLUD1, and HNF1A) in four out of 10 patients. Overall, the present study should be considered as a starting point to design further investigations: our results might indeed contribute to meta-analysis studies, aimed at the identification/confirmation of novel causative or modifier genes. PMID:23869231
Strain-Level Diversity of Secondary Metabolism in Streptomyces albus

PubMed Central

Seipke, Ryan F.

2015-01-01

Streptomyces spp. are robust producers of medicinally-, industrially- and agriculturally-important small molecules. Increased resistance to antibacterial agents and the lack of new antibiotics in the pipeline have led to a renaissance in natural product discovery. This endeavor has benefited from inexpensive high quality DNA sequencing technology, which has generated more than 140 genome sequences for taxonomic type strains and environmental Streptomyces spp. isolates. Many of the sequenced streptomycetes belong to the same species. For instance, Streptomyces albus has been isolated from diverse environmental niches and seven strains have been sequenced, consequently this species has been sequenced more than any other streptomycete, allowing valuable analyses of strain-level diversity in secondary metabolism. Bioinformatics analyses identified a total of 48 unique biosynthetic gene clusters harboured by Streptomyces albus strains. Eighteen of these gene clusters specify the core secondary metabolome of the species. Fourteen of the gene clusters are contained by one or more strain and are considered auxiliary, while 16 of the gene clusters encode the production of putative strain-specific secondary metabolites. Analysis of Streptomyces albus strains suggests that each strain of a Streptomyces species likely harbours at least one strain-specific biosynthetic gene cluster. Importantly, this implies that deep sequencing of a species will not exhaust gene cluster diversity and will continue to yield novelty. PMID:25635820
The "grep" command but not FusionMap, FusionFinder or ChimeraScan captures the CIC-DUX4 fusion gene from whole transcriptome sequencing data on a small round cell tumor with t(4;19)(q35;q13).

PubMed

Panagopoulos, Ioannis; Gorunova, Ludmila; Bjerkehagen, Bodil; Heim, Sverre

2014-01-01

Whole transcriptome sequencing was used to study a small round cell tumor in which a t(4;19)(q35;q13) was part of the complex karyotype but where the initial reverse transcriptase PCR (RT-PCR) examination did not detect a CIC-DUX4 fusion transcript previously described as the crucial gene-level outcome of this specific translocation. The RNA sequencing data were analysed using the FusionMap, FusionFinder, and ChimeraScan programs which are specifically designed to identify fusion genes. FusionMap, FusionFinder, and ChimeraScan identified 1017, 102, and 101 fusion transcripts, respectively, but CIC-DUX4 was not among them. Since the RNA sequencing data are in the fastq text-based format, we searched the files using the "grep" command-line utility. The "grep" command searches the text for specific expressions and displays, by default, the lines where matches occur. The "specific expression" was a sequence of 20 nucleotides from the coding part of the last exon 20 of CIC (Reference Sequence: NM_015125.3) chosen since all the so far reported CIC breakpoints have occurred here. Fifteen chimeric CIC-DUX4 cDNA sequences were captured and the fusion between the CIC and DUX4 genes was mapped precisely. New primer combinations were constructed based on these findings and were used together with a polymerase suitable for amplification of GC-rich DNA templates to amplify CIC-DUX4 cDNA fragments which had the same fusion point found with "grep". In conclusion, FusionMap, FusionFinder, and ChimeraScan generated a plethora of fusion transcripts but did not detect the biologically important CIC-DUX4 chimeric transcript; they are generally useful but evidently suffer from imperfect both sensitivity and specificity. The "grep" command is an excellent tool to capture chimeric transcripts from RNA sequencing data when the pathological and/or cytogenetic information strongly indicates the presence of a specific fusion gene.
High-Resolution Whole-Genome Sequencing Reveals That Specific Chromatin Domains from Most Human Chromosomes Associate with Nucleoli

PubMed Central

van Koningsbruggen, Silvana; Gierliński, Marek; Schofield, Pietá; Martin, David; Barton, Geoffey J.; Ariyurek, Yavuz; den Dunnen, Johan T.

2010-01-01

The nuclear space is mostly occupied by chromosome territories and nuclear bodies. Although this organization of chromosomes affects gene function, relatively little is known about the role of nuclear bodies in the organization of chromosomal regions. The nucleolus is the best-studied subnuclear structure and forms around the rRNA repeat gene clusters on the acrocentric chromosomes. In addition to rDNA, other chromatin sequences also surround the nucleolar surface and may even loop into the nucleolus. These additional nucleolar-associated domains (NADs) have not been well characterized. We present here a whole-genome, high-resolution analysis of chromatin endogenously associated with nucleoli. We have used a combination of three complementary approaches, namely fluorescence comparative genome hybridization, high-throughput deep DNA sequencing and photoactivation combined with time-lapse fluorescence microscopy. The data show that specific sequences from most human chromosomes, in addition to the rDNA repeat units, associate with nucleoli in a reproducible and heritable manner. NADs have in common a high density of AT-rich sequence elements, low gene density and a statistically significant enrichment in transcriptionally repressed genes. Unexpectedly, both the direct DNA sequencing and fluorescence photoactivation data show that certain chromatin loci can specifically associate with either the nucleolus, or the nuclear envelope. PMID:20826608
High-resolution whole-genome sequencing reveals that specific chromatin domains from most human chromosomes associate with nucleoli.

PubMed

van Koningsbruggen, Silvana; Gierlinski, Marek; Schofield, Pietá; Martin, David; Barton, Geoffey J; Ariyurek, Yavuz; den Dunnen, Johan T; Lamond, Angus I

2010-11-01

The nuclear space is mostly occupied by chromosome territories and nuclear bodies. Although this organization of chromosomes affects gene function, relatively little is known about the role of nuclear bodies in the organization of chromosomal regions. The nucleolus is the best-studied subnuclear structure and forms around the rRNA repeat gene clusters on the acrocentric chromosomes. In addition to rDNA, other chromatin sequences also surround the nucleolar surface and may even loop into the nucleolus. These additional nucleolar-associated domains (NADs) have not been well characterized. We present here a whole-genome, high-resolution analysis of chromatin endogenously associated with nucleoli. We have used a combination of three complementary approaches, namely fluorescence comparative genome hybridization, high-throughput deep DNA sequencing and photoactivation combined with time-lapse fluorescence microscopy. The data show that specific sequences from most human chromosomes, in addition to the rDNA repeat units, associate with nucleoli in a reproducible and heritable manner. NADs have in common a high density of AT-rich sequence elements, low gene density and a statistically significant enrichment in transcriptionally repressed genes. Unexpectedly, both the direct DNA sequencing and fluorescence photoactivation data show that certain chromatin loci can specifically associate with either the nucleolus, or the nuclear envelope.
Inhibition of hepatitis B virus replication with linear DNA sequences expressing antiviral micro-RNA shuttles

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie

2009-11-20

RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR)more » shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.« less
Low molecular weight polyethylenimine cross-linked by 2-hydroxypropyl-gamma-cyclodextrin coupled to peptide targeting HER2 as a gene delivery vector.

PubMed

Huang, Hongliang; Yu, Hai; Tang, Guping; Wang, Qingqing; Li, Jun

2010-03-01

Gene delivery is one of the critical steps for gene therapy. Non-viral vectors have many advantages but suffered from low gene transfection efficiency. Here, in order to develop new polymeric gene vectors with low cytotoxicity and high gene transfection efficiency, we synthesized a cationic polymer composed of low molecular weight polyethylenimine (PEI) of molecular weight of 600 Da cross-linked by 2-hydroxypropyl-gamma-cyclodextrin (HP gamma-CD) and then coupled to MC-10 oligopeptide containing a sequence of Met-Ala-Arg-Ala-Lys-Glu. The oligopeptide can target to HER2, the human epidermal growth factor receptor 2, which is often over expressed in many breast and ovary cancers. The new gene vector was expected to be able to target delivery of genes to HER2 positive cancer cells for gene therapy. The new gene vector was composed of chemically bonded HP gamma-CD, PEI (600 Da), and MC-10 peptide at a molar ratio of 1:3.3:1.2. The gene vector could condense plasmid DNA at an N/P ratio of 6 or above. The particle size of HP gamma-CD-PEI-P/DNA complexes at N/P ratios 40 was around 170-200 nm, with zeta potential of about 20 mV. The gene vector showed very low cytotoxicity, strong targeting specificity to HER2 receptor, and high efficiency of delivering DNA to target cells in vitro and in vivo with the reporter genes. The delivery of therapeutic IFN-alpha gene mediated by the new gene vector and the therapeutic efficiency were also studied in mice animal model. The animal study results showed that the new gene vector HP gamma-CD-PEI-P significantly enhanced the anti-tumor effect on tumor-bearing nude mice as compared to PEI (25 kDa), HP gamma-CD-PEI, and other controls, indicating that this new polymeric gene vector is a potential candidate for cancer gene therapy. (c) 2009 Elsevier Ltd. All rights reserved.
Influence of quasi-specific sites on kinetics of target DNA search by a sequence-specific DNA-binding protein.

PubMed

Kemme, Catherine A; Esadze, Alexandre; Iwahara, Junji

2015-11-10

Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such "quasi-specific" sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1's association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins.
Identification of Candidate Genes Underlying an Iron Efficiency Quantitative Trait Locus in Soybean1

PubMed Central

Peiffer, Gregory A.; King, Keith E.; Severin, Andrew J.; May, Gregory D.; Cianzio, Silvia R.; Lin, Shun Fu; Lauter, Nicholas C.; Shoemaker, Randy C.

2012-01-01

Prevalent on calcareous soils in the United States and abroad, iron deficiency is among the most common and severe nutritional stresses in plants. In soybean (Glycine max) commercial plantings, the identification and use of iron-efficient genotypes has proven to be the best form of managing this soil-related plant stress. Previous studies conducted in soybean identified a significant iron efficiency quantitative trait locus (QTL) explaining more than 70% of the phenotypic variation for the trait. In this research, we identified candidate genes underlying this QTL through molecular breeding, mapping, and transcriptome sequencing. Introgression mapping was performed using two related near-isogenic lines in which a region located on soybean chromosome 3 required for iron efficiency was identified. The region corresponds to the previously reported iron efficiency QTL. The location was further confirmed through QTL mapping conducted in this study. Transcriptome sequencing and quantitative real-time-polymerase chain reaction identified two genes encoding transcription factors within the region that were significantly induced in soybean roots under iron stress. The two induced transcription factors were identified as homologs of the subgroup lb basic helix-loop-helix (bHLH) genes that are known to regulate the strategy I response in Arabidopsis (Arabidopsis thaliana). Resequencing of these differentially expressed genes unveiled a significant deletion within a predicted dimerization domain. We hypothesize that this deletion disrupts the Fe-DEFICIENCY-INDUCED TRANSCRIPTION FACTOR (FIT)/bHLH heterodimer that has been shown to induce known iron acquisition genes. PMID:22319075
MicroRNAs Form Triplexes with Double Stranded DNA at Sequence-Specific Binding Sites; a Eukaryotic Mechanism via which microRNAs Could Directly Alter Gene Expression

PubMed Central

Grace, Christy R.; Ferreira, Antonio M.; Waddell, M. Brett; Ridout, Granger; Naeve, Deanna; Leuze, Michael; LoCascio, Philip F.; Panetta, John C.; Wilkinson, Mark R.; Pui, Ching-Hon; Naeve, Clayton W.; Uberbacher, Edward C.; Bonten, Erik J.; Evans, William E.

2016-01-01

MicroRNAs are important regulators of gene expression, acting primarily by binding to sequence-specific locations on already transcribed messenger RNAs (mRNA) and typically down-regulating their stability or translation. Recent studies indicate that microRNAs may also play a role in up-regulating mRNA transcription levels, although a definitive mechanism has not been established. Double-helical DNA is capable of forming triple-helical structures through Hoogsteen and reverse Hoogsteen interactions in the major groove of the duplex, and we show physical evidence (i.e., NMR, FRET, SPR) that purine or pyrimidine-rich microRNAs of appropriate length and sequence form triple-helical structures with purine-rich sequences of duplex DNA, and identify microRNA sequences that favor triplex formation. We developed an algorithm (Trident) to search genome-wide for potential triplex-forming sites and show that several mammalian and non-mammalian genomes are enriched for strong microRNA triplex binding sites. We show that those genes containing sequences favoring microRNA triplex formation are markedly enriched (3.3 fold, p<2.2 × 10−16) for genes whose expression is positively correlated with expression of microRNAs targeting triplex binding sequences. This work has thus revealed a new mechanism by which microRNAs could interact with gene promoter regions to modify gene transcription. PMID:26844769
Development of PCR primers specific for the amplification and direct sequencing of gyrB genes from microbacteria, order Actinomycetales.

PubMed

Richert, Kathrin; Brambilla, Evelyne; Stackebrandt, Erko

2005-01-01

PCR primer sets were developed for the specific amplification and sequence analyses encoding the gyrase subunit B (gyrB) of members of the family Microbacteriaceae, class Actinobacteria. The family contains species highly related by 16S rRNA gene sequence analyses. In order to test if the gene sequence analysis of gyrB is appropriate to discriminate between closely related species, we evaluate the 16S rRNA gene phylogeny of its members. As the published universal primer set for gyrB failed to amplify the responding gene of the majority of the 80 type strains of the family, three new primer sets were identified that generated fragments with a composite sequence length of about 900 nt. However, the amplification of all three fragments was successful only in 25% of the 80 type strains. In this study, the substitution frequencies in genes encoding gyrase and 16S rDNA were compared for 10 strains of nine genera. The frequency of gyrB nucleotide substitution is significantly higher than that of the 16S rDNA, and no linear correlation exists between the similarities of both molecules among members of the Microbacteriaceae. The phylogenetic analyses using the gyrB sequences provide higher resolution than using 16S rDNA sequences and seem able to discriminate between closely related species.
Genome analysis and identification of gelatinase encoded gene in Enterobacter aerogenes

NASA Astrophysics Data System (ADS)

Shahimi, Safiyyah; Mutalib, Sahilah Abdul; Khalid, Rozida Abdul; Repin, Rul Aisyah Mat; Lamri, Mohd Fadly; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, bioinformatic analysis towards genome sequence of E. aerogenes was done to determine gene encoded for gelatinase. Enterobacter aerogenes was isolated from hot spring water and gelatinase species-specific bacterium to porcine and fish gelatin. This bacterium offers the possibility of enzymes production which is specific to both species gelatine, respectively. Enterobacter aerogenes was partially genome sequenced resulting in 5.0 mega basepair (Mbp) total size of sequence. From pre-process pipeline, 87.6 Mbp of total reads, 68.8 Mbp of total high quality reads and 78.58 percent of high quality percentage was determined. Genome assembly produced 120 contigs with 67.5% of contigs over 1 kilo base pair (kbp), 124856 bp of N50 contig length and 55.17 % of GC base content percentage. About 4705 protein gene was identified from protein prediction analysis. Two candidate genes selected have highest similarity identity percentage against gelatinase enzyme available in Swiss-Prot and NCBI online database. They were NODE_9_length_26866_cov_148.013245_12 containing 1029 base pair (bp) sequence with 342 amino acid sequence and NODE_24_length_155103_cov_177.082458_62 which containing 717 bp sequence with 238 amino acid sequence, respectively. Thus, two paired of primers (forward and reverse) were designed, based on the open reading frame (ORF) of selected genes. Genome analysis of E. aerogenes resulting genes encoded gelatinase were identified.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Stella, Stefano; University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen; Molina, Rafael

Crystal structures of BurrH and the BurrH–DNA complex are reported. DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-bindingmore » domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing.« less
Prospecting Metagenomic Enzyme Subfamily Genes for DNA Family Shuffling by a Novel PCR-based Approach*

PubMed Central

Wang, Qiuyan; Wu, Huili; Wang, Anming; Du, Pengfei; Pei, Xiaolin; Li, Haifeng; Yin, Xiaopu; Huang, Lifeng; Xiong, Xiaolong

2010-01-01

DNA family shuffling is a powerful method for enzyme engineering, which utilizes recombination of naturally occurring functional diversity to accelerate laboratory-directed evolution. However, the use of this technique has been hindered by the scarcity of family genes with the required level of sequence identity in the genome database. We describe here a strategy for collecting metagenomic homologous genes for DNA shuffling from environmental samples by truncated metagenomic gene-specific PCR (TMGS-PCR). Using identified metagenomic gene-specific primers, twenty-three 921-bp truncated lipase gene fragments, which shared 64–99% identity with each other and formed a distinct subfamily of lipases, were retrieved from 60 metagenomic samples. These lipase genes were shuffled, and selected active clones were characterized. The chimeric clones show extensive functional and genetic diversity, as demonstrated by functional characterization and sequence analysis. Our results indicate that homologous sequences of genes captured by TMGS-PCR can be used as suitable genetic material for DNA family shuffling with broad applications in enzyme engineering. PMID:20962349
Chromosomal location and gene paucity of the male specific region on papaya Y chromosome.

PubMed

Yu, Qingyi; Hou, Shaobin; Hobza, Roman; Feltus, F Alex; Wang, Xiue; Jin, Weiwei; Skelton, Rachel L; Blas, Andrea; Lemke, Cornelia; Saw, Jimmy H; Moore, Paul H; Alam, Maqsudul; Jiang, Jiming; Paterson, Andrew H; Vyskot, Boris; Ming, Ray

2007-08-01

Sex chromosomes in flowering plants evolved recently and many of them remain homomorphic, including those in papaya. We investigated the chromosomal location of papaya's small male specific region of the hermaphrodite Y (Yh) chromosome (MSY) and its genomic features. We conducted chromosome fluorescence in situ hybridization mapping of Yh-specific bacterial artificial chromosomes (BACs) and placed the MSY near the centromere of the papaya Y chromosome. Then we sequenced five MSY BACs to examine the genomic features of this specialized region, which resulted in the largest collection of contiguous genomic DNA sequences of a Y chromosome in flowering plants. Extreme gene paucity was observed in the papaya MSY with no functional gene identified in 715 kb MSY sequences. A high density of retroelements and local sequence duplications were detected in the MSY that is suppressed for recombination. Location of the papaya MSY near the centromere might have provided recombination suppression and fostered paucity of genes in the male specific region of the Y chromosome. Our findings provide critical information for deciphering the sex chromosomes in papaya and reference information for comparative studies of other sex chromosomes in animals and plants.

Computational Tools and Algorithms for Designing Customized Synthetic Genes

PubMed Central

Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris

2014-01-01

Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050
Direct and long-term detection of gene doping in conventional blood samples.

PubMed

Beiter, T; Zimmermann, M; Fragasso, A; Hudemann, J; Niess, A M; Bitzer, M; Lauer, U M; Simon, P

2011-03-01

The misuse of somatic gene therapy for the purpose of enhancing athletic performance is perceived as a coming threat to the world of sports and categorized as 'gene doping'. This article describes a direct detection approach for gene doping that gives a clear yes-or-no answer based on the presence or absence of transgenic DNA in peripheral blood samples. By exploiting a priming strategy to specifically amplify intronless DNA sequences, we developed PCR protocols allowing the detection of very small amounts of transgenic DNA in genomic DNA samples to screen for six prime candidate genes. Our detection strategy was verified in a mouse model, giving positive signals from minute amounts (20 μl) of blood samples for up to 56 days following intramuscular adeno-associated virus-mediated gene transfer, one of the most likely candidate vector systems to be misused for gene doping. To make our detection strategy amenable for routine testing, we implemented a robust sample preparation and processing protocol that allows cost-efficient analysis of small human blood volumes (200 μl) with high specificity and reproducibility. The practicability and reliability of our detection strategy was validated by a screening approach including 327 blood samples taken from professional and recreational athletes under field conditions.
The construction of cDNA library and the screening of related antigen of ascitic tumor cells of ovarian cancer.

PubMed

Hou, Q; Chen, K; Shan, Z

2015-01-01

To construct the cDNA library of the ascites tumor cells of ovarian cancer, which can be used to screen the related antigen for the early diagnosis of ovarian cancer and therapeutic targets of immune treatment. Four cases of ovarian serous cystadenocarcinoma, two cases of ovarian mucinous cystadenocarcinoma, and two cases of ovarian endometrial carcinoma in patients with ascitic tumor cells which were used to construct the cDNA library. To screen the ovarian cancer antigen gene, evaluate the enzyme, and analyze nucleotide sequence, serological analysis of recombinant tumor cDNA expression libraries (SEREX) and suppression subtractive hybridization technique (SSH) techniques were utilized. The detection method of recombinant expression-based serological mini-arrays (SMARTA) was used to detect the ovarian cancer antigen and the positive reaction of 105 cases of ovarian cancer patients and 105 normal women's autoantibodies correspondingly in serum. After two rounds of serologic screening and glycosides sequencing analysis, 59 candidates of ovarian cancer antigen gene fragments were finally identified, which corresponded to 50 genes. They were then divided into six categories: (1) the homologous genes which related to the known ovarian cancer genes, such as BARD 1 gene, etc; (2) the homologous genes which were associated with other tumors, such as TM4SFI gene, etc; (3) the genes which were expressed in a special organization, such as ILF3, FXR1 gene, etc; (4) the genes which were the same with some protein genes of special function, such as TIZ, ClD gene; (5) the homologous genes which possessed the same source with embryonic genes, such as PKHD1 gene, etc; (6) the remaining genes were the unknown genes without the homologous sequence in the gene pool, such as OV-189 genes. SEREX technology combined with SSH method is an effective research strategy which can filter tumor antigen with high specific character; the corresponding autoantibodies of TM4SFl, ClD, TIZ, BARDI, FXRI, and OV-189 gene's recombinant antigen in serum can be regarded as the biomarkers which are used to diagnose ovarian cancer. The combination of multiple antigen detection can improve diagnostic efficiency.
Web application for automatic prediction of gene translation elongation efficiency.

PubMed

Sokolov, Vladimir; Zuraev, Bulat; Lashin, Sergei; Matushkin, Yury

2015-09-03

Expression efficiency is one of the major characteristics describing genes in various modern investigations. Expression efficiency of genes is regulated at various stages: transcription, translation, posttranslational protein modification and others. In this study, a special EloE (Elongation Efficiency) web application is described. The EloE sorts the organism's genes in a descend order on their theoretical rate of the elongation stage of translation based on the analysis of their nucleotide sequences. Obtained theoretical data have a significant correlation with available experimental data of gene expression in various organisms. In addition, the program identifies preferential codons in organism's genes and defines distribution of potential secondary structures energy in 5´ and 3´ regions of mRNA. The EloE can be useful in preliminary estimation of translation elongation efficiency for genes for which experimental data are not available yet. Some results can be used, for instance, in other programs modeling artificial genetic structures in genetically engineered experiments.
Site-Specific Gene Editing of Human Hematopoietic Stem Cells for X-Linked Hyper-IgM Syndrome.

PubMed

Kuo, Caroline Y; Long, Joseph D; Campo-Fernandez, Beatriz; de Oliveira, Satiro; Cooper, Aaron R; Romero, Zulema; Hoban, Megan D; Joglekar, Alok V; Lill, Georgia R; Kaufman, Michael L; Fitz-Gibbon, Sorel; Wang, Xiaoyan; Hollis, Roger P; Kohn, Donald B

2018-05-29

X-linked hyper-immunoglobulin M (hyper-IgM) syndrome (XHIM) is a primary immunodeficiency due to mutations in CD40 ligand that affect immunoglobulin class-switch recombination and somatic hypermutation. The disease is amenable to gene therapy using retroviral vectors, but dysregulated gene expression results in abnormal lymphoproliferation in mouse models, highlighting the need for alternative strategies. Here, we demonstrate the ability of both the transcription activator-like effector nuclease (TALEN) and clustered regularly interspaced short palindromic repeats-associated protein 9 (CRISPR/Cas9) platforms to efficiently drive integration of a normal copy of the CD40L cDNA delivered by Adeno-Associated Virus. Site-specific insertion of the donor sequence downstream of the endogenous CD40L promoter maintained physiologic expression of CD40L while overriding all reported downstream mutations. High levels of gene modification were achieved in primary human hematopoietic stem cells (HSCs), as well as in cell lines and XHIM-patient-derived T cells. Notably, gene-corrected HSCs engrafted in immunodeficient mice at clinically relevant frequencies. These studies provide the foundation for a permanent curative therapy in XHIM. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Surface Diversity in Mycoplasma agalactiae Is Driven by Site-Specific DNA Inversions within the vpma Multigene Locus

PubMed Central

Glew, Michelle D.; Marenda, Marc; Rosengarten, Renate; Citti, Christine

2002-01-01

The ruminant pathogen Mycoplasma agalactiae possesses a family of abundantly expressed variable surface lipoproteins called Vpmas. Phenotypic switches between Vpma members have previously been correlated with DNA rearrangements within a locus of vpma genes and are proposed to play an important role in disease pathogenesis. In this study, six vpma genes were characterized in the M. agalactiae type strain PG2. All vpma genes clustered within an 8-kb region and shared highly conserved 5′ untranslated regions, lipoprotein signal sequences, and short N-terminal sequences. Analyses of the vpma loci from consecutive clonal isolates showed that vpma DNA rearrangements were site specific and that cleavage and strand exchange occurred within a minimal region of 21 bp located within the 5′ untranslated region of all vpma genes. This process controlled expression of vpma genes by effectively linking the open reading frame (ORF) of a silent gene to a unique active promoter sequence within the locus. An ORF (xer1) immediately adjacent to one end of the vpma locus did not undergo rearrangement and had significant homology to a distinct subset of genes belonging to the λ integrase family of site-specific xer recombinases. It is proposed that xer1 codes for a site-specific recombinase that is not involved in chromosome dimer resolution but rather is responsible for the observed vpma-specific recombination in M. agalactiae. PMID:12374833
Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana

PubMed Central

Itoh, Takeshi; Tanaka, Tsuyoshi; Barrero, Roberto A.; Yamasaki, Chisato; Fujii, Yasuyuki; Hilton, Phillip B.; Antonio, Baltazar A.; Aono, Hideo; Apweiler, Rolf; Bruskiewich, Richard; Bureau, Thomas; Burr, Frances; Costa de Oliveira, Antonio; Fuks, Galina; Habara, Takuya; Haberer, Georg; Han, Bin; Harada, Erimi; Hiraki, Aiko T.; Hirochika, Hirohiko; Hoen, Douglas; Hokari, Hiroki; Hosokawa, Satomi; Hsing, Yue; Ikawa, Hiroshi; Ikeo, Kazuho; Imanishi, Tadashi; Ito, Yukiyo; Jaiswal, Pankaj; Kanno, Masako; Kawahara, Yoshihiro; Kawamura, Toshiyuki; Kawashima, Hiroaki; Khurana, Jitendra P.; Kikuchi, Shoshi; Komatsu, Setsuko; Koyanagi, Kanako O.; Kubooka, Hiromi; Lieberherr, Damien; Lin, Yao-Cheng; Lonsdale, David; Matsumoto, Takashi; Matsuya, Akihiro; McCombie, W. Richard; Messing, Joachim; Miyao, Akio; Mulder, Nicola; Nagamura, Yoshiaki; Nam, Jongmin; Namiki, Nobukazu; Numa, Hisataka; Nurimoto, Shin; O’Donovan, Claire; Ohyanagi, Hajime; Okido, Toshihisa; OOta, Satoshi; Osato, Naoki; Palmer, Lance E.; Quetier, Francis; Raghuvanshi, Saurabh; Saichi, Naomi; Sakai, Hiroaki; Sakai, Yasumichi; Sakata, Katsumi; Sakurai, Tetsuya; Sato, Fumihiko; Sato, Yoshiharu; Schoof, Heiko; Seki, Motoaki; Shibata, Michie; Shimizu, Yuji; Shinozaki, Kazuo; Shinso, Yuji; Singh, Nagendra K.; Smith-White, Brian; Takeda, Jun-ichi; Tanino, Motohiko; Tatusova, Tatiana; Thongjuea, Supat; Todokoro, Fusano; Tsugane, Mika; Tyagi, Akhilesh K.; Vanavichit, Apichart; Wang, Aihui; Wing, Rod A.; Yamaguchi, Kaori; Yamamoto, Mayu; Yamamoto, Naoyuki; Yu, Yeisoo; Zhang, Hao; Zhao, Qiang; Higo, Kenichi; Burr, Benjamin; Gojobori, Takashi; Sasaki, Takuji

2007-01-01

We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ∼32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene. PMID:17210932
Deep targeted sequencing in pediatric acute lymphoblastic leukemia unveils distinct mutational patterns between genetic subtypes and novel relapse-associated genes.

PubMed

Lindqvist, C Mårten; Lundmark, Anders; Nordlund, Jessica; Freyhult, Eva; Ekman, Diana; Carlsson Almlöf, Jonas; Raine, Amanda; Övernäs, Elin; Abrahamsson, Jonas; Frost, Britt-Marie; Grandér, Dan; Heyman, Mats; Palle, Josefine; Forestier, Erik; Lönnerholm, Gudmar; Berglund, Eva C; Syvänen, Ann-Christine

2016-09-27

To characterize the mutational patterns of acute lymphoblastic leukemia (ALL) we performed deep next generation sequencing of 872 cancer genes in 172 diagnostic and 24 relapse samples from 172 pediatric ALL patients. We found an overall greater mutational burden and more driver mutations in T-cell ALL (T-ALL) patients compared to B-cell precursor ALL (BCP-ALL) patients. In addition, the majority of the mutations in T-ALL had occurred in the original leukemic clone, while most of the mutations in BCP-ALL were subclonal. BCP-ALL patients carrying any of the recurrent translocations ETV6-RUNX1, BCR-ABL or TCF3-PBX1 harbored few mutations in driver genes compared to other BCP-ALL patients. Specifically in BCP-ALL, we identified ATRX as a novel putative driver gene and uncovered an association between somatic mutations in the Notch signaling pathway at ALL diagnosis and increased risk of relapse. Furthermore, we identified EP300, ARID1A and SH2B3 as relapse-associated genes. The genes highlighted in our study were frequently involved in epigenetic regulation, associated with germline susceptibility to ALL, and present in minor subclones at diagnosis that became dominant at relapse. We observed a high degree of clonal heterogeneity and evolution between diagnosis and relapse in both BCP-ALL and T-ALL, which could have implications for the treatment efficiency.
[Establishment of L-periaxin gene knock-out RSC96 cell line].

PubMed

Liang, Min; Peng, Tingting; Shi, Yawei

2016-12-25

Periaxin, a protein of noncompact myelin, is specifically expressed in the peripheral nervous system (PNS). There are two protein isoform L-periaxin and S-Periaxin by alternative splicing of periaxin gene, playing an important role in the initiation of myelin formation. So far, 18 different mutation sites in L-periaxin gene have been found to induce the peripheral demyelinating neurological charcot-marie-tooth diseases subtype 4F (CMT4F). The technique of activation of transcription activator-like effector nucleases (TALENS) was used to knock out the L-periaxin gene in RSC 96 cell line of Rattus. According to the design principle, the knock-out site of L-periaxin was assured to NLS domain of L-periaxin, which is target sequence of left and right arms of TALEN. The knock-out vectors of TALEN-L and TALEN-R were established and transfected into RSC96 cell. After puromycin screening, L-periaxin was knocked out successfully in RSC96 cell, which is confirmed by DNA sequence. The mutation efficiency is 21.6%. S-periaxin, not L-periaxin can be detected by Western blotting in L-periaxin gene knock-out RSC96 cell. The cell growth rate was decreased and the number of cells in G1 increased and decreased in S phase in L-periaxin gene knock-out RSC96 cell by flow cytometry and MTT assay.
Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.

PubMed

Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P

2016-05-27

Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.
Selective DNA demethylation by fusion of TDG with a sequence-specific DNA-binding domain

PubMed Central

Gregory, David J.; Mikhaylova, Lyudmila; Fedulov, Alexey V.

2012-01-01

Our ability to selectively manipulate gene expression by epigenetic means is limited, as there is no approach for targeted reactivation of epigenetically silenced genes, in contrast to what is available for selective gene silencing. We aimed to develop a tool for selective transcriptional activation by DNA demethylation. Here we present evidence that direct targeting of thymine-DNA-glycosylase (TDG) to specific sequences in the DNA can result in local DNA demethylation at potential regulatory sequences and lead to enhanced gene induction. When TDG was fused to a well-characterized DNA-binding domain [the Rel-homology domain (RHD) of NFκB], we observed decreased DNA methylation and increased transcriptional response to unrelated stimulus of inducible nitric oxide synthase (NOS2). The effect was not seen for control genes lacking either RHD-binding sites or high levels of methylation, nor in control mock-transduced cells. Specific reactivation of epigenetically silenced genes may thus be achievable by this approach, which provides a broadly useful strategy to further our exploration of biological mechanisms and to improve control over the epigenome. PMID:22419066
Fluorescence In situ Hybridization: Cell-Based Genetic Diagnostic and Research Applications.

PubMed

Cui, Chenghua; Shu, Wei; Li, Peining

2016-01-01

Fluorescence in situ hybridization (FISH) is a macromolecule recognition technology based on the complementary nature of DNA or DNA/RNA double strands. Selected DNA strands incorporated with fluorophore-coupled nucleotides can be used as probes to hybridize onto the complementary sequences in tested cells and tissues and then visualized through a fluorescence microscope or an imaging system. This technology was initially developed as a physical mapping tool to delineate genes within chromosomes. Its high analytical resolution to a single gene level and high sensitivity and specificity enabled an immediate application for genetic diagnosis of constitutional common aneuploidies, microdeletion/microduplication syndromes, and subtelomeric rearrangements. FISH tests using panels of gene-specific probes for somatic recurrent losses, gains, and translocations have been routinely applied for hematologic and solid tumors and are one of the fastest-growing areas in cancer diagnosis. FISH has also been used to detect infectious microbias and parasites like malaria in human blood cells. Recent advances in FISH technology involve various methods for improving probe labeling efficiency and the use of super resolution imaging systems for direct visualization of intra-nuclear chromosomal organization and profiling of RNA transcription in single cells. Cas9-mediated FISH (CASFISH) allowed in situ labeling of repetitive sequences and single-copy sequences without the disruption of nuclear genomic organization in fixed or living cells. Using oligopaint-FISH and super-resolution imaging enabled in situ visualization of chromosome haplotypes from differentially specified single-nucleotide polymorphism loci. Single molecule RNA FISH (smRNA-FISH) using combinatorial labeling or sequential barcoding by multiple round of hybridization were applied to measure mRNA expression of multiple genes within single cells. Research applications of these single molecule single cells DNA and RNA FISH techniques have visualized intra-nuclear genomic structure and sub-cellular transcriptional dynamics of many genes and revealed their functions in various biological processes.
[Hydrophidae identification through analysis on Cyt b gene barcode].

PubMed

Liao, Li-xi; Zeng, Ke-wu; Tu, Peng-fei

2015-08-01

Hydrophidae, one of the precious traditional Chinese medicines, is generally drily preserved to prevent corruption, but it is hard to identify the species of Hydrophidae through the appearance because of the change due to the drying process. The identification through analysis on gene barcode, a new technique in species identification, can avoid the problem. The gene barcodes of the 6 species of Hydrophidae like Lapemis hardwickii were aquired through DNA extraction and gene sequencing. These barcodes were then in sequence alignment and test the identification efficency by BLAST. Our results revealed that the barcode sequences performed high identification efficiency, and had obvious difference between intra- and inter-species. These all indicated that Cyt b DNA barcoding can confirm the Hydrophidae identification.
Influence of Quasi-Specific Sites on Kinetics of Target DNA Search by a Sequence-Specific DNA-Binding Protein

PubMed Central

2015-01-01

Functions of transcription factors require formation of specific complexes at particular sites in cis-regulatory elements of genes. However, chromosomal DNA contains numerous sites that are similar to the target sequences recognized by transcription factors. The influence of such “quasi-specific” sites on functions of the transcription factors is not well understood at present by experimental means. In this work, using fluorescence methods, we have investigated the influence of quasi-specific DNA sites on the efficiency of target location by the zinc finger DNA-binding domain of the inducible transcription factor Egr-1, which recognizes a 9 bp sequence. By stopped-flow assays, we measured the kinetics of Egr-1’s association with a target site on 143 bp DNA in the presence of various competitor DNAs, including nonspecific and quasi-specific sites. The presence of quasi-specific sites on competitor DNA significantly decelerated the target association by the Egr-1 protein. The impact of the quasi-specific sites depended strongly on their affinity, their concentration, and the degree of their binding to the protein. To quantitatively describe the kinetic impact of the quasi-specific sites, we derived an analytical form of the apparent kinetic rate constant for the target association and used it for fitting to the experimental data. Our kinetic data with calf thymus DNA as a competitor suggested that there are millions of high-affinity quasi-specific sites for Egr-1 among the 3 billion bp of genomic DNA. This study quantitatively demonstrates that naturally abundant quasi-specific sites on DNA can considerably impede the target search processes of sequence-specific DNA-binding proteins. PMID:26502071
Bifunctional isocitrate-homoisocitrate dehydrogenase: a missing link in the evolution of beta-decarboxylating dehydrogenase.

PubMed

Miyazaki, Kentaro

2005-05-27

Beta-decarboxylating dehydrogenases comprise 3-isopropylmalate dehydrogenase, isocitrate dehydrogenase, and homoisocitrate dehydrogenase. They share a high degree of amino acid sequence identity and occupy equivalent positions in the amino acid biosynthetic pathways for leucine, glutamate, and lysine, respectively. Therefore, not only the enzymes but also the whole pathways should have evolved from a common ancestral pathway. In Pyrococcus horikoshii, only one pathway of the three has been identified in the genomic sequence, and PH1722 is the sole beta-decarboxylating dehydrogenase gene. The organism does not require leucine, glutamate, or lysine for growth; the single pathway might play multiple (i.e., ancestral) roles in amino acid biosynthesis. The PH1722 gene was cloned and expressed in Escherichia coli and the substrate specificity of the recombinant enzyme was investigated. It exhibited activities on isocitrate and homoisocitrate at near equal efficiency, but not on 3-isopropylmalate. PH1722 is thus a novel, bifunctional beta-decarboxylating dehydrogenase, which likely plays a dual role in glutamate and lysine biosynthesis in vivo.
SGP-1: Prediction and Validation of Homologous Genes Based on Sequence Alignments

PubMed Central

Wiehe, Thomas; Gebauer-Jung, Steffi; Mitchell-Olds, Thomas; Guigó, Roderic

2001-01-01

Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors. PMID:11544202
AMPLISAS: a web server for multilocus genotyping using next-generation amplicon sequencing data.

PubMed

Sebastian, Alvaro; Herdegen, Magdalena; Migalska, Magdalena; Radwan, Jacek

2016-03-01

Next-generation sequencing (NGS) technologies are revolutionizing the fields of biology and medicine as powerful tools for amplicon sequencing (AS). Using combinations of primers and barcodes, it is possible to sequence targeted genomic regions with deep coverage for hundreds, even thousands, of individuals in a single experiment. This is extremely valuable for the genotyping of gene families in which locus-specific primers are often difficult to design, such as the major histocompatibility complex (MHC). The utility of AS is, however, limited by the high intrinsic sequencing error rates of NGS technologies and other sources of error such as polymerase amplification or chimera formation. Correcting these errors requires extensive bioinformatic post-processing of NGS data. Amplicon Sequence Assignment (AMPLISAS) is a tool that performs analysis of AS results in a simple and efficient way, while offering customization options for advanced users. AMPLISAS is designed as a three-step pipeline consisting of (i) read demultiplexing, (ii) unique sequence clustering and (iii) erroneous sequence filtering. Allele sequences and frequencies are retrieved in excel spreadsheet format, making them easy to interpret. AMPLISAS performance has been successfully benchmarked against previously published genotyped MHC data sets obtained with various NGS technologies. © 2015 John Wiley & Sons Ltd.
A novel sgRNA selection system for CRISPR-Cas9 in mammalian cells.

PubMed

Zhang, Haiwei; Zhang, Xixi; Fan, Cunxian; Xie, Qun; Xu, Chengxian; Zhao, Qun; Liu, Yongbo; Wu, Xiaoxia; Zhang, Haibing

2016-03-18

CRISPR-Cas9 mediated genome editing system has been developed as a powerful tool for elucidating the function of genes through genetic engineering in multiple cells and organisms. This system takes advantage of a single guide RNA (sgRNA) to direct the Cas9 endonuclease to a specific DNA site to generate mutant alleles. Since the targeting efficiency of sgRNAs to distinct DNA loci can vary widely, there remains a need for a rapid, simple and efficient sgRNA selection method to overcome this limitation of the CRISPR-Cas9 system. Here we report a novel system to select sgRNA with high efficacy for DNA sequence modification by a luciferase assay. Using this sgRNAs selection system, we further demonstrated successful examples of one sgRNA for generating one gene knockout cell lines where the targeted genes are shown to be functionally defective. This system provides a potential application to optimize the sgRNAs in different species and to generate a powerful CRISPR-Cas9 genome-wide screening system with minimum amounts of sgRNAs. Copyright © 2016 Elsevier Inc. All rights reserved.
The Conjugative Relaxase TrwC Promotes Integration of Foreign DNA in the Human Genome.

PubMed

González-Prieto, Coral; Gabriel, Richard; Dehio, Christoph; Schmidt, Manfred; Llosa, Matxalen

2017-06-15

Bacterial conjugation is a mechanism of horizontal DNA transfer. The relaxase TrwC of the conjugative plasmid R388 cleaves one strand of the transferred DNA at the oriT gene, covalently attaches to it, and leads the single-stranded DNA (ssDNA) into the recipient cell. In addition, TrwC catalyzes site-specific integration of the transferred DNA into its target sequence present in the genome of the recipient bacterium. Here, we report the analysis of the efficiency and specificity of the integrase activity of TrwC in human cells, using the type IV secretion system of the human pathogen Bartonella henselae to introduce relaxase-DNA complexes. Compared to Mob relaxase from plasmid pBGR1, we found that TrwC mediated a 10-fold increase in the rate of plasmid DNA transfer to human cells and a 100-fold increase in the rate of chromosomal integration of the transferred DNA. We used linear amplification-mediated PCR and plasmid rescue to characterize the integration pattern in the human genome. DNA sequence analysis revealed mostly reconstituted oriT sequences, indicating that TrwC is active and recircularizes transferred DNA in human cells. One TrwC-mediated site-specific integration event was detected, proving that TrwC is capable of mediating site-specific integration in the human genome, albeit with very low efficiency compared to the rate of random integration. Our results suggest that TrwC may stabilize the plasmid DNA molecules in the nucleus of the human cell, probably by recircularization of the transferred DNA strand. This stabilization would increase the opportunities for integration of the DNA by the host machinery. IMPORTANCE Different biotechnological applications, including gene therapy strategies, require permanent modification of target cells. Long-term expression is achieved either by extrachromosomal persistence or by integration of the introduced DNA. Here, we studied the utility of conjugative relaxase TrwC, a bacterial protein with site-specific integrase activity in bacteria, as an integrase in human cells. Although it is not efficient as a site-specific integrase, we found that TrwC is active in human cells and promotes random integration of the transferred DNA in the human genome, probably acting as a DNA chaperone until it is integrated by host mechanisms. TrwC-DNA complexes can be delivered to human cells through a type IV secretion system involved in pathogenesis. Thus, TrwC could be used in vivo to transfer the DNA of interest into the appropriate cell and promote its integration. If used in combination with a site-specific nuclease, it could lead to site-specific integration of the incoming DNA by homologous recombination. Copyright © 2017 American Society for Microbiology.
The Conjugative Relaxase TrwC Promotes Integration of Foreign DNA in the Human Genome

PubMed Central

González-Prieto, Coral; Gabriel, Richard; Dehio, Christoph; Schmidt, Manfred

2017-01-01

ABSTRACT Bacterial conjugation is a mechanism of horizontal DNA transfer. The relaxase TrwC of the conjugative plasmid R388 cleaves one strand of the transferred DNA at the oriT gene, covalently attaches to it, and leads the single-stranded DNA (ssDNA) into the recipient cell. In addition, TrwC catalyzes site-specific integration of the transferred DNA into its target sequence present in the genome of the recipient bacterium. Here, we report the analysis of the efficiency and specificity of the integrase activity of TrwC in human cells, using the type IV secretion system of the human pathogen Bartonella henselae to introduce relaxase-DNA complexes. Compared to Mob relaxase from plasmid pBGR1, we found that TrwC mediated a 10-fold increase in the rate of plasmid DNA transfer to human cells and a 100-fold increase in the rate of chromosomal integration of the transferred DNA. We used linear amplification-mediated PCR and plasmid rescue to characterize the integration pattern in the human genome. DNA sequence analysis revealed mostly reconstituted oriT sequences, indicating that TrwC is active and recircularizes transferred DNA in human cells. One TrwC-mediated site-specific integration event was detected, proving that TrwC is capable of mediating site-specific integration in the human genome, albeit with very low efficiency compared to the rate of random integration. Our results suggest that TrwC may stabilize the plasmid DNA molecules in the nucleus of the human cell, probably by recircularization of the transferred DNA strand. This stabilization would increase the opportunities for integration of the DNA by the host machinery. IMPORTANCE Different biotechnological applications, including gene therapy strategies, require permanent modification of target cells. Long-term expression is achieved either by extrachromosomal persistence or by integration of the introduced DNA. Here, we studied the utility of conjugative relaxase TrwC, a bacterial protein with site-specific integrase activity in bacteria, as an integrase in human cells. Although it is not efficient as a site-specific integrase, we found that TrwC is active in human cells and promotes random integration of the transferred DNA in the human genome, probably acting as a DNA chaperone until it is integrated by host mechanisms. TrwC-DNA complexes can be delivered to human cells through a type IV secretion system involved in pathogenesis. Thus, TrwC could be used in vivo to transfer the DNA of interest into the appropriate cell and promote its integration. If used in combination with a site-specific nuclease, it could lead to site-specific integration of the incoming DNA by homologous recombination. PMID:28411218

HMGB1-mediated DNA bending: Distinct roles in increasing p53 binding to DNA and the transactivation of p53-responsive gene promoters.

PubMed

Štros, Michal; Kučírek, Martin; Sani, Soodabeh Abbasi; Polanská, Eva

2018-03-01

HMGB1 is a chromatin-associated protein that has been implicated in many important biological processes such as transcription, recombination, DNA repair, and genome stability. These functions include the enhancement of binding of a number of transcription factors, including the tumor suppressor protein p53, to their specific DNA-binding sites. HMGB1 is composed of two highly conserved HMG boxes, linked to an intrinsically disordered acidic C-terminal tail. Previous reports have suggested that the ability of HMGB1 to bend DNA may explain the in vitro HMGB1-mediated increase in sequence-specific DNA binding by p53. The aim of this study was to reinvestigate the importance of HMGB1-induced DNA bending in relationship to the ability of the protein to promote the specific binding of p53 to short DNA duplexes in vitro, and to transactivate two major p53-regulated human genes: Mdm2 and p21/WAF1. Using a number of HMGB1 mutants, we report that the HMGB1-mediated increase in sequence-specific p53 binding to DNA duplexes in vitro depends very little on HMGB1-mediated DNA bending. The presence of the acidic C-terminal tail of HMGB1 and/or the oxidation of the protein can reduce the HMGB1-mediated p53 binding. Interestingly, the induction of transactivation of p53-responsive gene promoters by HMGB1 requires both the ability of the protein to bend DNA and the acidic C-terminal tail, and is promoter-specific. We propose that the efficient transactivation of p53-responsive gene promoters by HMGB1 depends on complex events, rather than solely on the promotion of p53 binding to its DNA cognate sites. Copyright © 2018 Elsevier B.V. All rights reserved.
Identification of mediator complex 26 (Crsp7) gametologs on platypus X1 and Y5 sex chromosomes: a candidate testis-determining gene in monotremes?

PubMed

Tsend-Ayush, Enkhjargal; Kortschak, R Daniel; Bernard, Pascal; Lim, Shu Ly; Ryan, Janelle; Rosenkranz, Ruben; Borodina, Tatiana; Dohm, Juliane C; Himmelbauer, Heinz; Harley, Vincent R; Grützner, Frank

2012-01-01

The basal lineage of monotremes features an extraordinarily complex sex chromosome system which has provided novel insights into the evolution of mammalian sex chromosomes. Recently, sequence information from autosomes, X chromosomes, and XY-shared pseudoautosomal regions has become available. However, no gene has so far been described on any of the Y chromosome-specific regions. We analyzed sequences derived from Y-specific BAC clones to identify genes with potentially male-specific function. Here, we report the identification and characterization of the mediator complex protein gametologs on platypus Y5 (Crspy). We also identified the X-chromosomal copy which unexpectedly maps to X1 (Crspx). Sequence comparison shows extensive divergence between the X and Y copy, but we found no significant positive selection on either gametolog. Expression analysis shows widespread expression of Crspx. Crspy is expressed exclusively in males with particularly strong expression in testis and kidney. Reporter gene assays to investigate whether Crspx/y can act on the recently discovered mouse Sox9 testis-specific enhancer element did reveal a modest effect together with mouse Sox9 + Sf1, but showed overall no significant upregulation of the reporter gene. This is the first report of a differentiated functional male-specific gene on platypus Y chromosomes, providing new insights into sex chromosome evolution and a candidate gene for male-specific function in monotremes.
Gluconate as suitable potential reduction supplier in Corynebacterium glutamicum: cloning and expression of gntP and gntK in Escherichia coli.

PubMed

Porco, Antonietta; Gamero, Elida E; Mylonás, Elena; Istúriz, Tomás

2008-01-01

Corynebacterium glutamicum is widely used in the industrial production of amino acids. We have found that this bacterium grows exponentially on a mineral medium supplemented with gluconate. Gluconate permease and Gluconokinase are expressed in an inducible form and, 6-phosphogluconate dehydrogenase, although constitutively expressed, shows a 3-fold higher specific level in gluconate grown cells than those grown in fructose under similar conditions. Interestingly, these activities are lower than those detected in the strain Escherichia coli M1-8, cultivated under similar conditions. Additionally, here we also confirmed that this bacterium lacks 6-phosphogluconate dehydratase activity. Thus, gluconate must be metabolized through the pentose phosphate pathway. Genes encoding gluconate transport and its phosphorylation were cloned from C. glutamicum, and expressed in suitable E. coli mutants. Sequence analysis revealed that the amino acid sequences obtained from these genes, denoted as gntP and gntK, were similar to those found in other bacteria. Analysis of both genes by RT-PCR suggested constitutive expression, in disagreement with the inducible character of their corresponding activities. The results suggest that gluconate might be a suitable source of reduction potential for improving the efficiency in cultures engaged in amino acids production. This is the first time that gluconate specific enzymatic activities are reported in C. glutamicum.
Thermodynamically optimal whole-genome tiling microarray design and validation.

PubMed

Cho, Hyejin; Chou, Hui-Hsien

2016-06-13

Microarray is an efficient apparatus to interrogate the whole transcriptome of species. Microarray can be designed according to annotated gene sets, but the resulted microarrays cannot be used to identify novel transcripts and this design method is not applicable to unannotated species. Alternatively, a whole-genome tiling microarray can be designed using only genomic sequences without gene annotations, and it can be used to detect novel RNA transcripts as well as known genes. The difficulty with tiling microarray design lies in the tradeoff between probe-specificity and coverage of the genome. Sequence comparison methods based on BLAST or similar software are commonly employed in microarray design, but they cannot precisely determine the subtle thermodynamic competition between probe targets and partially matched probe nontargets during hybridizations. Using the whole-genome thermodynamic analysis software PICKY to design tiling microarrays, we can achieve maximum whole-genome coverage allowable under the thermodynamic constraints of each target genome. The resulted tiling microarrays are thermodynamically optimal in the sense that all selected probes share the same melting temperature separation range between their targets and closest nontargets, and no additional probes can be added without violating the specificity of the microarray to the target genome. This new design method was used to create two whole-genome tiling microarrays for Escherichia coli MG1655 and Agrobacterium tumefaciens C58 and the experiment results validated the design.
Genotyping of single spore isolates of a Pasteuria penetrans population occurring in Florida using SNP-based markers.

PubMed

Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T

2017-02-01

To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.
Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

PubMed

Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2004-02-01

To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.
On the presence and role of human gene-body DNA methylation

PubMed Central

Jjingo, Daudi; Conley, Andrew B.; Yi, Soojin V.; Lunyak, Victoria V.; Jordan, I. King

2012-01-01

DNA methylation of promoter sequences is a repressive epigenetic mark that down-regulates gene expression. However, DNA methylation is more prevalent within gene-bodies than seen for promoters, and gene-body methylation has been observed to be positively correlated with gene expression levels. This paradox remains unexplained, and accordingly the role of DNA methylation in gene-bodies is poorly understood. We addressed the presence and role of human gene-body DNA methylation using a meta-analysis of human genome-wide methylation, expression and chromatin data sets. Methylation is associated with transcribed regions as genic sequences have higher levels of methylation than intergenic or promoter sequences. We also find that the relationship between gene-body DNA methylation and expression levels is non-monotonic and bell-shaped. Mid-level expressed genes have the highest levels of gene-body methylation, whereas the most lowly and highly expressed sets of genes both have low levels of methylation. While gene-body methylation can be seen to efficiently repress the initiation of intragenic transcription, the vast majority of methylated sites within genes are not associated with intragenic promoters. In fact, highly expressed genes initiate the most intragenic transcription, which is inconsistent with the previously held notion that gene-body methylation serves to repress spurious intragenic transcription to allow for efficient transcriptional elongation. These observations lead us to propose a model to explain the presence of human gene-body methylation. This model holds that the repression of intragenic transcription by gene-body methylation is largely epiphenomenal, and suggests that gene-body methylation levels are predominantly shaped via the accessibility of the DNA to methylating enzyme complexes. PMID:22577155
Computational sequence analysis of predicted long dsRNA transcriptomes of major crops reveals sequence complementarity with human genes.

PubMed

Jensen, Peter D; Zhang, Yuanji; Wiggins, B Elizabeth; Petrick, Jay S; Zhu, Jin; Kerstetter, Randall A; Heck, Gregory R; Ivashuta, Sergey I

2013-01-01

Long double-stranded RNAs (long dsRNAs) are precursors for the effector molecules of sequence-specific RNA-based gene silencing in eukaryotes. Plant cells can contain numerous endogenous long dsRNAs. This study demonstrates that such endogenous long dsRNAs in plants have sequence complementarity to human genes. Many of these complementary long dsRNAs have perfect sequence complementarity of at least 21 nucleotides to human genes; enough complementarity to potentially trigger gene silencing in targeted human cells if delivered in functional form. However, the number and diversity of long dsRNA molecules in plant tissue from crops such as lettuce, tomato, corn, soy and rice with complementarity to human genes that have a long history of safe consumption supports a conclusion that long dsRNAs do not present a significant dietary risk.
Genetic modifications of pigs for medicine and agriculture

PubMed Central

Whyte, Jeffrey J.; Prather, Randall S.

2011-01-01

SUMMARY Genetically modified swine hold great promise in the fields of agriculture and medicine. Currently, these swine are being used to optimize production of quality meat, to improve our understanding of the biology of disease resistance, and to reduced waste. In the field of biomedicine, swine are anatomically and physiologically analogous to humans. Alterations of key swine genes in disease pathways provide model animals to improve our understanding of the causes and potential treatments of many human genetic disorders. The completed sequencing of the swine genome will significantly enhance the specificity of genetic modifications, and allow for more accurate representations of human disease based on syntenic genes between the two species. Improvements in both methods of gene alteration and efficiency of model animal production are key to enabling routine use of these swine models in medicine and agriculture. PMID:21671302
Properties of Cells Carrying the Herpes Simplex Virus Type 2 Thymidine Kinase Gene: Mechanisms of Reversion to a Thymidine Kinase-Negative Phenotype

PubMed Central

Bastow, K. F.; Darby, G.; Wildy, P.; Minson, A. C.

1980-01-01

We have isolated cells with a thymidine kinase-negative (tk−) phenotype from cells which carry the herpes simplex virus type 2 tk gene by selection in 5-bromodeoxyuridine or 9-(2-hydroxyethoxymethyl)guanine. Both selection routines generated revertants with a frequency of 10−3 to 10−4, and resistance to either compound conferred simultaneous resistance to the other. tk− revertants fell into three classes: (i) cells that arose by deletion of all virus sequences, (ii) cells that had lost the virus tk gene but retained a nonselected virus-specific function and arose by deletion of part of the virus-specific sequence, and (iii) cells that retained the potential to express all of the virus-specific functions of the parental cells and retained all of the virus-specific DNA sequences. Images PMID:16789205
Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes

PubMed Central

Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded

2016-01-01

Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950
A small and efficient dimerization/packaging signal of rat VL30 RNA and its use in murine leukemia virus-VL30-derived vectors for gene transfer.

PubMed

Torrent, C; Gabus, C; Darlix, J L

1994-02-01

Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer.
Next-generation sequencing of mixed genomic DNA allows efficient assembly of rearranged mitochondrial genomes in Amolops chunganensis and Quasipaa boulengeri

PubMed Central

Yuan, Siqi; Zheng, Yuchi; Zeng, Xiaomao

2016-01-01

Recent improvements in next-generation sequencing (NGS) technologies can facilitate the obtainment of mitochondrial genomes. However, it is not clear whether NGS could be effectively used to reconstruct the mitogenome with high gene rearrangement. These high rearrangements would cause amplification failure, and/or assembly and alignment errors. Here, we choose two frogs with rearranged gene order, Amolops chunganensis and Quasipaa boulengeri, to test whether gene rearrangements affect the mitogenome assembly and alignment by using NGS. The mitogenomes with gene rearrangements are sequenced through Illumina MiSeq genomic sequencing and assembled effectively by Trinity v2.1.0 and SOAPdenovo2. Gene order and contents in the mitogenome of A. chunganensis and Q. boulengeri are typical neobatrachian pattern except for rearrangements at the position of “WANCY” tRNA genes cluster. Further, the mitogenome of Q. boulengeri is characterized with a tandem duplication of trnM. Moreover, we utilize 13 protein-coding genes of A. chunganensis, Q. boulengeri and other neobatrachians to reconstruct the phylogenetic tree for evaluating mitochondrial sequence authenticity of A. chunganensis and Q. boulengeri. In this work, we provide nearly complete mitochondrial genomes of A. chunganensis and Q. boulengeri. PMID:27994980
Genetic Characterization of a Panel of Diverse HIV-1 Isolates at Seven International Sites

PubMed Central

Chen, Yue; Sanchez, Ana M.; Sabino, Ester; Hunt, Gillian; Ledwaba, Johanna; Hackett, John; Swanson, Priscilla; Hewlett, Indira; Ragupathy, Viswanath; Vikram Vemula, Sai; Zeng, Peibin; Tee, Kok-Keng; Chow, Wei Zhen; Ji, Hezhao; Sandstrom, Paul; Denny, Thomas N.; Busch, Michael P.; Gao, Feng

2016-01-01

HIV-1 subtypes and drug resistance are routinely tested by many international surveillance groups. However, results from different sites often vary. A systematic comparison of results from multiple sites is needed to determine whether a standardized protocol is required for consistent and accurate data analysis. A panel of well-characterized HIV-1 isolates (N = 50) from the External Quality Assurance Program Oversight Laboratory (EQAPOL) was assembled for evaluation at seven international sites. This virus panel included seven subtypes, six circulating recombinant forms (CRFs), nine unique recombinant forms (URFs) and three group O viruses. Seven viruses contained 10 major drug resistance mutations (DRMs). HIV-1 isolates were prepared at a concentration of 107 copies/ml and compiled into blinded panels. Subtypes and DRMs were determined with partial or full pol gene sequences by conventional Sanger sequencing and/or Next Generation Sequencing (NGS). Subtype and DRM results were reported and decoded for comparison with full-length genome sequences generated by EQAPOL. The partial pol gene was amplified by RT-PCR and sequenced for 89.4%-100% of group M viruses at six sites. Subtyping results of majority of the viruses (83%-97.9%) were correctly determined for the partial pol sequences. All 10 major DRMs in seven isolates were detected at these six sites. The complete pol gene sequence was also obtained by NGS at one site. However, this method missed six group M viruses and sequences contained host chromosome fragments. Three group O viruses were only characterized with additional group O-specific RT-PCR primers employed by one site. These results indicate that PCR protocols and subtyping tools should be standardized to efficiently amplify diverse viruses and more consistently assign virus genotypes, which is critical for accurate global subtype and drug resistance surveillance. Targeted NGS analysis of partial pol sequences can serve as an alternative approach, especially for detection of low-abundance DRMs. PMID:27314585
Genetic Characterization of a Panel of Diverse HIV-1 Isolates at Seven International Sites.

PubMed

Hora, Bhavna; Keating, Sheila M; Chen, Yue; Sanchez, Ana M; Sabino, Ester; Hunt, Gillian; Ledwaba, Johanna; Hackett, John; Swanson, Priscilla; Hewlett, Indira; Ragupathy, Viswanath; Vikram Vemula, Sai; Zeng, Peibin; Tee, Kok-Keng; Chow, Wei Zhen; Ji, Hezhao; Sandstrom, Paul; Denny, Thomas N; Busch, Michael P; Gao, Feng

2016-01-01

HIV-1 subtypes and drug resistance are routinely tested by many international surveillance groups. However, results from different sites often vary. A systematic comparison of results from multiple sites is needed to determine whether a standardized protocol is required for consistent and accurate data analysis. A panel of well-characterized HIV-1 isolates (N = 50) from the External Quality Assurance Program Oversight Laboratory (EQAPOL) was assembled for evaluation at seven international sites. This virus panel included seven subtypes, six circulating recombinant forms (CRFs), nine unique recombinant forms (URFs) and three group O viruses. Seven viruses contained 10 major drug resistance mutations (DRMs). HIV-1 isolates were prepared at a concentration of 107 copies/ml and compiled into blinded panels. Subtypes and DRMs were determined with partial or full pol gene sequences by conventional Sanger sequencing and/or Next Generation Sequencing (NGS). Subtype and DRM results were reported and decoded for comparison with full-length genome sequences generated by EQAPOL. The partial pol gene was amplified by RT-PCR and sequenced for 89.4%-100% of group M viruses at six sites. Subtyping results of majority of the viruses (83%-97.9%) were correctly determined for the partial pol sequences. All 10 major DRMs in seven isolates were detected at these six sites. The complete pol gene sequence was also obtained by NGS at one site. However, this method missed six group M viruses and sequences contained host chromosome fragments. Three group O viruses were only characterized with additional group O-specific RT-PCR primers employed by one site. These results indicate that PCR protocols and subtyping tools should be standardized to efficiently amplify diverse viruses and more consistently assign virus genotypes, which is critical for accurate global subtype and drug resistance surveillance. Targeted NGS analysis of partial pol sequences can serve as an alternative approach, especially for detection of low-abundance DRMs.
Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.

PubMed Central

Schena, M; Shalon, D; Heller, R; Chai, A; Brown, P O; Davis, R W

1996-01-01

Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery. Images Fig. 1 Fig. 2 Fig. 3 PMID:8855227
The mutant vasopressin gene from diabetes insipidus (Brattleboro) rats is transcribed but the message is not efficiently translated.

PubMed Central

Schmale, H; Ivell, R; Breindl, M; Darmer, D; Richter, D

1984-01-01

The vasopressin gene from normal and diabetes insipidus (Brattleboro) rats has been isolated and sequenced. Except for a single deletion of a G residue in region coding for the neurophysin carrier protein the approximately 2300 nucleotides of both genes are identical. Blot analysis of hypothalamic RNA as well as transfection and microinjection experiments indicate that the mutant gene is correctly transcribed and spliced, however the resulting mRNA is not efficiently translated. Images Fig. 2. Fig. 3. PMID:6526016
Leveraging long sequencing reads to investigate R-gene clustering and variation in sugar beet

USDA-ARS?s Scientific Manuscript database

Host-pathogen interactions are of prime importance to modern agriculture. Plants utilize various types of resistance genes to mitigate pathogen damage. Identification of the specific gene responsible for a specific resistance can be difficult due to duplication and clustering within R-gene families....
An efficient and comprehensive strategy for genetic diagnostics of polycystic kidney disease.

PubMed

Eisenberger, Tobias; Decker, Christian; Hiersche, Milan; Hamann, Ruben C; Decker, Eva; Neuber, Steffen; Frank, Valeska; Bolz, Hanno J; Fehrenbach, Henry; Pape, Lars; Toenshoff, Burkhard; Mache, Christoph; Latta, Kay; Bergmann, Carsten

2015-01-01

Renal cysts are clinically and genetically heterogeneous conditions. Autosomal dominant polycystic kidney disease (ADPKD) is the most frequent life-threatening genetic disease and mainly caused by mutations in PKD1. The presence of six PKD1 pseudogenes and tremendous allelic heterogeneity make molecular genetic testing challenging requiring laborious locus-specific amplification. Increasing evidence suggests a major role for PKD1 in early and severe cases of ADPKD and some patients with a recessive form. Furthermore it is becoming obvious that clinical manifestations can be mimicked by mutations in a number of other genes with the necessity for broader genetic testing. We established and validated a sequence capture based NGS testing approach for all genes known for cystic and polycystic kidney disease including PKD1. Thereby, we demonstrate that the applied standard mapping algorithm specifically aligns reads to the PKD1 locus and overcomes the complication of unspecific capture of pseudogenes. Employing careful and experienced assessment of NGS data, the method is shown to be very specific and equally sensitive as established methods. An additional advantage over conventional Sanger sequencing is the detection of copy number variations (CNVs). Sophisticated bioinformatic read simulation increased the high analytical depth of the validation study and further demonstrated the strength of the approach. We further raise some awareness of limitations and pitfalls of common NGS workflows when applied in complex regions like PKD1 demonstrating that quality of NGS needs more than high coverage of the target region. By this, we propose a time- and cost-efficient diagnostic strategy for comprehensive molecular genetic testing of polycystic kidney disease which is highly automatable and will be of particular value when therapeutic options for PKD emerge and genetic testing is needed for larger numbers of patients.
Flanking sequence determination and specific PCR identification of transgenic wheat B102-1-2.

PubMed

Cao, Jijuan; Xu, Junyi; Zhao, Tongtong; Cao, Dongmei; Huang, Xin; Zhang, Piqiao; Luan, Fengxia

2014-01-01

The exogenous fragment sequence and flanking sequence between the exogenous fragment and recombinant chromosome of transgenic wheat B102-1-2 were successfully acquired using genome walking technology. The newly acquired exogenous fragment encoded the full-length sequence of transformed genes with transformed plasmid and corresponding functional genes including ubi, vector pBANF-bar, vector pUbiGUSPlus, vector HSP, reporter vector pUbiGUSPlus, promoter ubiquitin, and coli DH1. A specific polymerase chain reaction (PCR) identification method for transgenic wheat B102-1-2 was established on the basis of designed primers according to flanking sequence. This established specific PCR strategy was validated by using transgenic wheat, transgenic corn, transgenic soybean, transgenic rice, and non-transgenic wheat. A specifically amplified target band was observed only in transgenic wheat B102-1-2. Therefore, this method is characterized by high specificity, high reproducibility, rapid identification, and excellent accuracy for the identification of transgenic wheat B102-1-2.

The construction of a synthetic Escherichia coli trp promoter and its use in the expression of a synthetic interferon gene.

PubMed Central

Windass, J D; Newton, C R; De Maeyer-Guignard, J; Moore, V E; Markham, A F; Edge, M D

1982-01-01

An 82 base pair DNA fragment has been synthesised which contains the E. coli trp promoter and operator sequences and also encodes the first Shine Dalgarno sequence of the trp operon. This DNA fragment is flanked by EcoRI and ClaI/TaqI cohesive ends and is thus easy to clone, transfer between vector systems and couple to genes to drive their expression. It has been cloned into plasmid pAT153, producing a convenient trp promoter vector. We have also joined the fragment to a synthetic IFN-alpha 1 gene, using synthetic oligonucleotides to generate a completely natural, highly efficient bacterial translation initiation signal on the promoter proximal side of the IFN gene. Plasmids carrying this construction enable E. coli cells to express IFN-alpha 1 almost constitutively and with significantly higher efficiency than from a lacUV5 promoter based system. Images PMID:6184675
Target Site Recognition by a Diversity-Generating Retroelement

PubMed Central

Guo, Huatao; Tse, Longping V.; Nieh, Angela W.; Czornyj, Elizabeth; Williams, Steven; Oukil, Sabrina; Liu, Vincent B.; Miller, Jeff F.

2011-01-01

Diversity-generating retroelements (DGRs) are in vivo sequence diversification machines that are widely distributed in bacterial, phage, and plasmid genomes. They function to introduce vast amounts of targeted diversity into protein-encoding DNA sequences via mutagenic homing. Adenine residues are converted to random nucleotides in a retrotransposition process from a donor template repeat (TR) to a recipient variable repeat (VR). Using the Bordetella bacteriophage BPP-1 element as a prototype, we have characterized requirements for DGR target site function. Although sequences upstream of VR are dispensable, a 24 bp sequence immediately downstream of VR, which contains short inverted repeats, is required for efficient retrohoming. The inverted repeats form a hairpin or cruciform structure and mutational analysis demonstrated that, while the structure of the stem is important, its sequence can vary. In contrast, the loop has a sequence-dependent function. Structure-specific nuclease digestion confirmed the existence of a DNA hairpin/cruciform, and marker coconversion assays demonstrated that it influences the efficiency, but not the site of cDNA integration. Comparisons with other phage DGRs suggested that similar structures are a conserved feature of target sequences. Using a kanamycin resistance determinant as a reporter, we found that transplantation of the IMH and hairpin/cruciform-forming region was sufficient to target the DGR diversification machinery to a heterologous gene. In addition to furthering our understanding of DGR retrohoming, our results suggest that DGRs may provide unique tools for directed protein evolution via in vivo DNA diversification. PMID:22194701
Gene discovery in Eimeria tenella by immunoscreening cDNA expression libraries of sporozoites and schizonts with chicken intestinal antibodies.

PubMed

Réfega, Susana; Girard-Misguich, Fabienne; Bourdieu, Christiane; Péry, Pierre; Labbé, Marie

2003-04-02

Specific antibodies were produced ex vivo from intestinal culture of Eimeria tenella infected chickens. The specificity of these intestinal antibodies was tested against different parasite stages. These antibodies were used to immunoscreen first generation schizont and sporozoite cDNA libraries permitting the identification of new E. tenella antigens. We obtained a total of 119 cDNA clones which were subjected to sequence analysis. The sequences coding for the proteins inducing local immune responses were compared with nucleotide or protein databases and with expressed sequence tags (ESTs) databases. We identified new Eimeria genes coding for heat shock proteins, a ribosomal protein, a pyruvate kinase and a pyridoxine kinase. Specific features of other sequences are discussed.
Molecular mechanisms influencing efficiency of RNA interference in insects.

PubMed

Cooper, Anastasia M W; Silver, Kristopher; Jianzhen, Zhang; Park, Yoonseong; Zhu, Kun Yan

2018-06-21

RNA interference (RNAi) is an endogenous, sequence-specific gene silencing mechanism elicited by small RNA molecules. RNAi is a powerful reverse genetic tool, and is currently being utilized for managing insects and viruses. Widespread implementation of RNAi-based pest management strategies is currently hindered by inefficient and highly variable results when different insect species, strains, developmental stages, tissues, and genes are targeted. Mechanistic studies have shown that double-stranded ribonucleases (dsRNases), endosomal entrapment, deficient function of the core machinery, and inadequate immune stimulation contribute to limited RNAi efficiency. However, a comprehensive understanding of the molecular mechanisms limiting RNAi efficiency remains elusive. The recent advances in dsRNA stability in physiological tissues, dsRNA internalization into cells, the composition and function of the core RNAi machinery, as well as small-interfering RNA/double-stranded RNA amplification and spreading mechanisms are reviewed to establish a global understanding of the obstacles impeding wider understanding of RNAi mechanisms in insects. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Genomics of gene banks: A case study in rice.

PubMed

McCouch, Susan R; McNally, Kenneth L; Wang, Wen; Sackville Hamilton, Ruaraidh

2012-02-01

Only a small fraction of the naturally occurring genetic diversity available in the world's germplasm repositories has been explored to date, but this is expected to change with the advent of affordable, high-throughput genotyping and sequencing technology. It is now possible to examine genome-wide patterns of natural variation and link sequence polymorphisms with downstream phenotypic consequences. In this paper, we discuss how dramatic changes in the cost and efficiency of sequencing and genotyping are revolutionizing the way gene bank scientists approach the responsibilities of their job. Sequencing technology provides a set of tools that can be used to enhance the quality, efficiency, and cost-effectiveness of gene bank operations, the depth of scientific knowledge of gene bank holdings, and the level of public interest in natural variation. As a result, gene banks have the chance to take on new life. Previously seen as "warehouses" where seeds were diligently maintained, but evolutionarily frozen in time, gene banks could transform into vibrant research centers that actively investigate the genetic potential of their holdings. In this paper, we will discuss how genotyping and sequencing can be integrated into the activities of a modern gene bank to revolutionize the way scientists document the genetic identity of their accessions; track seed lots, varieties, and alleles; identify duplicates; and rationalize active collections, and how the availability of genomics data are likely to motivate innovative collaborations with the larger research and breeding communities to engage in systematic and rigorous phenotyping and multilocation evaluation of the genetic resources in gene banks around the world. The objective is to understand and eventually predict how variation at the DNA level helps determine the phenotypic potential of an individual or population. Leadership and vision are needed to coordinate the characterization of collections and to integrate genotypic and phenotypic information in ways that will illuminate the value of these resources. Genotyping of collections represents a powerful starting point that will enable gene banks to become more effective as stewards of crop biodiversity.
Design and Validation of CRISPR/Cas9 Systems for Targeted Gene Modification in Induced Pluripotent Stem Cells.

PubMed

Lee, Ciaran M; Zhu, Haibao; Davis, Timothy H; Deshmukh, Harshahardhan; Bao, Gang

2017-01-01

The CRISPR/Cas9 system is a powerful tool for precision genome editing. The ability to accurately modify genomic DNA in situ with single nucleotide precision opens up new possibilities for not only basic research but also biotechnology applications and clinical translation. In this chapter, we outline the procedures for design, screening, and validation of CRISPR/Cas9 systems for targeted modification of coding sequences in the human genome and how to perform genome editing in induced pluripotent stem cells with high efficiency and specificity.
Pervasive sequence patents cover the entire human genome.

PubMed

Rosenfeld, Jeffrey A; Mason, Christopher E

2013-01-01

The scope and eligibility of patents for genetic sequences have been debated for decades, but a critical case regarding gene patents (Association of Molecular Pathologists v. Myriad Genetics) is now reaching the US Supreme Court. Recent court rulings have supported the assertion that such patents can provide intellectual property rights on sequences as small as 15 nucleotides (15mers), but an analysis of all current US patent claims and the human genome presented here shows that 15mer sequences from all human genes match at least one other gene. The average gene matches 364 other genes as 15mers; the breast-cancer-associated gene BRCA1 has 15mers matching at least 689 other genes. Longer sequences (1,000 bp) still showed extensive cross-gene matches. Furthermore, 15mer-length claims from bovine and other animal patents could also claim as much as 84% of the genes in the human genome. In addition, when we expanded our analysis to full-length patent claims on DNA from all US patents to date, we found that 41% of the genes in the human genome have been claimed. Thus, current patents for both short and long nucleotide sequences are extraordinarily non-specific and create an uncertain, problematic liability for genomic medicine, especially in regard to targeted re-sequencing and other sequence diagnostic assays.
Looking into flowering time in almond (Prunus dulcis (Mill) D. A. Webb): the candidate gene approach.

PubMed

Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M

2005-03-01

Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.
The genome sequence of taurine cattle: a window to ruminant biology and evolution.

PubMed

Elsik, Christine G; Tellam, Ross L; Worley, Kim C; Gibbs, Richard A; Muzny, Donna M; Weinstock, George M; Adelson, David L; Eichler, Evan E; Elnitski, Laura; Guigó, Roderic; Hamernik, Debora L; Kappes, Steve M; Lewin, Harris A; Lynn, David J; Nicholas, Frank W; Reymond, Alexandre; Rijnkels, Monique; Skow, Loren C; Zdobnov, Evgeny M; Schook, Lawrence; Womack, James; Alioto, Tyler; Antonarakis, Stylianos E; Astashyn, Alex; Chapple, Charles E; Chen, Hsiu-Chuan; Chrast, Jacqueline; Câmara, Francisco; Ermolaeva, Olga; Henrichsen, Charlotte N; Hlavina, Wratko; Kapustin, Yuri; Kiryutin, Boris; Kitts, Paul; Kokocinski, Felix; Landrum, Melissa; Maglott, Donna; Pruitt, Kim; Sapojnikov, Victor; Searle, Stephen M; Solovyev, Victor; Souvorov, Alexandre; Ucla, Catherine; Wyss, Carine; Anzola, Juan M; Gerlach, Daniel; Elhaik, Eran; Graur, Dan; Reese, Justin T; Edgar, Robert C; McEwan, John C; Payne, Gemma M; Raison, Joy M; Junier, Thomas; Kriventseva, Evgenia V; Eyras, Eduardo; Plass, Mireya; Donthu, Ravikiran; Larkin, Denis M; Reecy, James; Yang, Mary Q; Chen, Lin; Cheng, Ze; Chitko-McKown, Carol G; Liu, George E; Matukumalli, Lakshmi K; Song, Jiuzhou; Zhu, Bin; Bradley, Daniel G; Brinkman, Fiona S L; Lau, Lilian P L; Whiteside, Matthew D; Walker, Angela; Wheeler, Thomas T; Casey, Theresa; German, J Bruce; Lemay, Danielle G; Maqbool, Nauman J; Molenaar, Adrian J; Seo, Seongwon; Stothard, Paul; Baldwin, Cynthia L; Baxter, Rebecca; Brinkmeyer-Langford, Candice L; Brown, Wendy C; Childers, Christopher P; Connelley, Timothy; Ellis, Shirley A; Fritz, Krista; Glass, Elizabeth J; Herzig, Carolyn T A; Iivanainen, Antti; Lahmers, Kevin K; Bennett, Anna K; Dickens, C Michael; Gilbert, James G R; Hagen, Darren E; Salih, Hanni; Aerts, Jan; Caetano, Alexandre R; Dalrymple, Brian; Garcia, Jose Fernando; Gill, Clare A; Hiendleder, Stefan G; Memili, Erdogan; Spurlock, Diane; Williams, John L; Alexander, Lee; Brownstein, Michael J; Guan, Leluo; Holt, Robert A; Jones, Steven J M; Marra, Marco A; Moore, Richard; Moore, Stephen S; Roberts, Andy; Taniguchi, Masaaki; Waterman, Richard C; Chacko, Joseph; Chandrabose, Mimi M; Cree, Andy; Dao, Marvin Diep; Dinh, Huyen H; Gabisi, Ramatu Ayiesha; Hines, Sandra; Hume, Jennifer; Jhangiani, Shalini N; Joshi, Vandita; Kovar, Christie L; Lewis, Lora R; Liu, Yih-Shin; Lopez, John; Morgan, Margaret B; Nguyen, Ngoc Bich; Okwuonu, Geoffrey O; Ruiz, San Juana; Santibanez, Jireh; Wright, Rita A; Buhay, Christian; Ding, Yan; Dugan-Rocha, Shannon; Herdandez, Judith; Holder, Michael; Sabo, Aniko; Egan, Amy; Goodell, Jason; Wilczek-Boney, Katarzyna; Fowler, Gerald R; Hitchens, Matthew Edward; Lozado, Ryan J; Moen, Charles; Steffen, David; Warren, James T; Zhang, Jingkun; Chiu, Readman; Schein, Jacqueline E; Durbin, K James; Havlak, Paul; Jiang, Huaiyang; Liu, Yue; Qin, Xiang; Ren, Yanru; Shen, Yufeng; Song, Henry; Bell, Stephanie Nicole; Davis, Clay; Johnson, Angela Jolivet; Lee, Sandra; Nazareth, Lynne V; Patel, Bella Mayurkumar; Pu, Ling-Ling; Vattathil, Selina; Williams, Rex Lee; Curry, Stacey; Hamilton, Cerissa; Sodergren, Erica; Wheeler, David A; Barris, Wes; Bennett, Gary L; Eggen, André; Green, Ronnie D; Harhay, Gregory P; Hobbs, Matthew; Jann, Oliver; Keele, John W; Kent, Matthew P; Lien, Sigbjørn; McKay, Stephanie D; McWilliam, Sean; Ratnakumar, Abhirami; Schnabel, Robert D; Smith, Timothy; Snelling, Warren M; Sonstegard, Tad S; Stone, Roger T; Sugimoto, Yoshikazu; Takasuga, Akiko; Taylor, Jeremy F; Van Tassell, Curtis P; Macneil, Michael D; Abatepaulo, Antonio R R; Abbey, Colette A; Ahola, Virpi; Almeida, Iassudara G; Amadio, Ariel F; Anatriello, Elen; Bahadue, Suria M; Biase, Fernando H; Boldt, Clayton R; Carroll, Jeffery A; Carvalho, Wanessa A; Cervelatti, Eliane P; Chacko, Elsa; Chapin, Jennifer E; Cheng, Ye; Choi, Jungwoo; Colley, Adam J; de Campos, Tatiana A; De Donato, Marcos; Santos, Isabel K F de Miranda; de Oliveira, Carlo J F; Deobald, Heather; Devinoy, Eve; Donohue, Kaitlin E; Dovc, Peter; Eberlein, Annett; Fitzsimmons, Carolyn J; Franzin, Alessandra M; Garcia, Gustavo R; Genini, Sem; Gladney, Cody J; Grant, Jason R; Greaser, Marion L; Green, Jonathan A; Hadsell, Darryl L; Hakimov, Hatam A; Halgren, Rob; Harrow, Jennifer L; Hart, Elizabeth A; Hastings, Nicola; Hernandez, Marta; Hu, Zhi-Liang; Ingham, Aaron; Iso-Touru, Terhi; Jamis, Catherine; Jensen, Kirsty; Kapetis, Dimos; Kerr, Tovah; Khalil, Sari S; Khatib, Hasan; Kolbehdari, Davood; Kumar, Charu G; Kumar, Dinesh; Leach, Richard; Lee, Justin C-M; Li, Changxi; Logan, Krystin M; Malinverni, Roberto; Marques, Elisa; Martin, William F; Martins, Natalia F; Maruyama, Sandra R; Mazza, Raffaele; McLean, Kim L; Medrano, Juan F; Moreno, Barbara T; Moré, Daniela D; Muntean, Carl T; Nandakumar, Hari P; Nogueira, Marcelo F G; Olsaker, Ingrid; Pant, Sameer D; Panzitta, Francesca; Pastor, Rosemeire C P; Poli, Mario A; Poslusny, Nathan; Rachagani, Satyanarayana; Ranganathan, Shoba; Razpet, Andrej; Riggs, Penny K; Rincon, Gonzalo; Rodriguez-Osorio, Nelida; Rodriguez-Zas, Sandra L; Romero, Natasha E; Rosenwald, Anne; Sando, Lillian; Schmutz, Sheila M; Shen, Libing; Sherman, Laura; Southey, Bruce R; Lutzow, Ylva Strandberg; Sweedler, Jonathan V; Tammen, Imke; Telugu, Bhanu Prakash V L; Urbanski, Jennifer M; Utsunomiya, Yuri T; Verschoor, Chris P; Waardenberg, Ashley J; Wang, Zhiquan; Ward, Robert; Weikard, Rosemarie; Welsh, Thomas H; White, Stephen N; Wilming, Laurens G; Wunderlich, Kris R; Yang, Jianqi; Zhao, Feng-Qi

2009-04-24

To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specific variations in genes associated with lactation and immune responsiveness. Genes involved in metabolism are generally highly conserved, although five metabolic genes are deleted or extensively diverged from their human orthologs. The cattle genome sequence thus provides a resource for understanding mammalian evolution and accelerating livestock genetic improvement for milk and meat production.
Pressure-Mediated Oligonucleotide Transfection of Rat and Human Cardiovascular Tissues

NASA Astrophysics Data System (ADS)

Mann, Michael J.; Gibbons, Gary H.; Hutchinson, Howard; Poston, Robert S.; Hoyt, E. Grant; Robbins, Robert C.; Dzau, Victor J.

1999-05-01

The application of gene therapy to human disease is currently restricted by the relatively low efficiency and potential hazards of methods of oligonucleotide or gene delivery. Antisense or transcription factor decoy oligonucleotides have been shown to be effective at altering gene expression in cell culture expreriments, but their in vivo application is limited by the efficiency of cellular delivery, the intracellular stability of the compounds, and their duration of activity. We report herein the development of a highly efficient method for naked oligodeoxynucleotide (ODN) transfection into cardiovascular tissues by using controlled, nondistending pressure without the use of viral vectors, lipid formulations, or exposure to other adjunctive, potentially hazardous substances. In this study, we have documented the ability of ex vivo, pressure-mediated transfection to achieve nuclear localization of fluorescent (FITC)-labeled ODN in approximately 90% and 50% of cells in intact human saphenous vein and rat myocardium, respectively. We have further documented that pressure-mediated delivery of antisense ODN can functionally inhibited target gene expression in both of these tissues in a sequence-specific manner at the mRNA and protein levels. This oligonucleotide transfection system may represent a safe means of achieving the intraoperative genetic engineering of failure-resistant human bypass grafts and may provide an avenue for the genetic manipulation of cardiac allograft rejection, allograft vasculopathy, or other transplant diseases.
A novel site-specific recombination system derived from bacteriophage phiMR11.

PubMed

Rashel, Mohammad; Uchiyama, Jumpei; Ujihara, Takako; Takemura, Iyo; Hoshiba, Hiroshi; Matsuzaki, Shigenobu

2008-04-04

We report identification of a novel site-specific DNA recombination system that functions in both in vivo and in vitro, derived from lysogenic Staphylococcus aureus phage phiMR11. In silico analysis of the phiMR11 genome indicated orf1 as a putative integrase gene. Phage and bacterial attachment sites (attP and attB, respectively) and attachment junctions were determined and their nucleotide sequences decoded. Sequences of attP and attB were mostly different to each other except for a two bp common core that was the crossover point. We found several inverted repeats adjacent to the core sequence of attP as potential protein binding sites. The precise and efficient integration properties of phiMR11 integrase were shown on attP and attB in Escherichia coli and the minimum size of attP was found to be 34bp. In in vitro assays using crude or purified integrase, only buffer and substrate DNAs were required for the recombination reaction, indicating that other bacterially encoded factors are not essential for activity.
Mining biological databases for candidate disease genes

NASA Astrophysics Data System (ADS)

Braun, Terry A.; Scheetz, Todd; Webster, Gregg L.; Casavant, Thomas L.

2001-07-01

The publicly-funded effort to sequence the complete nucleotide sequence of the human genome, the Human Genome Project (HGP), has currently produced more than 93% of the 3 billion nucleotides of the human genome into a preliminary `draft' format. In addition, several valuable sources of information have been developed as direct and indirect results of the HGP. These include the sequencing of model organisms (rat, mouse, fly, and others), gene discovery projects (ESTs and full-length), and new technologies such as expression analysis and resources (micro-arrays or gene chips). These resources are invaluable for the researchers identifying the functional genes of the genome that transcribe and translate into the transcriptome and proteome, both of which potentially contain orders of magnitude more complexity than the genome itself. Preliminary analyses of this data identified approximately 30,000 - 40,000 human `genes.' However, the bulk of the effort still remains -- to identify the functional and structural elements contained within the transcriptome and proteome, and to associate function in the transcriptome and proteome to genes. A fortuitous consequence of the HGP is the existence of hundreds of databases containing biological information that may contain relevant data pertaining to the identification of disease-causing genes. The task of mining these databases for information on candidate genes is a commercial application of enormous potential. We are developing a system to acquire and mine data from specific databases to aid our efforts to identify disease genes. A high speed cluster of Linux of workstations is used to analyze sequence and perform distributed sequence alignments as part of our data mining and processing. This system has been used to mine GeneMap99 sequences within specific genomic intervals to identify potential candidate disease genes associated with Bardet-Biedle Syndrome (BBS).
Species-Specific Exon Loss in Human Transcriptomes

PubMed Central

Wang, Jinkai; Lu, Zhi-xiang; Tokheim, Collin J.; Miller, Sara E.; Xing, Yi

2015-01-01

Changes in exon–intron structures and splicing patterns represent an important mechanism for the evolution of gene functions and species-specific regulatory networks. Although exon creation is widespread during primate and human evolution and has been studied extensively, much less is known about the scope and potential impact of human-specific exon loss events. Historically, transcriptome data and exon annotations are significantly biased toward humans over nonhuman primates. This ascertainment bias makes it challenging to discover human-specific exon loss events. We carried out a transcriptome-wide search of human-specific exon loss events, by taking advantage of RNA sequencing (RNA-seq) as a powerful and unbiased tool for exon discovery and annotation. Using RNA-seq data of humans, chimpanzees, and other primates, we reconstructed and compared transcript structures across the primate phylogeny. We discovered 33 candidate human-specific exon loss events, among which six exons passed stringent experimental filters for the complete loss of splicing activities in diverse human tissues. These events may result from human-specific deletion of genomic DNA, or small-scale sequence changes that inactivated splicing signals. The impact of human-specific exon loss events is predominantly regulatory. Three of the six events occurred in the 5′ untranslated region (5′-UTR) and affected cis-regulatory elements of mRNA translation. In SLC7A6, a gene encoding an amino acid transporter, luciferase reporter assays suggested that both a human-specific exon loss event and an independent human-specific single nucleotide substitution in the 5′-UTR increased mRNA translational efficiency. Our study provides novel insights into the molecular mechanisms and evolutionary consequences of exon loss during human evolution. PMID:25398629
Applications of Gene Editing Technologies to Cellular Therapies.

PubMed

Rein, Lindsay A M; Yang, Haeyoon; Chao, Nelson J

2018-03-27

Hematologic malignancies are characterized by genetic heterogeneity, making classic gene therapy with a goal of correcting 1 genetic defect ineffective in many of these diseases. Despite initial tribulations, gene therapy, as a field, has grown by leaps and bounds with the recent development of gene editing techniques including zinc finger nucleases, transcription activator-like effector nucleases, and clustered regularly interspaced short palindromic repeat (CRISPR) sequences and CRISPR-associated protein-9 (Cas9) nuclease or CRISPR/Cas9. These novel technologies have been applied to efficiently and specifically modify genetic information in target and effector cells. In particular, CRISPR/Cas9 technology has been applied to various hematologic malignancies and has also been used to modify and improve chimeric antigen receptor-modified T cells for the purpose of providing effective cellular therapies. Although gene editing is in its infancy in malignant hematologic diseases, there is much room for growth and application in the future. Copyright © 2018 The American Society for Blood and Marrow Transplantation. Published by Elsevier Inc. All rights reserved.
Development of a Targeted anti-HER2 scFv Chimeric Peptide for Gene Delivery into HER2-Positive Breast Cancer Cells.

PubMed

Cheraghi, Roya; Nazari, Mahboobeh; Alipour, Mohsen; Majidi, Asia; Hosseinkhani, Saman

2016-12-30

Chimeric polymers are known as suitable carriers for gene delivery. Certain properties are critical for a polymer to be used as a gene delivery vector. A new polymer was designed for the targeted delivery of genes into breast cancer cell lines, based on MPG peptide. It is composed of different functional domains, including HIV gp41, nuclear localization sequence of SV40 T-antigen, two C-terminus repeats of histone H1, and the scFv of anti-HER2 antibody. The results demonstrated that the vector can effectively condense plasmid DNA into nanoparticles with an average size of 250nm. Moreover, fusion of the scFv portion to the carrier brought about the specific recognition of HER2. Overall, the transfection efficiency of the vector demonstrated that it could deliver the desired gene into BT-474 HER2-positive breast cancer cells. Copyright Â© 2016 Elsevier B.V. All rights reserved.
Novel Mutations in pncA Gene of Pyrazinamide Resistant Clinical Isolates of Mycobacterium tuberculosis.

PubMed

Kahbazi, Manijeh; Sarmadian, Hossein; Ahmadi, Azam; Didgar, Farshideh; Sadrnia, Maryam; Poolad, Toktam; Arjomandzadegan, Mohammad

2018-04-16

In clinical isolates of Mycobacterium tuberculosis (MTB), resistance to pyrazinamide occurs by mutations in any positions of the pncA gene (NC_000962.3) especially in nucleotides 359 and 374. In this study we examined the pncA gene sequence in clinical isolates of MTB. Genomic DNA of 33 clinical isolates of MTB was extracted by the Chelex100 method. The polymerase chain reactions (PCR) were performed using specific primers for amplification of 744 bp amplicon comprising the coding sequences (CDS) of the pncA gene. PCR products were sequenced by an automated sequencing Bioscience system. Additionally, semi Nested-allele specific (sNASP) and polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) methods were carried out for verification of probable mutations in nucleotides 359 and 374. Sequencing results showed that from 33 MTB clinical isolates, nine pyrazinamide-resistant isolates have mutations. Furthermore, no mutation was detected in 24 susceptible strains in the entire 561 bp of the pncA gene. Moreover, new mutations of G→A at position 3 of the pncA gene were identified in some of the resistant isolates. Results showed that the sNASP method could detect mutations in nucleotide 359 and 374 of the pncA gene, but the PCR-RFLP method by the SacII enzyme could not detect these mutations. In conclusion, the identification of new mutations in the pncA gene confirmed the probable occurrence of mutations in any nucleotides of the pncA gene sequence in resistant isolates of MTB.
Genome editing in sea urchin embryos by using a CRISPR/Cas9 system.

PubMed

Lin, Che-Yi; Su, Yi-Hsien

2016-01-15

Sea urchin embryos are a useful model system for investigating early developmental processes and the underlying gene regulatory networks. Most functional studies using sea urchin embryos rely on antisense morpholino oligonucleotides to knockdown gene functions. However, major concerns related to this technique include off-target effects, variations in morpholino efficiency, and potential morpholino toxicity; furthermore, such problems are difficult to discern. Recent advances in genome editing technologies have introduced the prospect of not only generating sequence-specific knockouts, but also providing genome-engineering applications. Two genome editing tools, zinc-finger nuclease (ZFN) and transcription activator-like effector nucleases (TALENs), have been utilized in sea urchin embryos, but the resulting efficiencies are far from satisfactory. The CRISPR (clustered regularly interspaced short palindromic repeat)-Cas9 (CRISPR-associated nuclease 9) system serves as an easy and efficient method with which to edit the genomes of several established and emerging model organisms in the field of developmental biology. Here, we apply the CRISPR/Cas9 system to the sea urchin embryo. We designed six guide RNAs (gRNAs) against the well-studied nodal gene and discovered that five of the gRNAs induced the expected phenotype in 60-80% of the injected embryos. In addition, we developed a simple method for isolating genomic DNA from individual embryos, enabling phenotype to be precisely linked to genotype, and revealed that the mutation rates were 67-100% among the sequenced clones. Of the two potential off-target sites we examined, no off-target effects were observed. The detailed procedures described herein promise to accelerate the usage of CRISPR/Cas9 system for genome editing in sea urchin embryos. Copyright © 2015 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Klesmith, Justin R.; Bacik, John -Paul; Michalczyk, Ryszard

Synthetic metabolic pathways often suffer from low specific productivity, and new methods that quickly assess pathway functionality for many thousands of variants are urgently needed. Here we present an approach that enables the rapid and parallel determination of sequence effects on flux for complete gene-encoding sequences. We show that this method can be used to determine the effects of over 8000 single point mutants of a pyrolysis oil catabolic pathway implanted in Escherichia coli. Experimental sequence-function data sets predicted whether fitness-enhancing mutations to the enzyme levoglucosan kinase resulted from enhanced catalytic efficiency or enzyme stability. A structure of one designmore » incorporating 38 mutations elucidated the structural basis of high fitness mutations. One design incorporating 15 beneficial mutations supported a 15-fold improvement in growth rate and greater than 24-fold improvement in enzyme activity relative to the starting pathway. Lastly, this technique can be extended to improve a wide variety of designed pathways.« less
Advances in the application of genetic manipulation methods to apicomplexan parasites.

PubMed

Suarez, C E; Bishop, R P; Alzan, H F; Poole, W A; Cooke, B M

2017-10-01

Apicomplexan parasites such as Babesia, Theileria, Eimeria, Cryptosporidium and Toxoplasma greatly impact animal health globally, and improved, cost-effective measures to control them are urgently required. These parasites have complex multi-stage life cycles including obligate intracellular stages. Major gaps in our understanding of the biology of these relatively poorly characterised parasites and the diseases they cause severely limit options for designing novel control methods. Here we review potentially important shared aspects of the biology of these parasites, such as cell invasion, host cell modification, and asexual and sexual reproduction, and explore the potential of the application of relatively well-established or newly emerging genetic manipulation methods, such as classical transfection or gene editing, respectively, for closing important gaps in our knowledge of the function of specific genes and proteins, and the biology of these parasites. In addition, genetic manipulation methods impact the development of novel methods of control of the diseases caused by these economically important parasites. Transient and stable transfection methods, in conjunction with whole and deep genome sequencing, were initially instrumental in improving our understanding of the molecular biology of apicomplexan parasites and paved the way for the application of the more recently developed gene editing methods. The increasingly efficient and more recently developed gene editing methods, in particular those based on the CRISPR/Cas9 system and previous conceptually similar techniques, are already contributing to additional gene function discovery using reverse genetics and related approaches. However, gene editing methods are only possible due to the increasing availability of in vitro culture, transfection, and genome sequencing and analysis techniques. We envisage that rapid progress in the development of novel gene editing techniques applied to apicomplexan parasites of veterinary interest will ultimately lead to the development of novel and more efficient methods for disease control. Published by Elsevier Ltd.
Mob/oriT, a mobilizable site-specific recombination system for unmarked genetic manipulation in Bacillus thuringiensis and Bacillus cereus.

PubMed

Wang, Pengxia; Zhu, Yiguang; Zhang, Yuyang; Zhang, Chunyi; Xu, Jianyi; Deng, Yun; Peng, Donghai; Ruan, Lifang; Sun, Ming

2016-06-10

Bacillus thuringiensis and Bacillus cereus are two important species in B. cereus group. The intensive study of these strains at the molecular level and construction of genetically modified bacteria requires the development of efficient genetic tools. To insert genes into or delete genes from bacterial chromosomes, marker-less manipulation methods were employed. We present a novel genetic manipulation method for B. thuringiensis and B. cereus strains that does not leave selection markers. Our approach takes advantage of the relaxase Mob02281 encoded by plasmid pBMB0228 from Bacillus thuringiensis. In addition to its mobilization function, this Mob protein can mediate recombination between oriT sites. The Mob02281 mobilization module was associated with a spectinomycin-resistance gene to form a Mob-Spc cassette, which was flanked by the core 24-bp oriT sequences from pBMB0228. A strain in which the wild-type chromosome was replaced with the modified copy containing the Mob-Spc cassette at the target locus was obtained via homologous recombination. Thus, the spectinomycin-resistance gene can be used to screen for Mob-Spc cassette integration mutants. Recombination between the two oriT sequences mediated by Mob02281, encoded by the Mob-Spc cassette, resulted in the excision of the Mob-Spc cassette, producing the desired chromosomal alteration without introducing unwanted selection markers. We used this system to generate an in-frame deletion of a target gene in B. thuringiensis as well as a gene located in an operon of B. cereus. Moreover, we demonstrated that this system can be used to introduce a single gene or an expression cassette of interest in B. thuringiensis. The Mob/oriT recombination system provides an efficient method for unmarked genetic manipulation and for constructing genetically modified bacteria of B. thuringiensis and B. cereus. Our method extends the available genetic tools for B. thuringiensis and B. cereus strains.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.